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PREFACE 


or several generations of students and teachers, 
Biochemistry has been an invaluable resource, pre- 
senting the concepts and details of molecular structure, 
metabolism, and laboratory techniques in a streamlined 
and engaging way. Biochemistry’s success in helping 
students learn the subject for the first time is built on a 


number of hallmark features: 


¢ Clear writing and simple illustrations. The lan- 
guage of biochemistry is made as accessible as possi- 


ble for students learning the subject for the first time. 


To complement the straightforward language and 
organization of concepts in the text, figures illustrate 
a single concept at a time to help students see main 
points without the distraction of excess detail. 


¢ Physiological relevance. It has always been our 
goal to help students connect biochemistry to their 
own lives on a variety of scales. Pathways and pro- 
cesses are presented in a physiological context so 
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Figure 27.12 An idealized representation of fuels use as a function 


of aerobic exercise intensity. (A) With increased exercise intensity, the 
use of fats as fuels falls as the utilization of glucose increases. (B) The 
respiratory quotient (RQ) measures the alteration in fuel use. 


students can see how biochemistry works in the 
body and under different conditions, and Clinical 
Application sections in every chapter show students 
how the concepts they are studying impact human 
health. The eighth edition includes a number of new 
Clinical Application sections based on recent dis- 
coveries in biochemistry and health. (For a full list, 
see p. x1) 


¢ Evolutionary perspective. Discussions of evolution 


are woven into the narrative of the text, just as evolu- 
tion shapes every pathway and molecular structure 
described in the text. Molecular Evolution sections 
highlight important milestones in the evolution of 
life as a way to provide context for the processes and 
molecules being discussed. (For a full list, see p. x) 


Problem-solving practice. Every chapter of 
Biochemistry provides numerous opportunities for 
students to practice problem-solving skills and apply 
the concepts described in the text. End-of-chapter 
problems are divided into three categories to address 
different problem-solving skills: Mechanism prob- 
lems ask students to suggest or describe a chemical 
mechanism; Data interpretation problems ask stu- 
dents to draw conclusions from data taken from real 
research papers; and chapter integration problems 
require students to connect concepts from across 
chapters. Further problem-solving practice is pro- 
vided online, on the Biochemistry LaunchPad. (For 
more details on LaunchPad resources, see p. viii) 


A variety of molecular structures. All molecular 
structures in the book, with few exceptions, have been 
selected and rendered by Jeremy Berg and Gregory 
Gatto to emphasize the aspect of structure most impor- 
tant to the topic at hand. Students are introduced to 
realistic renderings of molecules through a molecular 
model “primer” in the appendices to Chapters 1 and 2 
so they are well-equipped to recognize and interpret 
the structures throughout the book. Figure legends 
direct students explicitly to the key features of a model, 
and often include PDB numbers so the reader can 
access the file used in generating the structure from the 
Protein Data Bank website (www.pdb.org). Students 
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Figure 9.48 Single molecule motion. (A) A trace of the position of a single dimeric myosin V molecule as it moves across a 
surface coated with actin filaments. (B) A model of how the dimeric molecule moves in discrete steps with an average size of 
74 + 5 nm. [Data from A. Yildiz et al., Science 300(5628)206 1-2065, 2003.] 


can explore molecular structures further online through 


the Living Figures, in which they can rotate 3D models 


of molecules and view alternative renderings. 


In this revision of Biochemistry, we focused on build- 
ing on the strengths of the previous editions to present 
biochemistry in an even more clear and streamlined 
manner, as well as incorporating exciting new advances 
from the field. Throughout the book, we have updated 
explanations of basic concepts and bolstered them with 
examples from new research. Some new topics that we 
present in the eighth edition include: 


Environmental factors that influence human 
biochemistry (Chapter 1) 


Genome editing (Chapter 5) 


Horizontal gene transfer events that may explain unex- 
pected branches of the evolutionary tree (Chapter 6) 


Penicillin irreversibly inactivating a key enzyme in 
bacterial cell-wall synthesis (Chapter 8) 


e 
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Scientists watching single molecules of myosin 
move (Chapter 9) 


Glycosylation functions in nutrient sensing 
(Chapter 11) 


The structure of aSNARE complex (Chapter 12) 
The mechanism of ABC transporters (Chapter 13) 
The structure of the gap junction (Chapter 13) 


The structural basis for activation of the B-adrenergic 
receptor (Chapter 14) 


Excessive fructose consumption can lead to patho- 
logical conditions (Chapter 16) 


Alterations in the glycolytic pathway by cancer cells 
(Chapter 16) 


Regulation of mitochondrial ATP synthase 
(Chapter 18) 


Control of chloroplast ATP synthase (Chapter 19) 


Activation of rubisco by rubisco activase (Chapter 20) 


Figure 12.39 SNARE complexes initiate membrane fusion. The SNARE protein synaptobrevin 
(yellow) from one membrane forms a tight four-helical bundle with the corresponding SNARE 
proteins syntaxin-1 (blue) and SNAP25 (red) from a second membrane. The complex brings the 
membranes close together, initiating the fusion event. [Drawn from 1SFC.pdb.] 


¢ The role of the pentose phosphate pathway in rapid 
cell growth (Chapter 20) 


* Biochemical characteristics of muscle fiber types 
(Chapter 21) 


¢ Alteration of fatty acid metabolism in tumor cells 
(Chapter 22) 

* Biochemical basis of neurological symptoms of 
phenylketonuria (Chapter 24) 

* Ribonucleotide reductase as a chemotherapeutic 
target (Chapter 25) 
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The role of excess choline in the development of 
heart disease (Chapter 26) 

Cycling of the LDL receptor is regulated (Chapter 26) 
The role of ceramide metabolism in stimulating 
tumor growth (Chapter 26) 

The extraordinary power of DNA repair systems 
illustrated by Deinococcus radiodurans (Chapter 28) 


The structural details of ligand binding by TLRs 
(Chapter 34) 


MEDIA AND ASSESSMENT 


data, developing critical thinking skills, connecting 


je LaunchPad topics, and applying knowledge to real scenarios. 
We also provide instructional guidance with each 

All of the new media resources for Biochemistry will be case study (with suggestions on how to use the case 

available in our new system. in the classroom) and aligned assessment questions 


www.macmillanhighered.com/launchpad/berg8e for quizzes and exams. 


Newly Updated Clicker Questions allow instruc- 


LaunchPad is a dynamic, fully integrated learning tors to integrate active learning in the classroom and 


environment that brings together all of our teaching and to assess students’ understanding of key concepts 


learning resources in one place. It also contains the fully during lectures. Available in Microsoft Word and 
interactive e-Book and other newly updated resources PowerPoint (PPT). 


for students and instructors, including the following: 


Newly Updated Lecture PowerPoints have been 

developed to minimize preparation time for new 

users of the book. These files offer suggested lectures 
including key illustrations and summaries that 
instructors can adapt to their teaching styles. 


* Updated Layered PPTs deconstruct key 
concepts, sequences, and processes from the 
textbook images, allowing instructors to pres- 
ent complex ideas step-by-step. 


Updated Textbook Images and Tables are 
offered as high-resolution JPEG files. Each 
image has been fully optimized to increase 
type sizes and adjust color saturation. These 
images have been tested in a large lecture hall 
to ensure maximum clarity and visibility. 


* NEW Case Studies are a series of biochemistry 
case studies you can integrate into your course. Each 
case study gives students practice in working with 


The Clinical Companion, by Gregory 
Raner, The University of North Carolina at 
Greensboro and Douglas Root, University of 
North Texas, applies concepts that students 
have learned in the book to novel medical situ- 
ations. Students read clinical case studies and 
use basic biochemistry concepts to solve the 
medical mysteries, applying and reinforcing 
what they learn in lecture and from the book. 


Hundreds of self-graded practice prob- 
lems allow students to test their understanding 
of concepts explained in the text, with immedi- 
ate feedback. 


The Metabolic Map helps students under- 
stand the principles and applications of the 
core metabolic pathways. Students can work 
through guided tutorials with embedded 
assessment questions, or explore the Metabolic 
Map on their own using the dragging and 
zooming functionality of the map. 


CS 


Figure 34.3 Recognition of a PAMP by a Toll-like receptor. The structure 
of TLR3 bound to its PAMP, a fragment of double-stranded RNA, as seen from 
the side (top) and from above (bottom). Notice that the PAMP induces receptor * Jmol tutorials by Jeffrey Cohlberg, California 
dimerization by binding the surfaces on the side of each of the extracellular State University at Long Beach, teach students 
domains. [Drawn from 3ClY.pdb]. 
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how to create models of proteins in Jmol based 

on data from the Protein Data Bank. By working 
through the tutorial and answering assessment ques- 
tions at the end of each exercise, students 

learn to use this important database and fully 

realize the relationships between the structure 

and function of enzymes. 


Living figures allow students to explore protein 
structure in 3-D. Students can zoom and rotate the 
“live” structures to get a better understanding of 
their three-dimensional nature and can experiment 
with different display styles (space-filling, ball-and- 
stick, ribbon, backbone) by means of a user-friendly 
interface. 


Concept-based tutorials by Neil D. Clarke help 
students build an intuitive understanding of some of 
the more difficult concepts covered in the textbook. 


Animated techniques help students grasp experi- 
mental techniques used for exploring genes and 
proteins. 


NEW animations show students biochemical pro- 
cesses in motion. The eighth edition includes many 
new animations. 


Online end-of-chapter questions are assignable 
and self-graded multiple-choice versions of the 


end-of-chapter questions in the book, giving stu- 
dents a way to practice applying chapter content in 
an online environment. 


¢ Flashcards are an interactive tool that allows 
students to study key terms from the book. 


¢ LearningCurve is a self-assessment tool that helps 
students evaluate their progress. Students can test 
their understanding by taking an online multiple- 
choice quiz provided for each chapter, as well as a 
general chemistry review. 


Updated Student Companion 


[1-4641-8803-3] 
For each chapter of the textbook, the Student Companion 
includes: 


* Chapter Learning Objectives and Summary 


¢ Self-Assessment Problems, including multiple- 
choice, short-answer, matching questions, and chal- 
lenge problems, and their answers 


* Expanded Solutions to end-of-chapter problems 
in the textbook 


MOLECULAR EVOLUTION 


This icon signals the start of the many discussions that highlight protein 
commonalities or other molecular evolutionary insights. 


Only L amino acids make up proteins (p. 29) 

Why this set of 20 amino acids? (p. 35) 

Sickle-cell trait and malaria (p. 206) 

Additional human globin genes (p. 208) 

Catalytic triads in hydrolytic enzymes (p. 258) 
Major classes of peptide-cleaving enzymes (p. 260) 


Common catalytic core in type II restriction enzymes 
(p. 275) 


P-loop NTPase domains (p. 280) 

Conserved catalytic core in protein kinases (p. 298) 

Why do different human blood types exist? (p. 331) 
Archaeal membranes (p. 346) 

Ion pumps (p. 370) 

P-type ATPases (p. 374) 

ATP-binding cassettes (p. 374) 

Sequence comparisons of Na* and Ca’* channels (p. 382) 
Small G proteins (p. 414) 

Metabolism in the RNA world (p. 444) 

Why is glucose a prominent fuel? (p. 451) 

NAD* binding sites in dehydrogenases (p. 465) 

Isozymic forms of lactate dehydrogenase (p. 487) 
Evolution of glycolysis and gluconeogenesis (p. 487) 

The a-ketoglutarate dehydrogenase complex (p. 505) 
Domains of succinyl CoA synthetase (p. 507) 

Evolution of the citric acid cycle (p. 516) 

Mitochondrial evolution (p. 525) 

Conserved structure of cytochrome c (p. 541) 

Common features of ATP synthase and G proteins (p. 548) 


Pigs lack uncoupling protein 1 (UCP-1) and brown fat 
(p. 556) 


Related uncoupling proteins (p. 556) 
Chloroplast evolution (p. 568) 

Evolutionary origins of photosynthesis (p. 584) 
Evolution of the C4 pathway (p. 601) 


The relationship of the Calvin cycle and the pentose 
phosphate pathway (p. 610) 

Increasing sophistication of glycogen phosphorylase 
regulation (p. 629) 


Glycogen synthase is homologous to glycogen 
phosphorylase (p. 631) 


A recurring motif in the activation of carboxyl groups 
(p. 649) 


Prokaryotic counterparts of the ubiquitin pathway and the 
proteasome (p. 686) 


A family of pyridoxal-dependent enzymes (p. 692) 
Evolution of the urea cycle (p. 696) 
The P-loop NTPase domain in nitrogenase (p. 716) 


Conserved amino acids in transaminases determine amino 
acid chirality (p. 721) 


Feedback inhibition (p. 731) 

Recurring steps in purine ring synthesis (p. 749) 
Ribonucleotide reductases (p. 755) 

Increase in urate levels during primate evolution (p. 761) 


Deinococcus radiodurans illustrates the power of DNA 
repair systems (p. 828) 


DNA polymerases (p. 829) 
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Sigma factors in bacterial transcription (p. 865) 
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(p. 876) 
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Composition of the primordial ribosome (p. 903) 
Homologous G proteins (p. 908) 


A family of proteins with common ligand-binding domains 
(p. 930) 


The independent evolution of DNA-binding sites of 
regulatory proteins (p. 931) 


Key principles of gene regulation are similar in bacteria and 
archaea (p. 937) 


CpG islands (p. 949) 

Iron-response elements (p. 955) 

miRNAs in gene evolution (p. 957) 
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The immunoglobulin fold (p. 988) 
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CLINICAL APPLICATIONS 


This icon signals the start of a clinical application in the text. Additional, briefer 


clinical correlations appear in the text as appropriate. 
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Biochemistry: An Evolving Science 


Chemistry in action. Human activities require energy. The 
interconversion of different forms of energy requires large 
biochemical machines comprising many thousands of atoms such 
as the complex shown above. Yet, the functions of these elaborate 
assemblies depend on simple chemical processes such as the 
protonation and deprotonation of the carboxylic acid groups shown 
on the right. The photograph is of Nobel Prize winners Peter Agre, M.D., 
and Carol Greider, Ph.D., who used, respectively, biochemical 
techniques to reveal key mechanisms of how water is transported 
into and out of cells, and how chromosomes are replicated faithfully. 
[Keith Weller for Johns Hopkins Medicine.] 


iochemistry is the study of the chemistry of life processes. Since the OUTLINE 


discovery that biological molecules such as urea could be synthesized 
from nonliving components in 1828, scientists have explored the chemistry 1.1 Biochemical Unity Underlies 
of life with great intensity. Through these investigations, many of the most Biological Diversity 
fundamental mysteries of how living things function at a biochemical level 
have now been solved. However, much remains to be investigated. As is 
often the case, each discovery raises at least as many new questions as it 
answers. Furthermore, we are now in an age of unprecedented opportunity 1.3 Concepts from Chemistry 
for the application of our tremendous knowledge of biochemistry to Explain the Properties of 
problems in medicine, dentistry, agriculture, forensics, anthropology, envi- lero ice INeiyeu es 


1.2 DNA Illustrates the Interplay 
Between Form and Function 


ronmental sciences, alternative energy, and many other fields. We begin 1.4 The Genomic Revolution Is 
our journey into biochemistry with one of the most startling discoveries Transforming Biochemistry, 
of the past century: namely, the great unity of all living things at the Medicine, and Other Fields 


biochemical level. 


1.1 Biochemical Unity Underlies Biological Diversity 


The biological world is magnificently diverse. The animal kingdom is 
rich with species ranging from nearly microscopic insects to elephants 
and whales. The plant kingdom includes species as small and relatively 
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simple as algae and as large and complex as giant sequoias. This diversity 
extends further when we descend into the microscopic world. Organisms 
such as protozoa, yeast, and bacteria are present with great diversity in 
water, in soil, and on or within larger organisms. Some organisms can 
survive and even thrive in seemingly hostile environments such as hot 
springs and glaciers. 

The development of the microscope revealed a key unifying feature that 
underlies this diversity. Large organisms are built up of cells, resembling, to 
some extent, single-celled microscopic organisms. The construction of ani- 
mals, plants, and microorganisms from cells suggested that these diverse 
organisms might have more in common than is apparent from their outward 
appearance. With the development of biochemistry, this suggestion has 
been tremendously supported and expanded. At the biochemical level, all 
organisms have many common features (Figure 1.1). 

As mentioned earlier, biochemistry is the study of the chemistry of life 
processes. These processes entail the interplay of two different classes of 
molecules: large molecules such as proteins and nucleic acids, referred to as 
biological macromolecules, and low-molecular-weight molecules such as glu- 
cose and glycerol, referred to as metabolites, that are chemically transformed 
in biological processes. 

Members of both these classes of molecules are common, with minor 
variations, to all living things. For example, deoxyribonucleic acid (DNA) 
stores genetic information in all cellular organisms. Proteins, the macromol- 
ecules that are key participants in most biological processes, are built from 
the same set of 20 building blocks in all organisms. Furthermore, proteins 
that play similar roles in different organisms often have very similar three- 
dimensional structures (Figure 1.1). 


Sulfolobus archaea 


FIGURE 1.1 Biological diversity and similarity. The shape of a key molecule in gene 
regulation (the TATA-box-binding protein) is similar in three very different organisms that are 
separated from one another by billions of years of evolution. [(Left) Eye of Science/Science 
Source; (middle) Holt Studios/Photo Researchers; (right) Time Life Pictures/Getty Images. ] 
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FIGURE 1.2 A possible time line for biochemical evolution. Selected key events are 
indicated. Note that life on Earth began approximately 3.5 billion years ago, whereas human 
beings emerged quite recently. 


Key metabolic processes also are common to many organisms. For 
example, the set of chemical transformations that converts glucose and oxy- 
gen into carbon dioxide and water is essentially identical in simple bacteria 
such as Escherichia coli (E. coli) and human beings. Even processes that 
appear to be quite distinct often have common features at the biochemical 
level. Remarkably, the biochemical processes by which plants capture light 
energy and convert it into more-useful forms are strikingly similar to steps 
used in animals to capture energy released from the breakdown of glucose. 

These observations overwhelmingly suggest that all living things on 
Earth have a common ancestor and that modern organisms have evolved 
from this ancestor into their present forms. Geological and biochemical find- 
ings support a time line for this evolutionary path (Figure 1.2). On the basis 
of their biochemical characteristics, the diverse organisms of the modern 
world can be divided into three fundamental groups called domains: Eukarya 
(eukaryotes), Bacteria, and Archaea. Domain Eukarya comprises all multicel- 
lular organisms, including human beings as well as many microscopic unicel- 
lular organisms such as yeast. The defining characteristic of eukaryotes is the 
presence of a well-defined nucleus within each cell. Unicellular organisms 
such as bacteria, which lack a nucleus, are referred to as prokaryotes. The pro- 
karyotes were reclassified as two separate domains in response to Carl 
Woese’s discovery in 1977 that certain bacteria-like organisms are biochemi- 
cally quite distinct from other previously characterized bacterial species. 
These organisms, now recognized as having diverged from 
bacteria early in evolution, are the archaea. Evolutionary 
paths from a common ancestor to modern organisms can BACTERIA 
be deduced on the basis of biochemical information. One 
such path is shown in Figure 1.3. 

Much of this book will explore the chemical reactions 
and the associated biological macromolecules and metab- 
olites that are found in biological processes common to all 
organisms. The unity of life at the biochemical level 
makes this approach possible. At the same time, different 
organisms have specific needs, depending on the particu- 
lar biological niche in which they evolved and live. By 
comparing and contrasting details of particular biochemi- 
cal pathways in different organisms, we can learn how 
biological challenges are solved at the biochemical level. 
In most cases, these challenges are addressed by the adap- 
tation of existing macromolecules to new roles rather than 
by the evolution of entirely new ones. 

Biochemistry has been greatly enriched by our ability 
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FIGURE 1.3 The tree of life. A possible evolutionary path from a 


Halobacterium 


to examine the three-dimensional structures of biological common ancestor approximately 3.5 billion years ago at the bottom 
macromolecules in great detail. Some of these structures _ of the tree to organisms found in the modern world at the top. 
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are simple and elegant, whereas others are incredibly complicated. In any 
case, these structures provide an essential framework for understanding 
function. We begin our exploration of the interplay between structure and 
function with the genetic material, DNA. 


1.2 DNA Illustrates the Interplay Between 
Form and Function 


A fundamental biochemical feature common to all cellular organisms is the 
use of DNA for the storage of genetic information. The discovery that DNA 
plays this central role was first made in studies of bacteria in the 1940s. This 
discovery was followed by a compelling proposal for the three-dimensional 
structure of DNA in 1953, an event that set the stage for many of the 
advances in biochemistry and many other fields, extending to the present. 

The structure of DNA powerfully illustrates a basic principle common 
to all biological macromolecules: the intimate relation between structure 
and function. The remarkable properties of this chemical substance allow it 
to function as a very efficient and robust vehicle for storing information. We 
start with an examination of the covalent structure of DNA and its exten- 
sion into three dimensions. 


DNA is constructed from four building blocks 


DNA isa linear polymer made up of four different types of monomers. It has 
a fixed backbone from which protrude variable substituents, referred to as 
bases (Figure 1.4). The backbone is built of repeating sugar—phosphate 
units. The sugars are molecules of deoxyribose from which DNA receives its 
name. Each sugar is connected to two phosphate groups through different 
linkages. Moreover, each sugar is oriented in the same way, and so each 
DNA strand has directionality, with one end distinguishable from the other. 
Joined to each deoxyribose is one of four possible bases: adenine (A), cyto- 
sine (C), guanine (G), and thymine (T). 
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FIGURE 1.4 Covalent structure of 
DNA. Each unit of the polymeric structure 
is composed of a sugar (deoxyribose), a 
phosphate, and a variable base that 
protrudes from the sugar—-phosphate 
backbone. 


Cytosine (C) Guanine (G) Thymine (T) 


These bases are connected to the sugar components in the DNA backbone 
through the bonds shown in black in Figure 1.4. All four bases are planar 
but differ significantly in other respects. Thus, each monomer of DNA 
consists of a sugar—phosphate unit and one of four bases attached to the 
sugar. These bases can be arranged in any order along a strand of DNA. 
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Two single strands of DNA combine to form a double helix 


Most DNA molecules consist of not one but two strands (Figure 1.5). In 
1953, James Watson and Francis Crick deduced the arrangement of these 
strands and proposed a three-dimensional structure for DNA molecules. 
This structure is a double helix composed of two inter- 

twined strands arranged such that the sugar—phosphate 

backbone lies on the outside and the bases on the inside. 

The key to this structure is that the bases form specific base 

pairs (bp) held together by hydrogen bonds (Section 1.3): 

adenine pairs with thymine (A—T) and guanine pairs with 

cytosine (G—C), as shown in Figure 1.6. Hydrogen bonds “ 
are much weaker than covalent bonds such as the carbon— 
carbon or carbon-—nitrogen bonds that define the struc- 
tures of the bases themselves. Such weak bonds are crucial 


1.2 DNA: Form and Function 


FIGURE 1.5 The double helix. The double-helical structure of 
DNA proposed by Watson and Crick. The sugar-phosphate 


to biochemical systems; they are weak enough to be backbones of the two chains are shown in red and blue, and the 
reversibly broken in biochemical processes, yet they are bases are shown in green, purple, orange, and yellow. The two 
strong enough, particularly when many form simultane- strands are antiparallel, running in opposite directions with respect 


ously, to help stabilize specific structures such as the 
double helix. 
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FIGURE 1.6 Watson-Crick base pairs. Adenine pairs with thymine (A—T), and guanine with 
cytosine (G—C). The dashed green lines represent hydrogen bonds. 


DNA structure explains heredity and the storage of information 


The structure proposed by Watson and Crick has two properties of central 
importance to the role of DNA as the hereditary material. First, the structure 
is compatible with any sequence of bases. While the bases are distinct in 
structure, the base pairs have essentially the same shape (Figure 1.6) and 
thus fit equally well into the center of the double-helical structure of any 
sequence. Without any constraints, the sequence of bases along a DNA 
strand can act as an efficient means of storing information. Indeed, the 
sequence of bases along DNA strands is how genetic information is stored. 
The DNA sequence determines the sequences of the ribonucleic acid (RNA) 
and protein molecules that carry out most of the activities within cells. 

Second, because of base-pairing, the sequence of bases along one strand 
completely determines the sequence along the other strand. As Watson and 
Crick so coyly wrote: “It has not escaped our notice that the specific pairing 
we have postulated immediately suggests a possible copying mechanism for 
the genetic material.” Thus, if the DNA double helix is separated into two 
single strands, each strand can act as a template for the generation of its 
partner strand through specific base-pair formation (Figure 1.7). The three- 
dimensional structure of DNA beautifully illustrates the close connection 
between molecular form and function. 


to the axis of the double helix, as indicated by the arrows. 
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FIGURE 1.7 DNA replication. If a DNA 
molecule is separated into two strands, 
each strand can act as the template for 
the generation of its partner strand. 
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FIGURE 1.8 Formation of a double helix. When 
two DNA strands with appropriate, complementary 


sequences are mixed, they spontaneously assemble 


to form a double helix. 


1.3 Concepts from Chemistry Explain the Properties 
of Biological Molecules 


We have seen how a chemical insight into the hydrogen-bonding capabili- 
ties of the bases of DNA led to a deep understanding of a fundamental 
biological process. To lay the groundwork for the rest of the book, we begin 
our study of biochemistry by examining selected concepts from chemistry 
and showing how these concepts apply to biological systems. The concepts 
include the types of chemical bonds; the structure of water, the solvent in 
which most biochemical processes take place; the First and Second Laws of 
Thermodynamics; and the principles of acid—base chemistry. 


The formation of the DNA double helix as a key example 


We will use these concepts to examine an archetypical biochemical process— 
namely, the formation of a DNA double helix from its two component 
strands. The process is but one of many examples that could have been 
chosen to illustrate these topics. Keep in mind that, although the specific 
discussion is about DNA and double-helix formation, the concepts consid- 
ered are quite general and will apply to many other classes of molecules and 
processes that will be discussed in the remainder of the book. In the course 
of these discussions, we will touch on the properties of water and the 
concepts of pK, and buffers that are of great importance to many aspects 
of biochemistry. 


The double helix can form from its component strands 


The discovery that DNA from natural sources exists in a double-helical 
form with Watson—Crick base pairs suggested, but did not prove, that such 
double helices would form spontaneously outside biological systems. 
Suppose that two short strands of DNA were chemically synthesized to 
have complementary sequences so that they could, in principle, form 
a double helix with Watson—Crick base pairs. Two such sequences are 
CGATTAAT and ATTAATCG. The structures of these molecules 
in solution can be examined by a variety of techniques. In isolation, 
each sequence exists almost exclusively as a single-stranded molecule. 
However, when the two sequences are mixed, a double helix with 
Watson—Crick base pairs does form (Figure 1.8). This reaction pro- 
ceeds nearly to completion. 

What forces cause the two strands of DNA to bind to each other? 
To analyze this binding reaction, we must consider several factors: the 
types of interactions and bonds in biochemical systems and the ener- 
getic favorability of the reaction. We must also consider the influence 
of the solution conditions—in particular, the consequences of acid— 
base reactions. 


Covalent and noncovalent bonds are important for the structure 
and stability of biological molecules 


Atoms interact with one another through chemical bonds. These bonds 
include the covalent bonds that define the structure of molecules as well as 
a variety of noncovalent bonds that are of great importance to biochemistry. 


Covalent bonds. The strongest bonds are covalent bonds, such as the 
bonds that hold the atoms together within the individual bases shown on 
page 4. A covalent bond is formed by the sharing of a pair of electrons 
between adjacent atoms. A typical carbon-carbon (C—C) covalent bond has 


a bond length of 1.54 A and bond energy of 355 kJ mol! (85 kcal mol~!). 
Because covalent bonds are so strong, considerable energy must be expended 
to break them. More than one electron pair can be shared between two 
atoms to form a multiple covalent bond. For example, three of the bases in 
Figure 1.6 include carbon—oxygen (C==O) double bonds. These bonds are 
even stronger than C—C single bonds, with energies near 730 kJ mol7! 
(175 kcal mol~!) and are somewhat shorter. 

For some molecules, more than one pattern of covalent bonding can be 
written. For example, adenine can be written in two nearly equivalent ways 
called resonance structures. 
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These adenine structures depict alternative arrangements of single and 
double bonds that are possible within the same structural framework. 
Resonance structures are shown connected by a double-headed arrow. 
Adenine’s true structure is a composite of its two resonance structures. The 
composite structure is manifested in the bond lengths such as that for 
the bond joining carbon atoms C-4 and C-5. The observed bond length of 
1.40 A is between that expected for a C—C single bond (1.54 A) and a 
C=C double bond (1.34 A). A molecule that can be written as several 
resonance structures of approximately equal energies has greater stability 
than does a molecule without multiple resonance structures. 


Noncovalent bonds. Noncovalent bonds are weaker than covalent bonds 
but are crucial for biochemical processes such as the formation of a double 
helix. Four fundamental noncovalent bond types are ionic interactions, 
hydrogen bonds, van der Waals interactions, and hydrophobic interactions. 
They differ in geometry, strength, and specificity. Furthermore, these 
bonds are affected in vastly different ways by the presence of water. Let us 
consider the characteristics of each type: 


1. Ionic Interactions. A charged group on one molecule can attract an oppo- 
sitely charged group on the same or another molecule. The energy of an 
ionic interaction (sometimes called an electrostatic interaction) is given by 
the Coulomb energy: 


E = kqiqo/Dr 


where F is the energy, q, and q, are the charges on the two atoms (in units of 
the electronic charge), r is the distance between the two atoms (in ang- 
stroms), D is the dielectric constant (which decreases the strength of the 
Coulomb depending on the intervening solvent or medium), and k is a pro- 
portionality constant (k = 1389, for energies in units of kilojoules per mole, 
or 332 for energies in kilocalories per mole). 

By convention, an attractive interaction has a negative energy. The 
ionic interaction between two ions bearing single opposite charges sepa- 
rated by 3 A in water (which has a dielectric constant of 80) has an energy 
of —5.8 kJ mol~! (—1.4 kcal mol7!). Note how important the dielectric 
constant of the medium is. For the same ions separated by 3 A ina nonpolar 
solvent such as hexane (which has a dielectric constant of 2), the energy of 
this interaction is —232 kJ mol~! (—55 kcal mol7!). 
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Distance and energy units 


Interatomic distances and bond lengths are 
usually measured in angstrom (A) units: 


1A = 10719 m = 107-8 cm = 0.1 nm 


Several energy units are in common use. 


One joule (J) is the amount of energy required 
to move 1 meter against a force of 1 newton. 


A kilojoule (kJ) is 1000 joules. One calorie is 
the amount of energy required to raise the 
temperature of 1 gram of water 1 degree 
Celsius. A kilocalorie (kcal) is 1000 calories. 


One joule is equal to 0.239 cal. 
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FIGURE 1.9 Hydrogen bonds. Hydrogen 
bonds are depicted by dashed green lines. 
The positions of the partial charges (8* 
and 87) are shown. 
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FIGURE 1.10 Energy of a van der Waals 
interaction as two atoms approach each 
other. The energy is most favorable at the 
van der Waals contact distance. Owing to 
electron-electron repulsion, the energy 
rises rapidly as the distance between the 
atoms becomes shorter than the contact 
distance. 
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2. Hydrogen Bonds. These interactions are largely ionic interactions, with 
partial charges on nearby atoms attracting one another. Hydrogen bonds are 
responsible for specific base-pair formation in the DNA double helix. The 
hydrogen atom in a hydrogen bond is partially shared by two electronega- 
tive atoms such as nitrogen or oxygen. The hydrogen-bond donor is the group 
that includes both the atom to which the hydrogen atom is more tightly 
linked and the hydrogen atom itself, whereas the hydrogen-bond acceptor is 
the atom less tightly linked to the hydrogen atom (Figure 1.9). The electro- 
negative atom to which the hydrogen atom is covalently bonded pulls elec- 
tron density away from the hydrogen atom, which thus develops a partial 
positive charge (8*). Thus, the hydrogen atom with a partial positive charge 
can interact with an atom having a partial negative charge (6~ ) through an 
ionic interaction. 

Hydrogen bonds are much weaker than covalent bonds. They have ener- 
gies ranging from 4 to 20 kJ mol~! (from 1 to 5 kcal mol~'). Hydrogen 
bonds are also somewhat longer than covalent bonds; their bond lengths 
(measured from the hydrogen atom) range from 1.5 A to 2.6 A; hence, a 
distance ranging from 2.4 A to 3.5 A separates the two nonhydrogen atoms 
in a hydrogen bond. 

The strongest hydrogen bonds have a tendency to be approximately 
straight, such that the hydrogen-bond donor, the hydrogen atom, and the 
hydrogen-bond acceptor lie along a straight line. This tendency toward lin- 
earity can be important for orienting interacting molecules with respect to 
one another. Hydrogen-bonding interactions are responsible for many of 
the properties of water that make it such a special solvent, as will be 
described shortly. 


3. van der Waals Interactions. The basis of a van der Waals interaction is 
that the distribution of electronic charge around an atom fluctuates with 
time. At any instant, the charge distribution is not perfectly symmetric. 
This transient asymmetry in the electronic charge about an atom acts 
through ionic interactions to induce a complementary asymmetry in the 
electron distribution within its neighboring atoms. The atom and its 
neighbors then attract one another. This attraction increases as two atoms 
come closer to each other, until they are separated by the van der Waals 
contact distance (Figure 1.10). At distances shorter than the van der 
Waals contact distance, very strong repulsive forces become dominant 
because the outer electron clouds of the two atoms overlap. 


Energies associated with van der Waals interactions are quite small; 
typical interactions contribute from 2 to 4 kJ mol7! (from 0.5 to 1 kcal 
mol~!) per atom pair. When the surfaces of two large molecules come 
together, however, a large number of atoms are in van der Waals contact, 
and the net effect, summed over many atom pairs, can be substantial. 

We will cover the fourth noncovalent interaction, the hydrophobic inter- 
action, after we examine the characteristics of water; these characteristics 
are essential to understanding the hydrophobic interaction. 


Properties of water. Water is the solvent in which most biochemical reac- 
tions take place, and its properties are essential to the formation of macro- 
molecular structures and the progress of chemical reactions. Two properties 
of water are especially relevant: 


1. Water is a polar molecule. The water molecule is bent, not linear, and so 
the distribution of charge is asymmetric. The oxygen nucleus draws elec- 
trons away from the two hydrogen nuclei, which leaves the region around 


each hydrogen atom with a net positive charge. The water molecule is thus 
an electrically polar structure. 


2. Water is highly cohesive. Water molecules interact strongly with one 
another through hydrogen bonds. These interactions are apparent in 
the structure of ice (Figure 1.11). Networks of hydrogen bonds hold the 
structure together; similar interactions link molecules in 
liquid water and account for many of the properties of 
water. In the liquid state, approximately one in four of the 
hydrogen bonds present in ice are broken. The polar nature 
of water is responsible for its high dielectric constant of 80. 
Molecules in aqueous solution interact with water mole- 
cules through the formation of hydrogen bonds and through 
ionic interactions. These interactions make water a versatile 
solvent, able to readily dissolve many species, especially 
polar and charged compounds that can participate in these 
interactions. 


The hydrophobic effect. A final fundamental interaction 
called the hydrophobic effect is a manifestation of the proper- 
ties of water. Some molecules (termed nonpolar molecules) 
cannot participate in hydrogen bonding or ionic interac- 
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tions. The interactions of nonpolar molecules with water FIGURE 1.11 Structure of ice. Hydrogen bonds (shown as 
molecules are not as favorable as are interactions between dashed green lines) are formed between water molecules to 
the water molecules themselves. The water molecules in produce a highly ordered and open structure. 


contact with these nonpolar molecules form “cages” around 

them, becoming more well ordered than water molecules free in solution. 
However, when two such nonpolar molecules come together, some of the 
water molecules are released, allowing them to interact freely with bulk 
water (Figure 1.12). The release of water from such cages is favorable for 
reasons to be considered shortly. The result is that nonpolar molecules 
show an increased tendency to associate with one another in water 
compared with other, less polar and less self-associating, solvents. This 
tendency is called the hydrophobic effect and the associated interactions 
are called hydrophobic interactions. 


The double helix is an expression of the rules of chemistry 


Let us now see how these four noncovalent interactions work together in 
driving the association of two strands of DNA to form a double helix. First, 
each phosphate group in a DNA strand carries a negative charge. These 
negatively charged groups interact unfavorably with one another over dis- 
tances. Thus, unfavorable ionic interactions take place when two strands of 


FIGURE 1.12 The hydrophobic 

effect. The aggregation of nonpolar 
groups in water leads to the release of 
water molecules, initially interacting with 
the nonpolar surface, into bulk water. The 
release of water molecules into solution 
makes the aggregation of nonpolar groups 
favorable. 


Be 


FIGURE 1.13 lonic interactions in 

DNA. Each unit within the double helix 
includes a phosphate group (the 
phosphorus atom being shown in purple) 
that bears a negative charge. The 
unfavorable interactions of one phosphate 
with several others are shown by red lines. 
These repulsive interactions oppose the 
formation of a double helix. 


van der Waals 
contacts 


FIGURE 1.14 Base stacking. In the DNA 
double helix, adjacent base pairs are 
stacked nearly on top of one another, and 
so many atoms in each base pair are 
separated by their van der Waals contact 
distance. The central base pair is shown in 
dark blue and the two adjacent base pairs 
in light blue. Several van der Waals 
contacts are shown in red. 
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DNA come together. These phosphate groups are far apart in the double 
helix with distances greater than 10 A, but many such interactions take 
place (Figure 1.13). Thus, ionic interactions oppose the formation of the 
double helix. The strength of these repulsive ionic interactions is dimin- 
ished by the high dielectric constant of water and the presence of ionic 
species such as Na* or Mg?" ions in solution. These positively charged 
species interact with the phosphate groups and partly neutralize their 
negative charges. 

Second, as already noted, hydrogen bonds are important in determining 
the formation of specific base pairs in the double helix. However, in single- 
stranded DNA, the hydrogen-bond donors and acceptors are exposed to 
solution and can form hydrogen bonds with water molecules. 
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When two single strands come together, these hydrogen bonds with water 
are broken and new hydrogen bonds between the bases are formed. Because 
the number of hydrogen bonds broken is the same as the number formed, 
these hydrogen bonds do not contribute substantially to driving the overall 
process of double-helix formation. However, they contribute greatly to the 
specificity of binding. Suppose two bases that cannot form Watson—Crick 
base pairs are brought together. Hydrogen bonds with water must be bro- 
ken as the bases come into contact. Because the bases are not complemen- 
tary in structure, not all of these bonds can be simultaneously replaced by 
hydrogen bonds between the bases. Thus, the formation of a double helix 
between noncomplementary sequences is disfavored. 

Third, within a double helix, the base pairs are parallel and stacked 
nearly on top of one another. The typical separation between the planes of 
adjacent base pairs is 3.4 A, and the distances between the most closely 
approaching atoms are approximately 3.6 A. This separation distance cor- 
responds nicely to the van der Waals contact distance (Figure 1.14). Bases 
tend to stack even in single-stranded DNA molecules. However, the base 
stacking and associated van der Waals interactions are nearly optimal in a 
double-helical structure. 

Fourth, the hydrophobic effect also contributes to the favorability of 
base stacking. More-complete base stacking moves the nonpolar surfaces of 
the bases out of water into contact with each other. 

The principles of double-helix formation between two strands of DNA 
apply to many other biochemical processes. Many weak interactions con- 
tribute to the overall energetics of the process, some favorably and some 
unfavorably. Furthermore, surface complementarity is a key feature: when 
complementary surfaces meet, hydrogen-bond donors align with hydrogen- 
bond acceptors and nonpolar surfaces come together to maximize van der 
Waals interactions and minimize nonpolar surface area exposed to the aque- 
ous environment. The properties of water play a major role in determining 
the importance of these interactions. 


The laws of thermodynamics govern the behavior of 
biochemical systems 


We can look at the formation of the double helix from a different perspec- 
tive by examining the laws of thermodynamics. These laws are general 


principles that apply to all physical (and biological) processes. They are of 
great importance because they determine the conditions under which spe- 
cific processes can or cannot take place. We will consider these laws from a 
general perspective first and then apply the principles that we have devel- 
oped to the formation of the double helix. 

The laws of thermodynamics distinguish between a system and its 
surroundings. A system refers to the matter within a defined region of 
space. The matter in the rest of the universe is called the surroundings. The 
First Law of Thermodynamics states that the total energy of a system and its 
surroundings is constant. In other words, the energy content of the uni- 
verse is constant; energy can be neither created nor destroyed. Energy 
can take different forms, however. Heat, for example, is one form of 
energy. Heat is a manifestation of the kinetic energy associated with the 
random motion of molecules. Alternatively, energy can be present as 
potential energy—energy that will be released on the occurrence of some 
process. Consider, for example, a ball held at the top of a tower. The ball 
has considerable potential energy because, when it is released, the ball 
will develop kinetic energy associated with its motion as it falls. Within 
chemical systems, potential energy is related to the likelihood that atoms 
can react with one another. For instance, a mixture of gasoline and oxy- 
gen has a large potential energy because these molecules may react to 
form carbon dioxide and water and release energy as heat. The First Law 
requires that any energy released in the formation of chemical bonds 
must be used to break other bonds, released as heat or light, or stored in 
some other form. 

Another important thermodynamic concept is that of entropy, a measure 
of the degree of randomness or disorder in a system. The Second Law of 
Thermodynamics states that the total entropy of a system plus that of its sur- 
roundings always increases. For example, the release of water from nonpolar 
surfaces responsible for the hydrophobic effect is favorable because water 
molecules free in solution are more disordered than they are when they are 
associated with nonpolar surfaces. At first glance, the Second Law appears 
to contradict much common experience, particularly about biological sys- 
tems. Many biological processes, such as the generation of a leaf from car- 
bon dioxide gas and other nutrients, clearly increase the level of order and 
hence decrease entropy. Entropy may be decreased locally in the formation 
of such ordered structures only if the entropy of other parts of the universe 
is increased by an equal or greater amount. The local decrease in entropy is 
often accomplished by a release of heat, which increases the entropy of the 
surroundings. 

We can analyze this process in quantitative terms. First, consider the 
system. The entropy (S) of the system may change in the course of a 
chemical reaction by an amount AS, .... If heat flows from the system to 
its surroundings, then the heat content, often referred to as the enthalpy (H), 
of the system will be reduced by an amount AH sistem? To apply the 
Second Law, we must determine the change in entropy of the surround- 
ings. If heat flows from the system to the surroundings, then the entropy 
of the surroundings will increase. The precise change in the entropy of the 
surroundings depends on the temperature; the change in entropy is greater 
when heat is added to relatively cold surroundings than when heat is added 
to surroundings at high temperatures that are already in a high degree of 
disorder. To be even more specific, the change in the entropy of the sur- 
roundings will be proportional to the amount of heat transferred from the 
system and inversely proportional to the temperature (T) of the surround- 
ings. In biological systems, T [in kelvins (K), absolute temperature] is 
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usually assumed to be constant. Thus, a change in the entropy of the 
surroundings is given by 


ASsurroundings = ~ AFsystem/T (1) 
The total entropy change is given by the expression 
AStort = ASsysien * ASsurecondings (2) 
Substituting equation 1 into equation 2 yields 
AStotal = ASsystem a AA system/ T (3) 
Multiplying by —T gives 
—TASioa = Ab geen — TASsystem (4) 


The function — TAS has units of energy and is referred to as free energy or 
Gibbs free energy, after Josiah Willard Gibbs, who developed this function 
in 1878: 


AG = B Aosta ~~ TASgystem (5) 


The free-energy change, AG, will be used throughout this book to describe 
the energetics of biochemical reactions. The Gibbs free energy is essentially 
an accounting tool that keeps track of both the entropy of the system 
(directly) and the entropy of the surroundings (in the form of heat released 
from the system). 

Recall that the Second Law of Thermodynamics states that, for a process 
to take place, the entropy of the universe must increase. Examination of 
equation 3 shows that the total entropy will increase if and only if 


ASsystem = AA system / T (6) 


Rearranging gives TAS AH or, in other words, entropy will increase if 


and only if 


system’ 


AG = AF system — TAS system <0 (7) 


Thus, the free-energy change must be negative for a process to take place 
spontaneously. There is negative free-energy change when and only when the 
overall entropy of the universe is increased. Again, the free energy represents 
a single term that takes into account both the entropy of the system and the 
entropy of the surroundings. 


Heat is released in the formation of the double helix 


Let us see how the principles of thermodynamics apply to the formation of 
the double helix (Figure 1.15). Suppose solutions containing each of the two 
single strands are mixed. Before the double helix forms, each of the single 
strands is free to translate and rotate in solution, whereas each matched pair 
of strands in the double helix must move together. Furthermore, the free 
single strands exist in more conformations than possible when bound 
together in a double helix. Thus, the formation of a double helix from two 
single strands appears to result in an increase in order for the system, that is, 
a decrease in the entropy of the system. 

On the basis of this analysis, we expect that the double helix cannot form 
without violating the Second Law of Thermodynamics unless heat is released 
to increase the entropy of the surroundings. Experimentally, we can measure 
the heat released by allowing the solutions containing the two single strands to 
come together within a water bath, which here corresponds to the surround- 
ings. We then determine how much heat must be absorbed by the water bath 
or released from it to maintain it at a constant temperature. This experiment 


reveals that a substantial amount of heat is 
released—namely, approximately 250 kJ mol~! 
(60 kcal mol~'). This experimental result reveals 
that the change in enthalpy for the process is 
quite large, —250 kJ mol~!, consistent with our 
expectation that significant heat would have to 
be released to the surroundings for the process 
not to violate the Second Law. We see in quanti- 
tative terms how order within a system can be 
increased by releasing sufficient heat to the sur- 
roundings to ensure that the entropy of the uni- 
verse increases. We will encounter this general 
theme again and again throughout this book. 


Acid-base reactions are central in many 
biochemical processes 


Throughout our consideration of the formation 
of the double helix, we have dealt only with the 
noncovalent bonds that are formed or broken in 
this process. Many biochemical processes entail 
the formation and cleavage of covalent bonds. A 
particularly important class of reactions promi- 
nent in biochemistry is acid—base reactions. 

In acid and base reactions, hydrogen ions 
are added to molecules or removed from them. 
Throughout the book, we will encounter many 
processes in which the addition or removal of 
hydrogen atoms is crucial, such as the meta- 
bolic processes by which carbohydrates are 
degraded to release energy for other uses. Thus, 
a thorough understanding of the basic princi- 
ples of these reactions is essential. 

A hydrogen ion, often written as H*, cor- 
responds to a proton. In fact, hydrogen ions 
exist in solution bound to water molecules, thus 
forming what are knownas hydronium ions, H,O*. 
For simplicity, we will continue to write H*, 
but we should keep in mind that H* is short- 
hand for the actual species present. 

The concentration of hydrogen ions in solu- 
tion is expressed as the pH. Specifically, the pH 
of a solution is defined as 


pH = —log{H*] 
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FIGURE 1.15 Double-helix formation and entropy. When solutions containing 
DNA strands with complementary sequences are mixed, the strands react to 
orm double helices. This process results in a loss of entropy from the system, 
indicating that heat must be released to the surroundings to prevent a violation 
of the Second Law of Thermodynamics. 


where [H*] is in units of molarity. Thus, pH 7.0 refers to a solution for 
which —log[H*] = 7.0, and so log[H*] = —7.0 and [Ht] = 10/eslH+] = 


107-79 = 1.0 X 10-7M 


The pH also indirectly expresses the concentration of hydroxide ions, 
[OH], in solution. To see how, we must realize that water molecules can 
dissociate to form H*t and OH 7 ions in an equilibrium process. 


H,O = H* + OH™ 


The equilibrium constant (K) for the dissociation of water is defined as 


K = [H*][OH~ ]/[H,0] 
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Fraction of molecules in 
double-helical form 


FIGURE 1.16 DNA denaturation by the 
addition of a base. The addition of a base 
to a solution of double-helical DNA initially 
at pH 7 causes the double helix to separate 
into single strands. The process is half 
complete at slightly above pH 9. 


and has a value of K = 1.8 X 107!°. Note that an equilibrium constant does 
not formally have units. Nonetheless, the value of the equilibrium constant 
given assumes that particular units are used for concentration (sometimes 
referred to a standard states); in this case and in many others, units of molar- 
ity (M) are assumed. 

The concentration of water, [HO], in pure water is 55.5 M, and this 
concentration is constant under most conditions. Thus, we can define a new 
constant, Ky;: 


Kw = K[H,0] = [H*][OH7] 
K[H,O] = 1.8 x 10° 1° x 55.5 


= 1.0 x 10° 
Because Ky, = [H*][OH7] = 1.0 x 107"*, we can calculate 
[OH] = 10° '*/[H*] and [H*] = 10° !*/[OH 7] 


With these relations in hand, we can easily calculate the concentration of 
hydroxide ions in an aqueous solution, given the pH. For example, at pH = 
7.0, we know that [Ht] = 107’ Mand so [OH7] = 10714/1077 = 10-7M. 
In acidic solutions, the concentration of hydrogen ions is higher than 1077 
and, hence, the pH is below 7. For example, in 0.1 M HCl, [Ht] = 107!M 
and so pH = 1.0 and [OH] = 107!4/107! = 1078 M. 


Acid-base reactions can disrupt the double helix 


The reaction that we have been considering between two strands of DNA to 
form a double helix takes place readily at pH 7.0. Suppose that we take the 
solution containing the double-helical DNA and treat it with a solution of 
concentrated base (i.e., with a high concentration of OH 7). As the base is 
added, we monitor the pH and the fraction of DNA in double-helical form 
(Figure 1.16). When the first additions of base are made, the pH rises, but 
the concentration of the double-helical DNA does not change significantly. 
However, as the pH approaches 9, the DNA double helix begins to dissoci- 
ate into its component single strands. As the pH continues to rise from 9 to 
10, this dissociation becomes essentially complete. Why do the two strands 
dissociate? The hydroxide ions can react with bases in DNA base pairs to 
remove certain protons. The most susceptible proton is the one bound to the 
N-1 nitrogen atom in a guanine base. 
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Proton dissociation for a substance HA (such as that bound to N-1 on gua- 
nine) has an equilibrium constant defined by the expression 


K, = [H"][A”]/[HA] 


The susceptibility of a proton to removal by reaction with a base is often 
described by its pK, value: 


pK, = —log(K,) 


When the pH is equal to the pK,, we have 

pH = pK, 
and so 

—logf{H*] = —log({[H*][A~]/[HA]) 
and 
[H*] = [H™][A7]/[HA] 
Dividing by [H*] reveals that 
L= A] 7 [EA] 
and so 
LA” ] = [HA] 


Thus, when the pH equals the pK,, the concentration of the deprotonated 
form of the group or molecule is equal to the concentration of the proton- 
ated form; the deprotonation process is halfway to completion. 

The pK, for the proton on N-1 of guanine is typically 9.7. When the pH 
approaches this value, the proton on N-1 is lost (Figure 1.16). Because this 
proton participates in an important hydrogen bond, its loss substantially 
destabilizes the DNA double helix. The DNA double helix is also destabi- 
lized by low pH. Below pH 5, some of the hydrogen bond acceptors that 
participate in base-pairing become protonated. In their protonated forms, 
these bases can no longer form hydrogen bonds and the double helix sepa- 
rates. Thus, acid—base reactions that remove or donate protons at specific 
positions on the DNA bases can disrupt the double helix. 


Buffers regulate pH in organisms and in the laboratory 


These observations about DNA reveal that a significant change in pH can 
disrupt molecular structure. The same is true for many other biological 
macromolecules; changes in pH can protonate or deprotonate key groups, 
potentially disrupting structures and initiating harmful reactions. Thus, 
systems have evolved to mitigate changes in pH in biological systems. 
Solutions that resist such changes are called buffers. Specifically, when acid 
is added to an unbuffered aqueous solution, the pH drops in proportion to 
the amount of acid added. In contrast, when acid is added to a buffered solu- 
tion, the pH drops more gradually. Buffers also mitigate the pH increase 
caused by the addition of base and changes in pH caused by dilution. 

Compare the result of adding a 1 M solution of the strong acid HCl drop 
by drop to pure water with adding it to a solution containing 100 mM of the 
buffer sodium acetate (Na*CH,COO-; Figure 1.17). The process of 
gradually adding known amounts of reagent to a solution with which the 
reagent reacts while monitoring the results is called a titration. For pure 
water, the pH drops from 7 to close to 2 on the addition of the first few 
drops of acid. However, for the sodium acetate solution, the pH first falls 
rapidly from its initial value near 10, then changes more gradually until the 
pH reaches 3.5, and then falls more rapidly again. Why does the pH 
decrease so gradually in the middle of the titration? The answer is that, 
when hydrogen ions are added to this solution, they react with acetate ions 
to form acetic acid. This reaction consumes some of the added hydrogen 
ions so that the pH does not drop. Hydrogen ions continue reacting with 
acetate ions until essentially all of the acetate ion is converted into acetic 
acid. After this point, added protons remain free in solution and the pH 
begins to fall sharply again. 
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FIGURE 1.17 Buffer action. The addition 
of a strong acid, 1 M HCl, to pure water 
results in an immediate drop in pH to near 
2. In contrast, the addition of the acid to a 
0.1 M sodium acetate (Nat CH,COO7) 
solution results in a much more gradual 
change in pH until the pH drops below 3.5. 
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FIGURE 1.18 Buffer protonation. When 
acid is added to sodium acetate, the 
added hydrogen ions are used to convert 
acetate ion into acetic acid. Because the 
proton concentration does not increase 
significantly, the pH remains relatively 
constant until all of the acetate has been 
converted into acetic acid. 


100% 


We can analyze the effect of the buffer in quantitative terms. The equi- 
librium constant for the deprotonation of an acid is 


K, = [H*][A ]/[HA] 
Taking logarithms of both sides yields 
log(K,) = log([H™]) + log([A~ ]/[HA]) 
Recalling the definitions of pK, and pH and rearranging gives 
pH = pK, + log([A’ ]/[HA]) 


This expression is referred to as the Henderson—Hasselbalch equation. 

We can apply the equation to our titration of sodium acetate. The pK, of 
acetic acid is 4.75. We can calculate the ratio of the concentration of acetate 
ion to the concentration of acetic acid as a function of pH by using the 
Henderson—Hasselbalch equation, slightly rearranged. 


[Acetate ion] /[Acetic acid] = [A~ ]/[HA] = 10PH~ PK: 


At pH 9, this ratio is 10?~+-7° = 10+2° = 17,800; very little acetic acid 
has been formed. At pH 4.75 (when the pH equals the pK,), the ratio is 
10-4 10 = 1, At pH 3, the ratio is 107-4? = = 19-125 = = 0.02; 
almost all of the acetate ion has been converted into acetic acid. We can fol- 
low the conversion of acetate ion into acetic acid over the entire titration 
(Figure 1.18). The graph shows that the region of relatively constant pH 
corresponds precisely to the region in which acetate ion is being protonated 
to form acetic acid. 

From this discussion, we see that a buffer functions best close to the 
pK, value of its acid component. Physiological pH 1s typically about 7.4. 
An important buffer in biological systems is based on phosphoric 
acid (H,PO,). The acid can be deprotonated in three steps to form a 
phosphate ion. 


pos 
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At about pH 7.4, inorganic phosphate exists primarily as a nearly equal 
mixture of H,PO4, and HPO,” . Thus, phosphate solutions function as 
effective buffers near pH 7.4. The concentration of inorganic phosphate in 
blood is typically approximately 1 mM, providing a useful buffer against 
processes that produce either acid or base. We can examine this utility in 
quantitative terms with the use of the Henderson—Hasselbalch equation. 
What concentration of acid must be added to change the pH of 1 mM phos- 
phate buffer from 7.4 to 7.3? Without buffer, this change in [H*] corre- 
sponds to a change of 10773 — 10774 M = (5.0 x 1078 — 4.0 x 1078) 
M = 1.0 X 1078 M. Let us now consider what happens to the buffer com- 
ponents. At pH 7.4, 


[HPO,* ]/(HaPOxg | = 1077-7" = 


The total concentration of phosphate, [HPO,? 7] 
1mM, Thus, 


[HPO,?7] = (1.55/2.55) X 1mM = 0.608 mM 


10°19 = 1.55 
+ [H2PO, ], 1s 


and 


[H»PO,47] = (1/2.55) X 1mM = 0.392 mM 


At pH 7.3, 17 
(HPO, ~]/[H POs] = 1073-77-21 = 49:09 ~123 1.4 The Genomic Revolution 


and so 
(HPO, ] = (1.23/2.23) = 0.552 mM 
and 
[H»PO4 ] = (1/2.23) = 0.448mM 


Thus, (0.608 — 0.552) = 0.056 mM HPO,’ is converted into HPO, , 
consuming 0.056 mM = 5.6 X 10~> M[H™]. Thus, the buffer increases the 
amount of acid required to produce a drop in pH from 7.4 to 7.3 by a factor 
of 5.6 X 107°/1.0 X 1078 = 5600 compared with pure water. 


1.4 The Genomic Revolution Is Transforming 
Biochemistry, Medicine, and Other Fields 


Watson and Crick’s discovery of the structure of DNA suggested the 
hypothesis that hereditary information is stored as a sequence of bases 
along long strands of DNA. This remarkable insight provided an entirely 
new way of thinking about biology. However, at the time that it was 
made, Watson and Crick’s discovery, though full of potential, remained 
to be confirmed and many features needed to be elucidated. How is the 
sequence information read and translated into action? What are the 
sequences of naturally occurring DNA molecules and how can such 
sequences be experimentally determined? Through advances in bio- 
chemistry and related sciences, we now have essentially complete answers 
to these questions. Indeed, in the past decade or so, scientists have deter- 
mined the complete genome sequences of hundreds of different organ- 
isms, including simple microorganisms, plants, animals of varying 
degrees of complexity, and human beings. Comparisons of these genome 
sequences, with the use of methods introduced in Chapter 6, have been 
sources of many insights that have transformed biochemistry. In addition 
to its experimental and clinical aspects, biochemistry has now become an 
information science. 


Genome sequencing has transformed biochemistry 
and other fields 


The sequencing of a human genome was a daunting task because it contains 
approximately 3 billion (3 X 10°) base pairs. For example, the sequence 


ACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTC 
AAACAGACACCATGGTGCATCTGACTCCTGAGGAGAAGT 
CTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGA ... 


is a part of one of the genes that encodes hemoglobin, the oxygen carrier in 
our blood. This gene is found on the end of chromosome 9 of our 24 distinct 
chromosomes. If we were to include the complete sequence of our entire 
genome, this chapter would run to more than 500,000 pages. The sequenc- 
ing of our genome is truly a landmark in human history. This sequence 
contains a vast amount of information, some of which we can now extract 
and interpret, but much of which we are only beginning to understand. For 
example, some human diseases have been linked to particular variations in 
genomic sequence. Sickle-cell anemia, discussed in detail in Chapter 7, 
is caused by a single base change of an A (noted in boldface type in the 
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preceding sequence) to a T. We will encounter many other examples of dis- 
eases that have been linked to specific DNA sequence changes. 
Determining the first human genome sequences was a great challenge. It 
required the efforts of large teams of geneticists, molecular biologists, bio- 
chemists, and computer scientists, as well as billions of dollars, because 
there was no previous framework for aligning the sequences of various 
DNA fragments. One human genome sequence can serve as a reference for 
other sequences. The availability of such reference sequences enables much 
more rapid characterization of partial or complete genomes from other indi- 
viduals. As we will discuss in Chapter 5, arrays containing millions of target 
single-stranded DNA molecules with sequences from the reference genome 
and known or potential variants are powerful tools. These arrays can 
be exposed to mixtures of DNA fragments for a particular individual and 
those single-stranded targets that bind to their complementary strands can 
be determined. This allows many positions within the genome of the indi- 
vidual to be simulaneously interrogated. 
Methods for sequencing DNA have also been improv- 


@ $100M ing rapidly, driven by a deep understanding of the 
5 biochemistry of DNA replication and other processes. 
=. 2M This has resulted in both dramatic increases in the DNA 
z sequencing rate and decreases in cost (Figure 1.19). The 
a FM availability of such powerful sequencing technology is 
8 er transforming many fields, including medicine, dentistry, 
£ microbiology, pharmacology, and ecology, although a 
= Hee great deal remains to be done to improve the accuracy 
a and precision of the interpretation of these large genomic 
B $1K | | | pf Lf | L | | and related data sets. 
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vest pairs. How different are we from one another at the 
FIGURE 1.19 Decreasing costs of DNA sequencing. Through the genomic level? Examination of genomic variation 
Human Genome Project, the cost of DNA sequencing dropped reveals that, on average, each pair of individuals has a 
steadily. With the advent of new methods, these costs dropped different base in one position per 200 bases; that is, 
dramatically and are now approaching $1000 for a complete human the difference is approximately 0.5%. This variation 


genome sequence. [National Human Genome Research Institute. 


www.genome.gov/sequencingcosts] 


between individuals who are not closely related is 
quite substantial compared with differences in popu- 
lations. The average difference between two people within one ethnic 
group Is greater than the difference between the averages of two differ- 
ent ethnic groups. 

The significance of much of this genetic variation is not understood. 
As noted earlier, variation in a single base within the genome can lead to a 
disease such as sickle-cell anemia. Scientists have now identified the 
genetic variations associated with hundreds of diseases for which the cause 
can be traced to a single gene. For other diseases and traits, we know that 
variation in many different genes contributes in significant and often 
complex ways. Many of the most prevalent human ailments such as heart 
disease are linked to variations in many genes. Furthermore, in most cases, 
the presence of a particular variation or set of variations does not inevita- 
bly result in the onset of a disease but, instead, leads to a predisposition to 
the development of the disease. 

Our own genes are not the only ones that can contribute to health and 
disease. Our bodies, including our skin, mouth, digestive tract, genito- 
urinary tract, respiratory tract, and other areas, contain large number of 
microorganisms. These complex communities have been characterized 
through powerful methods that allow DNA isolated from these biologi- 
cal samples to be sequenced without any previous knowledge of the 


organisms present. Many of these organisms had not previously been 
discovered because they can only grow as part of complex communities 
and thus cannot be isolated through standard microbiological techniques. Gastrointestinal 
Remarkably, it appears that we are outnumbered in our own bodies! 
Each of us contains approximately ten times more microbial cells than 
human cells and these microbial cells include many more genes than do 
our own genomes. These microbiomes differ from site to site, from one 
person to another and over time in the same individual. They appear to 
play roles in health and in diseases such as obesity and dental caries 
(Figure 1.20). 

In addition to the implications for understanding human health and Skin 
disease, the genome sequence is a source of deep insight into other aspects 
of human biology and culture. For example, by comparing the sequences of 
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FIGURE 1.20 The human microbiome. 
Microorganisms cover the human body. 


different individuals and populations, we can learn a great deal about Examination of the microbial communities 
human history. On the basis of such analysis, a compelling case can be made using DNA sequencing methods revealed 
that the human species originated in Africa, and the occurrence and even many previously uncharacterized species. 


The Venn diagrams represent populations 
of related species as determined by DNA 
sequence comparisons. The populations 


the timing of important migrations of groups of human beings can be dem- 
onstrated (Figure 1.21). Finally, comparisons of the human genome with 


the genomes of other organisms are confirming the tremendous unity that present on different body surfaces are 
exists at the level of biochemistry and are revealing key steps in the course largely distinct. [Adapted from www.nature. 
of evolution from relatively simple, single-celled organisms to complex, com/nature/journal/v486/n7 402/fig_tab/ 


multicellular organisms such as human beings. For example, many genes nature! 1234_F1 html] 


that are key to the function of the human brain and nervous system have 
evolutionary and functional relatives in the genomes of bacteria. Because 
many studies that are possible in model organisms are difficult or unethical 
to conduct in human beings, these discoveries have many practical implica- 
tions. Comparative genomics has become a powerful science, linking evolu- 
tion and biochemistry. 


46,000-50,000 20,000-30,000 


years ago years ago 15,000 years ago 


150,000 
years ago 


50,000-60,000 
years ago 


12,500 years ago 


FIGURE 1.21 Human migrations supported by DNA sequence comparisons. Modern 
human beings originated in Africa, migrated first to Asia, and then to Europe, Australia, and 
North and South America. [Adapted from S. Oppenheimer, “Out-of-Africa, the peopling of 
continents and islands: tracing uniparental gene trees across the map.” Philos. Trans. R. Soc. 
Lond. B. Biol. Sci. 367(1590):770-784] 
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Fruits 


Vegetables 


X 


FIGURE 1.22 Nutrition. Proper health depends 
on an appropriate combination of food groups 
(fruits, vegetables, protein, grains, dairy) (left) 
to provide an optimal mix of biochemicals 
(carbohydrates, proteins, fats, vitamins, 

and minerals) (right). [Adapted from www. 
choosemyplate.gov] 


Environmental factors influence human biochemistry 


Although our genetic makeup (and that of our microbiomes) is an impor- 
tant factor that contributes to disease susceptibility and to other traits, 
factors in a person’s environment also are significant. What are these envi- 
ronmental factors? Perhaps the most obvious are chemicals that we eat or 
are exposed to in some other way. The adage “you are what you eat” has 
considerable validity; it applies both to substances that we ingest in sig- 
nificant quantities and to those that we ingest in only trace amounts. 
Throughout our study of biochemistry, we will encounter vitamins and 
trace elements and their derivatives that play crucial roles in many pro- 
cesses. In many cases, the roles of these chemicals were first revealed 
through investigation of deficiency disorders observed in people who do not 
take in a sufficient quantity of a particular vitamin or trace element. 
Despite the fact that the most important essential dietary factors have been 
known for some time, new roles for them continue to be discovered. 
A healthful diet requires a balance of major food 
Vitamins groups. In addition to providing vitamins and trace 
and minerals elements, food provides calories in the form of sub- 
; stances that can be broken down to release energy 
that drives other biochemical processes. Proteins, 
fats, and carbohydrates provide the building blocks 
used to construct the molecules of life (Figure 1.22). 
Finally, it is possible to get too much of a good 
thing. Human beings evolved under circumstances 
in which food, particularly rich foods such as meat, 
was scarce. With the development of agriculture 
and modern economies, rich foods are now plentiful 
in parts of the world. Some of the most prevalent 
diseases in the so-called developed world, such as 
heart disease and diabetes, can be attributed to the large quantities of fats 
and carbohydrates present in modern diets. We are now developing a deeper 
understanding of the biochemical consequences of these diets and the inter- 
play between diet and genetic factors. 

Chemicals are only one important class of environmental factors. Our 
behaviors also have biochemical consequences. Through physical activ- 
ity, we consume the calories that we take in, ensuring an appropriate bal- 
ance between food intake and energy expenditure. Activities ranging 
from exercise to emotional responses such as fear and love may activate 
specific biochemical pathways, leading to changes in levels of gene 
expression, the release ofhormones, and other consequences. Furthermore, 
the interplay between biochemistry and behavior is bidirectional. Just as 
our biochemistry is affected by our behavior, so, too, our behavior is 
affected, although certainly not completely determined, by our genetic 
makeup and other aspects of our biochemistry. Genetic factors associated 
with a range of behavioral characteristics have been at least tentatively 
identified. 

Just as vitamin deficiencies and genetic diseases have revealed funda- 
mental principles of biochemistry and biology, investigations of variations 
in behavior and their linkage to genetic and biochemical factors are potential 
sources of great insight into mechanisms within the brain. For example, 
studies of drug addiction have revealed neural circuits and biochemical 
pathways that greatly influence aspects of behavior. Unraveling the inter- 
play between biology and behavior is one of the great challenges in modern 
science, and biochemistry is providing some of the most important concepts 
and tools for this endeavor. 


Genome sequences encode proteins and patterns of expression 


The structure of DNA revealed how information is stored in the base 
sequence along a DNA strand. But what information is stored and how 
is it expressed? The most fundamental role of DNA is to encode the 
sequences of proteins. Like DNA, proteins are linear polymers. However, 
proteins differ from DNA in two important ways. First, proteins are 
built from 20 building blocks, called amino acids, rather than just four, 
as in DNA. The chemical complex- 

ity provided by this variety of build- 


ing blocks enables proteins to per- C06 = 660 . 
form a wide range of functions. 


Second, proteins spontaneously fold Amino acid sequence 1 
into elaborate three-dimensional 
structures, determined only by their 
amino acid sequences (Figure 1.23). 
We have explored in depth how 
solutions containing two appropri- 
ate strands of DNA come together 
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to form a solution of double-helical Coe = 660 —<—<— 


molecules. A similar spontaneous 
folding process gives proteins their 
three-dimensional structure. A bal- 
ance of hydrogen bonding, van der 
Waals interactions, and hydrophobic interactions overcomes the entropy 
lost in going from an unfolded ensemble of proteins to a homogenous set 
of well-folded molecules. Proteins and protein folding will be discussed 
extensively in Chapter 2. 

The fundamental unit of hereditary information, the gene, is becom- 
ing increasingly difficult to precisely define as our knowledge of the com- 
plexities of genetics and genomics increases. The genes that are simplest 
to define encode the sequences of proteins. For these protein-encoding 
genes, a block of DNA bases encodes the amino acid sequence of a spe- 
cific protein molecule. A set of three bases along the DNA strand, called 
a codon, determines the identity of one amino acid within the protein 
sequence. The set of rules that links the DNA sequence to the encoded 
protein sequence is called the genetic code. One of the biggest surprises 
from the sequencing of the human genome is the small number of pro- 
tein-encoding genes. Before the genome-sequencing project began, the 
consensus view was that the human genome would include approximately 
100,000 protein-encoding genes. The current analysis suggests that the 
actual number is between 20,000 and 25,000. We shall use an estimate of 
21,000 throughout this book. However, additional mechanisms allow 
many genes to encode more than one protein. For example, the genetic 
information in some genes is translated in more than one way to produce 
a set of proteins that differ from one another in parts of their amino acid 
sequences. In other cases, proteins are modified after they have been syn- 
thesized through the addition of accessory chemical groups. Through 
these indirect mechanisms, much more complexity is encoded in our 
genomes than would be expected from the number of protein-encoding 
genes alone. 

On the basis of current knowledge, the protein-encoding regions 
account for only about 3% of the human genome. What 1s the function of 
the rest of the DNA? Some of it contains information that regulates the 
expression of specific genes (i.e., the production of specific proteins) in 
particular cell types and physiological conditions. Essentially every human 


Amino acid sequence 2 


FIGURE 1.23 Protein folding. Proteins 
are linear polymers of amino acids that 
fold into elaborate structures. The 
sequence of amino acids determines 
the three-dimensional structure. Thus, 
amino acid sequence 1 gives rise only 
to a protein with the shape depicted in 
blue, not the shape depicted in red. 
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cell contains the same DNA genome, yet cell types differ considerably in 
the proteins that they produce. For example, hemoglobin is expressed only 
in precursors of red blood cells, even though the genes for hemoglobin are 
present in essentially every cell. Specific sets of genes are expressed in 
response to hormones, even though these genes are not expressed in the 
same cell in the absence of the hormones. The control regions that regulate 
such differences account for only a small amount of the remainder of our 
genomes. The truth is that we do not yet understand all of the function of 
much of the remainder of the DNA. Some of it is sometimes referred to as 
“junk”—stretches of DNA that were inserted at some stage of evolution 
and have remained. In some cases, this DNA may, in fact, serve important 
functions. In others, it may serve no function but, because it does not cause 
significant harm, it has remained. 


APPENDIX: Visualizing Molecular Structures I: Small Molecules 


The authors of a biochemistry textbook face the problem of 
trying to present three-dimensional molecules in the two 
dimensions available on the printed page. The interplay 
between the three-dimensional structures of biomolecules 
and their biological functions will be discussed extensively 
throughout this book. Toward this end, we will frequently 
use representations that, although of necessity are rendered 
in two dimensions, emphasize the three-dimensional struc- 
tures of molecules. 


Stereochemical Renderings 

Most of the chemical formulas in this book are drawn to 
depict the geometric arrangement of atoms, crucial to 
chemical bonding and reactivity, as accurately as possi- 
ble. For example, the carbon atom of methane is tetra- 
hedral, with H-C-—H angles of 109.5 degrees, whereas 
the carbon atom in formaldehyde has bond angles of 
120 degrees. 


H H i 
we I 

H~ ~H HH” ~H 

Methane Formaldehyde 


To illustrate the correct stereochemistry about tetra- 
hedral carbon atoms, wedges will be used to depict the 
direction of a bond into or out of the plane of the page. A 
solid wedge with the broad end away from the carbon atom 
denotes a bond coming toward the viewer out of the plane. 
A dashed wedge, with its broad end at the carbon atom, 
represents a bond going away from the viewer behind the 
plane of the page. The remaining two bonds are depicted as 
straight lines. 


Fischer Projections 

Although representative of the actual structure of a com- 
pound, stereochemical structures are often difficult to draw 
quickly. An alternative, less-representative method of 
depicting structures with tetrahedral carbon centers relies 
on the use of Fischer projections. 


a 
2x a 
: yo Sx 
Y Y 
Fischer Stereochemical 
projection rendering 


In a Fischer projection, the bonds to the central carbon 
are represented by horizontal and vertical lines from the 
substituent atoms to the carbon atom, which is assumed to 
be at the center of the cross. By convention, the horizontal 
bonds are assumed to project out of the page toward the 
viewer, whereas the vertical bonds are assumed to project 
behind the page away from the viewer. 


Molecular Models for Small Molecules 

For depicting the molecular architecture of small molecules 
in more detail, two types of models will often be used: space 
filling and ball and stick. These models show structures at 
the atomic level. 


1. Space-Filling Models. The space-filling models are the 
most realistic. The size and position of an atom in a space- 
filling model are determined by its bonding properties and 
van der Waals radius, or contact distance. A van der Waals 
radius describes how closely two atoms can approach each 
other when they are not linked by a covalent bond. The 
colors of the model are set by convention. 


Carbon, black Hydrogen, white Nitrogen, blue 
Oxygen, red Sulfur, yellow Phosphorus, purple 


Space-filling models of several simple molecules are shown 
in Figure 1.24. 


2. Ball-and-Stick Models. Ball-and-stick models are not as 
realistic as space-filling models, because the atoms are depicted 
as spheres of radii smaller than their van der Waals radi. 
However, the bonding arrangement is easier to see because the 
bonds are explicitly represented as sticks. In an illustration, 
the taper of a stick, representing parallax, tells which of a pair 
of bonded atoms is closer to the reader. A ball-and-stick 


model reveals a complex structure more clearly than a space- 
filling model does. Ball-and-stick models of several simple 
molecules are shown in Figure 1.24. 


Water Acetate 


Formamide 


O H 
H,0 H3C ra HN a 
2 36 2N 
\ \ + 
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Molecular models for depicting large molecules will be 
discussed in the appendix to Chapter 2. 


FIGURE 1.24 Molecular representations. 
Structural formulas (bottom), ball-and-stick 
models (top), and space-filling representations 
(middle) of selected molecules are shown. 
Black = carbon, red = oxygen, white = 
hydrogen, yellow = sulfur, blue = nitrogen. 


Cysteine 


biological macromolecule (p. 2) 
metabolite (p. 2) 

deoxyribonucleic acid (DNA) (p. 2) 
protein (p. 2) 

Eukarya (p. 3) 

Bacteria (p. 3) 

Archaea (p. 3) 

eukaryote (p. 3) 


double helix (p. 5) 


prokaryote (p. 3) entropy (p. 11) 


PROBLEMS 


covalent bond (p. 5) 

resonance structure (p. 7) 

ionic interaction (p. 7) 
hydrogen bond (p. 8) 

van der Waals interaction (p. 8) 
hydrophobic effect (p. 9) 
hydrophobic interaction (p. 9) 


enthalpy (p. 11) 

free energy (Gibbs free energy) (p. 12) 
pH (p. 13) 

pK, value (p. 14) 

buffer (p. 15) 

predisposition (p. 18) 

microbiome (p. 19) 

amino acid (p. 21) 

genetic code (p. 21) 


1. Donors and acceptors. Identify the hydrogen-bond 
donors and acceptors in each of the four bases on page 4. 


2. Resonance structures. The structure of an amino acid, tyro- 
sine, is shown here. Draw an alternative resonance structure. 


H 
Ho. 
H 
H 
os 
*H3N COO- 


3. It takes all types. What types of noncovalent bonds hold 
together the following solids? 


(a) Table salt (NaCl), which contains Na* and Cl7 ions. 


(b) Graphite (C), which consists of sheets of covalently 
bonded carbon atoms. 


4. Don’t break the law. Given the following values for the 
changes in enthalpy (AH) and entropy (AS), which of the 
following processes can take place at 298 K without violat- 
ing the Second Law of Thermodynamics? 
(a) AH = —84kJ mol~! (—20 kcal mol7!), 

AS = +125 J mol! K~! (+30 cal mol~! K~*) 
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(b) AH = —84kJ mol~! (—20 kcal mol), 

AS = —125J mol7! K7! (—30 cal mol! K~!) 
(c) AH = +84kJ mol~! (+20 kcal mol), 

AS = +125 J mol~! K~! (+30 cal mol7! K~!) 
(d) AH = +84kJ mol~! (+20 kcal mol~‘), 

AS = —125J mol7! K~! (—30 cal mol! K~1) 


5. Double-helix-formation entropy. For double-helix forma- 
tion, AG can be measured to be —54 kJ mol~! (—13 kcal 
mol~!) at pH 7.0 in 1 M NaCl at 25°C (298 K). The heat 
released indicates an enthalpy change of —251 kJ mol7! 
(—60 kcal mol~!). For this process, calculate the entropy 
change for the system and the entropy change for the 
surroundings. 


6. Find the pH. What are the pH values for the following 
solutions? 


(a) 0.1 M HCl 
(b) 0.1 M NaOH 
(c) 0.05 M HCl 
(d) 0.05 M NaOH 


7. A weak acid. What is the pH of a 0.1 M solution of acetic 
acid (pK, = 4.75)? 


(Hint: Let x be the concentration of Ht ions released 
from acetic acid when it dissociates. The solutions to a 
quadratic equation of the form ax? + bx + c = 0 are 


x =(—-b + Vb? — 4ac)/2a.) 


8. Substituent effects. What is the pH ofa 0.1 M solution of 
chloroacetic acid (CICH, COOH, pK, = 2.86)? 


9. Water in water. Given a density of 1 g/ml and a molecu- 
lar weight of 18 g/mol, calculate the concentration of water 
in water. 


10. Basic fact. What is the pH of a 0.1 M solution of 
ethylamine, given that the pK, of ethylammonium ion 
(CH, CH, NH,") is 10.70? 


11. Comparison. A solution is prepared by adding 0.01 M 
acetic acid and 0.01 M ethylamine to water and adjusting 
the pH to 7.4. What is the ratio of acetate to acetic acid? 
What is the ratio of ethylamine to ethylammonium ion? 


12. Concentrate. Acetic acid is added to water until the pH 
value reaches 4.0. What is the total concentration of the 
added acetic acid? 


13. Dilution. 100 mL of a solution of hydrochloric acid 
with pH 5.0 is diluted to 1 L. What is the pH of the diluted 
solution? 


14. Buffer dilution. 100 mL of a 0.1 mM buffer solution 
made from acetic acid and sodium acetate with pH 5.0 is 
diluted to 1 L. What is the pH of the diluted solution? 


15. Find the pK. For an acid HA, the concentrations of 
HA and A™ are 0.075 and 0.025, respectively, at pH 6.0. 
What is the pK, value for HA? 


16. pH indicator. A dye that is an acid and that appears 
as different colors in its protonated and deprotonated 
forms can be used as a pH indicator. Suppose that you 
have a 0.001 M solution of a dye with a pK, of 7.2. From 
the color, the concentration of the protonated form is 
found to be 0.0002 M. Assume that the remainder of 
the dye is in the deprotonated form. What is the pH of 
the solution? 


17. What’s the ratio? An acid with a pK, of 8.0 is present in 
a solution with a pH of 6.0. What is the ratio of the proton- 
ated to the deprotonated form of the acid? 


18. Phosphate buffer. What is the ratio of the concentra- 
tions of H,PO, and HPO?" at (a) pH 7.0; (b) pH 7.5; 
(c) pH 8.0? 


19. Neutralization of phosphate. Given that phosphoric 
acid (H,PO,) can give up three protons with different pK, 
values, sketch a plot of pH as a function of added drops of 
sodium hydroxide solution, starting with a solution of 
phosphoric acid at pH 1.0. 


20. Buffer capacity. Two solutions of sodium acetate are 
prepared, one with a concentration of 0.1 M and the other 
with a concentration of 0.01 M. Calculate the pH values 
when the following concentrations of HCI have been added 
to each of these solutions: 0.0025 M, 0.005 M, 0.01 M, 
and 0.05 M. 


21. Buffer preparation. You wish to prepare a buffer con- 
sisting of acetic acid and sodium acetate with a total acetic 
acid plus acetate concentration of 250 mM anda pH of 5.0. 
What concentrations of acetic acid and sodium acetate 
should you use? Assuming you wish to make 2 liters of this 
buffer, how many moles of acetic acid and sodium acetate 
will you need? How many grams of each will you need 
(molecular weights: acetic acid 60.05 g mol™!, sodium 
acetate, 82.03 g mol !)? 


22. An alternative approach. When you go to prepare the 
buffer described in Problem 21, you discover that your 
laboratory is out of sodium acetate, but you do have sodium 
hydroxide. How much (in moles and grams) acetic acid and 
sodium hydroxide do you need to make the buffer? 


23. Another alternative. Your friend from another labora- 
tory was out of acetic acid, so tries to prepare the buffer in 
Problem 21 by dissolving 41.02 g of sodium acetate in 
water, carefully adding 180.0 ml of 1 M HCl, and adding 
more water to reach a total volume of 2 liters. What is the 
total concentration of acetate plus acetic acid in the solu- 
tion? Will this solution have pH 5.0? Will it be identical 
with the desired buffer? If not, how will it differ? 


24. Blood substitute. As noted in this chapter, blood con- 
tains a total concentration of phosphate of approximately 
1 mM and typically has a pH of 7.4. You wish to make 100 
liters of phosphate buffer with a pH of 7.4 from NaH,PO, 
(molecular weight, 119.98 g mol~') and Na,HPO, (molec- 
ular weight, 141.96 g mol~!). How much of each (in grams) 
do you need? 


25. A potential problem. You wish to make a buffer with pH 
7.0. You combine 0.060 grams of acetic acid and 14.59 
grams of sodium acetate and add water to yield a total vol- 
ume of 1 liter. What is the pH? Will this be the useful pH 
7.0 buffer you seek? 


26. Charge! Suppose two phosphate groups in DNA (each 
with a charge of —1) are separated by 12 A. What is the 
energy of the ionic interaction between these two phos- 
phates assuming a dielectric constant of 80? Repeat the 
calculation assuming a dielectric constant of 2. 


27. Vive la différence. On average, estimate how many base 
differences there are between two human beings. 
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28. Epigenomics. The human body contains many distinct 
cell types yet almost all human cells contain the same 
genome with 21,000 genes. The distinct cell types are pri- 
marily due to differences in gene expression. Assume that 
one set of 1000 genes is expressed in all cell types and that 
the remaining 20,000 genes can be divided into sets of 1000 
genes that are either all expressed or all not expressed in a 
given cell type. How many different cell types are possible 
if each cell type expresses 10 sets of these genes? Note that 
the number of combinations of n objects into m sets is given 
by n!/(m!(n-m)!) where n! = 1*2*---*(n — 1)*n. 


29. Predispositions in populations. Assume that 10% of the 
members of a population will get a particular disease over 
the course of their lifetime. Genomic studies reveal that 5% 
of the population have sequences in their genomes such 
that their probability of getting the disease over the course 
of their lifetimes is 50%. What is the average lifetime risk of 
this disease for the remaining 95% of the population with- 
out these sequences? 


CHAPTER 


Protein Composition 
and Structure 


Insulin is a protein hormone, crucial for maintaining blood sugar at appropriate 
levels. (Below) Chains of amino acids in a specific sequence (the primary 
structure) define a protein such as insulin. Amino acids close to one another 
within this sequence can fold into regular structures (the secondary structure), 
such as the a-helix. Entire chains fold into well-defined structures (the tertiary 
structure)—in this case, a single insulin molecule. Such structures assemble with 
other chains to form arrays such as the complex of six insulin molecules shown 
at the far right (the quaternary structure). These arrays can often be induced to 
form well-defined crystals (photograph at left), which allows a determination of 
these structures in detail. [Photograph from Christo Nanev.] 


Tertiary 


Quarternary 
structure structure 


OUTLINE 
Ba are the most versatile macromolecules in living systems and 
serve crucial functions in essentially all biological processes. They func- 2-1 Piteing Are Bull fone Reperons 
tion as catalysts, transport and store other molecules such as oxygen, pro- a anne 
vide mechanical support and immune protection, generate movement, 2.2 Primary Structure: Amino Acids 
transmit nerve impulses, and control growth and differentiation. Indeed, Are Linked by Peptide Bonds to 
much of this book will focus on understanding what proteins do and how Form Polypeptide Chains 
they perform these functions. 2.3 Secondary Structure: Polypeptide 
Several key properties enable proteins to participate in a wide range of Chains Can Fold into Regular 
functions. Structures Such As the Alpha 
Helix, the Beta Sheet, and Turns 
1. Proteins are linear polymers built of monomer units called amino acids, and Loops 
which are linked end to end. The sequence of linked amino acids is called 2.4 Tertiary Structure: Water-Soluble 
the primary structure. Remarkably, proteins spontaneously fold up into Proteins Fold into Compact 
three-dimensional structures that are determined by the sequence of amino Structures with Nonpolar Cores 


acids in the protein polymer. Three-dimensional structure formed by 
hydrogen bonds between amino acids near one another is called secondary 
structure, whereas tertiary structure is formed by long-range interactions 
between amino acids. Protein function depends directly on this three- 
dimensional structure (Figure 2.1). Thus, proteins are the embodiment of the 
transition from the one-dimensional world of sequences to the three-dimensional 
world of molecules capable of diverse activities. Many proteins also display 


2.5 Quaternary Structure: Polypeptide 
Chains Can Assemble into 
Multisubunit Structures 


2.6 The Amino Acid Sequence of a 
Protein Determines Its Three- 
Dimensional Structure 
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hk FIGURE 2.1 Structure dictates 
function. A protein component of the DNA 
replication machinery surrounds a section 
of DNA double helix depicted as a cylinder. 
The protein, which consists of two identical 
subunits (shown in red and yellow), acts as 
a clamp that allows large segments of DNA 
to be copied without the replication 
machinery dissociating from the DNA. 
[Drawn from 2POL.pdb.] 


FIGURE 2.2 A complex protein 
assembly. (A) A single muscle cell contains 
multiple myofibrils, each of which is 
comprised of numerous repeats of a 
complex protein assembly known as the 
sarcomere. (B) The banding pattern of a 
sarcomere, evident by electron microscopy, 
is caused by (C) the interdigitation of 
filaments made up of many individual 
proteins. [(B) Courtesy of Dr. Hugh Huxley.] 
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Single muscle fiber (cell) 


Plasma membrane 


quaternary structure, in which the functional protein is 
composed of several distinct polypeptide chains. 


2. Proteins contain a wide range of functional groups. These 
functional groups include alcohols, thiols, thioethers, carbox- 
ylic acids, carboxamides, and a variety of basic groups. Most 
of these groups are chemically reactive. When combined in 
various sequences, this array of functional groups accounts 
for the broad spectrum of protein function. For instance, their 
reactive properties are essential to the function of enzymes, 
the proteins that catalyze specific chemical reactions in bio- 
logical systems (Chapters 8 through 10). 


3. Proteins can interact with one another and with other bio- 
logical macromolecules to form complex assemblies. The proteins 
within these assemblies can act synergistically to generate 
capabilities that individual proteins may lack. Examples of 
these assemblies include macromolecular machines that repli- 
cate DNA, transmit signals within cells, and enable muscle 
cells to contract (Figure 2.2). 


Myofibrils 


Sarcomere 


| band -- A band ——————_>+—_ | band — 


Z line <— H zone —> Z line 


Thick Thin 
filaments filaments 


4. Some proteins are quite rigid, whereas others display considerable flexibility. 
Rigid units can function as structural elements in the cytoskeleton (the internal 
scaffolding within cells) or in connective tissue. Proteins with some flexibility 
may act as hinges, springs, or levers. In addition, conformational changes 
within proteins enable the regulated assembly of larger protein complexes as 
well as the transmission of information within and between cells (Figure 2.3). 


2.1 Proteins Are Built from a Repertoire of 20 Amino Acids 


Amino acids are the building blocks of proteins. An a-amino acid consists of a 
central carbon atom, called the a carbon, linked to an amino group, a carboxylic 
acid group, a hydrogen atom, and a distinctive R group. The R group is often 
referred to as the side chain. With four different groups connected to the tetrahe- 
dral a-carbon atom, a-amino acids are chiral: they may exist in one or the other 
of two mirror-image forms, called the L isomer and the D isomer (Figure 2.4). 


Only L amino acids are constituents of proteins. For almost all amino acids, 

the L isomer has S (rather than R) absolute configuration (Figure 2.5). 

What is the basis for the preference for L amino acids? The answer has been lost 

to evolutionary history. It is possible that the preference for L over D amino acids 

was a consequence of a chance selection. However, there is evidence that 

Lamino acids are slightly more soluble than a racemic mixture of D and L amino 

acids, which tend to form crystals. This small solubility difference could have 
been amplified over time so that the L isomer became dominant in solution. 

Amino acids in solution at neutral pH exist predominantly as dipolar ions 

(also called zwitterions). In the dipolar form, the amino group is protonated 


D isomer 


L isomer 


FIGURE 2.4 The L and D isomers of amino acids. The letter R refers to the side chain. The 
Land D isomers are mirror images of each other. 
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hk FIGURE 2.3 Flexibility and 
function. On binding iron, the protein 
lactoferrin undergoes a substantial change 
in conformation that allows other molecules 
to distinguish between the iron-free and the 
iron-bound forms. [Drawn from 1LFH.pdb 
and 1LFG.pdb.] 


Notation for distinguishing stereoisomers 
The four different substituents of an asymmetric 
carbon atom are assigned a priority according 
to atomic number. The lowest-priority 
substituent, often hydrogen, is pointed away 
from the viewer. The configuration about the 
carbon atom is called S (from the Latin sinister, 
left’) if the progression from the highest to the 
lowest priority is counterclockwise. The 
configuration is called RF (from the Latin rectus, 
“right”) if the progression is clockwise. 


FIGURE 2.5 Only L amino acids are 
found in proteins. Almost all L amino 
acids have an S absolute configuration. 
The counterclockwise direction of the arrow 
from highest- to lowest-priority substituents 
indicates that the chiral center is of the 

S configuration. 
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FIGURE 2.6 lonization state as a 
function of pH. The ionization state of 


amino acids is altered by a change in pH. 


The zwitterionic form predominates near 
physiological pH. 


(—NH,*)and the carboxyl group is deprotonated (—COO° ). The ionization 
state of an amino acid varies with pH (Figure 2.6). In acid solution (e.g., 
pH 1), the amino group is protonated (—NH,”) and the carboxyl group is 
not dissociated (—COOH). As the pH is raised, the carboxylic acid is the 
first group to give up a proton, inasmuch as its pK, is near 2. The dipolar 
form persists until the pH approaches 9, when the protonated amino group 
loses a proton. 


he Zwitterionic form ~— Both groups 
deprotonated 
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pH 


Twenty kinds of side chains varying in size, shape, charge, hydrogen- 
bonding capacity, hydrophobic character, and chemical reactivity are com- 
monly found in proteins. Indeed, all proteins in all species—bacterial, 
archaeal, and eukaryotic—are constructed from the same set of 20 amino 
acids with only a few exceptions. This fundamental alphabet for the con- 
struction of proteins is several billion years old. The remarkable range of 
functions mediated by proteins results from the diversity and versatility of 
these 20 building blocks. Understanding how this alphabet is used to create 
the intricate three-dimensional structures that enable proteins to carry out 
so many biological processes is an exciting area of biochemistry and one that 
we will return to in Section 2.6. 

Although there are many ways to classify amino acids, we will sort these 
molecules into four groups, on the basis of the general chemical characteris- 
tics of their R groups: 


1. Hydrophobic amino acids with nonpolar R groups 


2. Polar amino acids with neutral R groups but the charge is not evenly 
distributed 


3. Positively charged amino acids with R groups that have a positive charge 
at physiological pH 


4. Negatively charged amino acids with R groups that have a negative 
charge at physiological pH 


Hydrophobic amino acids. The simplest amino acid is glycine, which has a 
single hydrogen atom as its side chain. With two hydrogen atoms bonded to the 
a-carbon atom, glycine is unique in being achiral. Alanine, the next simplest 
amino acid, has a methyl group (—CH)) as its side chain (Figure 2.7). 
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FIGURE 2.7 Structures of hydrophobic amino acids. For each amino acid, a 
ball-and-stick model (top) shows the arrangement of atoms and bonds in space. 
A stereochemically realistic formula (middle) shows the geometric arrangement of 
bonds around atoms, and a Fischer projection (bottom) shows all bonds as being 
perpendicular for a simplified representation (see the Appendix to Chapter 1). 
The additional chiral center in isoleucine is indicated by an asterisk. 
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Larger hydrocarbon side chains are found in valine, leucine, and isoleucine. 
“s Methionine contains a largely aliphatic side chain that includes a thioether 
y Sey (—S—) group. The side chain of isoleucine includes an additional chiral 
ya ° center; only the isomer shown in Figure 2.7 is found in proteins. The larger 
+UGN : coo" aliphatic side chains are especially hydrophobic; that is, they tend to cluster 
3 . ’ 
together rather than contact water. The three-dimensional structures of water- 
SH soluble proteins are stabilized by this tendency of hydrophobic groups to come 
| together, which is called the hydrophobic effect (p. 9). The different sizes and 
CH2 shapes of these hydrocarbon side chains enable them to pack together to form 
| Pp yi P g' 
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FIGURE 2.8 Structures of the polar 
amino acids. The additional chiral center 
in threonine is indicated by an asterisk. 


2i 


Indole 


32 


compact structures with little empty space. Proline also has an aliphatic side 
chain, but it differs from other members of the set of 20 in that its side chain is 
bonded to both the nitrogen and the a-carbon atoms, yielding a pyrrolidine 
ring. Proline markedly influences protein architecture because its cyclic struc- 
ture makes it more conformationally restricted than the other amino acids. 
Two amino acids with relatively simple aromatic side chains are part of the 
fundamental repertoire. Phenylalanine, as its name indicates, contains a phe- 
nyl ring attached in place of one of the hydrogen atoms of alanine. Tryptophan 
has an indole group joined to a methylene (—CH_.,—) group; the indole group 
comprises two fused rings containing an NH group. Phenylalanine is purely 
hydrophobic, whereas tryptophan is less so because of its NH group. 


Polar amino acids. Six amino acids are polar but uncharged. Three amino 
acids, serine, threonine, and tyrosine, contain hydroxyl groups (—OH) attached 
to a hydrophobic side chain (Figure 2.8). Serine can be thought of as a version 
of alanine with a hydroxyl group attached, threonine resembles valine with a 


hydroxyl group in place of one of valine’s 
methyl groups, and tyrosine is a version of 
phenylalanine with the hydroxyl group 
replacing a hydrogen atom on the aromatic 
ring. The hydroxyl group makes these 
amino acids much more hydrophilic (water 
loving) and reactive than their hydrophobic 
analogs. Threonine, like isoleucine, con- 
tains an additional asymmetric center; 
again, only one isomer is present in proteins. 

In addition, the set includes asparagine 
and glutamine, two amino acids that con- 
tain a terminal carboxamide. The side 
chain of glutamine is one methylene group 
longer than that of asparagine. 

Cysteine is structurally similar to ser- 
ine but contains a sulfhydryl, or thiol 
(—SH), group in place of the hydroxyl 
(—OH) group. The sulfhydryl group is 
much more reactive. Pairs of sulfhydryl 
groups may come together to form disul- 
fide bonds, which are particularly impor- 
tant in stabilizing some proteins, as will 
be discussed shortly. 


Positively charged amino acids. We turn 
now to amino acids with complete posi- 
tive charges that render them highly 
hydrophilic. Lysine and arginine have long 
side chains that terminate with groups 
that are positively charged at neutral pH. 
Lysine is capped by a primary amino 
group and arginine by a guanidinium 
group. Histidine contains an imidazole 
group, an aromatic ring that also can be 
positively charged (Figure 2.9). 
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FIGURE 2.9 Positively charged amino 
acids lysine, arginine, and histidine. 


With a pK, value near 6, the imidazole group can be uncharged or posi- 
tively charged near neutral pH, depending on its local environment 
(Figure 2.10). Histidine is often found in the active sites of enzymes, where 


the imidazole ring can bind and release protons in the course of enzymatic Ve n eA NCH 
reactions. C 
HN SNH, we 

Negatively charged amino acids. This set of amino acids contains two with eae a, selgaoela 
acidic side chains: aspartic acid and glutamic acid (Figure 2.11). These amino 
acids are charged derivatives of asparagine and glutamine (Figure 2.8), with 
a carboxylic acid in place of a carboxamide. Aspartic acid and glutamic acid 
are often called aspartate and glutamate to emphasize that, at physiological 
pH, their side chains usually lack a proton that is present in the acid form H H 
and hence are negatively charged. Nonetheless, these side chains can accept ss a 
protons in some proteins, often with functionally important consequences. ae + ‘cu a ‘cu 

Seven of the 20 amino acids have readily ionizable side chains. These Wd i nf 
7 amino acids are able to donate or accept protons to facilitate reactions as \ ats \ 
well as to form ionic bonds. Table 2.1 gives equilibria and typical pK, val- H r He * a H r He 
ues for ionization of the side chains of tyrosine, cysteine, arginine, lysine, so cae we a a . 

H uN H 4 


FIGURE 2.10 Histidine ionization. Histidine can bind or release protons near physiological pH. 
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FIGURE 2.11 Negatively charged 


amino acids. 
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TABLE 2.1 Typical pK, values of ionizable groups in proteins 


Group Acid Base Typical pK,* 
: fe) 
Terminal a-carboxyl group I — 321. 
Cl UH OS " 
“~No- “~So 
Aspartic acid : = i 
Glutamic acid C —— m Aril 
FO ae 
i 
Histidine a ns 6.0 
+ 
<i Aol 
H H 
+H 
Terminal a-amino group = NCH —— rn Ne H 8.0 
H H 
Cysteine = a — _—s- 8.3 
; H 
Tyrosine o———— iF 10.9 
+ H 
Lysine Nowy — Te H 10.8 
H H 
H H 
\ \ 
H + N~ H } 
Arginine N= ———s N—C 12S 
\ \ 
wh WH 


*pK, values depend on temperature, ionic strength, and the microenvironment of the ionizable group 


histidine, and aspartic and glutamic acids in proteins. Two other groups in 
proteins—the terminal a-amino group and the terminal a-carboxyl 
group—can be ionized, and typical pK, values for these groups also are 
included in Table 2.1. 

Amino acids are often designated by either a three-letter abbreviation or 
a one-letter symbol (Table 2.2). The abbreviations for amino acids are the 
first three letters of their names, except for asparagine (Asn), glutamine 
(Gln), isoleucine (Ile), and tryptophan (Trp). The symbols for many amino 
acids are the first letters of their names (e.g., G for glycine and L for leucine); 


TABLE 2.2 Abbreviations for amino acids 


Three-letter One-letter Three-letter One-letter 
Aminoacid abbreviation abbreviation} Aminoacid abbreviation abbreviation 
Alanine Ala A Methionine Met M 
Arginine Arg R Phenylalanine Phe F 
Asparagine Asn N Proline Pro P 
Aspartic acid Asp D Serine Ser S 
Cysteine Cys Cc Threonine Thr T 
Glutamine Gln Q Tryptophan Trp W 
Glutamic acid Glu E Tyrosine Tyr Y 
Glycine Gly G Valine Val Vv 
Histidine His H Asparagine or 
Isoleucine Tle I aspartic acid Asx B 
Leucine Leu L Glutamine or 
Lysine Lys K glutamic acid Glx Z 


H2 H, He FIGURE 2.12 Undesirable 


He on UH c—C reactivity in amino 
H J 0 H | \ acids. Some amino acids are 
NK UGE — \_ + 2X unsuitable for proteins because 
, a i of undesirable cyclization. 


Homoserine can cyclize to 
Homoserine form a stable, five-membered 
ring, potentially resulting in 
peptide-bond cleavage. The 


Ho P H> cyclization of serine would form 
H va ao H Cw a strained, four-membered ring 
Ke X LL - and is thus disfavored. X can 
7 Se Ny “se OF HX be an amino group from a 
H tl \\ neighboring amino acid or 
fe) H ) ; 
another potential leaving group. 
Serine 


the other symbols have been agreed on by convention. These abbreviations 
and symbols are an integral part of the vocabulary of biochemists. 


How did this particular set of amino acids become the building 

blocks of proteins? First, as a set, they are diverse: their structural 
and chemical properties span a wide range, endowing proteins with the 
versatility to assume many functional roles. Second, many of these amino 
acids were probably available from prebiotic reactions; that is, from reac- 
tions that took place before the origin of life. Finally, other possible amino 
acids may have simply been too reactive. For example, amino acids such as 
homoserine and homocysteine tend to form five-membered cyclic forms 
that limit their use in proteins; the alternative amino acids that are found in 
proteins—serine and cysteine—do not readily cyclize, because the rings in 
their cyclic forms are too small (Figure 2.12). 


2.2 Primary Structure: Amino Acids Are Linked by 
Peptide Bonds to Form Polypeptide Chains 


Proteins are linear polymers formed by linking the a-carboxyl group of one 
amino acid to the a-amino group of another amino acid. This type of linkage 
is called a peptide bond or an amide bond. The formation of a dipeptide from 
two amino acids is accompanied by the loss ofa water molecule (Figure 2.13). 
The equilibrium of this reaction lies on the side of hydrolysis rather than 
synthesis under most conditions. Hence, the biosynthesis of peptide bonds 
requires an input of free energy. Nonetheless, peptide bonds are quite stable 
kinetically because the rate of hydrolysis is extremely slow; the lifetime of a 
peptide bond in aqueous solution in the absence of a catalyst approaches 
1000 years. 

A series of amino acids joined by peptide bonds form a polypeptide 
chain, and each amino acid unit in a polypeptide is called a residue. A poly- 
peptide chain has directionality because its ends are different: an w-amino group 


R R R fe) 
va ie ip Oe 
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Peptide bond 


FIGURE 2.13 Peptide-bond formation. The linking of two amino acids is accompanied by 
the loss of a molecule of water. 
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Dalton 

A unit of mass very nearly equal to that of a 
hydrogen atom. Named after John Dalton 
(1766-1844), who developed the atomic 
theory of matter. 


Kilodalton (kDa) 
A unit of mass equal to 1000 daltons 
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FIGURE 2.14 Amino acid sequences have direction. This illustration of the pentapeptide 
Tyr-Gly-Gly-Phe-Leu (YGGFL) shows the sequence from the amino terminus to the carboxy! 
terminus. This pentapeptide, Leu-enkephalin, is an opioid peptide that modulates the 
perception of pain. The reverse pentapeptide, Leu-Phe-Gly-Gly-Tyr (LFGGY), is a different 
molecule and has no such effects. 


is present at one end and an a-carboxyl group at the other. The amino 
end is taken to be the beginning of a polypeptide chain; by convention, 
the sequence of amino acids in a polypeptide chain is written starting with 
the amino-terminal residue. Thus, in the polypeptide Tyr-Gly-Gly-Phe- 
Leu (YGGFL), tyrosine is the amino-terminal (N-terminal) residue and 
leucine is the carboxyl-terminal (C-terminal) residue (Figure 2.14). Leu- 
Phe-Gly-Gly-Tyr (LFGGY) is a different polypeptide, with different 
chemical properties. 

A polypeptide chain consists of a regularly repeating part, called the 
main chain or backbone, and a variable part, comprising the distinctive side 
chains (Figure 2.15). The polypeptide backbone is rich in hydrogen-bonding 
potential. Each residue contains a carbonyl group (C=O), which is a good 
hydrogen-bond acceptor, and, with the exception of proline, an NH group, 
which is a good hydrogen-bond donor. These groups interact with each 
other and with functional groups from side chains to stabilize particular 
structures, as will be discussed in Section 2.3. 

Most natural polypeptide chains contain between 50 and 2000 amino 
acid residues and are commonly referred to as proteins. The largest single 
polypeptide known is the muscle protein titin, which consists of more than 
27,000 amino acids. Polypeptide chains made of small numbers of amino 
acids are called oligopeptides or simply peptides. The mean molecular weight 
of an amino acid residue is about 110 g mol~!, and so the molecular weights 
of most proteins are between 5500 and 220,000 g mol” !. We can also refer 
to the mass of a protein, which is expressed in units of 
daltons; one dalton is equal to one atomic mass unit. 


H i i GH i Hd H a A protein with a molecular weight of 50,000 g mol! 
see ers 7 a aa eas é i pe _~ hasamass of 50,000 daltons, or 50 kDa (kilodaltons). 

bi dj “OH i : 4 if In some proteins, the linear polypeptide chain is 

Ro Ry cross-linked. The most common cross-links are disul- 


fide bonds, formed by the oxidation of a pair of cysteine 
residues (Figure 2.16). The resulting unit of two 
linked cysteines is called cystine. Extracellular proteins 


FIGURE 2.15 Components of a polypeptide chain. A polypeptide 
chain consists of a constant backbone (shown in black) and variable 
side chains (shown in green). 


often have several disulfide bonds, whereas intracel- 


lular proteins usually lack them. Rarely, nondisulfide ee LN 
cross-links derived from other side chains are present a Vas ~ 
in proteins. For example, collagen fibers in connective ne * 
tissue are strengthened in this way, as are fibrin blood ” 
clots (Section 10.4). 

Cysteine 
Proteins have unique amino acid sequences 
specified by genes 
In 1953, Frederick Sanger determined the amino acid @ 
sequence of insulin, a protein hormone (Figure 2.17). Hy ft 
This work is a landmark in biochemistry because it showed — ws we 
for the first time that a protein has a precisely defined amino , f 
acid sequence consisting only of L amino acids linked by ce) 
peptide bonds. This accomplishment stimulated other Cysteine 


scientists to carry out sequence studies of a wide variety 


Oxidation 


Reduction 


Cystine 


of proteins. Currently, the complete aminoacidsequences _two cysteine residues is an oxidation reaction. 


of more than 2,000,000 proteins are known. The striking 
fact is that each protein has a unique, precisely defined amino acid sequence. 
The amino acid sequence of a protein is referred to as its primary structure. 


A chain 
Gly-Ile-Val-Glu-Gln-Cys-Cys-Ala-Ser-Val-Cys-Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn 
5 10 15 | 21 
§ 


B chain | 


Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Leu-Tyr-Leu-Val-Cys-Gly-Glu-Arg-Gly-Phe-Phe-Tyr-Thr-Pro-Lys-Ala 


5 10 15 20 


FIGURE 2.17 Amino acid sequence of bovine insulin. 


A series of incisive studies in the late 1950s and early 1960s revealed that 
the amino acid sequences of proteins are determined by the nucleotide 
sequences of genes. The sequence of nucleotides in DNA specifies a com- 
plementary sequence of nucleotides in RNA, which in turn specifies the 
amino acid sequence of a protein. In particular, each of the 20 amino acids 
of the repertoire is encoded by one or more specific sequences of three 
nucleotides (Section 4.6). 

Knowing amino acid sequences is important for several reasons. First, 
knowledge of the sequence of a protein is usually essential to elucidating its 
function (e.g., the catalytic mechanism of an enzyme). In fact, proteins with 
novel properties can be generated by varying the sequence of known 
proteins. Second, amino acid sequences determine the three-dimensional 
structures of proteins. The amino acid sequence is the link between the 
genetic message in DNA and the three-dimensional structure that performs 
a protein’s biological function. Analyses of relations between amino acid 
sequences and three-dimensional structures of proteins are uncovering the 
rules that govern the folding of polypeptide chains. Third, alterations in 
amino acid sequence can lead to abnormal protein function and disease. 
Severe and sometimes fatal diseases, such as sickle-cell anemia (Chapter 7) 
and cystic fibrosis, can result from a change in a single amino acid within a 
protein. Fourth, the sequence of a protein reveals much about its evolution- 
ary history (Chapter 6). Proteins resemble one another in amino acid 
sequence only if they have a common ancestor. Consequently, molecular 
events in evolution can be traced from amino acid sequences; molecular 
paleontology is a flourishing area of research. 
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FIGURE 2.16 Cross-links. The formation of a disulfide bond from 
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FIGURE 2.18 Peptide bonds are planar. In a pair of linked 
amino acids, six atoms (C,, C, O, N, H, and C,) lie in a plane. 


Side chains are shown as green balls. 


Polypeptide chains are flexible yet conformationally restricted 


Examination of the geometry of the protein backbone reveals several 
important features. First, the peptide bond is essentially planar (Figure 2.18). 
Thus, for a pair of amino acids linked by a peptide bond, six atoms lie in the 
same plane: the a-carbon atom and CO group of the first amino acid and 
the NH group and a-carbon atom of the second amino acid. The nature of 
the chemical bonding within a peptide accounts for the bond’s 
planarity. The bond resonates between a single bond and a 
double bond. Because of this partial double-bond character, 
rotation about this bond is prevented and thus the conforma- 
tion of the peptide backbone is constrained. 


% q \\+ 
Fe ae <— Pe ata 
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Peptide-bond resonance structures 


The partial double-bond character is also expressed in 
the length of the bond between the CO and the NH groups. 
As shown in Figure 2.19, the C—N distance in a peptide 
bond is typically 1.32 A, which is between the values 
expected for a C—N single bond (1.49 A) and a C=N 
double bond (1.27 A). Finally, the peptide bond is uncharged, allowing 
polymers of amino acids linked by peptide bonds to form tightly packed 
globular structures. 

Two configurations are possible for a planar peptide bond. In the trans 
configuration, the two a-carbon atoms are on opposite sides of the peptide 
bond. In the cis configuration, these groups are on the same side of the pep- 
tide bond. Almost all peptide bonds in proteins are trans. This preference for 
trans over cis can be explained by the fact that steric clashes between groups 
attached to the a-carbon atoms hinder the formation of the cis 
configuration but do not arise in the trans configuration 
(Figure 2.20). By far the most common cis peptide bonds are 
X—Pro linkages. Such bonds show less preference for the trans 
configuration because the nitrogen of proline is bonded to two 
tetrahedral carbon atoms, limiting the steric differences 
between the trans and cis forms (Figure 2.21). 

In contrast with the peptide bond, the bonds between the 
amino group and the a-carbon atom and between the a-carbon 
atom and the carbonyl group are pure single bonds. The two 
adjacent rigid peptide units can rotate about these bonds, taking 
on various orientations. This freedom of rotation about two 


FIGURE 2.19 Typical bond lengths within a peptide bonds of each amino acid allows proteins to fold in many different 
unit. The peptide unit is shown in the trans configuration. ways. The rotations about these bonds can be specified by 


‘ ) 


Trans Cis 


FIGURE 2.20 Trans and cis peptide bonds. The trans form is strongly favored because of 
steric clashes, indicated by the orange semicircles, that arise in the cis form. 


XQ 
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FIGURE 2.21 Trans and cis X-Pro bonds. The energies of these forms are similar to one 
another because steric clashes, indicated by the orange semicircles, arise in both forms. 
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FIGURE 2.22 Rotation about bonds in a polypeptide. The structure of each amino acid in 
a polypeptide can be adjusted by rotation about two single bonds. (A) Phi (¢) is the angle of 
rotation about the bond between the nitrogen and the a-carbon atoms, whereas psi (i) is the 
angle of rotation about the bond between the a-carbon and the carbonyl carbon atoms. 

(B) A view down the bond between the nitrogen and the a-carbon atoms, showing how @ is 
measured. (C) A view down the bond between the a-carbon and the carbonyl carbon atoms, 
showing how yf is measured. 


torsion angles (Figure 2.22). The angle of rotation about the bond between the 
nitrogen and the a-carbon atoms is called phi (#). The angle of rotation 
about the bond between the a-carbon and the carbonyl carbon atoms is 
called psi (y). A clockwise rotation about either bond as viewed from the 
nitrogen atom toward the a-carbon atom or from the a-carbon atom toward 
the carbonyl group corresponds to a positive value. The @ and yf angles 
determine the path of the polypeptide chain. 

Are all combinations of @ and wW possible? Gopalasamudram 
Ramachandran recognized that many combinations are forbidden because 
of steric collisions between atoms. The allowed values can be visualized on 
a two-dimensional plot called a Ramachandran plot (Figure 2.23). Three- 
quarters of the possible (@, ys) combinations are excluded simply by local 
steric clashes. Steric exclusion, the fact that two atoms cannot be in the same 
place at the same time, can be a powerful organizing principle. 

The ability of biological polymers such as proteins to fold into well- 
defined structures is remarkable thermodynamically. An unfolded polymer 
exists as arandom coil: each copy of an unfolded polymer will have a differ- 
ent conformation, yielding a mixture of many possible conformations. 
The favorable entropy associated with a mixture of many conformations 
opposes folding and must be overcome by interactions favoring the folded 
form. Thus, highly flexible polymers with a large number of possible con- 
formations do not fold into unique structures. The rigidity of the peptide 
unit and the restricted set of allowed and angles limits the number of 
structures accessible to the unfolded form sufficiently to allow protein folding 
to take place. 
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Torsion angle 

A measure of the rotation about a bond, 
usually taken to lie between —180 and +180 
degrees. Torsion angles are sometimes called 
dihedral angles. 
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Screw sense 

Describes the direction in which a helical 
structure rotates with respect to its axis. If, 
viewed down the axis of a helix, the chain turns 
in a clockwise direction, it has a right-handed 
screw sense. If the turning is counterclockwise, 
the screw sense is left-handed. 
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FIGURE 2.23 A Ramachandran plot showing the values of ¢ and w#. Not all ¢ and » 
values are possible without collisions between atoms. The most favorable regions are shown 
in dark green; borderline regions are shown in light green. The structure on the right is 
disfavored because of steric clashes. 


2.3 Secondary Structure: Polypeptide Chains Can Fold 
into Regular Structures Such As the Alpha Helix, 
the Beta Sheet, and Turns and Loops 


Can a polypeptide chain fold into a regularly repeating structure? In 1951, 
Linus Pauling and Robert Corey proposed two periodic structures called 
the a@ helix (alpha helix) and the B pleated sheet (beta pleated sheet). 
Subsequently, other structures such as the B turn and omega (Q) loop were 
identified. Although not periodic, these common turn or loop structures are 
well defined and contribute with a helices and B sheets to form the final 
protein structure. Alpha helices, 8 strands, and turns are formed by a regu- 
lar pattern of hydrogen bonds between the peptide N—H and C=O 
groups of amino acids that are near one another in the linear sequence. Such 
folded segments are called secondary structure. 


The alpha helix is a coiled structure stabilized by intrachain 
hydrogen bonds 


In evaluating potential structures, Pauling and Corey considered which con- 
formations of peptides were sterically allowed and which most fully exploited 
the hydrogen-bonding capacity of the backbone NH and CO groups. The 
first of their proposed structures, the a helix, is a rodlike structure 
(Figure 2.24). A tightly coiled backbone forms the inner part of the rod and 
the side chains extend outward in a helical array. The o helix is stabilized by 
hydrogen bonds between the NH and CO groups of the main chain. In par- 
ticular, the CO group of each amino acid forms a hydrogen bond with the 
NH group of the amino acid that is situated four residues ahead in the 
sequence (Figure 2.25). Thus, except for amino acids near the ends of an a 
helix, all the main-chain CO and NH groups are hydrogen bonded. Each resi- 
due is related to the next one by a rise, also called translation, of 1.5 A along 
the helix axis and a rotation of 100 degrees, which gives 3.6 amino acid resi- 
dues per turn of helix. Thus, amino acids spaced three and four apart in the 
sequence are spatially quite close to one another in an @ helix. In contrast, 
amino acids spaced two apart in the sequence are situated on opposite sides 
of the helix and so are unlikely to make contact. The pitch of the a helix is the 
length of one complete turn along the helix axis and is equal to the product of 
the rise (1.5 A) and the number of residues per turn (3.6), or 5.4 A. The screw 


FIGURE 2.24 Structure of the a helix. (A) A ribbon depiction shows the a-carbon atoms 
and side chains (green). (B) A side view of a ball-and-stick version depicts the hydrogen 
bonds (dashed lines) between NH and CO groups. (C) An end view shows the coiled 
backbone as the inside of the helix and the side chains (green) projecting outward. (D) A 
space-filling view of part C shows the tightly packed interior core of the helix. 
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FIGURE 2.25 Hydrogen-bonding scheme for an « helix. In the « helix, the CO group of 
residue / forms a hydrogen bond with the NH group of residue / + 4. 


sense of an @ helix can be right-handed (clockwise) or left-handed (counter- 
clockwise). The Ramachandran plot reveals that both the right-handed and 
the left-handed helices are among allowed conformations (Figure 2.26). 
However, right-handed helices are energetically more favorable because 
there is less steric clash between the side chains and the backbone. Essentially 
all a helices found in proteins ave right-handed. In schematic representations 
of proteins, o helices are depicted as twisted ribbons or rods (Figure 2.27). 

Not all amino acids can be readily accommodated in an a helix. 
Branching at the B-carbon atom, as in valine, threonine, and isoleucine, 
tends to destabilize a helices because of steric clashes. Serine, aspartate, and 
asparagine also tend to disrupt @ helices because their side chains contain 
hydrogen-bond donors or acceptors in close proximity to the main chain, 
where they compete for main-chain NH and CO groups. Proline also is a 
helix breaker because it lacks an NH group and because its ring structure 
prevents it from assuming the ¢ value to fit into an @ helix. 

The a-helical content of proteins ranges widely, from none to almost 
100%. For example, about 75% of the residues in ferritin, a protein that 
helps store iron, are in a helices (Figure 2.28). Indeed, about 25% of all 
soluble proteins are composed of a helices connected by loops and turns of 
the polypeptide chain. Single a helices are usually less than 45 A long. 
Many proteins that span biological membranes also contain « helices. 
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FIGURE 2.26 Ramachandran plot for 
helices. Both right- and left-handed helices 
lie in regions of allowed conformations in the 
Ramachandran plot. However, essentially all 
a helices in proteins are right-handed. 


(A) (B) 


FIGURE 2.27 Schematic views of a 
helices. (A) A ribbon depiction. (B) A 
cylindrical depiction. 


k FIGURE 2.28 A largely a-helical protein. 
Ferritin, an iron-storage protein, is built from a 
bundle of « helices. [Drawn from 1AEW.pdb.] 
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FIGURE 2.29 Ramachandran plot for B 
strands. The red area shows the sterically 
allowed conformations of extended, 
8-strand-like structures. 
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Beta sheets are stabilized by hydrogen bonding between 
polypeptide strands 


Pauling and Corey proposed another periodic structural motif, which they 
named the B pleated sheet (B because it was the second structure that they 
elucidated, the a helix having been the first). The B pleated sheet (or, more 
simply, the B sheet) differs markedly from the rodlike a helix. It is composed 
of two or more polypeptide chains called B strands. A B strand is almost fully 
extended rather than being tightly coiled as in the a helix. A range of 
extended structures are sterically allowed (Figure 2.29). 

The distance between adjacent amino acids along a B strand 1s approxi- 
mately 3.5 A, in contrast with a distance of 1.5 A along an a helix. The side 
chains of adjacent amino acids point in opposite directions (Figure 2.30). 


FIGURE 2.30 Structure of a B strand. The side chains (green) are alternately above and 
below the plane of the strand. 


A 8 sheet is formed by linking two or more £ strands lying next to one 
another through hydrogen bonds. Adjacent strands in a B sheet can run in 
opposite directions (antiparallel 8 sheet) or in the same direction (parallel B 
sheet). In the antiparallel arrangement, the NH group and the CO group of 
each amino acid are respectively hydrogen bonded to the CO group and the 
NH group of a partner on the adjacent chain (Figure 2.31). In the parallel 
arrangement, the hydrogen-bonding scheme is slightly more complicated. 
For each amino acid, the NH group is hydrogen bonded to the CO group of 
one amino acid on the adjacent strand, whereas the CO group is hydrogen 
bonded to the NH group on the amino acid two residues farther along the 
chain (Figure 2.32). Many strands, typically 4 or 5 but as many as 10 or 
more, can come together in B sheets. Such B sheets can be purely antiparal- 
lel, purely parallel, or mixed (Figure 2.33). 


FIGURE 2.31 An antiparallel 8 sheet. Adjacent B strands run in opposite directions, as 
indicated by the arrows. Hydrogen bonds between NH and CO groups connect each amino 
acid to a single amino acid on an adjacent strand, stabilizing the structure. 
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FIGURE 2.32 A parallel sheet. Adjacent B strands run in the same direction, as indicated 
by the arrows. Hydrogen bonds connect each amino acid on one strand with two different 
amino acids on the adjacent strand. 


FIGURE 2.33 Structure of a mixed B sheet. The arrows indicate directionality of each strand. 


In schematic representations, B strands are usually depicted by broad 
arrows pointing in the direction of the carboxyl-terminal end to indicate the 
type of B sheet formed—parallel or antiparallel. More structurally diverse 
than a helices, 8B sheets can be almost flat but most adopt a somewhat 
twisted shape (Figure 2.34). The B sheet is an important structural element 
in many proteins. For example, fatty acid-binding proteins, important for 
lipid metabolism, are built almost entirely from B sheets (Figure 2.35). 


(A) 
k& FIGURE 2.35 A protein rich in B 


FIGURE 2.34 A schematic twisted 8 sheet. (A) A schematic model. (B) The schematic view sheets. The structure of a fatty acid- 
rotated by 90 degrees to illustrate the twist more clearly. binding protein. [Drawn from 1FTP.pdb.] 
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FIGURE 2.36 Structure of a reverse 
turn. The CO group of residue / of the 
polypeptide chain is hydrogen bonded to 
the NH group of residue / + 3 to stabilize 
the turn. 


hk FIGURE 2.37 Loops on a protein 
surface. A part of an antibody molecule 
has surface loops (shown in red) that 
mediate interactions with other molecules. 
[Drawn from 7FAB.pdb.] 


Polypeptide chains can change direction by making reverse 
turns and loops 


Most proteins have compact, globular shapes owing to reversals in the 
direction of their polypeptide chains. Many of these reversals are accom- 
plished by a common structural element called the reverse turn (also known 
as the B turn or hairpin turn), illustrated in Figure 2.36. In many reverse 
turns, the CO group of residue i of a polypeptide is hydrogen bonded to the 
NH group of residue i + 3. This interaction stabilizes abrupt changes in 
direction of the polypeptide chain. In other cases, more-elaborate structures 
are responsible for chain reversals. These structures are called loops or 
sometimes 2 loops (omega loops) to suggest their overall shape. Unlike a 
helices and B strands, loops do not have regular, periodic structures. 
Nonetheless, loop structures are often rigid and well defined (Figure 2.37). 
Turns and loops invariably lie on the surfaces of proteins and thus often 
participate in interactions between proteins and other molecules. 


Fibrous proteins provide structural support for cells and tissues 


Special types of helices are present in the two proteins a-keratin and collagen. 
These proteins form long fibers that serve a structural role. 

a-Keratin, which is an essential component of wool, hair, and skin, con- 
sists of two right-handed « helices intertwined to form a type of left-handed 
superhelix called an a-helical coiled coil. a-Keratin is amember of a superfam- 
ily of proteins referred to as coiled-coil proteins (Figure 2.38). In these proteins, 
two or more @ helices can entwine to form a very stable structure, which can 
havea length of 1000 A (100 nm, or 0.1 xm) or more. There are approximately 
60 members of this family in humans, including intermediate filaments, pro- 
teins that contribute to the cell cytoskeleton (internal scaffolding in a cell), and 
the muscle proteins myosin and tropomyosin (Section 35.2). Members of this 
family are characterized by a central region of 300 amino acids that contains 
imperfect repeats of a sequence of seven amino acids called a heptad repeat. 

The two helices in a-keratin associate with each other by weak interactions 
such as van der Waals forces and ionic interactions. The left-handed supercoil 
alters the two right-handed a helices such that there are 3.5 residues per turn 
instead of 3.6. Thus, the pattern of side-chain interactions can be repeated 
every seven residues, forming the heptad repeats. Two helices with such 
repeats are able to interact with one another if the repeats are complementary 
(Figure 2.39). For example, the repeating residues may be hydrophobic, allow- 
ing van der Waals interactions, or have opposite charge, allowing ionic interac- 
tions. In addition, the two helices may be linked by disulfide bonds formed by 
neighboring cysteine residues. The bonding of the helices accounts for the 
physical properties of wool, an example of an a-keratin. Wool is extensible and 
can be stretched to nearly twice its length because the « helices stretch, breaking 
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®& FIGURE 2.38 An a-helical coiled coil. (A) Space-tfilling model. (B) Ribbon diagram. The 
two helices wind around one another to form a superhelix. Such structures are found in many 
proteins, including keratin in hair, quills, claws, and horns. [Drawn from 1C1G.pdb.] 


the weak interactions between neighboring helices. However, the covalent 
disulfide bonds resist breakage and return the fiber to its original state once the Leucine (Leu) 
stretching force is released. The number of disulfide bond cross-links further residue 
defines the fiber’s properties. Hair and wool, having fewer cross-links, are flex- 
ible. Horns, claws, and hooves, having more cross-links, are much harder. 
A different type of helix is present in collagen, the most abundant protein 
of mammals. Collagen is the main fibrous component of skin, bone, tendon, 
cartilage, and teeth. This extracellular protein is a rod-shaped molecule, about Leu Leu 
3000 A long and only 15 A in diameter. It contains three helical polypeptide 
chains, each nearly 1000 residues long. Glycine appears at every third residue in 
the amino acid sequence, and the sequence glycine-proline-hydroxyproline 
recurs frequently (Figure 2.40). Hydroxyproline is a derivative of proline that has 
a hydroxyl group in place of one of the hydrogen atoms on the pyrrolidine ring. 
The collagen helix has properties different from those of the a helix. 
Hydrogen bonds within a strand are absent. Instead, the helix is stabilized by 
steric repulsion of the pyrrolidine rings of the proline and hydroxyproline vesi- 
dues (Figure 2.41). The pyrrolidine rings keep out of each other’s way when 
the polypeptide chain assumes its helical form, which has about three resi- 
dues per turn. Three strands wind around one another to form a superhelical 
cable that is stabilized by hydrogen bonds between strands. The hydrogen 


bonds form between the peptide NH groups of glycine residues and the CO hk FIGURE 2.39 Heptad repeats in a 
groups of residues on the other chains. The hydroxyl groups of hydroxypro- coiled-coil protein. Every seventh residue 
line residues also participate in hydrogen bonding. in each helix is leucine. The two helices are 


held together by van der Waals interactions 
primarily between the leucine residues. 
[Drawn from 2ZTA.pdb.] 


ine 
-Gly-Pro-Met-Gly-Pro-Ser-Gly-Pro-Arg- 

22 
-Gly-Leu-Hyp-Gly-Pro-Hyp-Gly-Ala-Hyp- 

31 
-Gly-Pro-Gln-Gly-Phe-Gln-Gly-Pro-Hyp- 

40 
FIGURE 2.41 Conformation of a single strand of a collagen triple helix. fo ee 
-Gly-Pro-Met-Gly-Pro-Arg-Gly-Pro-Hyp- 

The inside of the triple-stranded helical cable is very crowded and 58 
-Gly-Pro-Hyp-Gly-Lys-Asn-Gly-Asp-Asp- 


accounts for the requirement that glycine be present at every third position 


on each strand (Figure 2.42A). The only residue that can fit in an interior FIGURE 2.40 Amino acid sequence of 


position is glycine. The amino acid residue on either side of glycine is located a part of a collagen chain. Every 

on the outside of the cable, where there is room for the bulky rings of proline third residue is a glycine. Proline and 
and hydroxyproline residues (Figure 2.42B). hydroxyproline (Hyp) also are alundant. 
(A) (B) 


FIGURE 2.42 Structure of the protein collagen. (A) Space- 
filling model of collagen. Each strand is shown in a different 
color. (B) Cross section of a model of collagen. Each strand 
is hydrogen bonded to the other two strands. The a-carbon 
atom of a glycine residue is identified by the letter G. Every 
third residue must be glycine because there is no space in 
the center of the helix. Notice that the pyrrolidine rings of the 
proline residues are on the outside. 


45 


46 


CHAPTER 2 Protein Composition 
and Structure 


The importance of the positioning of glycine inside the triple helix 

is illustrated in the disorder osteogenesis imperfecta, also known as 
brittle bone disease. In this condition, which can vary from mild to very 
severe, other amino acids replace the internal glycine residue. This replace- 
ment leads to a delayed and improper folding of collagen. The most serious 
symptom is severe bone fragility. Defective collagen in the eyes causes the 
whites of the eyes to have a blue tint (blue sclera). 


2.4 Tertiary Structure: Water-Soluble Proteins Fold into 
Compact Structures with Nonpolar Cores 


Let us now examine how amino acids are grouped together in a complete 
protein. X-ray crystallographic and nuclear magnetic resonance (NMR) 
studies (Section 3.5) have revealed the detailed three-dimensional structures 
of thousands of proteins. We begin here with an examination of myoglobin, 
the first protein to be seen in atomic detail. 

Myoglobin, the oxygen storage protein in muscle, is a single polypeptide 
chain of 153 amino acids (Chapter 7). The capacity of myoglobin to bind 
oxygen depends on the presence of heme, a nonpolypeptide prosthetic 
(helper) group consisting of protoporphyrin IX and a central iron atom. 
Myoglobin is an extremely compact molecule. Its overall dimensions are 
45 x 35 x 25 A, an order of magnitude less than if it were fully stretched 
out (Figure 2.43). About 70% of the main chain is folded into eight a heli- 
ces, and much of the rest of the chain forms turns and loops between helices. 

The folding of the main chain of myoglobin, like that of most other pro- 
teins, is complex and devoid of symmetry. The overall course of the poly- 
peptide chain of a protein is referred to as its tertiary structure. A unifying 
principle emerges from the distribution of side chains. Strikingly, the inte- 
rior consists almost entirely of nonpolar residues such as leucine, valine, 
methionine, and phenylalanine (Figure 2.44). Charged residues such as 
aspartate, glutamate, lysine, and arginine are absent from the inside of myo- 
globin. The only polar residues inside are two histidine residues, which play 
critical roles in binding iron and oxygen. The outside of myoglobin, on the 
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h& FIGURE 2.43 Three-dimensional structure of myoglobin. (A) A ribbon diagram shows 
that the protein consists largely of a helices. (B) A space-filling model in the same orientation 
shows how tightly packed the folded protein is. Notice that the heme group is nestled into a 
crevice in the compact protein with only an edge exposed. One helix is blue to allow 
comparison of the two structural depictions. [Drawn from 1A6N.pdb.] 


other hand, consists of both polar and nonpolar residues. The space-filling 
model shows that there is very little empty space inside. 

This contrasting distribution of polar and nonpolar residues reveals a 
key facet of protein architecture. In an aqueous environment, protein 
folding is driven by the strong tendency of hydrophobic residues to be 
excluded from water. Recall that a system is more thermodynamically 
stable when hydrophobic groups are clustered rather than extended 
into the aqueous surroundings (p. 9). The polypeptide chain therefore folds 
so that its hydrophobic side chains are buried and its polar, charged chains 
are on the surface. Many a helices and B strands are amphipathic; that is, 
the a helix or B strand has a hydrophobic face, which points into the 
protein interior, and a more polar face, which points into solution. The 
fate of the main chain accompanying the hydrophobic side chains is 
important, too. An unpaired peptide NH or CO group markedly prefers 
water to a nonpolar milieu. The secret of burying a segment of main 
chain in a hydrophobic environment is to pair all the NH and CO groups 
by hydrogen bonding. This pairing is neatly accomplished in an « helix 
or B sheet. Van der Waals interactions between tightly packed hydrocar- 
bon side chains also contribute to the stability of pro- 
teins. We can now understand why the set of 20 amino 
acids contains several that differ subtly in size and 
shape. They provide a palette from which to choose to 
fill the interior of a protein neatly and thereby maxi- 
mize van der Waals interactions, which require inti- 
mate contact. 

Some proteins that span biological membranes are 
“the exceptions that prove the rule” because they have 
the reverse distribution of hydrophobic and hydrophilic 
amino acids. For example, consider porins, proteins 
found in the outer membranes of many bacteria 
(Figure 2.45). Membranes are built largely of hydropho- 
bic alkane chains (Section 12.2). Thus, porins are cov- 
ered on the outside largely with hydrophobic residues 
that interact with the neighboring alkane chains. In 
contrast, the center of the protein contains many charged 
and polar amino acids that surround a water-filled chan- 
nel running through the middle of the protein. Thus, 
because porins function in hydrophobic environments, 
they are “inside out” relative to proteins that function in 
aqueous solution. 


Water-filled 


hydrophilic channel 


k& FIGURE 2.44 Distribution of amino 
acids in myoglobin. (A) A space-filling 
model of myoglobin with hydrophobic 
amino acids shown in yellow, charged 
amino acids shown in blue, and others 
shown in white. Notice that the surface of 
the molecule has many charged amino 
acids, as well as some hydrophobic amino 
acids. (B) In this cross-sectional view, 
notice that mostly hydrophobic amino acids 
are found on the inside of the structure, 
whereas the charged amino acids are 
found on the protein surface. [Drawn from 
1MBD.pdb.] 


hk FIGURE 2.45 “Inside out” amino 
acid distribution in porin. The outside of 
porin (which contacts hydrophobic groups 
in membranes) is covered largely with 
hydrophobic residues, whereas the center 
includes a water-filled channel lined with 
charged and polar amino acids. [Drawn 
from 1PRN.pdb.] 
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h& FIGURE 2.46 The helix-turn-helix 
motif, a supersecondary structural 
element. Helix-turn-helix motifs are found 


in many DNA-binding proteins. [Drawn from 
1LMB.pdb.] 


Certain combinations of secondary structure are present in many pro- 
teins and frequently exhibit similar functions. These combinations are 
called motifs or supersecondary structures. For example, an o helix separated 
from another a helix by a turn, called a helix-turn-helix unit, is found in 
many proteins that bind DNA (Figure 2.46). 

Some polypeptide chains fold into two or more compact regions that 
may be connected by a flexible segment of polypeptide chain, rather like 
pearls on a string. These compact globular units, called domains, range in 
size from about 30 to 400 amino acid residues. For example, the extracel- 
lular part of CD4, a protein on the surface of certain cells of the immune 
system (Section 34.4), comprises four similar domains of approximately 
100 amino acids each (Figure 2.47). Proteins may have domains in common 
even if their overall tertiary structures are different. 


h& FIGURE 2.47 Protein domains. The cell-surface protein CD4 consists of four similar 
domains. [Drawn from 1WIO.pdb.] 


2.5 Quaternary Structure: Polypeptide Chains Can 
Assemble into Multisubunit Structures 


Four levels of structure are frequently cited in discussions of protein 
architecture. So far, we have considered three. Primary structure is the 
amino acid sequence. Secondary structure refers to the spatial arrange- 
ment of amino acid residues that are nearby in the sequence. Some of 
these arrangements give rise to periodic structures. The a helix and B 
strand are elements of secondary structure. Tertiary structure refers to the 
spatial arrangement of amino acid residues that are far apart 
in the sequence and to the pattern of disulfide bonds. We 
now turn to proteins containing more than one polypeptide 
chain. Such proteins exhibit a fourth level of structural 
organization. Each polypeptide chain in such a protein is 
called a subunit. Quaternary structure refers to the spatial 
arrangement of subunits and the nature of their interac- 
tions. The simplest sort of quaternary structure is a dimer, 
consisting of two identical subunits. This organization is 
present in the DNA-binding protein Cro, found in a 
bacterial virus called \ (Figure 2.48). More-complicated 


hk FIGURE 2.48 Quaternary structure. The Cro protein of quaternary structures also are common. More than one type 
bacteriophage d is a dimer of identical subunits. [Drawn from of subunit can be present, often in variable numbers. For 
5CRO.pdb ] 


example, human hemoglobin, the oxygen-carrying protein 

in blood, consists of two subunits of one type (designated a) 
and two subunits of another type (designated §), as illustrated in Figure 
2.49. Thus, the hemoglobin molecule exists as an a8, tetramer. Subtle 
changes in the arrangement of subunits within the hemoglobin molecule 
allow it to carry oxygen from the lungs to tissues with great efficiency 
(Chapter 7). 


FIGURE 2.49 The af, tetramer of human hemoglobin. The structure of the two a 
subunits (red) is similar to but not identical with that of the two B subunits (yellow). The 
molecule contains four heme groups (gray with the iron atom shown in purple). (A) The ribbon 
diagram highlights the similarity of the subunits and shows that they are composed mainly of 
a helices. (B) The space-filling model illustrates how the heme groups occupy crevices in the 
protein. [Drawn from 1A3N.pdb.] 


Viruses make the most of a limited amount of genetic information by 
forming coats that use the same kind of subunit repetitively in a symmetric 
array. The coat of rhinovirus, the virus that causes the common cold, 
includes 60 copies of each of four subunits (Figure 2.50). The subunits come 
together to form a nearly spherical shell that encloses the viral genome. 


2.6 The Amino Acid Sequence of a Protein Determines Its 
Three-Dimensional Structure 


How is the elaborate three-dimensional structure of proteins attained? The 
classic work of Christian Anfinsen in the 1950s on the enzyme ribonuclease 
revealed the relation between the amino acid sequence of a protein and its 
conformation. Ribonuclease is a single polypeptide chain consisting of 124 
amino acid residues cross-linked by four disulfide bonds (Figure 2.51). 
Anfinsen’s plan was to destroy the three-dimensional structure of the enzyme 
and to then determine what conditions were required to restore the structure. 
Agents such as urea or guanidinium chloride effectively disrupt a protein’s 
noncovalent bonds. Although the mechanism of action of these agents is not 
fully understood, computer simulations suggest that they replace water as the 
molecule solvating the protein and are then able to disrupt the van der Waals 
interactions stabilizing the protein structure. The disulfide bonds can be 
cleaved reversibly by reducing them with a reagent such as 8-mercaptoethanol 
(Figure 2.52). In the presence of a large excess of B-mercaptoethanol, the 
disulfides (cystines) are fully converted into sulfhydryls (cysteines). 
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FIGURE 2.52 Role of B-mercaptoethanol in reducing disulfide bonds. Note that, as the 
disulfides are reduced, the B-mercaptoethanol is oxidized and forms dimers. 


FIGURE 2.50 Complex quaternary 
structure. The coat of human rhinovirus, 
the cause of the common cold, comprises 
60 copies of each of four subunits. The 
three most prominent subunits are shown 
as different colors. 


FIGURE 2.51 Amino acid sequence of 
bovine ribonuclease. The four disulfide 
bonds are shown in color. [After C. H. W. 
Hirs, S. Moore, and W. H. Stein, J. Biol. 
Chem. 235:633-647, 1960.] 


) NH2 

a 
H)N~ NH) H)N* NH> 
Urea Guanidinium chloride 
H2 
TO gH 
H2 

B-Mercaptoethanol 

49 


FIGURE 2.53 Reduction and denaturation 
of ribonuclease. 


Scrambled ribonuclease 
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FIGURE 2.54 Reestablishing correct 
disulfide pairing. Native ribonuclease can be 
re-formed from scrambled ribonuclease in the 
presence of a trace of B-mercaptoethanol. 
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Native ribonuclease Denatured reduced ribonuclease 


Most polypeptide chains devoid of cross-links assume a random-coil 
conformation in 8 M urea or 6 M guanidinium chloride. When ribonuclease 
was treated with B-mercaptoethanol in 8 M urea, the product was a fully 
reduced, randomly coiled polypeptide chain devoid of enzymatic activity. 
When a protein is converted into a randomly coiled peptide without its 
normal activity, it is said to be denatured (Figure 2.53). 

Anfinsen then made the critical observation that the denatured ribonu- 
clease, freed of urea and B-mercaptoethanol by dialysis (Section 3.1), slowly 
regained enzymatic activity. He perceived the significance of this chance 
finding: the sulfhydryl groups of the denatured enzyme became oxidized by 
air, and the enzyme spontaneously refolded into a catalytically active form. 
Detailed studies then showed that nearly all the original enzymatic activity 
was regained if the sulfhydryl groups were oxidized under suitable condi- 
tions. All the measured physical and chemical properties of the refolded 
enzyme were virtually identical with those of the native enzyme. These 
experiments showed that the information needed to specify the catalytically 
active structure of ribonuclease is contained in its amino acid sequence. 
Subsequent studies have established the generality of this central principle 
of biochemistry: sequence specifies conformation. The dependence of confor- 
mation on sequence is especially significant because of the intimate connec- 
tion between conformation and function. 

A quite different result was obtained when reduced ribonuclease was 
reoxidized while it was still in 8 M urea and the preparation was then dia- 
lyzed to remove the urea. Ribonuclease reoxidized in this way had only 1% 
of the enzymatic activity of the native protein. Why were the outcomes so 
different when reduced ribonuclease was reoxidized in the presence and 
absence of urea? The reason is that the wrong disulfides formed pairs in 
urea. There are 105 different ways of pairing eight cysteine molecules to 
form four disulfides; only one of these combinations is enzymatically active. 
The 104 wrong pairings have been picturesquely termed “scrambled” 
ribonuclease. Anfinsen found that scrambled ribonuclease spontaneously 
converted into fully active, native ribonuclease when trace amounts of 
B-mercaptoethanol were added to an aqueous solution of the protein 
(Figure 2.54). The added B-mercaptoethanol catalyzed the rearrangement 
of disulfide pairings until the native structure was regained in about 10 
hours. This process was driven by the decrease in free energy as the scrambled 
conformations were converted into the stable, native conformation of the 
enzyme. The native disulfide pairings of ribonuclease thus contribute to the 
stabilization of the thermodynamically preferred structure. 

Similar refolding experiments have been performed on many other pro- 
teins. In many cases, the native structure can be generated under suitable 
conditions. For other proteins, however, refolding does not proceed 
efficiently. In these cases, the unfolded protein molecules usually become 
tangled up with one another to form aggregates. Inside cells, proteins called 
chaperones block such undesirable interactions. Additionally, it is now 


evident that some proteins do not assume a defined structure until they 
interact with molecular partners, as we will see shortly. 


Amino acids have different propensities for forming « helices, 
B sheets, and turns 


How does the amino acid sequence of a protein specify its three-dimensional 
structure? How does an unfolded polypeptide chain acquire the form of the 
native protein? These fundamental questions in biochemistry can be 
approached by first asking a simpler one: What determines whether a par- 
ticular sequence in a protein forms an a helix, a B strand, or a turn? One 
source of insight is to examine the frequency of occurrence of particular 
amino acid residues in these secondary structures (Table 2.3). Residues 
such as alanine, glutamate, and leucine tend to be present in a helices, 
whereas valine and isoleucine tend to be present in B strands. Glycine, 
asparagine, and proline are more commonly observed in turns. 

Studies of proteins and synthetic peptides have revealed some reasons 
for these preferences. Branching at the B-carbon atom, as in valine, threo- 
nine, and isoleucine, tends to destabilize a helices because of steric clashes. 
These residues are readily accommodated in B strands, where their side 
chains project out of the plane containing the main chain. Serine and aspar- 
agine tend to disrupt a helices because their side chains contain hydrogen- 
bond donors or acceptors in close proximity to the main chain, where they 
compete for main-chain NH and CO groups. Proline tends to disrupt both 
« helices and B strands because it lacks an NH group and because its ring 
structure restricts its @ value to near 60 degrees. Glycine readily fits into 
all structures, but its conformational flexibility renders it well-suited to 
reverse turns. 

Can we predict the secondary structure of a protein by using this 
knowledge of the conformational preferences of amino acid residues? 


TABLE 2.3 Relative frequencies of amino acid residues in secondary structures 


Amino acid a helix B sheet Reverse turn 
Glu 1.59 0.52 1.01 
Ala 1.41 0.72 0.82 
Leu 1.34 1.22 0.57 
Met 10) 1.14 0.52 
Gln 1.27 0.98 0.84 
Lys 1E23) 0.69 1.07 
Arg 1.21 0.84 0.90 
His 1.05 0.80 0.81 
Val 0.90 1.87 0.41 
Tle 1.09 1.67 0.47 
Tyr 0.74 1.45 0.76 
Cys 0.66 1.40 0.54 
Trp 1.02 1.35 0.65 
Phe 1.16 133) 0.59 
Thr 0.76 1.17 0.96 
Gly 0.43 0.58 Waa 
Asn 0.76 0.48 1.34 
Pro 0.34 0.31 2 
Ser 0.57 0.96 1.22 
Asp 0.99 0.39 1.24 


Note: The amino acids are grouped according to their preference for « helices (top group), B sheets (middle 
group), or turns (bottom group). 

Source: T. E. Creighton, Proteins: Structures and Molecular Properties, 2d ed. (W. H. Freeman and Company, 
1992), p. 256. 
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Accurate predictions of secondary structure adopted by even a 
short stretch of residues have proved to be difficult. What 
stands in the way of more-accurate prediction? Note that the 
conformational preferences of amino acid residues are not 
tipped all the way to one structure (Table 2.3). For example, 
glutamate, one of the strongest helix formers, prefers a helix to 
8 strand by only a factor of three. The preference ratios of most 
other residues are smaller. Indeed, some penta- and hexapeptide 
sequences have been found to adopt one structure in one protein 
and an entirely different structure in another (Figure 2.55). 
Hence, some amino acid sequences do not uniquely determine 
secondary structure. Tertiary interactions—interactions 
between residues that are far apart in the sequence—may be 
decisive in specifying the secondary structure of some segments. 


FIGURE 2.55 Alternative conformations of a peptide Context is often crucial: the conformation of a protein has 


sequence. Many sequences can adopt alternative 


conformations in different proteins. Here the sequence 
VDLLKN shown in red assumes an « helix in one protein 


evolved to work in a particular environment. Nevertheless, sub- 
stantial improvements in secondary structure prediction have 


context (left) and a 8 strand in another (right). [Drawn from been achieved by using families of related sequences, each of 


(left) 83WRP.pdb and (right) 2HLA.pdb.] 
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FIGURE 2.56 Transition from folded to 
unfolded state. Most proteins show a 
sharp transition from the folded to the 
unfolded form on treatment with increasing 


concentrations of denaturants. 
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FIGURE 2.57 Components of a partly denatured protein 
solution. In a half-unfolded protein solution, half the molecules are 


fully folded and half are fully unfolded. 
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which adopts the same structure. 


Protein folding is a highly cooperative process 


Proteins can be denatured by any treatment that disrupts the weak bonds 
stabilizing tertiary structure, such as heating, or by chemical denaturants 
such as urea or guanidinium chloride. For many proteins, a comparison of 
the degree of unfolding as the concentration of denaturant increases reveals 
a sharp transition from the folded, or native, form to the unfolded, or dena- 
tured form, suggesting that only these two conformational states are pres- 
ent to any significant extent (Figure 2.56). A similar sharp transition is 
observed if denaturants are removed from unfolded proteins, allowing the 
proteins to fold. 

The sharp transition seen in Figure 2.56 suggests that protein folding 
and unfolding is an “all or none” process that results from a cooperative tran- 
sition. For example, suppose that a protein is placed in conditions under 
which some part of the protein structure is thermodynamically unstable. As 
this part of the folded structure is disrupted, the interactions between it and 
the remainder of the protein will be lost. The loss of these interactions, in 
turn, will destabilize the remainder of the structure. Thus, 
conditions that lead to the disruption of any part of a 
protein structure are likely to unravel the protein com- 
pletely. The structural properties of proteins provide a 
clear rationale for the cooperative transition. 

The consequences of cooperative folding can be illus- 
trated by considering the contents of a protein solution 
under conditions corresponding to the middle of the tran- 
sition between the folded and the unfolded forms. Under 
these conditions, the protein is “half folded.” Yet the solu- 
tion will appear to have no partly folded molecules but, 
instead, look as if it is a 50/50 mixture of fully folded and 
fully unfolded molecules (Figure 2.57). Although the pro- 
tein may appear to behave as if it exists in only two states, 
this simple two-state existence is an impossibility at a 
molecular level. Even simple reactions go through reac- 
tion intermediates, and so a complex molecule such as a 
protein cannot simply switch from a completely unfolded 
state to the native state in one step. Unstable, transient 


intermediate structures must exist between the native and denatured state 
(p. 54). Determining the nature of these intermediate structures is an area of 
intense biochemical research. 


Proteins fold by progressive stabilization of intermediates rather than 
by random search 


How does a protein make the transition from an unfolded structure to a 
unique conformation in the native form? One possibility a priori would 
be that all possible conformations are sampled to find the energetically 
most favorable one. How long would such a random search take? 
Consider a small protein with 100 residues. Cyrus Levinthal calculated 
that, if each residue can assume three different conformations, the total 
number of structures would be 3!%, which is equal to 5 X 104”. If it takes 
10~13 5 to convert one structure into another, the total search time would 
be 5 X 104” X 10713 s, which is equal to 5 X 10°4s, or 1.6 X 1077 years. 
In reality, small proteins can fold in less than a second. Clearly, it would 
take much too long for even a small protein to fold properly by randomly 
trying out all possible conformations. The enormous difference between 
calculated and actual folding times is called Levinthal’s paradox. This 
paradox clearly reveals that proteins do not fold by trying every possible 
conformation; instead, they must follow at least a partly defined folding 
pathway consisting of intermediates between the fully denatured protein 
and its native structure. 

The way out of this paradox is to recognize the power of cumulative selec- 
tion. Richard Dawkins, in The Blind Watchmaker, asked how long it would 
take a monkey poking randomly at a typewriter to reproduce Hamlet’s 
remark to Polonius, “Methinks it is like a weasel” (Figure 2.58). An astro- 
nomically large number of keystrokes, on the order of 104°, would be 
required. However, suppose that we preserved each correct character and 
allowed the monkey to retype only the wrong ones. In this case, only a few 
thousand keystrokes, on average, would be needed. The crucial difference 
between these cases is that the first employs a completely random search, 
whereas, in the second, partly correct intermediates ave retained. 

The essence of protein folding is the tendency to retain partly correct inter- 
mediates. However, the protein-folding problem is much more difficult 
than the one presented to our simian Shakespeare. First, the criterion of 
correctness is not a residue-by-residue scrutiny of conformation by an 
omniscient observer but rather the total free energy of the transient species. 
Second, proteins are only marginally stable. The free-energy difference 
between the folded and the unfolded states of a typical 100-residue protein 
is 42 kJ mol~! (10 kcal mol‘), and thus each residue contributes on aver- 
age only 0.42 kJ mol! (0.1 kcal mol~') of energy to maintain the folded 
state. This amount is less than the amount of thermal energy, which is 
2.5 kJ mol! (0.6 kcal mol~'!) at room temperature. This meager stabilization 
energy means that correct intermediates, especially those formed early in 
folding, can be lost. The analogy is that the monkey would be somewhat 
free to undo its correct keystrokes. Nonetheless, the interactions that lead 
to cooperative folding can stabilize intermediates as structure builds up. 
Thus, local regions that have significant structural preference, though not 
necessarily stable on their own, will tend to adopt their favored structures 
and, as they form, can interact with one other, leading to increasing stabi- 
lization. This conceptual framework 1s often referred to as the nucleation- 
condensation model. 

A simulation of the folding of a protein, based on the nucleation- 
condensation model, is shown in Figure 2.59. This model suggests that 
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FIGURE 2.58 Typing-monkey analogy. A 
monkey randomly poking a typewriter could 
write a line from Shakespeare’s Hamiet, 
provided that correct keystrokes were 
retained. In the two computer simulations 
shown, the cumulative number of keystrokes 
is given at the left of each line. 
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and Structure 


(1) (2) (3) (4) (5) 
Denatured Native 


FIGURE 2.59 Proposed folding pathway of chymotrypsin inhibitor. Local regions with 
sufficient structural preference tend to adopt their favored structures initially (1). These 
structures come together to form a nucleus with a nativelike, but still mobile, structure (4). This 
structure then fully condenses to form the native, more rigid structure (5). [From A. R. Fersht 
and V. Daggett. Cell 108:573-582, 2002; with permission from Elsevier.] 


certain pathways may be preferred. Although Figure 2.59 suggests a dis- 
crete pathway, each of the intermediates shown represents an ensemble of 
similar structures, and thus a protein follows a general rather than a precise 
pathway in its transition from the unfolded to the native state. The energy 
surface for the overall process of protein folding can be visualized as a fun- 
nel (Figure 2.60). The wide rim of the funnel represents the wide range of 
structures accessible to the ensemble of denatured protein molecules. As 
the free energy of the population of protein molecules decreases, the pro- 
teins move down into narrower parts of the funnel and fewer conforma- 
tions are accessible. At the bottom of the funnel is the folded state with its 

well-defined conformation. Many paths can lead to this same 


Beginning of helix formation and collapse energy minimum. 


Energy 


FIGURE 2.60 Folding funnel. The folding funnel depicts the 
thermodynamics of protein folding. The top of the funnel 
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Prediction of three-dimensional structure from sequence 
remains a great challenge 


The prediction of three-dimensional structure from sequence 
has proved to be extremely difficult. The local sequence appears 
to determine only between 60 and 70% of the secondary struc- 
ture; long-range interactions are required to stabilize the full 
secondary structure and the tertiary structure. 

Investigators are exploring two fundamentally different 
approaches to predicting three-dimensional structure from 


alent ot amino acid sequence. The first is ab initio (Latin, “from the 
protein in native beginning”) prediction, which attempts to predict the folding 
conformation of an amino acid sequence without prior knowledge about 


similar sequences in known protein structures. Computer- 
based calculations are employed that attempt to minimize the 
free energy of a structure with a given amino acid sequence or 
to simulate the folding process. The utility of these methods is 
limited by the vast number of possible conformations, the 
marginal stability of proteins, and the subtle energetics of 
weak interactions in aqueous solution. The second approach 
takes advantage of our growing knowledge of the three-dimen- 
sional structures of many proteins. In these knowledge-based 


—~ 100 


represents all possible denatured conformations —that is, methods, an amino acid sequence of unknown structure 1s 
maximal conformational entropy. Depressions on the sides of examined for compatibility with known protein structures or 
the funnel represent semistable intermediates that can facilitate fragments therefrom. If a significant match is detected, the 


or hinder the formation of the native structure, depending on 
their depth. Secondary structures, such as helices, form and 
collapse onto one another to initiate folding. [After D. L. Nelson 


known structure can be used as an initial model. Knowledge- 
based methods have been a source of many insights into the 


and M. M. Gox, Lehninger Principles of Biochemistry, 5th ed. three-dimensional conformation of proteins of known sequence 
(W. H. Freeman and Company, 2008), p. 143.] 


but unknown structure. 


Some proteins are inherently unstructured and can exist 
in multiple conformations 


The discussion of protein folding thus far is based on the paradigm that a given 
protein amino acid sequence will fold into a particular three-dimensional struc- 
ture. This paradigm holds well for many proteins. However, it has been known 
for some time that some proteins can adopt two different structures, only one 
of which results in protein aggregation and pathological conditions (p. 56). 
Such alternate structures originating from a unique amino acid sequence were 
thought to be rare, the exception to the paradigm. Recent work has called into 
question the universality of the idea that each amino acid sequence gives rise to 
one structure for certain proteins, even under normal cellular conditions. 

Our first example is a class of proteins referred to as intrinsically unstructured 
proteins ([UPs). As the name suggests, these proteins, completely or in part, do 
not have a discrete three-dimensional structure under physiological conditions. 
Indeed, an estimated 50% of eukaryotic proteins have at least one unstructured 
region greater than 30 amino acids in length. Unstructured regions are rich in 
charged and polar amino acids with few hydrophobic residues. These proteins 
assume a defined structure on interaction with other proteins. This molecular 
versatility means that one protein can assume different structures and interact 
with the different partners, yielding different biochemical functions. [UPs 
appear to be especially important in signaling and regulatory pathways. 

Another class of proteins that do not adhere to the paradigm is metamorphic 
proteins. These proteins appear to exist in an ensemble of structures of 
approximately equal energy that are in equilibrium. Small molecules or 
other proteins may bind to different members of the ensemble, resulting in 
various complexes, each having a different biochemical function. An espe- 
cially clear example of a metamorphic protein is the chemokine lymphotac- 
tin. Chemokines are small signaling proteins in the immune system that 
bind to receptor proteins on the surface of immune-system cells, instigating 
an immunological response. Lymphotactin exists in two very different 
structures that are in equilibrium (Figure 2.61). One structure is a character- 
istic of chemokines, consisting of a three-stranded B sheet and a carboxyl- 
terminal helix. This structure binds to its receptor and activates it. The 
alternative structure is an identical dimer of all B sheets. When in this struc- 
ture, lymphotactin binds to glycosaminoglycan, a complex carbohydrate 
(Chapter 11). The biochemical activities of each structure are mutually 
exclusive: the chemokine structure cannot bind the glycosaminoglycan, and 
the B-sheet structure cannot activate the receptor. Yet, remarkably, both 
activities are required for full biological activity of the chemokine. 


N 
Chemokine Glycosaminoglycan-binding 
structure structure 


FIGURE 2.61 Lymphotactin exists in two conformations, which are in equilibrium. 
[R. L. Tuinstra, F. C. Peterson, S. Kutlesa, E. S. Elgin, M. A. Kron, and B. F. Volkman. Proc. Natl. 
Sci. U.S.A. 105:5057-5062, 2008, Fig. 2A.] 
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Note that IUPs and metamorphic proteins effectively expand the 
protein-encoding capacity of the genome. In some cases, a gene can encode 
a single protein that has more than one structure and function. These 
examples also illustrate the dynamic nature of the study of biochemistry 
and its inherent excitement: even well-established ideas are often subject 
to modifications. 


Protein misfolding and aggregation are associated with some 
neurological diseases 


Understanding protein folding and misfolding is of more than aca- 
demic interest. A host of diseases, including Alzheimer disease, 
Parkinson disease, Huntington disease, and transmissible spongiform 
encephalopathies (prion disease), are associated with improperly folded 
proteins. All of these diseases result in the deposition of protein aggregates, 
called amyloid fibrils or plaques. These diseases are consequently referred to 
as amyloidoses. A common feature of amyloidoses is that normally soluble 
proteins are converted into insoluble fibrils rich in B sheets. The correctly 
folded protein is only marginally more stable than the incorrect form. But 
the incorrect form aggregates, pulling more correct forms into the incorrect 
form. We will focus on the transmissible spongiform encephalopathies. 
One of the great surprises in modern medicine was that certain infectious 
neurological diseases were found to be transmitted by agents that were similar 
in size to viruses but consisted only of protein. These diseases include bovine 
spongiform encephalopathy (commonly referred to as mad cow disease) and the 
analogous diseases in other organisms, including Creutzfeld—Jakob disease 
(CJD) in human beings, scrapie in sheep, and chronic wasting disease in deer 
and elk. The agents causing these diseases are termed prions. Prions are com- 
posed largely, if not exclusively, of a cellular protein called PrP, which is nor- 
mally present in the brain; its function is still the focus of active research. The 
infectious prions are aggregated forms of the PrP protein termed PrP“. 
How does the structure of the protein in the aggregated form differ from 
that of the protein in its normal state in the brain? The normal cellular protein 
PrP contains extensive regions of a helix and relatively little B strand. The 
structure of the form of PrP present in infected brains, termed PrP*©, has not 
yet been determined because of challenges posed by its insoluble and hetero- 
geneous nature. However, a variety of evidence indicates that some parts of 
the protein that had been in a-helical or turn conformations have been con- 
verted into B-strand conformations (Figure 2.62). The 
B strands of largely planar monomers stack on one 
another with their side chains tightly interwoven. A 
side view shows the extensive network of hydrogen 


bonds between the monomers. These fibrous protein 
aggregates are often referred to as amyloid forms. 
With the realization that the infectious agent in 
prion diseases is an aggregated form of a protein that is 
already present in the brain, a model for disease trans- 
mission emerges (Figure 2.63). Protein aggregates built 
of abnormal forms of PrP*€ act as sites of nucleation to 
which other PrP molecules attach. Prion diseases can 


FIGURE 2.62 A model of the human prion protein amyloid. thus be transferred from one individual organism to 
A detailed model of a human prion amyloid fibril deduced from spin another through the transfer of an aggregated nucleus, 


labeling and electron paramagnetic resonance (EPR) spectroscopy 
studies shows that protein aggregation is due to the formation of large 
parallel B sheets. The black arrow indicates the long axis of the fibril. 


as likely happened in the mad cow disease outbreak in 
the United Kingdom that emerged in the late 1980s. 


[N. J. Cobb, F. D. Sénnichsen, H. Mchaourab, and W. K. Surewicz. Cattle fed on animal feed containing material from 
Proc. Natl, Acad. Sci. U.S.A. 104: 18946-18951, 2007, Fig. 4] diseased cows developed the disease in turn. 
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Amyloid fibers are also seen in the brains of patients with certain noninfec- 
tious neurodegenerative disorders such as Alzheimer and Parkinson diseases. 
For example, the brains of patients with Alzheimer disease contain protein 
aggregates called amyloid plaques that consist primarily of a single polypeptide 
termed AB. This polypeptide is derived from a cellular protein called amyloid 
precursor protein (APP) through the action of specific proteases. Polypeptide 
AB is prone to form insoluble aggregates. Despite the difficulties posed by the 
protein’s insolubility, a detailed structural model for AB has been derived 
through the use of NMR techniques that can be applied to solids rather than 
to materials in solution. As expected, the structure is rich in B strands, which 
come together to form extended parallel 8-sheet structures (Figure 2.62). 

How do such aggregates lead to the death of the cells that harbor them? 
The answer is still controversial. One hypothesis is that the large aggregates 
themselves are not toxic but, instead, smaller aggregates of the same pro- 
teins may be the culprits, perhaps damaging cell membranes. 


Protein modification and cleavage confer new capabilities 


Proteins are able to perform numerous functions that rely solely on 

the versatility of their 20 amino acids. In addition, many proteins are 
covalently modified, through the attachment of groups other than amino 
acids, to augment their functions (Figure 2.64). For example, acetyl groups are 
attached to the amino termini of many proteins, a modification that makes 
these proteins more resistant to degradation. As discussed earlier (p. 45), the 
addition of hydroxyl groups to many proline residues stabilizes fibers of newly 
synthesized collagen. The biological significance of this modification is evi- 
dent in the disease scurvy: a deficiency of vitamin C results in insufficient 
hydroxylation of collagen, and the abnormal collagen fibers that result are 
unable to maintain normal tissue strength (Section 27.6). Another specialized 
amino acid is y-carboxyglutamate. In vitamin K deficiency, insufficient 
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FIGURE 2.64 Finishing touches. Some common and important covalent modifications of 
amino acid side chains are shown. 


FIGURE 2.63 The protein-only model for 
prion-disease transmission. A nucleus 
consisting of proteins in an abnormal 
conformation grows by the addition of 
proteins from the normal pool. 
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(A) 


carboxylation of glutamate in prothrombin, a clotting protein, can lead to 
hemorrhage (Section 10.4). Many proteins, especially those that are present 
on the surfaces of cells or are secreted, acquire carbohydrate units on specific 
asparagine, serine, or threonine residues (Chapter 11). The addition of sugars 
makes the proteins more hydrophilic and able to participate in interactions 
with other proteins. Conversely, the addition of a fatty acid to an a-amino 
group or a cysteine sulfhydryl group produces a more hydrophobic protein. 

Many hormones, such as epinephrine (adrenaline), alter the activities of 
enzymes by stimulating the phosphorylation of the hydroxyl amino acids 
serine and threonine; phosphoserine and phosphothreonine are the most ubiqui- 
tous modified amino acids in proteins. Growth factors such as insulin act by 
triggering the phosphorylation of the hydroxyl group of tyrosine residues to 
form phosphotyrosine. The phosphoryl groups on these three modified amino 
acids are readily removed; thus the modified amino acids are able to act as 
reversible switches in regulating cellular processes. The roles of phosphoryla- 
tion in signal transduction will be discussed extensively in Chapter 14. 

The preceding modifications consist of the addition of special groups to 
amino acids. Other special groups are generated by chemical rearrangements of 
side chains and, sometimes, the peptide backbone. For example, the jellyfish 
Aequorea victoria produces green fluorescent protein (GFP), which emits green 
light when stimulated with blue light. The source of the fluorescence is a group 
formed by the spontaneous rearrangement and oxidation of the sequence 
Ser-Tyr-Gly within the center of the protein (Figure 2.65A). Since the discovery 


(9) 


FIGURE 2.65 Chemical rearrangement in GFP. (A) The structure of green fluorescent 
protein (GFP). The rearrangement and oxidation of the sequence Ser-Tyr-Gly is the source of 
fluorescence. (B) Mutants of GFP emit light across the visible spectrum. (C) A melanoma cell 
line engineered to express one of these GFP mutants, red fluorescent protein (RFP), was then 
injected into a mouse whose blood vessels express GFP. In this fluorescence micrograph, the 
formation of new blood vessels (green) in the tumor (red) is readily apparent. [(A) Drawn from 
1GFL.pdb; (B) R.Y. Tsien. Integr. Biol. 2:77-93, 2010, Fig. 12; (C) M. Yang, et al. Proc. Natl. Acad. 
Sci. U.S.A. 100:14259-14262, 2008, Fig. 2B] 


of GFP, a number of mutants have been engineered which absorb and emit light 
across the entire visible spectrum (Figure 2.65B). These proteins are of great 
utility to researchers as markers within cells (Figure 2.65C). 

Finally, many proteins are cleaved and trimmed after synthesis. For 
example, digestive enzymes are synthesized as inactive precursors that can be 
stored safely in the pancreas. After release into the intestine, these precursors 
become activated by peptide-bond cleavage (Section 10.4). In blood clotting, 
peptide-bond cleavage converts soluble fibrinogen into insoluble fibrin. A 
number of polypeptide hormones, such as adrenocorticotropic hormone, 
arise from the splitting of a single large precursor protein. Likewise, many 
viral proteins are produced by the cleavage of large polyprotein precursors. 
We shall encounter many more examples of modification and cleavage as 
essential features of protein formation and function. Indeed, these finishing 
touches account for much of the versatility, precision, and elegance of protein 
action and regulation. 


SUMMARY 


Protein structure can be described at four levels. The primary structure 
refers to the amino acid sequence. The secondary structure refers to the 
conformation adopted by local regions of the polypeptide chain. Tertiary 
structure describes the overall folding of the polypeptide chain. Finally, 
quaternary structure refers to the specific association of multiple polypep- 
tide chains to form multisubunit complexes. 


2.1 Proteins Are Built from a Repertoire of 20 Amino Acids 

Proteins are linear polymers of amino acids. Each amino acid consists 
of a central tetrahedral carbon atom linked to an amino group, a car- 
boxylic acid group, a distinctive side chain, and a hydrogen atom. 
These tetrahedral centers, with the exception of that of glycine, are 
chiral; only the L isomer exists in natural proteins. Nearly all natural 
proteins are constructed from the same set of 20 amino acids. The side 
chains of these 20 building blocks vary tremendously in size, shape, 
and the presence of functional groups. They can be grouped as follows: 
(1) hydrophobic side chains, including the aliphatic amino acids— 
glycine, alanine, valine, leucine, isoleucine, methionine, and proline— 
and aromatic side chains—phenylalanine, and tryptophan; (2) polar 
side chains, including hydroxyl-containing side chains—serine, 
threonine and tyrosine; the sulfhydryl-containing cysteine; and 
carboxamide-containing side chains—asparagine and glutamine; 
(3) basic side chains—lysine, arginine, and histidine; and (4) acidic side 
chains—aspartic acid and glutamic acid. These groupings are some- 
what arbitrary and many other sensible groupings are possible. 


2.2 Primary Structure: Amino Acids Are Linked by Peptide Bonds to Form 

Polypeptide Chains 

The amino acids in a polypeptide are linked by amide bonds formed 
between the carboxyl group of one amino acid and the amino group of 
the next. This linkage, called a peptide bond, has several important 
properties. First, it is resistant to hydrolysis, and so proteins are remark- 
ably stable kinetically. Second, the peptide group is planar because the 
C—N bond has considerable double-bond character. Third, each 
peptide bond has both a hydrogen-bond donor (the NH group) and a 
hydrogen-bond acceptor (the CO group). Hydrogen bonding between 
these backbone groups is a distinctive feature of protein structure. 
Finally, the peptide bond is uncharged, which allows proteins to form 
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2.3 


2.4 


2.5 


2.6 


tightly packed globular structures having significant amounts of the 
backbone buried within the protein interior. Because they are linear 
polymers, proteins can be described as sequences of amino acids. Such 
sequences are written from the amino to the carboxyl terminus. 


Secondary Structure: Polypeptide Chains Can Fold into 

Regular Structures Such As the Alpha Helix, the Beta Sheet, 

and Turns and Loops 

Two major elements of secondary structure are the a helix and the B 
strand. In the a helix, the polypeptide chain twists into a tightly packed 
rod. Within the helix, the CO group of each amino acid is hydrogen 
bonded to the NH group of the amino acid four residues farther along 
the polypeptide chain. In the B strand, the polypeptide chain 1s nearly 
fully extended. Two or more B strands connected by NH-to-CO 
hydrogen bonds come together to form B sheets. The strands in B 
sheets can be antiparallel, parallel, or mixed. 


Tertiary Structure: Water-Soluble Proteins Fold into Compact Structures 
with Nonpolar Cores 

The compact, asymmetric structure that individual polypeptides attain 
is called tertiary structure. The tertiary structures of water-soluble pro- 
teins have features in common: (1) an interior formed of amino acids 
with hydrophobic side chains and (2) a surface formed largely of hydro- 
philic amino acids that interact with the aqueous environment. The 
hydrophobic interactions between the interior residues are the driving 
force for the formation of the tertiary structure of water-soluble pro- 
teins. Some proteins that exist in a hydrophobic environment, such as in 
membranes, display the inverse distribution of hydrophobic and hydro- 
philic amino acids. In these proteins, the hydrophobic amino acids are 
on the surface to interact with the environment, whereas the hydrophilic 
groups are shielded from the environment in the interior of the protein. 


Quaternary Structure: Polypeptide Chains Can Assemble into 
Multisubunit Structures 

Proteins consisting of more than one polypeptide chain display quater- 
nary structure; each individual polypeptide chain is called a subunit. 
Quaternary structure can be as simple as two identical subunits or as 
complex as dozens of different subunits. In most cases, the subunits are 
held together by noncovalent bonds. 


The Amino Acid Sequence of a Protein Determines Its 
Three-Dimensional Structure 
The amino acid sequence determines the three-dimensional structure 
and, hence, all other properties of a protein. Some proteins can be 
unfolded completely yet refold efficiently when placed under condi- 
tions in which the folded form of the protein is stable. The amino acid 
sequence of a protein is determined by the sequences of bases ina DNA 
molecule. This one-dimensional sequence information is extended into 
the three-dimensional world by the ability of proteins to fold spontane- 
ously. Protein folding is a highly cooperative process; structural inter- 
mediates between the unfolded and folded forms do not accumulate. 

Some proteins, such as intrinsically unstructured proteins and meta- 
morphic proteins, do not strictly adhere to the one-sequence—one- 
structure paradigm. Because of this versatility, these proteins expand 
the protein encoding capacity of the genome. 

The versatility of proteins is further enhanced by covalent modifi- 
cations. Such modifications can incorporate functional groups not 
present in the 20 amino acids. Other modifications are important to the 
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sity, and chemical reactivity, proteins make possible most of the key 


processes associated with life. 


Appendix 


APPENDIX: Visualizing Molecular Structures II: Proteins 


Scientists have developed powerful techniques for the deter- 
mination of protein structures, as will be considered in Chapter 
3. In most cases, these techniques allow the positions of the 
thousands of atoms within a protein structure to be deter- 
mined. The final results from such an experiment include the 
x, y, and z coordinates for each atom in the structure. These 
coordinate files are compiled in the Protein Data Bank (http:// 
www.pdb.org) from which they can be readily downloaded. 
These structures comprise thousands or even tens of thou- 
sands of atoms. The complexity of proteins with thousands of 
atoms presents a challenge for the depiction of their structure. 
Several different types of representations are used to portray 
proteins, each with its own strengths and weaknesses. The 
types that you will see most often in this book are space-filling 
models, ball-and-stick models, backbone models, and ribbon 
diagrams. Where appropriate, structural features of particular 
importance or relevance are noted in an illustration’s legend. 


Space-Filling Models 

Space-filling models are the most realistic type of represen- 
tation. Each atom is shown as a sphere with a size corre- 
sponding to the van der Waals radius of the atom (Section 1.3). 
Bonds are not shown explicitly but are represented by the 
intersection of the spheres shown when atoms are closer 
together than the sum of their van der Waals radii. All atoms 
are shown, including those that make up the backbone and 
those in the side chains. A space-filling model of lysozyme is 
depicted in Figure 2.66. 

Space-filling models convey a sense of how little open 
space there is in a protein’s structure, which always has 
many atoms in van der Waals contact with one another. 
These models are particularly useful in showing conforma- 
tional changes in a protein from one set of circumstances to 
another. A disadvantage of space-filling models is that the 
secondary and tertiary structures of the protein are difficult 
to see. Thus, these models are not very effective in distin- 
guishing one protein from another—many space-filling 
models of proteins look very much alike. 


Ball-and-Stick Models 
Ball-and-stick models are not as realistic as space-filling 
models. Realistically portrayed atoms occupy more space, 
determined by their van der Waals radii, than do the atoms 
depicted in ball-and-stick models. However, the bonding 
arrangement is easier to see because the bonds are explicitly 
represented as sticks (Figure 2.67). A ball-and-stick model 
reveals a complex structure more clearly than a space-filling 
model does. However, the depiction is so complicated that 
structural features such as a helices or potential binding 
sites are difficult to discern. 

Because space-filling and ball-and-stick models depict 
protein structures at the atomic level, the large number of 


atoms in a complex structure makes it difficult to distin- 
guish the relevant structural features. Thus, representa- 
tions that are more schematic—such as backbone models 
and ribbon diagrams—have been developed for the depic- 
tion of macromolecular structures. In these representa- 
tions, most or all atoms are not shown explicitly. 


Backbone Models 
Backbone models show only the backbone atoms of a poly- 
peptide chain or even only the a-carbon atom of each 


FIGURE 2.66 Space-filling model of lysozyme. Notice how tightly 
packed the atoms are, with little unfilled space. All atoms are shown 
with the exception of hydrogen atoms. Hydrogen atoms are often 
omitted because their positions are not readily determined by x-ray 
crystallographic methods and because their omission somewhat 
improves the clarity of the structure’s depiction. 


FIGURE 2.67 Ball-and-stick model of lysozyme. Again, hydrogen 
atoms are omitted. 
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FIGURE 2.68 Backbone model of lysozyme. 


amino acid. Atoms are linked by lines representing bonds; 
if only a-carbon atoms are depicted, lines connect a-carbon 
atoms of amino acids that are adjacent in the amino acid 
sequence (Figure 2.68). In this book, backbone models 
show only the lines connecting the a-carbon atoms; other 
carbon atoms are not depicted. 

A backbone model shows the overall course of the poly- 
peptide chain much better than a space-filling or ball-and- 
stick model does. However, secondary structural elements 
are still difficult to see. 


Ribbon Diagrams 
Ribbon diagrams are highly schematic and most commonly 
used to accent a few dramatic aspects of protein structure, 
such as the o helix (depicted as a coiled ribbon or a cylinder), 
the B strand (a broad arrow), and loops (thin tubes), to pro- 
vide clear views of the folding patterns of proteins (Figure 
2.69). The ribbon diagram allows the course of a polypeptide 
chain to be traced and readily shows the secondary structural 
elements. Thus, ribbon diagrams of proteins that are related 
to one another by evolutionary divergence appear similar 
(Figure 6.15), whereas unrelated proteins are clearly distinct. 
In this book, coiled ribbons will be generally used to 
depict a helices. However, for membrane proteins, which 
are often quite complex, cylinders will be used rather than 
coiled ribbons. This convention will also make membrane 
proteins with their membrane-spanning «a helices easy to 
recognize (Figure 12.18). 


B strand 


o helix 


FIGURE 2.69 Ribbon diagram of lysozyme. The a helices are 
shown as coiled ribbons; 8 strands are depicted as arrows. More 
irregular structures are shown as thin tubes. 


Bear in mind that the open appearance of ribbon dia- 
grams is deceptive. As noted earlier, protein structures are 
tightly packed and have little open space. The openness of 
ribbon diagrams makes them particularly useful as frame- 
works in which to highlight additional aspects of protein 
structure. Active sites, substrates, bonds, and other struc- 
tural fragments can be included in ball-and-stick or space- 
filling form within a ribbon diagram (Figure 2.70). 


Disulfide bonds 


Active-site 
aspartate residue 


Disulfide bonds 


FIGURE 2.70 Ribbon diagram of lysozyme with highlights. Four 
disulfide bonds and a functionally important aspartate residue are 
shown in ball-and-stick form. 
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side chain (R group) (p. 29) 

L amino acid (p. 29) 

dipolar ion (zwitterion) (p. 29) 
peptide bond (amide bond) (p. 35) 
disulfide bond (p. 36) 


psi (ys) angle (p. 39) 


primary structure (p. 37) 
torsion angle (p. 39) 
phi (¢) angle (p. 39) 


Ramachandran plot (p. 39) 


secondary structure (p. 40) 
a helix (p. 40) 

rise (translation) (p. 40) 

B pleated sheet (p. 42) 

B strand (p. 42) 


reverse turn (8 turn; hairpin turn) 


(p. 44) 
coiled coil (p. 44) 
heptad repeat (p. 44) 
tertiary structure (p. 46) 


(p. 48) 
domain (p. 48) 


subunit (p. 48) 
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Problems 


cooperative transition (p. 52) 


intrinsically unstructured protein (IUP) 
(p. 55) 
metamorphic protein (p. 55) 


prion (p. 56) 


1. Identify. Examine the following four amino acids (A—D): 


coo- CcOoO™ ‘i coo- 
*HoN—cH  “HsN——CH oe *H3;N—CH 
Ha. fla Cn 1 CH) 
oD Z CH CH, 
/* 
NS H;C CH CH, 
bu CH, 
NH; 
A B C D 


What are their names, three-letter abbreviations, and one- 
letter symbols? 


2. Properties. In reference to the amino acids shown in Problem 
1, which are associated with the following characteristics? 


(a) Hydrophobic side chain 
(b) Basic side chain 

(c) Three ionizable groups 
( 


d) pK, of approximately 10 in proteins 
(e) Modified form of phenylalanine 


3. Match ’em. Match each amino acid in the left-hand col- 
umn with the appropriate side-chain type in the right-hand 
column. 


(a) Leu (1) hydroxyl-containing 
(b) Glu (2) acidic 

(c) Lys (3) basic 

(d) Ser (4) sulfur-containing 
(e) Cys (5) nonpolar aromatic 
(f) Trp (6) nonpolar aliphatic 


4. Solubility. In each of the following pairs of amino acids, 
identify which amino acid would be more soluble in water: 


(a) Ala, Leu; (b) Tyr, Phe; (c) Ser, Ala; (d) Trp, His. 


5. Bonding is good. Which of the following amino acids 
have R groups that have hydrogen-bonding potential? Ala, 
Gly, Ser, Phe, Glu, Tyr, He, and Thr. 


6. Name those components. Examine the segment of a pro- 
tein shown here. 


(a) What three amino acids are present? 

(b) Of the three, which is the N-terminal amino acid? 
(c) Identify the peptide bonds. 

(d) Identify the a-carbon atoms. 


7. Who’s charged? Draw the structure of the dipeptide Gly- 
His. What is the charge on the peptide at pH 5.5? pH 7.5? 


8. Alphabet soup. How many different polypeptides of 
50 amino acids in length can be made from the 20 common 
amino acids? 


9. Sweet tooth, but calorie conscious. Aspartame (NutraSweet), 
an artificial sweetener, is a dipeptide composed of Asp-Phe in 
which the carboxyl terminus is modified by the attachment of 
a methyl! group. Draw the structure of Aspartame at pH 7. 


10. Vertebrate proteins? What is meant by the term polypep- 
tide backbone? 


11. Not a sidecar. Define the term side chain in the context 
of amino acid or protein structure. 


12. One from many. Differentiate between amino acid com- 
position and amino acid sequence. 


13. Shape and dimension. (a) Tropomyosin, a 70-kDa muscle 
protein, is a two-stranded a-helical coiled coil. Estimate the 
length of the molecule. (b) Suppose that a 40-residue seg- 
ment of a protein folds into a two-stranded antiparallel B 
structure with a 4-residue hairpin turn. What is the longest 
dimension of this motif? 


14. Contrasting isomers. Poly-L-leucine in an organic sol- 
vent such as dioxane is a helical, whereas poly-L-isoleucine 
isnot. Why do these amino acids with the same number and 
kinds of atoms have different helix-forming tendencies? 


15. Exceptions to the rule. Ramachandran plots for two 
amino acids differ significantly from that shown in Figure 
2.23. Which two, and why? 


16. Active again. A mutation that changes an alanine resi- 
due in the interior of a protein to valine is found to lead toa 
loss of activity. However, activity is regained when a second 


64 
CHAPTER 2. Protein Composition and Structure 


mutation at a different position changes an isoleucine resi- 
due to glycine. How might this second mutation lead to a 
restoration of activity? 


17. Exposure issues. Many of the loops on proteins are com- 
posed of hydrophilic amino acids. Why might this be the case? 


18. Shuffle test. An enzyme that catalyzes disulfide—sulf- 
hydryl exchange reactions, called protein disulfide isomer- 
ase (PDI), has been isolated. PDI rapidly converts inactive 
scrambled ribonuclease into enzymatically active ribonu- 
clease. In contrast, insulin is rapidly inactivated by PDI. 
What does this important observation imply about the 
relation between the amino acid sequence of insulin and its 
three-dimensional structure? 


19. Stretching a target. A protease is an enzyme that cata- 
lyzes the hydrolysis of the peptide bonds of target proteins. 
How might a protease bind a target protein so that its main 
chain becomes fully extended in the vicinity of the vulner- 
able peptide bond? 


20. Often irreplaceable. Glycine is a highly conserved 
amino acid residue in the evolution of proteins. Why? 


21. Potential partners. Identify the groups in a protein that 
can form hydrogen bonds or electrostatic bonds with an 
arginine side chain at pH 7. 


22. Permanent waves. The shape of hair is determined in 
part by the pattern of disulfide bonds in keratin, its major 
protein. How can curls be induced? 


23. Location is everything 1. Most proteins have hydro- 
philic exteriors and hydrophobic interiors. Would you 
expect this structure to apply to proteins embedded in the 
hydrophobic interior of a membrane? Explain. 


24. Location is everything 2. Proteins that span biological 
membranes often contain a helices. Given that the insides 
of membranes are highly hydrophobic (Section 12.2), pre- 
dict what type of amino acids would be in such an a helix. 
Why is an @ helix particularly suited to existence in the 
hydrophobic environment of the interior of a membrane? 


25. Neighborhood peer pressure? Table 2.1 shows the typical 
pK, values for ionizable groups in proteins. However, more 
than 500 pK, values have been determined for individual 
groups in folded proteins. Account for this discrepancy. 


26. Greasy patches. The a and B subunits of hemoglobin 
bear a remarkable structural similarity to myoglobin. 
However, in the subunits of hemoglobin, certain residues 
that are hydrophilic in myoglobin are hydrophobic. Why 
might this be the case? 


27. Maybe size does matter. Osteogenesis imperfecta dis- 
plays a wide range of symptoms, from mild to severe. On 


the basis of your knowledge of amino acid and collagen 
structure, propose a biochemical basis for the variety of 
symptoms. 


28. Issues of stability. Proteins are quite stable. The lifetime 
of a peptide bond in aqueous solution is nearly 1000 years. 
However, the free energy of hydrolysis of proteins is nega- 
tive and quite large. How can you account for the stability 
of the peptide bond in light of the fact that hydrolysis 
releases much energy? 


29. Minor species. For an amino acid such as alanine, the 
major species in solution at pH 7 is the zwitterionic 
form. Assume a pK, value of 8 for the amino group and 
a pK, value of 3 for the carboxylic acid. Estimate the 
ratio of the concentration of the neutral amino acid spe- 
cies (with the carboxylic acid protonated and the amino 
group neutral) to that of the zwitterionic species at pH 7 
(Section 1.3). 


30. A matter of convention. All L amino acids have an S 
absolute configuration except L-cysteine, which has the R 
configuration. Explain why L-cysteine is designated as 
having the R absolute configuration. 


31. Hidden message. Translate the following amino acid 
sequence into one-letter code: Glu-Leu-Val-Ile-Ser-Ile- 
Ser-Leu-TIle-Val-He-Asn-Gly-Ile-Asn-Leu-Ala-Ser-Val- 
Glu-Gly-Ala-Ser. 


32. Who goes first? Would you expect Pro—X peptide 
bonds to tend to have cis conformations like those of 
X—Pro bonds? Why or why not? 


33. Matching. For each of the amino acid derivatives shown 
here (A—E), find the matching set of @ and w values (a-e). 


(A) (B) (C) . (D) (E) 

YY FN TT WW TY 
(a) (b) (c) (d) (e) 
¢=120°, =180°, $=180°, $¢=0°, ¢=-60°, 
w=120° y=0° w=180° w=180° w= -40° 


34. Scrambled ribonuclease. When performing his experi- 
ments on protein refolding, Christian Anfinsen obtained a 
quite different result when reduced ribonuclease was reoxi- 
dized while it was still in 8 M urea and the preparation was 
then dialyzed to remove the urea. Ribonuclease reoxidized 
in this way had only 1% of the enzymatic activity of the 
native protein. Why were the outcomes so different when 
reduced ribonuclease was reoxidized in the presence and 
absence of urea? 


Exploring Proteins and 
Proteomes 


CHAPTER 
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Milk, a source of nourishment for all mammals, is composed, in part, of a 
variety of proteins. The protein components of milk are revealed by the 
technique of MALDI-TOF mass spectrometry, which separates molecules 
on the basis of their mass-to-charge ratio. [(Left) Okea/istockphoto.com. 
(Right) Courtesy of Dr. Brian Chait.] 


Proteins play crucial roles in nearly all biological processes—in catalysis, 
signal transmission, and structural support. This remarkable range of 
functions arises from the existence of thousands of proteins, each folded 
into a distinctive three-dimensional structure that enables it to interact 
with one or more of a highly diverse array of molecules. A major goal of 
biochemistry is to determine how amino acid sequences specify the confor- 
mations, and hence functions, of proteins. Other goals are to learn how 
individual proteins bind specific substrates and other molecules, mediate 
catalysis, and transduce energy and information. 

It is often preferable to study a protein of interest after it has been sepa- 
rated from other components within the cell so that the structure and func- 
tion of this protein can be probed without any confounding effects from 
contaminants. Hence, the first step in these studies is the purification of the 
protein of interest. Proteins can be separated from one another on the basis 
of solubility, size, charge, and binding ability. After a protein has been puri- 
fied, its amino acid sequence can be determined. Many protein sequences, 
often deduced from genome sequences, are available in vast sequence data- 
bases. If the sequence of a purified protein has been archived in a publicly 
searchable database, the job of the investigator becomes much easier. The 
investigator need determine only a small stretch of amino acid sequence of 
the protein to find its match in the database. Alternatively, such a protein 
might be identified by matching its mass to those deduced for proteins in the 
database. Mass spectrometry provides a powerful method for determining 
the mass and sequence of a protein. 


OUTLINE 


3.1 The Purification of Proteins Is an 
Essential First Step in 
Understanding Their Function 


3.2 Immunology Provides Important 
Techniques with Which to 
Investigate Proteins 


3.3 Mass Spectrometry Is a Powerful 
Technique for the Identification of 
Peptides and Proteins 


3.4 Peptides Can Be Synthesized by 
Automated Solid-Phase Methods 


3.5 Three-Dimensional Protein 
Structure Can Be Determined by 
X-ray Crystallography and NMR 
Spectroscopy 
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After a protein has been purified and its identity confirmed, the chal- 
lenge remains to determine its function within a physiologically relevant 
context. Antibodies are choice probes for locating proteins in vivo and 
measuring their quantities. Monoclonal antibodies, able to recognize spe- 
cific proteins, can be obtained in large amounts and used to detect and 
quantify the protein both in isolation and in cells. Peptides and proteins can 
be chemically synthesized, providing tools for research and, in some cases, 
highly pure material for use as drugs. Finally, x-ray crystallography and 
nuclear magnetic resonance (NMR) spectroscopy are the principal tech- 
niques for elucidating three-dimensional structure, the key determinant of 
function. 

The exploration of proteins by this array of physical and chemical tech- 
niques has greatly enriched our understanding of the molecular basis of life. 
These techniques make it possible to tackle some of the most challenging 
questions of biology in molecular terms. 


The proteome is the functional representation of the genome 


As will be discussed in Chapter 5, the complete DNA base sequences, or 
genomes, of many organisms are now available. For example, the round- 
worm Caenorhabditis elegans has a genome of 97 million bases and about 
19,000 protein-encoding genes, whereas that of the fruit fly Drosophila 
melanogaster contains 180 million bases and about 14,000 genes. The com- 
pletely sequenced human genome contains 3 billion bases and about 23,000 
genes. However, these genomes are simply inventories of the genes that 
could be expressed within a cell under specific conditions. Only a subset of 
the proteins encoded by these genes will actually be present in a given bio- 
logical context. The proteome—derived from proteins expressed by the 
genome—of an organism signifies a more complex level of information 
content, encompassing the types, functions, and interactions of proteins 
within its biological environment. 

The proteome is not a fixed characteristic of the cell. Because it repre- 
sents the functional expression of information, it varies with cell type, 
developmental stage, and environmental conditions, such as the presence 
of hormones. The proteome is much larger than the genome because 
almost all gene products are proteins that can be chemically modified in a 
variety of ways. Furthermore, these proteins do not exist in isolation; they 
often interact with one another to form complexes with specific functional 
properties. Whereas the genome is “hard wired,” the proteome is highly 
dynamic. An understanding of the proteome is acquired by investigating, 
characterizing, and cataloging proteins. In some, but not all, cases, this 
process begins by separating a particular protein from all other biomole- 
cules in the cell. 


3.1 The Purification of Proteins Is an Essential First 
Step in Understanding Their Function 


An adage of biochemistry is “Never waste pure thoughts on an impure pro- 
tein.” Starting from pure proteins, we can determine amino acid sequences 
and investigate biochemical functions. From the amino acid sequences, we 
can map evolutionary relationships between proteins in diverse organisms 
(Chapter 6). By using crystals grown from pure protein, we can obtain x-ray 
data that will provide us with a picture of the protein’s tertiary structure— 
the shape that determines function. 


The assay: How do we recognize the protein that we are looking for? 


Purification should yield a sample containing only one type of molecule— 
the protein in which the biochemist is interested. This protein sample may 
be only a fraction of 1% of the starting material, whether that starting mate- 
rial consists of one type of cell in culture or a particular organ from a plant 
or animal. How is the biochemist able to isolate a particular protein from a 
complex mixture of proteins? 

A protein can be purified by subjecting the impure mixture of the 
starting material to a series of separations based on physical properties 
such as size and charge. To monitor the success of this purification, the 
biochemist needs a test, called an assay, for some unique identifying 
property of the protein. A positive result on the assay indicates that the 
protein is present. Although assay development can be a challenging 
task, the more specific the assay, the more effective the purification. For 
enzymes, which are protein catalysts (Chapter 8), the assay usually mea- 
sures enzyme activity—that is, the ability of the enzyme to promote a 
particular chemical reaction. This activity is often measured indirectly. 
Consider the enzyme lactate dehydrogenase, which catalyzes the follow- 
ing reaction in the synthesis of glucose: 


Cc Lactate Os 
| dehydrogenase 
ee a + NAD+ =——————= | + NADH + Ht 
C 
4 
CHs 0” CH; 
Lactate Pyruvate 


Reduced nicotinamide adenine dinucleotide (NADH, Figure 15.13) 
absorbs light at 340 nm, whereas oxidized nicotinamide adenine dinucleo- 
tide (NAD*) does not. Consequently, we can follow the progress of the 
reaction by examining how much light-absorbing ability is developed by a 
sample in a given period of time—for instance, within 1 minute after the 
addition of the enzyme. Our assay for enzyme activity during the purifica- 
tion of lactate dehydrogenase is thus the increase in the absorbance of light 
at 340 nm observed in 1 minute. 

To analyze how our purification scheme is working, we need one additional 
piece of information—the amount of protein present in the mixture being 
assayed. There are various rapid and reasonably accurate means of determin- 
ing protein concentration. With these two experimentally determined num- 
bers—enzyme activity and protein concentration—we then calculate the 
specific activity, the ratio of enzyme activity to the amount of protein in the 
mixture. Ideally, the specific activity will rise as the purification proceeds and 
the protein mixture will contain the protein of interest to a greater extent. In 
essence, the overall goal of the purification is to maximize the specific activity. 
For a pure enzyme, the specific activity will have a constant value. 


Proteins must be released from the cell to be purified 


Having found an assay and chosen a source of protein, we now fractionate 
the cell into components and determine which component is enriched in the 
protein of interest. In the first step, a homogenate is formed by disrupting the 
cell membrane, and the mixture is fractionated by centrifugation, yielding a 
dense pellet of heavy material at the bottom of the centrifuge tube and a 
lighter supernatant above (Figure 3.1). The supernatant is again centrifuged 
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FIGURE 3.1 Differential centrifugation. 
Cells are disrupted in a homogenizer and 
the resulting mixture, called the 

homogenate, is centrifuged in a step-by- 


step fashion of increasing centrifugal force. 


The denser material will form a pellet at 
lower centrifugal force than will the less- 
dense material. The isolated fractions 
can be used for further purification. 
[Photographs courtesy of Dr. S. Fleischer 
and Dr. B. Fleischer.] 
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at a greater force to yield yet another pellet and supernatant. This proce- 
dure, called differential centrifugation, yields several fractions of decreasing 
density, each still containing hundreds of different proteins. The fractions 
are each separately assayed for the desired activity. Usually, one fraction will 
be enriched for such activity, and it then serves as the source of material to 
which more-discriminating purification techniques are applied. 


Proteins can be purified according to solubility, size, charge, 
and binding affinity 


Several thousand proteins have been purified in active form on the basis of 
such characteristics as solubility, size, charge, and specific binding affinity. 
Usually, protein mixtures are subjected to a series of separations, each based 
on a different property. At each step in the purification, the preparation is 
assayed and its specific activity is determined. A variety of purification 
techniques are available. 


Salting out. Most proteins are less soluble at high salt concentrations, an 
effect called salting out. The salt concentration at which a protein precipi- 
tates differs from one protein to another. Hence, salting out can be used 
to fractionate proteins. For example, 0.8 M ammonium sulfate precipi- 
tates fibrinogen, a blood-clotting protein, whereas a concentration of 
2.4 M is needed to precipitate serum albumin. Salting out is also useful for 
concentrating dilute solutions of proteins, including active fractions 
obtained from other purification steps. Dialysis can be used to remove the 
salt if necessary. 


Dialysis. Proteins can be separated from small molecules such 
as salt by dialysis through a semipermeable membrane, such as 
a cellulose membrane with pores (Figure 3.2). The protein 
mixture is placed inside the dialysis bag, which is then sub- 
merged in a buffer solution that is devoid of the small mole- 


cules to be separated away. Molecules having dimensions Concentrated ~ 


eae : di 
significantly greater than the pore diameter are retained igre 


inside the dialysis bag. Smaller molecules and ions capable of Buffer —~] 


passing through the pores of the membrane diffuse down their 
concentration gradients and emerge in the solution outside the 
bag. This technique is useful for removing a salt or other small 
molecule from a cell fractionate, but it will not distinguish 
between proteins effectively. 


Gel-filtration chromatography. More-discriminating separations on the 
basis of size can be achieved by the technique of gel-filtration chromatogra- 
phy, also known as molecular exclusion chromatography (Figure 3.3). The 
sample 1s applied to the top of a column consisting of porous beads made of 
an insoluble but highly hydrated polymer such as dextran or agarose (which 
are carbohydrates) or polyacrylamide. Sephadex, Sepharose, and Biogel are 
commonly used commercial preparations of these beads, which are typically 
100 jzm (0.1 mm) in diameter. Small molecules can enter these beads, but 
large ones cannot. The result is that small molecules are distributed in the 
aqueous solution both inside the beads and between them, whereas large 
molecules are located only in the solution between the beads. Large mole- 
cules flow more rapidly through this column and emerge first because a smaller 
volume is accessible to them. Molecules of medium size occasionally enter the 
beads and will flow from the column at an intermediate position, while 
small molecules, which take a longer, tortuous path, will exit last. 


Ion-exchange chromatography. To obtain a protein of high purity, one 
chromatography step is usually not sufficient, because other proteins in the 
crude mixture will likely co-elute with the desired material. Additional 
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FIGURE 3.2 Dialysis. Protein molecules 
(red) are retained within the dialysis bag, 
whereas small molecules (blue) diffuse 
down their concentration gradient into the 
surrounding medium. 


FIGURE 3.3 Gel-filtration 
chromatography. A mixture 
of proteins in a small volume 
is applied to a column filled 
with porous beads. Because 
large proteins cannot enter 


a —_— the internal volume of the 
=| Id Flow direction 
] beads, they emerge sooner 


than do small ones. 
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FIGURE 3.4 lon-exchange 


chromatography. This technique separates 


proteins mainly according to their net 


charge. 
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FIGURE 3.5 Affinity chromatography. 
Affinity chromatography of concanavalin 
A (shown in yellow) using a solid support 
containing covalently attached glucose 


residues (G). 
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purity can be achieved by performing sequential separations that are based 
on distinct molecular properties. For example, in addition to size, proteins 
can be separated on the basis of their net charge by ion-exchange chromatog- 
raphy. If a protein has a net positive charge at pH 7, it will usually bind to a 
column of beads containing carboxylate groups, whereas a negatively 
charged protein will not (Figure 3.4). The bound protein can then be eluted 
(released) by increasing the concentration of sodium chloride or another salt 
in the eluting buffer; sodium ions compete with positively charged groups 
on the protein for binding to the column. Proteins that have a low density of 
net positive charge will tend to emerge first, followed by those having a 
higher charge density. This procedure 1s also referred to as cation exchange 
to indicate that positively charged groups will bind to the anionic beads. 
Positively charged proteins (cationic proteins) can be separated by chroma- 
tography on negatively charged carboxymethylcellulose (CM-cellulose) 
columns. Conversely, negatively charged proteins (anionic proteins) can be 
separated by anion exchange on positively charged diethylaminoethylcellu- 
lose (DEAE-cellulose) columns. 


Carboxymethyl 
(CM) group 
(ionized form) 


Diethylaminoethyl 
(DEAE) group 
(protonated form) 


Affinity chromatography. Affinity chromatography is another powerful 
means of purifying proteins that is highly selective for the protein of inter- 
est. This technique takes advantage of the high affinity of many proteins for 
specific chemical groups. For example, the plant protein concanavalin A is 
a carbohydrate-binding protein, or lectin (Section 11.4), that has affinity for 
glucose. When a crude extract is passed through a column of beads contain- 
ing covalently attached glucose residues, concanavalin A binds to the beads, 
whereas most other proteins do not (Figure 3.5). The bound concanavalin A 
can then be released from the column by adding a concentrated solution of 
glucose. The glucose in solution displaces the column-attached glucose 
residues from binding sites on concanavalin A. Affinity chromatography is 
a powerful means of isolating transcription factors—proteins that regulate 
gene expression by binding to specific DNA sequences. A protein mixture 
is passed through a column containing specific DNA sequences attached to 
a matrix; proteins with a high affinity for the sequence will bind and be 
retained. In this instance, the transcription factor is released by washing 
with a solution containing a high concentration of salt. 

In general, affinity chromatography can be effectively used to isolate a 
protein that recognizes group X by (1) covalently attaching X or a derivative 
of it to a column; (2) adding a mixture of proteins to this column, which is 
then washed with buffer to remove unbound proteins; and (3) eluting the 
desired protein by adding a high concentration of a soluble form of X or 
altering the conditions to decrease binding affinity. Affinity chromatogra- 
phy is most effective when the interaction of the protein and the molecule 
that is used as the bait is highly specific. 

Affinity chromatography can be used to isolate proteins expressed from 
cloned genes (Section 5.2). Extra amino acids are encoded in the cloned gene 


that, when expressed, serve as an affinity tag that can be readily trapped. 
For example, repeats of the codon for histidine may be added such that the 
expressed protein has a string of histidine residues (called a His tag) on one 
end. The tagged proteins are then passed through a column of beads con- 
taining covalently attached, immobilized nickel(II) or other metal ions. The 
His tags bind tightly to the immobilized metal ions, binding the desired 
protein, while other proteins flow through the column. The protein can 
then be eluted from the column by the addition of imidazole or some other 
chemical that binds to the metal ions and displaces the protein. 


High-performance liquid chromatography. A technique called high- 
performance liquid chromatography (HPLC) is an enhanced version of the 
column techniques already discussed. The column materials are much more 
finely divided and, as a consequence, possess more interaction sites and 
thus greater resolving power. Because the column is made of finer material, 
pressure must be applied to the column to obtain adequate flow rates. The 
net result is both high resolution and rapid separation. In a typical HPLC 
setup, a detector that monitors the absorbance of the eluate at a particular 
wavelength is placed immediately after the column. In the sample HPLC 
elution profile shown in Figure 3.6, proteins are detected by setting the 
detector to 220 nm (the characteristic absorbance wavelength of the peptide 
bond). In a short span of 10 minutes, a number of sharp peaks representing 
individual proteins can be readily identified. 


Proteins can be separated by gel electrophoresis and displayed 


How can we tell that a purification scheme is effective? One way is to ascer- 
tain that the specific activity rises with each purification step. Another is to 
determine that the number of different proteins in each sample declines at 
each step. The technique of electrophoresis makes the latter method 
possible. 


Gel electrophoresis. A molecule with a net charge will move in an electric 
field. This phenomenon, termed electrophoresis, offers a powerful means of 
separating proteins and other macromolecules, such as DNA and RNA. 
The velocity of migration (v) of a protein (or any molecule) in an electric 
field depends on the electric field strength (E), the net charge on the protein 
(z), and the frictional coefficient (f). 


v = Ez/f (1) 


The electric force Ez driving the charged molecule toward the oppositely 
charged electrode is opposed by the viscous drag fv arising from friction 
between the moving molecule and the medium. The frictional coefficient f 
depends on both the mass and shape of the migrating molecule and the vis- 
cosity (17) of the medium. For a sphere of radius r, 


f = O7nyr (2) 


Electrophoretic separations are nearly always carried out in porous gels (or 
on solid supports such as paper) because the gel serves as a molecular sieve 
that enhances separation (Figure 3.7). Molecules that are small compared 
with the pores in the gel readily move through the gel, whereas molecules 
much larger than the pores are almost immobile. Intermediate-size mole- 
cules move through the gel with various degrees of facility. The electric 
field is applied such that proteins migrate from the negative to the positive 
electrodes, typically from top to bottom. Electrophoresis is performed in a 
thin, vertical slab of polyacrylamide gel. Polyacrylamide gels are choice 
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FIGURE 3.6 High-performance liquid 
chromatography (HPLC). Gel filtration by 
HPLC clearly defines the individual proteins 
because of its greater resolving power: 

(1) thyroglobulin (669 kDa), (2) catalase 
(232 kDa), (8) bovine serum albumin 

(67 kDa), (4) ovalbumin (43 kDa), and 

(5) ribonuclease (13.4 kDa). [Data from 

K. J. Wilson and T. D. Schlabach. In Current 
Protocols in Molecular Biology, vol. 2, suppl. 
41, F. M. Ausubel, R. Brent, R. E. Kingston, 
D. D. Moore, J. G. Seidman, J. A. Smith, and 
K. Struhl, Eds. (Wiley, 1998), p. 10.14.1]] 


FIGURE 3.7 
Polyacrylamide gel 
electrophoresis. (A) Gel- 
electrophoresis apparatus. 
Typically, several samples 
undergo electrophoresis on 
one flat polyacrylamide gel. A 
microliter pipette is used to 
place solutions of proteins in 
the wells of the slab. A cover 
is then placed over the gel 
chamber and voltage is 
applied. The negatively 
charged SDS (sodium 
dodecyl sulfate)—-protein 
complexes migrate in the 
direction of the anode, at the 
bottom of the gel. (B) The 
sieving action of a porous 
polyacrylamide gel separates 
proteins according to size, 
with the smallest moving 
most rapidly. 
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supporting media for electrophoresis because they are chemically inert 
and readily formed by the polymerization of acrylamide with a small 
amount of the cross-linking agent methylenebisacrylamide to make a three- 
dimensional mesh (Figure 3.8). Electrophoresis is distinct from gel filtra- 
tion in that, because of the electric field, all of the molecules, regardless of 
size, are forced to move through the same matrix. 
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FIGURE 3.8 Formation of a polyacrylamide gel. A three-dimensional mesh is formed by 
copolymerizing activated monomer (blue) and cross-linker (red). 


Proteins can be separated largely on the basis of mass by electrophoresis 
in a polyacrylamide gel under denaturing conditions. The mixture of pro- 
teins is first dissolved in a solution of sodium dodecyl sulfate (SDS), an 
anionic detergent that disrupts nearly all noncovalent interactions in native 
proteins. B-Mercaptoethanol (2-thioethanol) or dithiothreitol is added to 
reduce disulfide bonds. Anions of SDS bind to main chains at a ratio of 
about one SDS anion for every two amino acid residues. The negative charge 
acquired on binding SDS is usually much greater than the charge on the 
native protein; the contribution of the protein to the total charge of the 


SDS-protein complex is thus rendered insignificant. As a result, this com- 
plex of SDS with a denatured protein has a large net negative charge that is 
roughly proportional to the mass of the protein. The SDS—protein com- 
plexes are then subjected to electrophoresis. When the electrophoresis is 
complete, the proteins in the gel can be visualized by staining them with 
silver nitrate or a dye such as Coomassie blue, which reveals a series of 
bands (Figure 3.9). Radioactive labels, if they have been incorporated into 
proteins, can be detected by placing a sheet of x-ray film over the gel, a 
procedure called autoradiography. 

Small proteins move rapidly through the gel, whereas large proteins stay at 
the top, near the point of application of the mixture. The mobility of most 
polypeptide chains under these conditions is linearly proportional to the 
logarithm of their mass (Figure 3.10). Some carbohydrate-rich proteins and 
membrane proteins do not obey this empirical relation, however. SDS— 
polyacrylamide gel electrophoresis (often referred to as SDS-PAGE) is rapid, 
sensitive, and capable of a high degree of resolution. As little as 0.1 ug (~2 pmol) 
of a protein gives a distinct band when stained with Coomassie blue, and 
even less (~0.02 rg) can be detected with a silver stain. Proteins that differ 
in mass by about 2% (e.g., 50 and 51 kDa, arising from a difference of about 
10 amino acids) can usually be distinguished with SDS-PAGE. 

We can examine the efficacy of our purification scheme by analyzing a 
part of each fraction by electrophoresis. The initial fractions will display 
dozens to hundreds of proteins. As the purification progresses, the number 
of bands will diminish, and the prominence of one of the bands should 
increase. This band should correspond to the protein of interest. 


Isoelectric focusing. Proteins can also be separated electrophoretically on 
the basis of their relative contents of acidic and basic residues. The isoelectric 
point (pI) of a protein is the pH at which its net charge is zero. At this pH, 
its electrophoretic mobility is zero because z in equation 1 is equal to zero. 
For example, the pI of cytochrome c, a highly basic electron-transport pro- 
tein, is 10.6, whereas that of serum albumin, an acidic protein in blood, is 
4.8. Suppose that a mixture of proteins undergoes electrophoresis in a pH 
gradient in a gel in the absence of 5DS. Each protein will move until it 
reaches a position in the gel at which the pH is equal to the pl of the protein. 
This method of separating proteins according to their isoelectric point is 
called isoelectric focusing. The pH gradient in the gel is formed first by sub- 
jecting a mixture of polyampholytes (small multi-charged polymers) having 
many different pI values to electrophoresis. Isoelectric focusing can readily 
resolve proteins that differ in pI by as little as 0.01, which means that pro- 
teins differing by one net charge can be separated (Figure 3.11). 
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FIGURE 3.11 The principle of isoelectric focusing. A pH gradient is established in a gel 
before loading the sample. (A) Each protein, represented by the different colored circles, will 
possess a net positive charge in the regions of the gel where the pH is lower than its respective 
pl value and a net negative charge where the pH is greater than its pl. When voltage is applied 
to the gel, each protein will migrate to its pl, the location at which it has no net charge. 

(B) The proteins form bands that can be excised and used for further experimentation. 
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FIGURE 3.9 Staining of proteins after 
electrophoresis. Mixtures of proteins 

from cellular extracts subjected to 
electrophoresis on an SDS-polyacrylamide 
gel can be visualized by staining with 
Coomassie blue. The first lane contains a 
mixture of proteins of known molecular 
weights, which can be used to estimate the 
sizes of the bands in the samples. [©Dr. 
Robert Farrell.] 
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FIGURE 3.10 Electrophoresis can 
determine mass. The electrophoretic mobility 
of many proteins in SDS-polyacrylamide 
gels is inversely proportional to the 
logarithm of their mass. [Data from K. Weber 
and M. Osborn, The Proteins, vol. 1, 3d ed. 
(Academic Press, 1975), p. 179.] 
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FIGURE 3.12 Two-dimensional gel 
electrophoresis. (A) A protein sample is 
initially fractionated in one dimension by 
isoelectric focusing as described in 

Figure 3.11. The isoelectric focusing gel is 
then attached to an SDS-polyacrylamide 
gel, and electrophoresis is performed in the 
second dimension, perpendicular to the 
original separation. Proteins with the same 
pl are now separated on the basis of mass. 
(B) Proteins from E. coli were separated by 
two-dimensional gel electrophoresis, 
resolving more than a thousand different 
proteins. [(B) Courtesy of Dr. Patrick H. 
O'Farrell.) 


FIGURE 3.13 Alterations in protein levels 
detected by two-dimensional gel 
electrophoresis. Samples of (A) normal 
colon mucosa and (B) colorectal tumor 
tissue from the same person were analyzed 
by two-dimensional gel electrophoresis. 

In the gel section shown, changes in the 


intensity of several spots are evident, including 


a dramatic increase in levels of the protein 
indicated by the arrow, corresponding to 
the enzyme glyceraldehyde-3-phosphate 


dehydrogenase. [Courtesy of Lin Quinsong © 


2010, The American Society for Biochemistry 
and Molecular Biology.] 
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Two-dimensional electrophoresis. Isoelectric focusing can be combined 
with SDS-PAGE to obtain very high resolution separations. A single sample 
is first subjected to isoelectric focusing. This single-lane gel is then placed 
horizontally on top of an SDS—polyacrylamide slab. The proteins are thus 
spread across the top of the polyacrylamide gel according to how far they 
migrated during isoelectric focusing. They then undergo electrophoresis 
again in a perpendicular direction (vertically) to yield a two-dimensional 
pattern of spots. In such a gel, proteins have been separated in the horizon- 
tal direction on the basis of isoelectric point and in the vertical direction on 
the basis of mass. Remarkably, more than a thousand different proteins in 
the bacterium Escherichia coli can be resolved in a single experiment by two- 
dimensional electrophoresis (Figure 3.12). 

Proteins isolated from cells under different physiological conditions 
can be subjected to two-dimensional electrophoresis. The intensities of 
individual spots on the gels can then be compared, which indicates that the 
concentrations of specific proteins have changed in response to the physi- 
ological state (Figure 3.13). How can we discover the identity of a protein 
that is showing such responses? Although many proteins are displayed on 
a two-dimensional gel, they are not identified. It is now possible to identify 
proteins by coupling two-dimensional gel electrophoresis with mass 
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Total protein ‘Totalactivity Specific activity Yield Purification 


Step (mg) (units) (units mg~!) (%) level 
Homogenization 15,000 150,000 10 100 1 
Salt fractionation 4,600 138,000 30 92 3 
Ton-exchange 

chromatography 1,278 115,500 90 77 9 
Gel-filtration 

chromatography 68.8 75,000 1,100 50 110 
Affinity 

chromatography 1.75 52,500 30,000 35 3,000 


spectrometric techniques. We will examine these powerful techniques 
shortly (Section 3.3). 


A protein purification scheme can be quantitatively 
evaluated 


: . ca . Homogenate Salt lon-exchange — Gel-filtration Affinity 
To determine the success of a protein purification scheme, fractionation chromatography chromatography chromatography 


we monitor each step of the procedure by determining the 
specific activity of the protein mixture and by subjecting it 
to SDS-PAGE analysis. Consider the results for the puri- 
fication of a fictitious protein, summarized in Table 3.1 
and Figure 3.14. At each step, the following parameters 
are measured: 


——— 


. 


PL TMM ITE 


Total Protein. The quantity of protein present in a fraction 
is obtained by determining the protein concentration of a 
part of each fraction and multiplying by the fraction’s 
total volume. 


— aaa] 
Total Activity. The enzyme activity for the fraction is — 
obtained by measuring the enzyme activity in the volume 


of fraction used in the assay and multiplying by the 
fraction’s total volume. 


Specific Activity. This parameter is obtained by dividing 


total activity by total protein. FIGURE 3.14 Electrophoretic analysis of a protein 


. F : ie : purification. The purification scheme in Table 3.1 was analyzed 
Yield. This parameter is a measure of the activity retained by SDS-PAGE. Each lane contained 50 ag of sample. The 


after each purification step as a percentage of the activity effectiveness of the purification can be seen as the band for 
in the crude extract. The amount of activity in the initial the protein of interest becomes more prominent relative to 
extract is taken to be 100%. other bands. 


Purification Level. This parameter is a measure of the increase in purity and 
is obtained by dividing the specific activity, calculated after each purifica- 
tion step, by the specific activity of the initial extract. 


As we see in Table 3.1, the first purification step, salt fractionation, leads 
to an increase in purity of only 3-fold, but we recover nearly all the target 
protein in the original extract, given that the yield is 92%. After dialysis to 
lower the high concentration of salt remaining from the salt fractionation, 
the fraction is passed through an ion-exchange column. The purification 
now increases to 9-fold compared with the original extract, whereas the 
yield falls to 77%. Gel-filtration chromatography brings the level of purifi- 
cation to 110-fold, but the yield is now at 50%. The final step is affinity 
chromatography with the use of a ligand specific for the target enzyme. This 
step, the most powerful of these purification procedures, results in a purifi- 
cation level of 3000-fold but lowers the yield to 35%. The SDS-PAGE 
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analysis in Figure 3.14 shows that, if we load a constant amount of protein 
onto each lane after each step, the number of bands decreases in proportion 
to the level of purification, and the amount of protein of interest increases as 
a proportion of the total protein present. 

A good purification scheme takes into account both purification levels 
and yield. A high degree of purification and a poor yield leave little protein 
with which to experiment. A high yield with low purification leaves many 
contaminants (proteins other than the one of interest) in the fraction and 
complicates the interpretation of subsequent experiments. 


Ultracentrifugation is valuable for separating biomolecules 
and determining their masses 


We have already seen that centrifugation is a powerful and generally appli- 
cable method for separating a crude mixture of cell components. This tech- 
nique is also valuable for the analysis of the physical properties of biomol- 
ecules. Using centrifugation, we can determine such parameters as mass 
and density, learn about the shape of a molecule, and investigate the interac- 
tions between molecules. To deduce these properties from the centrifuga- 
tion data, we require a mathematical description of how a particle behaves 
when a centrifugal force is applied. 

A particle will move through a liquid medium when subjected to a centrifu- 
gal force. A convenient means of quantifying the rate of movement is to calculate 
the sedimentation coefficient, s, of a particle by using the following equation: 


s = m(1 — vp)/f 


where m is the mass of the particle, v is the partial specific volume (the 
reciprocal of the particle density), p is the density of the medium, and f is 
the frictional coefficient (a measure of the shape of the particle). The 
(1 — vp) term is the buoyant force exerted by liquid medium. 

Sedimentation coefficients are usually expressed in Svedberg units (S), 
equal to 10~!% s. The smaller the S value, the more slowly a molecule moves 
in a centrifugal field. The S values for a number of biomolecules and cellular 
components are listed in Table 3.2 and Figure 3.15. 

Several important conclusions can be drawn from the preceding equation: 


1. The sedimentation velocity of a particle depends in part on its mass. A 
more massive particle sediments more rapidly than does a less massive par- 
ticle of the same shape and density. 


2. Shape, too, influences the sedimentation velocity because it affects the 
viscous drag. The frictional coefficient f of a compact particle is smaller than 
that of an extended particle of the same mass. Hence, elongated particles 
sediment more slowly than do spherical ones of the same mass. 


TABLE 3.2 S values and molecular weights of sample proteins 


Protein S value (Svedberg units) Molecular weight 
Pancreatic trypsin inhibitor 1 6,520 
Cytochrome c 1.83 12,310 
Ribonuclease A 1.78 13,690 
Myoglobin 1.97 17,800 
Trypsin 2.5 23,200 
Carbonic anhydrase 3.23 28,800 
Concanavalin A 3.8 51,260 
Malate dehydrogenase 5.76 74,900 
Lactate dehydrogenase 7,54 146,200 


Source: T. Creighton, Proteins, 2d ed. (W. H. Freeman and Company, 1993), Table 7.1. 
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FIGURE 3.15 Density and sedimentation coefficients of cellular components. [Data from 
L. J. Kleinsmith and V. M. Kish, Principles of Cell and Molecular Biology, 2d ed. (HarperCollins, 
1995), p. 138.] 


3. A dense particle moves more rapidly than does a less dense one because 
the opposing buoyant force (1 — vp) is smaller for the denser particle. 


4, The sedimentation velocity also depends on the density of the solution 
(p). Particles sink when vp < 1, float when vp > 1, and do not move 
when vp = 1. 


A technique called zonal, band, or most commonly gradient centrifugation 
can be used to separate proteins with different sedimentation coefficients. 
The first step is to form a density gradient in a centrifuge tube. Differing 
proportions of a low-density solution (such as 5% sucrose) and a high-density 
solution (such as 20% sucrose) are mixed to create a linear gradient of sucrose 
concentration ranging from 20% at the bottom of the tube to 5% at the top 
(Figure 3.16). The role of the gradient is to prevent convective flow. A small 
volume of a solution containing the mixture of proteins to be separated is 
placed on top of the density gradient. When the rotor is spun, proteins move 
through the gradient and separate according to their sedimentation coeffi- 
cients. The time and speed of the centrifugation is determined empirically. 
The separated bands, or zones, of protein can be harvested by making a hole 
in the bottom of the tube and collecting drops. The drops can be measured 
for protein content and catalytic activity or another functional property. This 
sedimentation-velocity technique readily separates proteins differing in 
sedimentation coefficient by a factor of two or more. 


Low-density High-density 
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FIGURE 3.16 Zonal centrifugation. The 
steps are as follows: (A) form a density 
gradient, (B) layer the sample on top of 

the gradient, (C) place the tube in a 
swinging-bucket rotor and centrifuge it, 
and (D) collect the samples. [Information from 
D. Freifelder, Physical Biochemistry, 2d ed. 

(W. H. Freeman and Company, 1982), p. 397.] 
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CHAPTER 3 Exploring Proteins and 
Proteomes 


The mass of a protein can be directly determined by sedimentation 
equilibrium, in which a sample is centrifuged at low speed such that a 
concentration gradient of the sample is formed. However, this sedimenta- 
tion is counterbalanced by the diffusion of the sample from regions of high 
to low concentration. When equilibrium has been achieved, the shape of the 
final gradient depends solely on the mass of the sample. The sedimentation- 
equilibrium technique for determining mass is very accurate and can be applied 
without denaturing the protein. Thus the native quaternary structure of multi- 
meric proteins is preserved. In contrast, SDS—polyacrylamide gel electropho- 
resis provides an estimate of the mass of dissociated polypeptide chains under 
denaturing conditions. Note that, if we know the mass of the dissociated 
components of a multimeric protein as determined by SDS—polyacrylamide 
analysis and the mass of the intact multimer as determined by sedimenta- 
tion-equilibrium analysis, we can determine the number of copies of each 
polypeptide chain present in the protein complex. 


Protein purification can be made easier with the use of recombinant 
DNA technology 


In Chapter 5, we shall consider the widespread effect of recombinant DNA 
technology on all areas of biochemistry and molecular biology. The applica- 
tion of recombinant methods to the overproduction of proteins has enabled 
dramatic advances in our understanding of their structure and function. 
Before the advent of this technology, proteins were isolated solely from their 
native sources, often requiring a large amount of tissue to obtain a sufficient 
amount of protein for analytical study. For example, the purification of 
bovine deoxyribonuclease in 1946 required nearly ten pounds of beef pan- 
creas to yield one gram of protein. As a result, biochemical studies on puri- 
fied material were often limited to abundant proteins. 

Armed with the tools of recombinant technology, the biochemist is now 
able to enjoy a number of significant advantages: 


1. Proteins can be expressed in large quantities. The homogenate serves as 
the starting point in a protein purification scheme. For recombinant sys- 
tems, a host organism that is amenable to genetic manipulation, such as the 
bacterium Escherichia coli or the yeast Pichia pastoris, is utilized to express 
a protein of interest. The biochemist can exploit the short doubling times 
and ease of genetic manipulation of such organisms to produce large 
amounts of protein from manageable amounts of culture. As a result, puri- 
fication can begin with a homogenate that is often highly enriched with the 
desired molecule. Moreover, a protein can be easily obtained regardless of 
its natural abundance or its species of origin. 


2. Affinity tags can be fused to proteins. As described above, affinity chro- 
matography can be a highly selective step within a protein purification 
scheme. Recombinant DNA technology enables the attachment of any one 
of anumber of possible affinity tags to a protein (such as the “His tag” men- 
tioned earlier). Hence, the benefits of affinity chromatography can be real- 
ized even for those proteins for which a binding partner is unknown or not 
easily determined. 


3. Proteins with modified primary structures can be readily generated. A power- 
ful aspect of recombinant DNA technology as applied to protein purification 
is the ability to manipulate genes to generate variants of a native protein 
sequence (Section 5.2). We learned in Section 2.4 that many proteins consist 
of compact domains connected by flexible linker regions. With the use of 
genetic-manipulation strategies, fragments of a protein that encompass single 
domains can be generated, an advantageous approach when expression of the 


entire protein is limited by its size or solubility. Additionally, as we will see in 
Section 9.1, amino acid substitutions can be introduced into the active site of an 
enzyme to precisely probe the roles of specific residues within its catalytic cycle. 


3.2 Immunology Provides Important Techniques with 
Which to Investigate Proteins 


The purification of a protein enables the biochemist to explore its function 
and structure within a precisely controlled environment. However, the iso- 
lation of a protein removes it from its native context within the cell, where 
its activity is most physiologically relevant. Advances in the field of immu- 
nology (Chapter 34) have enabled the use of antibodies as critical reagents 
for exploring the functions of proteins within the cell. The exquisite speci- 
ficity of antibodies for their target proteins provides a means to tag a specific 
protein so that it can be isolated, quantified, or visualized. 


Antibodies to specific proteins can be generated 


Immunological techniques begin with the generation of antibodies to a par- 
ticular protein. An antibody (also called an immunoglobulin, Ig) is itself a pro- 
tein (Figure 3.17); it is synthesized by vertebrates in response to the presence 
of a foreign substance, called an antigen. Antibodies have specific and high 
affinity for the antigens that elicited their synthesis. The binding of antibody 
to antigen is a step in the immune response that protects the animal from 
infection (Chapter 34). Foreign proteins, polysaccharides, and nucleic acids 
can be antigens. Small foreign molecules, such as synthetic peptides, also can 
elicit antibodies, provided that the small molecule is attached to a macromo- 
lecular carrier. An antibody recognizes a specific group or cluster of amino 
acids on the target molecule called an antigenic determinant or epitope. The 
specificity of the antibody—antigen interaction is a consequence of the shape 
complementarity between the two surfaces (Figure 3.18). Animals have a very 
large repertoire of antibody-producing cells, each producing an antibody that 
contains a unique surface for antigen recognition. When an antigen is intro- 
duced into an animal, it is recognized by a select few cells from this popula- 
tion, stimulating the proliferation of these cells. This process ensures that 
more antibodies of the appropriate specificity are produced. 
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FIGURE 3.17 Antibody structure. 

(A) Immunoglobulin G (IgG) consists of four 
chains, two heavy chains (blue) and two 
light chains (red), linked by disulfide bonds. 
The heavy and light chains come together 
to form F,,, domains, which have the 
antigen-binding sites at the ends. The two 
heavy chains form the F, domain. Notice 
that the F,,, domains are linked to the F, 
domain by flexible linkers. (B) A more 
schematic representation of an IgG 
molecule. [Drawn from 1IGT.pdb.] 
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hk FIGURE 3.18 Antigen-antibody 
interactions. A protein antigen, in this 
case lysozyme, binds to the end of an F,, 
domain of an antibody. Notice that the 
end of the antibody and the antigen have 
complementary shapes, allowing a large 
amount of surface to be buried on binding. 
[Drawn from 1YQV.pdb.] 
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FIGURE 3.19 Polyclonal and monoclonal 
antibodies. Most antigens have several 
epitopes. Polyclonal antibodies are 
heterogeneous mixtures of antibodies, each 
specific for one of the various epitopes on 
an antigen. Monoclonal antibodies are all 
identical, produced by clones of a single 
antibody-producing cell. They recognize 
one specific epitope. [Information from 

R. A. Goldsby, T. J. Kindt, and B. A. Osborne, 
Kuby Immunology, 4th ed. (W. H. Freeman 
and Company, 2000), p. 154.] 
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Antigen | 


Immunological techniques depend on the ability to generate antibodies 
to a specific antigen. To obtain antibodies that recognize a particular pro- 
tein, a biochemist injects the protein into a rabbit twice, 3 weeks apart. The 
injected protein acts as an antigen, stimulating the reproduction of cells 
producing antibodies that recognize it. Blood is drawn from the immunized 
rabbit several weeks later and centrifuged to separate blood cells from the 
supernatant, or serum. The serum, called an antiserum, contains antibodies to 
all antigens to which the rabbit has been exposed. Only some of them will be 
antibodies to the injected protein. Moreover, antibodies that recognize a par- 
ticular antigen are nota single molecular species. For instance, 2,4-dinitrophenol 
(DNP) was used as an antigen to generate antibodies. Analyses of anti- DNP 
antibodies revealed a wide range of binding affinities; the dissociation constants 
ranged from about 0.1 nM to 1 uM. Correspondingly, a large number of bands 
were evident when anti-DNP antibody was subjected to isoelectric focusing. 
These results indicate that cells are producing many different antibodies, 
each recognizing a different surface feature of the same antigen. These anti- 
bodies are termed polyclonal, referring to the fact that they are derived from 
multiple antibody-producing cell populations (Figure 3.19). The heteroge- 
neity of polyclonal antibodies can be advantageous for certain applications, 
such as the detection of a protein of low abundance, because each protein 
molecule can be bound by more than one antibody at multiple distinct 
antigenic sites. 


Monoclonal antibodies with virtually any desired specificity 
can be readily prepared 


The discovery of a means of producing monoclonal antibodies of virtually any 
desired specificity was a major breakthrough that intensified the power of 
immunological approaches. As with impure proteins, working with an impure 
mixture of antibodies makes it difficult to interpret data. Ideally, one would 


isolate a clone of cells producing a single, identical antibody. The problem is 
that antibody-producing cells isolated from an organism have short life spans. 

Immortal cell lines that produce monoclonal antibodies do exist. These 
cell lines are derived from a type of cancer, multiple myeloma, which is a 
malignant disorder of antibody-producing cells. In this cancer, a single 
transformed plasma cell divides uncontrollably, generating a very large num- 
ber of cells of a single kind. Such a group of cells is a clone because the cells are 
descended from the same cell and have identical properties. The identical 
cells of the myeloma secrete large amounts of a single immunoglobulin genera- 
tion after generation. While these antibodies have proven useful for elucidat- 
ing antibody structure, nothing is known about their specificity. Hence, they 
have little utility for the immunological methods described in the next pages. 

César Milstein and Georges Kohler discovered that large amounts of anti- 
bodies of nearly any desired specificity can be obtained by fusing a short-lived 
antibody-producing cell with an immortal myeloma cell. An antigen is injected 
into a mouse, and its spleen is removed several weeks later (Figure 3.20). 
A mixture of plasma cells from this spleen is fused in vitro with myeloma cells. 
Each of the resulting hybrid cells, called hybridoma cells, indefinitely pro- 
duces the identical antibody specified by the parent cell from the spleen. 
Hybridoma cells can then be screened by a specific assay for the antigen—anti- 
body interaction to determine which ones produce antibodies of the preferred 
specificity. Collections of cells shown to produce the desired antibody are 
subdivided and reassayed. This process is repeated until a pure cell line, aclone 
producing a single antibody, is isolated. These positive cells can be grown in 
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FIGURE 3.20 Preparation of monoclonal 
antibodies. Hybridoma cells are formed by 
the fusion of antibody-producing cells and 
myeloma cells. The hybrid cells are allowed 
to proliferate by growing them in selective 
medium. They are then screened to 
determine which ones produce antibody 

of the desired specificity. [Information from 
C. Milstein. Monoclonal antibodies. 

Copyright © 1980 by Scientific American, 

Inc. All rights reserved.] 


Ste 


FIGURE 3.21 Fluorescence micrograph 
of a developing Drosophila embryo. The 
embryo was stained with a fluorescence- 
labeled monoclonal antibody for the DNA- 
binding protein encoded by engrailed, an 
essential gene in specifying the body plan. 
[Courtesy of Dr. Nipam Patel and Dr. Corey 
Goodman.] 
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culture medium or injected into mice to induce myelomas. Alternatively, the 
cells can be frozen and stored for long periods. 

The hybridoma method of producing monoclonal antibodies has opened 
new vistas in biology and medicine. Large amounts of identical antibodies 
with tailor-made specificities can be readily prepared. They are sources of 
insight into relations between antibody structure and specificity. Moreover, 
monoclonal antibodies can serve as precise analytical and preparative 
reagents. Proteins that guide development have been identified with the use 
of monoclonal antibodies as tags (Figure 3.21). Monoclonal antibodies 
attached to solid supports can be used as affinity columns to purify scarce 
proteins. This method has been used to purify interferon (an antiviral pro- 
tein) 5000-fold from a crude mixture. Clinical laboratories use monoclonal 
antibodies in many assays. For example, the detection in blood of isozymes 
that are normally localized in the heart points to a myocardial infarction 
(heart attack). Blood transfusions have been made safer by antibody screen- 
ing of donor blood for viruses that cause AIDS (acquired immune defi- 
ciency syndrome), hepatitis, and other infectious diseases. Monoclonal 
antibodies can be used as therapeutic agents. For example, trastuzumab 
(Herceptin) is a monoclonal antibody useful for treating some forms of 
breast cancer. 


Proteins can be detected and quantified by using an enzyme-linked 
immunosorbent assay 


Antibodies can be used as exquisitely specific analytic reagents to quantify 
the amount of a protein or other antigen present in a biological sample. The 
enzyme-linked immunosorbent assay (ELISA) makes use of an enzyme that 
reacts with a colorless substrate to produce a colored product. The enzyme 
is covalently linked to a specific antibody that recognizes a target antigen. If 
the antigen is present, the antibody—enzyme complex will bind to it and, on 
addition of the substrate, the enzyme will catalyze the reaction, generating 
the colored product. Thus, the presence of the colored product indicates the 
presence of the antigen. Rapid and convenient, ELISAs can detect less than 
a nanogram (10? g) of a specific protein. ELISAs can be performed with 
either polyclonal or monoclonal antibodies, but the use of monoclonal anti- 
bodies yields more-reliable results. 


es We will consider two among the several types of ELISA. The indi- 

rect ELISA is used to detect the presence of antibody and is the basis 
of the test for HIV infection. The HIV test detects the presence of anti- 
bodies that recognize viral core protein antigens. Viral core proteins are 
adsorbed to the bottom of a well. Antibodies from the person being tested 
are then added to the coated well. Only someone infected with HIV will 
have antibodies that bind to the antigen. Finally, enzyme-linked antibod- 
ies to human antibodies (e.g., enzyme-linked goat antibodies that recog- 
nize human antibodies) are allowed to react in the well, and unbound 
antibodies are removed by washing. Substrate is then applied. An enzyme 
reaction yielding a colored product suggests that the enzyme-linked anti- 
bodies were bound to human antibodies, which in turn implies that the 
patient has antibodies to the viral antigen (Figure 3.22A). This assay is 
quantitative: the rate of the color-formation reaction is proportional to the 
amount of antibody originally present. 

The sandwich ELISA is used to detect antigen rather than antibody. 
Antibody to a particular antigen is first adsorbed to the bottom of a well. 
Next, solution containing the antigen (such as blood or urine, in medical 
diagnostic tests) is added to the well and binds to the antibody. Finally, a 
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FIGURE 3.22 Indirect ELISA and sandwich ELISA. (A) In indirect ELISA, the production 

of color indicates the amount of an antibody to a specific antigen. (B) In sandwich ELISA, the 
production of color indicates the quantity of antigen. [Information from R. A. Goldsby, T. J. Kindt, 
and B. A. Osborne, Kuby Immunology, 4th ed. (W. H. Freeman and Company, 2000), p. 162.] 


second, different antibody to the antigen is added. This antibody is enzyme 
linked and is processed as described for indirect ELISA. In this case, the 
rate of color formation is directly proportional to the amount of antigen 
present. Consequently, it permits the measurement of small quantities of 
antigen (Figure 3.22B). 


Western blotting permits the detection of proteins separated 
by gel electrophoresis 


Very small quantities of a protein of interest in a cell or in body fluid can be 
detected by an immunoassay technique called western blotting (Figure 3.23). 
A sample is subjected to electrophoresis on an 5DS—polyacrylamide gel. A 
polymer sheet is pressed against the gel, transferring the resolved proteins on 
the gel to the sheet, which makes the proteins more accessible for reaction. 
An antibody that is specific for the protein of interest, called the primary 
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FIGURE 3.23 Western blotting. Proteins 
on an SDS-polyacrylamide gel are 
transferred to a polymer sheet. The sheet is 
first treated with a primary antibody, which 
is specific for the protein of interest, and 
then washed to remove unbound antibody. 
Next, the sheet is treated with a secondary 
antibody, which recognizes the primary 
antibody, and washed again. Since the 
secondary antibody is labeled (here, with a 
fluorescent tag indicated by the yellow 
circle), the band containing the protein of 
interest can be identified. 
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FIGURE 3.24 Actin filaments. Fluorescence 
micrograph of a cell shows actin filaments 
stained green using an antibody specific to 
actin. [David Becker/Science Source.] 
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FIGURE 3.26 Immunoelectron 
microscopy. The opaque particles (150-A, 
or 15-nm, diameter) in this electron 
micrograph are clusters of gold atoms 
bound to antibody molecules. A gold- 
labeled antibody against a channel protein 
(Section 13.4) identifies membrane vesicles 
at the termini of neurons that contain this 
protein. [Courtesy of Dr. Peter Sargent.] 
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antibody, is added to the sheet and reacts with the antigen. The antibody— 
antigen complex on the sheet can then be detected by rinsing the sheet with 
a second antibody, called the secondary antibody, that is specific for the pri- 
mary antibody (e.g., a goat antibody that recognizes mouse antibodies). 
Typically, the secondary antibody is fused to an enzyme that produces a 
chemiluminescent or colored product or contains a fluorescent tag, enabling 
the identification and quantitation of the protein of interest. Western blot- 
ting makes it possible to find a protein in a complex mixture, the proverbial 
needle in a haystack. It is the basis for the test for infection by hepatitis C, 
where it is used to detect a core protein of the virus. This technique is also 
very useful in monitoring protein purification and in the cloning of genes. 


Fluorescent markers make the visualization of proteins 
in the cell possible 


Biochemistry is often performed in test tubes or polyacrylamide gels. However, 
most proteins function in the context of a cell. Fluorescent markers provide a 
powerful means of examining proteins in their biological context. Cells can be 
stained with fluorescence-labeled antibodies and examined by fluorescence 
microscopy to reveal the location of a protein of interest. For example, arrays of 
parallel bundles are evident in cells stained with antibody specific for actin, a 
protein that polymerizes into filaments (Figure 3.24). Actin filaments are con- 
stituents of the cytoskeleton, the internal scaffolding of cells that controls their 
shape and movement. By tracking protein location, fluorescent markers also 
provide clues to protein function. For instance, the mineralocorticoid receptor 
protein binds to steroid hormones (Section 26.4), including cortisol. The 
receptor was linked to a yellow variant of green fluorescent protein (GFP), a 
naturally fluorescent protein isolated from the jellyfish Aequovea victoria 
(Chapter 2). Fluorescence microscopy revealed that, in the absence of the hor- 
mone, the receptor is located in the cytoplasm (Figure 3.25A). On addition of 
the steroid, the receptor is translocated to the nucleus, where it binds to DNA 
(Figure 3.25B). These results indicate that the mineralocorticoid receptor pro- 
tein is a transcription factor that controls gene expression. 
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FIGURE 3.25 Nuclear localization of a steroid receptor. (A) The mineralocorticoid receptor, 
made visible by attachment to a yellow variant of GFP, is located predominantly in the cytoplasm 
of the cultured cell. (B) Subsequent to the addition of corticosterone (a glucocorticoid steroid 
that also binds to the mineralocorticoid receptor), the receptor moves into the nucleus. [M. Nishi, 
M. Tanaka, K.-i. Matsuda, M. Sunaguchi, and M. Kawata. J. Neurosci. 24:4918-4927, 2004, Fig. 7A.] 


The highest resolution of fluorescence microscopy 1s about 0.2 jum (200 
nm, or 2000 A), the wavelength of visible light. Finer spatial resolution can 
be achieved by electron microscopy if the antibodies are tagged with elec- 
tron-dense markers. For example, antibodies conjugated to clusters of gold 
or to ferritin (which has an electron-dense core rich in iron) are highly visible 
under the electron microscope. Immunoelectron microscopy can define the 
position of antigens to a resolution of 10 nm (100 A) or finer (Figure 3.26). 


3.3 Mass Spectrometry is a Powerful Technique for 
the Identification of Peptides and Proteins 


In many instances, the study of a particular biological process in its native 
context is advantageous. For example, if we are interested in a pathway that 
is localized to the nucleus of a cell, we might conduct studies on an isolated 
nuclear extract. In these experiments, identification of the proteins present 
in the sample is often critical. Antibody-based techniques, such as the 
ELISA method described in the previous section, can be very helpful 
toward this goal. However, these techniques are limited to the detection of 
proteins for which an antibody is already available. Mass spectrometry 
enables the highly precise and sensitive measurement of the atomic compo- 
sition of a particular molecule, or analyte, without prior knowledge of its 
identity. Originally, this method was relegated to the study of the chemical 
composition and molecular mass of gases or volatile liquids. However, tech- 
nological advances in the past two decades have dramatically expanded the 
utility of mass spectrometry to the study of proteins, even those found at 
very low concentrations within highly complex mixtures, such as the con- 
tents of a particular cell type. 

Mass spectrometry enables the highly accurate and sensitive detec- 
tion of the mass of an analyte. This information can be used to determine 
the identity and chemical state of the molecule of interest. Mass spec- 
trometers operate by converting analyte molecules into gaseous, charged 
forms (gas-phase ions). Through the application of electrostatic poten- 
tials, the ratio of the mass of each ion to its charge (the mass-to-charge 
ratio, or m/z) can be measured. Although a wide variety of techniques 
employed by mass spectrometers are used in current practice, each of them 
comprises three essential components: the ion source, the mass analyzer, 
and the detector. Let us consider the first two in greater detail, because 
improvements in them have contributed most significantly to the analysis 
of biological samples. 

The ion source achieves the first critical step in mass spectrometric 
analysis: conversion of the analyte into gas-phase ions (ionization). Until 
recently, proteins could not be ionized efficiently because of their high 
molecular weights and low volatility. However, the development of tech- 
niques such as matrix-assisted laser desorption/ionization (MALDI) and 
electrospray ionization (ESI) has enabled the clearing of this significant 
hurdle. In MALDI, the analyte is evaporated to dryness in the presence of 
a volatile, aromatic compound (the matrix) that can absorb light at specific 
wavelengths. A laser pulse tuned to one of these wavelengths excites and 
vaporizes the matrix, converting some of the analyte into the gas phase. 
Subsequent gaseous collisions enable the intermolecular transfer of charge, 
ionizing the analyte. In ESI, a solution of the analyte is passed through an 
electrically charged nozzle. Droplets of the analyte, now charged, emerge 
from the nozzle into a chamber of very low pressure, evaporating the solvent 
and ultimately yielding the ionized analyte. 

The newly formed analyte ions then enter the mass analyzer, where they 
are distinguished on the basis of their mass-to-charge ratios. There are a 
number of different types of mass analyzers. For this discussion, we will 
consider one of the simplest, the time-of-flight (TOF) mass analyzer, in 
which ions are accelerated through an elongated chamber under a fixed 
electrostatic potential. Given two ions of identical net charge, the smaller 
ion will require less time to traverse the chamber than will the larger ion. 
The mass of each ion can be determined by measuring the time required for 
each ion to pass through the chamber. 
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FIGURE 3.27 MALDI-TOF mass 
spectrometry. (1) The protein sample, 
embedded in an appropriate matrix, is 
ionized by the application of a laser beam. 
(2) An electric field accelerates the ions 
through the flight tube toward the detector. 
(3) The lightest ions arrive first. (4) The 
ionizing laser pulse also triggers a clock that 
measures the time of flight (TOF) for the 
ions. [Information from J. T. Watson, 
Introduction to Mass Spectrometry, 

3d ed. (Lippincott-Raven, 1997), p. 279.] 


FIGURE 3.28 MALDI-TOF mass 
spectrum of insulin and B- 
lactoglobulin. A mixture of 5 pmol each of 
insulin (I) and B-lactoglobulin (L) was 
ionized by MALDI, which produces 
predominantly singly charged molecular 
ions from peptides and proteins—the 
insulin ion (I+H)* and the lactoglobulin ion 
(L+H)*t. Molecules with multiple charges, 
such as those for B-lactoglobulin indicated 
by the blue arrows, as well as small 
quantities of a singly charged dimer of 
insulin (2 I+H)* also are produced. 

[Data from J. T. Watson, Introduction to Mass 
Spectrometry, 3d ed. (Lippincott-Raven, 
1997), p. 282.] 
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The sequential action of the ion source and the mass analyzer enables 
the highly sensitive measurement of the mass of potentially massive ions, 
such as those of proteins. Consider an example of a MALDI ion source 
coupled to a TOF mass analyzer: the MALDI-TOF mass spectrometer 
(Figure 3.27). Gas-phase ions generated by the MALDI ion source pass 
directly into the TOF analyzer, where the mass-to-charge ratios are 
recorded. In Figure 3.28, the MALDI-TOF mass spectrum of a mixture of 
5 pmol each of insulin and lactoglobulin is shown. The masses determined 
by MALDI-TOF are 5733.9 and 18,364, respectively. A comparison with 
the calculated values of 5733.5 and 18,388 reveals that MALDI-TOF is 
clearly an accurate means of determining protein mass. 

In the ionization process, a family of ions, each of the same mass but car- 
rying different total net charges, is formed from a single analyte. Because the 
mass spectrometer detects ions on the basis of their mass-to-charge ratio, 
these ions will appear as separate peaks in the mass spectrum. For example, 
in the mass spectrum of B-lactoglobulin shown in Figure 3.28, peaks near 
m/z = 18,364 (corresponding to the +1 charged ion) and m/z = 9,183 
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(corresponding to the +2 charged ion) are visible (indicated by the blue 
arrows). Although multiple peaks for the same ion may appear to be a nui- 
sance, they enable the spectrometrist to measure the mass of an analyte ion 
more than once in a single experiment, improving the overall precision of 
the calculated result. 


Peptides can be sequenced by mass spectrometry 


Mass spectrometry is one of several techniques for the determination of the 
amino acid sequence of a protein. As we shall discuss shortly, this sequence 
data can be a valuable source of information. For many years, chemical 
methods were the primary means for peptide sequencing. In the most com- 
mon of these methods, Edman degradation, the N-terminal amino acid of a 
polypeptide is labeled with phenyl isothiocyanate. Subsequent cleavage 
yields the phenylthiohydantoin (PTH)-amino acid derivative, which can be 
identified by spectroscopic methods, and the polypeptide chain, now short- 
ened by one residue (Figure 3.29). This procedure can then be repeated on 
the shortened peptide, yielding another PTH—amino acid, which can again 
be identified by chromatography. 

While technological advancements have improved the speed and sensi- 
tivity of the Edman degradation, these parameters have largely been sur- 
passed by the application of mass spectrometric methods. The use of mass 
spectrometry for protein sequencing takes advantage of the fact that ions of 
proteins that have been analyzed by a mass spectrometer, the precursor ions, 
can be broken into smaller peptide chains by bombardment with atoms of an 
inert gas such as helium or argon. These new fragments, or product ions, can 
be passed through a second mass analyzer for further mass characterization. 
The utilization of two mass analyzers arranged in this manner 1s referred to 
as tandem mass spectrometry. Importantly, the product-ion fragments are 
formed in chemically predictable ways that can provide clues to the amino 
acid sequence of the precursor ion. For polypeptide analytes, disruption of 
individual peptide bonds will yield two smaller peptide ions, containing the 
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FIGURE 3.29 The Edman 

degradation. The labeled amino-terminal 
residue (PTH-—alanine in the first round) can 
be released without hydrolyzing the rest 

of the peptide. Hence, the amino-terminal 
residue of the shortened peptide 
(Gly-Asp-Phe-Arg-Gly) can be determined in 
the second round. Three more rounds of 

the Edman degradation reveal the complete 
sequence of the original peptide. 
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FIGURE 3.30 Peptide sequencing by 
tandem mass spectrometry. (A) Within 
the mass spectrometer, peptides can be 
fragmented by bombardment with inert 
gaseous ions to generate a family of 
product ions in which individual amino 
acids have been removed from one end. 
In this example, the carboxyl fragment of 
the cleaved peptide bond is ionized. (B) 
The product ions are detected in the 
second mass analyzer. The mass 
differences between the peaks indicate the 
sequence of amino acids in the precursor 
ion. [Data from H. Steen and M. Mann. Nat. 
Rev. Mol. Cell Biol. 5:699-711, 2004.] 
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sequences before and after the cleavage site. Hence, a family of ions can be 
detected; each ion represents a fragment of the original peptide with one or 
more amino acids removed from one end (Figure 3.30A). For simplicity, only 
the carboxyl-terminal peptide fragments are shown in Figure 3.30A. Figure 
3.30B depicts a representative mass spectrum from a fragmented peptide. 
The mass differences between the peaks in this fragmentation experiment 
indicate the amino acid sequence of the precursor peptide ion. 


Proteins can be specifically cleaved into small peptides 
to facilitate analysis 


In principle, it should be possible to sequence an entire protein using the 
Edman degradation or mass spectrometric methods. In practice, the Edman 
degradation is limited to peptides of 50 residues, because not all peptides in 
the reaction mixture release the amino acid derivative at each step. For 
instance, if the efficiency of release for each round were 98%, the proportion 
of “correct” amino acid released after 60 rounds would be (0.98), or 0.3—a 
hopelessly impure mix. Similarly, sequencing of long peptides by mass spec- 
trometry yields a mass spectrum that can be complex and difficult to inter- 
pret. This obstacle can be circumvented by cleaving a protein into smaller 
peptides that can be sequenced. Protein cleavage can be achieved by chemi- 
cal reagents, such as cyanogen bromide, or proteolytic enzymes, such as 
trypsin. Table 3.3 gives several other ways of specifically cleaving polypep- 
tide chains. Note that these methods are sequence specific: they disrupt the 
protein backbone at particular amino acid residues in a predictable manner. 

The peptides obtained by specific chemical or enzymatic cleavage are 
separated by some type of chromatography. The sequence of each purified 
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Reagent Cleavage site 
Chemical cleavage 
Cyanogen bromide Carboxyl side of methionine residues 
O-Iodosobenzoate Carboxyl side of tryptophan residues 
Hydroxylamine Asparagine-glycine bonds 
2-Nitro-5-thiocyanobenzoate Amino side of cysteine residues 


Enzymatic cleavage 


Trypsin Carboxy] side of lysine and arginine residues 

Clostripain Carboxyl side of arginine residues 

Staphylococcal protease Carboxyl side of aspartate and glutamate residues 
(glutamate only under certain conditions) 

Thrombin Carboxyl side of arginine 

Chymotrypsin Carboxy] side of tyrosine, tryptophan, phenylalanine, 


leucine, and methionine 


Carboxypeptidase A Amino side of C-terminal amino acid (not arginine, 
lysine, or proline) 


peptide is then determined by the methods described above. (Alay, Gly, Lys, Phe, Thr, Trp, Val) 
At this point, the amino acid sequences of segments of the 
protein are known, but the order of these segments is not yet 
defined. How can we order the peptides to obtain the primary 
structure of the original protein? The necessary additional 
information is obtained from overlap peptides (Figure 3.31). 


Digestion 
and 
peptide sequencing 


Trypsin Chymotrypsin 


A second enzyme is used to split the polypeptide chain at dif- pi — Be Te ays i aL 

ferent linkages. For example, chymotrypsin cleaves preferen- Thr —Phe— Val — Lys Gly Lys Thr —Phe 

tially on the carboxy] side of aromatic and some other bulky Mjaee 

nonpolar residues (Chapter 9). Because these chymotryptic fragments 

peptides overlap two or more tryptic peptides, they can be 

used to establish the order of the peptides. The entire amino 

acid sequence of the polypeptide chain is then known. Tryptic peptide Tryptic peptide 
Additional steps are necessary if the initial protein ‘ Thr —Phe— Val iis Ae Ala — Trp — Gly — Lys 

sample is actually several polypeptide chains. SDS~—gel > chyniolepatic overlap pantide 


electrophoresis under reducing conditions should display 

the number of chains. Alternatively, the number of distinct | FIGURE 3.31 Overlap peptides. The peptide obtained by _ 

N-terminal amino acids could be determined. After a pro- chymotryptic digestion overlaps two tryptic peptides, establishing 
: é ore : their order. 

tein has been identified as being made up of two or more 

polypeptide chains, denaturing agents, such as urea or 

guanidine hydrochloride, are used to dissociate chains held together by 

noncovalent bonds. The dissociated chains must be separated from one 

another before sequence determination can begin. Polypeptide chains 

linked by disulfide bonds are separated by reduction with thiols such as 

B-mercaptoethanol or dithiothreitol. To prevent the cysteine residues 

from recombining, they are alkylated with iodoacetate to form stable 

S-carboxymethy] derivatives (Figure 3.32). Sequencing can then be per- 

formed as already described. 


Genomic and proteomic methods are complementary 


Despite the technological advancements in both chemical and mass spectro- 
metric methods of peptide sequencing, heroic effort is required to elucidate 
the sequence of large proteins, those with more than 1000 residues. For 
sequencing such proteins, a complementary experimental approach based 
on recombinant DNA technology is often more efficient. As will be dis- 
cussed in Chapter 5, long stretches of DNA can be cloned and sequenced, 
and the nucleotide sequence can be translated to reveal the amino acid 
sequence of the protein encoded by the gene (Figure 3.33). Recombinant 
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FIGURE 3.32 Disulfide-bond 

reduction. Polypeptides linked by disulfide 
bonds can be separated by reduction with 
dithiothreitol followed by alkylation to 
prevent them from re-forming. 


DNA sequence 


Amino acid 


sequence 


DNA technology is producing a wealth of amino acid sequence information 
at a remarkable rate. 

Nevertheless, even with the use of the DNA base sequence to determine 
primary structure, there is still a need to work with isolated proteins. The 
amino acid sequence deduced by reading the DNA sequence is that of the 
nascent protein, the direct product of the translational machinery. However, 
many proteins undergo posttranslational modifications after their syntheses. 
Some have their ends trimmed, and others arise by cleavage of a larger initial 
polypeptide chain. Cysteine residues in some proteins are oxidized to form 
disulfide links, connecting either parts within a chain or separate polypeptide 
chains. Specific side chains of some proteins are altered. Amino acid 
sequences derived from DNA sequences are rich in information, but they do 
not disclose these modifications. Chemical analyses of proteins in their 
mature form are needed to delineate the nature of these changes, which are 
critical for the biological activities of most proteins. Thus, genomic and pyro- 
teomic analyses are complementary approaches to elucidating the structural 
basis of protein function. 


The amino acid sequence of a protein provides valuable information 


Regardless of the method used for its determination, the amino acid 
sequence of a protein can provide the biochemist with a wealth of informa- 
tion as to the protein’s structure, function, and history. 


1. The sequence of a protein of interest can be compared with all other known 
sequences to ascertain whether significant similarities exist. A search for kin- 
ship between a newly sequenced protein and the millions of previously 
sequenced ones takes only a few seconds on a personal computer (Chapter 6). 
If the newly isolated protein is a member of an established class of protein, 
we can begin to infer information about the protein’s structure and func- 
tion. For instance, chymotrypsin and trypsin are members of the serine 
protease family, a clan of proteolytic enzymes that have a common catalytic 
mechanism based on a reactive serine residue (Chapter 9). If the sequence 
of the newly isolated protein shows sequence similarity with trypsin or 
chymotrypsin, the result suggests that it may be a serine protease. 


2. Comparison of sequences of the same protein in different species yields a 
wealth of information about evolutionary pathways. Genealogical relationships 
between species can be inferred from sequence differences between their pro- 
teins. If we assume that the random mutation rate of proteins over time is 
constant, then careful sequence comparison of related proteins between two 
organisms can provide an estimate for when these two evolutionary lines 
diverged. For example, a comparison of serum albumins found in primates 
indicates that human beings and African apes diverged 5 million years ago, 
not 30 million years ago as was once thought. Sequence analyses have opened 
a new perspective on the fossil record and the pathway of human evolution. 


GGG TIC TIG GGA GCA GCA GGA AGC ACT ATG GGC GCA 


Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 


FIGURE 3.33 DNA sequence yields the amino acid sequence. The complete nucleotide 
sequence of HIV-1 (human immunodeficiency virus), the cause of AIDS (acquired immune 
deficiency syndrome), was determined within a year after the isolation of the virus. A part of the 
DNA sequence specified by the RNA genome of the virus is shown here with the corresponding 
amino acid sequence (deduced from a knowledge of the genetic code). 


3. Amino acid sequences can be searched for the presence of internal repeats. 
Such internal repeats can reveal the history of an individual protein itself. 
Many proteins apparently have arisen by duplication of primordial genes 
followed by their diversification. For example, calmodulin, a ubiquitous 
calcium sensor in eukaryotes, contains four similar calctum-binding mod- 
ules that arose by gene duplication (Figure 3.34). 


4. Many proteins contain amino acid sequences that serve as signals designat- 
ing their destinations or controlling their processing. For example, a protein 
destined for export from a cell or for location in a membrane contains a 
signal sequence, a stretch of about 20 hydrophobic residues near the amino 
terminus that directs the protein to the appropriate membrane. Another 
protein may contain a stretch of amino acids that functions as a nuclear 
localization signal, directing the protein to the nucleus. 


5. Sequence data provide a basis for preparing antibodies specific for a protein 
of interest. One or more parts of the amino acid sequence of a protein will 
elicit an antibody when injected into a mouse or rabbit. These specific anti- 
bodies can be very useful in determining the amount of a protein present in 
solution or in the blood, ascertaining its distribution within a cell, or cloning 
its gene (Section 3.2). 


6. Amino acid sequences are valuable for making DNA probes that are spe- 
cific for the genes encoding the corresponding proteins. Knowledge of a pro- 
tein’s primary structure permits the use of reverse genetics. DNA sequences 
that correspond to a part of the amino acid sequence can be constructed on 
the basis of the genetic code. These DNA sequences can be used as probes 
to isolate the gene encoding the protein so that the entire sequence of the 
protein can be determined. The gene in turn can provide valuable informa- 
tion about the physiological regulation of the protein. Protein sequencing is 
an integral part of molecular genetics, just as DNA cloning is central to the 
analysis of protein structure and function. We will revisit some of these top- 
ics in more detail in Chapter 5. 


Individual proteins can be identified by mass spectrometry 


The combination of mass spectrometry with chromatographic and peptide- 
cleavage techniques enables highly sensitive protein identification in com- 
plex biological mixtures. When a protein is cleaved by chemical or enzymatic 
methods (Table 3.3), a specific and predictable family of peptide fragments 
is formed. We learned in Chapter 2 that each protein has a unique, precisely 
defined amino acid sequence. Hence, the identity of the individual peptides 
formed from this cleavage reaction—and, importantly, their corresponding 
masses—is a distinctive signature for that particular protein. Protein cleav- 
age, followed by chromatographic separation and mass spectrometry, 
enables rapid identification and quantitation of these signatures, even if 
they are present at very low concentrations. This technique for protein iden- 
tification is referred to as peptide mass fingerprinting. 

The speed and sensitivity of mass spectrometry has made this technology 
critical for the study of proteomics. Let us consider the analysis of the 
nuclear-pore complex from yeast, which facilitates the transport of large mol- 
ecules into and out of the nucleus. This huge macromolecular complex was 
purified from yeast cells by careful procedures. The purified complex was 
fractionated by HPLC followed by gel electrophoresis. Individual bands 
from the gel were isolated, cleaved with trypsin, and analyzed by MALDI- 
TOF mass spectrometry. The fragments produced were compared with 
amino acid sequences deduced from the DNA sequence of the yeast genome 
as shown in Figure 3.35. A total of 174 nuclear-pore proteins were identified 
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hk FIGURE 3.34 Repeating motifs ina 
protein chain. Calmodulin, a calcium 
sensor, contains four similar units (Shown in 
red, yellow, blue, and orange) in a single 
polypeptide chain. Notice that each unit 
binds a calcium ion (shown in green). 
[Drawn from 1CLL.pdb.] 


FIGURE 3.35 Proteomic analysis by mass 
spectrometry. This mass spectrum was 
obtained by analyzing a trypsin-treated band 
in a gel derived from a yeast nuclear-pore 
sample. Many of the peaks were found to 
match the masses predicted for peptide 
fragments from three proteins (Nup120p, 
Kap122p, and Kap120p) within the yeast 
genome. The band corresponded to an 
apparent molecular mass of 100 kDa. 

[Data from M. P. Rout, J. D. Aitchison, 

A. Suprapto, K. Hjertaas, Y. Zhao, and 

B. T. Chait. J. Cell Biol. 148:635-651, 2000.] 
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in this manner. Many of these proteins had not previously been identified 
as being associated with the nuclear pore despite years of study. Furthermore, 
mass spectrometric methods are sensitive enough to detect essentially all 
components of the pore if they are present in the samples used. Thus, a 
complete list of the components constituting this macromolecular complex 
could be obtained in a straightforward manner. Proteomic analysis of this 
type 1s growing in power as mass spectrometric and biochemical fraction- 
ation methods are refined. 


3.4 Peptides Can Be Synthesized by Automated 
Solid-Phase Methods 


Peptides of defined sequence can be synthesized to assist in biochemical 
analysis. These peptides are valuable tools for several purposes. 


1. Synthetic peptides can serve as antigens to stimulate the formation of specific 
antibodies. Suppose we want to isolate the protein expressed by a specific 
gene. Peptides can be synthesized that match the translation of part of the 
gene’s nucleic acid sequence, and antibodies can be generated that target 
these peptides. These antibodies can then be used to isolate the intact pro- 
tein or localize it within the cell. 


2. Synthetic peptides can be used to isolate receptors for many hormones and 
other signal molecules. For example, white blood cells are attracted to 
bacteria by formylmethionyl (fMet) peptides released in the breakdown of 
bacterial proteins. Synthetic formylmethionyl peptides have been useful in 
identifying the cell-surface receptor for this class of peptide. Moreover, 
synthetic peptides can be attached to agarose beads to prepare affinity chro- 
matography columns for the purification of receptor proteins that specifi- 
cally recognize the peptides. 


3. Synthetic peptides can serve as drugs. Vasopressin is a peptide hor- 
mone that stimulates the reabsorption of water in the distal tubules of 
the kidney, leading to the formation of more-concentrated urine. Patients with 
diabetes insipidus are deficient in vasopressin (also called antidiuretic hormone), 
and so they excrete large volumes of dilute urine (more than 5 liters per day) 
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and are continually thirsty. This defect can be treated by administering 
1-desamino-8-D-arginine vasopressin, a synthetic analog of the missing 
hormone (Figure 3.36). This synthetic peptide is degraded in vivo much more 
slowly than vasopressin and does not increase blood pressure. 


4, Finally, studying synthetic peptides can help define the rules governing the 
three-dimensional structure of proteins. We can ask whether a particular 
sequence by itself tends to fold into an a helix, a B strand, or a hairpin turn 
or behaves as arandom coil. The peptides created for such studies can incor- 
porate amino acids not normally found in proteins, allowing more variation 
in chemical structure than is possible with the use of only 20 amino acids. 


How are these peptides constructed? The amino group of one amino acid H3C O R 
is linked to the carboxyl group of another. However, a unique product is Hs i i Be é 
formed only if a single amino group and a single carboxy] group are avail- H5C Oo”  N rae 


able for reaction. Therefore, it is necessary to block some groups and to 2 ib 
activate others to prevent unwanted reactions. First, the carboxyl-terminal ; - 
amino acid is attached to an insoluble resin by its carboxyl group, effectively ee pol acid). = 
protecting it from further peptide-bond-forming reactions (Figure 3.37). 
The a-amino group of this amino acid is blocked with a protecting group 
such as a tert-butyloxycarbonyl (t-Boc) group. The t-Boc protecting group (wen { 
of this amino acid is then removed with trifluoroacetic acid. 

The next amino acid (in the protected t-Boc form) and dicyclohexylcar- Dicydloleexcyicartiodiinida 
bodiimide (DCC) are added together. At this stage, only the carboxyl (Dcc) 
group of the incoming amino acid and the amino group of the resin-bound 
amino acid are free to form a peptide bond. DCC reacts with the carboxyl 
group of the incoming amino acid, activating it for the peptide-bond- 
forming reaction. After the peptide bond has formed, excess reagents are 
washed away, leaving the desired dipeptide product attached to the beads. 
Additional amino acids are linked by the same sequence of reactions. 
At the end of the synthesis, the peptide is released from the beads by the 
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FIGURE 3.37 Solid-phase peptide 
synthesis. The sequence of steps in 
solid-phase synthesis is: (1) anchoring of 
the C-terminal amino acid to a solid resin, 
(2) deprotection of the amino terminus, and 
(8) coupling of the free amino terminus with 
the DCC-activated carboxyl group of the 
next amino acid. Steps 2 and 3 are 
repeated for each added amino acid. 
Finally, in step 4, the completed peptide is 
released from the resin. 
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addition of hydrofluoric acid (HF), which cleaves the carboxyl ester anchor 
without disrupting peptide bonds. Protecting groups on potentially reac- 
tive side chains, such as that of lysine, also are removed at this time. 

A major advantage of this solid-phase method, first developed by 
R. Bruce Merrifield, is that the desired product at each stage is bound to 
beads that can be rapidly filtered and washed. Hence, there is no need to 
purify intermediates. All reactions are carried out in a single vessel, elimi- 
nating losses caused by repeated transfers of products. This cycle of reactions 
can be readily automated, which makes it feasible to routinely synthesize 
peptides containing about 50 residues in good yield and purity. In fact, 
the solid-phase method has been used to synthesize interferons (155 residues) 
that have antiviral activity and ribonuclease (124 residues) that is catalytically 
active. The protecting groups and cleavage agents may be varied for increased 
flexibility or convenience. 


Synthetic peptides can be linked to create even longer molecules. 
With the use of specially developed peptide-ligation methods, proteins of 
100 amino acids or more can by synthesized in very pure form. These meth- 
ods enable the construction of even sharper tools for examining protein 
structure and function. 


3.5 Three-Dimensional Protein Structure Can Be Determined 
by X-ray Crystallography and NMR Spectroscopy 


Elucidation of the three-dimensional structure of a protein is often the 
source of a tremendous amount of insight into its corresponding function, 
inasmuch as the specificity of active sites and binding sites is defined by the 
precise atomic arrangement within these regions. For example, knowledge 
of the structure of a protein enables the biochemist to predict its mechanism 
of action, the effects of mutations on its function, and the desired features of 
drugs that may inhibit or augment its activity. X-ray crystallography and 
nuclear magnetic resonance spectroscopy are the two most important tech- 
niques for elucidating the conformation of proteins. 


X-ray crystallography reveals three-dimensional structure 
in atomic detail 


X-ray crystallography was the first method developed to determine protein 
structure in atomic detail. This technique provides the clearest visualization of 
the precise three-dimensional positions of most atoms within a protein. Of all 
forms of radiation, x-rays provide the best resolution for the determination of 
molecular structures because their wavelength approximately corresponds to 
the length of a covalent bond. The three components in an x-ray crystallographic 
analysis are a protein crystal, a source of x-rays, and a detector (Figure 3.38). 
X-ray crystallography first requires the preparation of a protein or pro- 
tein complex in crystal form, in which all protein molecules are oriented in 
a fixed, repeated arrangement with respect to one another. Slowly adding 
ammonium sulfate or another salt to a concentrated solution of protein to 
reduce its solubility favors the formation of highly ordered crystals—the 
process of salting out discussed on page 68. For example, myoglobin crys- 
tallizes in 3 M ammonium sulfate. Protein crystallization can be quite chal- 
lenging: a concentrated solution of highly pure material is required and it is 
often difficult to predict which experimental conditions will yield the most- 
effective crystals. Methods for screening many different crystallization 
conditions using a small amount of protein sample have been developed. 
Typically, hundreds of conditions must be tested to obtain crystals fully suit- 
able for crystallographic studies. Nevertheless, increasingly large and com- 
plex proteins have been crystallized. For example, poliovirus, an 8500-kDa 
assembly of 240 protein subunits surrounding an RNA core, has been crys- 
tallized and its structure solved by x-ray methods. Crucially, proteins fre- 
quently crystallize in their biologically active configuration. Enzyme crystals 
may display catalytic activity if the crystals are suffused with substrate. 
After a suitably pure crystal of protein has been obtained, a source of 
x-rays is required. A beam of x-rays of wavelength 1.54 A is produced by 
accelerating electrons against a copper target. Equipment suitable for gener- 
ating x-rays in this manner is available in many laboratories. Alternatively, 
x-rays can be produced by synchrotron radiation, the acceleration of electrons 
in circular orbits at speeds close to the speed of light. Synchrotron- generated 
x-ray beams are much more intense than those generated by electrons hitting 
copper. The higher intensity enables the acquisition of high quality data from 
smaller crystals over a shorter exposure times. Several facilities throughout 


95 


3.5 Crystallography and NMR Spectroscopy 


Detector 


\¥ 


X-ray 
source 


X-ray 
beam 


Crystal —V 
@ 


re 


V/s 
Hii 
PFA 


FIGURE 3.38 An x-ray crystallographic 
experiment. An x-ray source generates a 
beam, which is diffracted by a crystal. The 
resulting diffraction pattern is collected on a 


detector. 


96 the world generate synchrotron radiation, such as the Advanced Light Source at 
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Argonne National Laboratory outside Chicago and the Photon Factory in 
Tsukuba City, Japan. 


When a narrow beam of x-rays is directed at the protein crystal, most of 
the beam passes directly through the crystal while a small part is scattered 
in various directions. These scattered, or diffracted, x-rays can be detected 


FIGURE 3.39 An x-ray diffraction pattern. A protein crystal 
diffracts x-rays to produce a pattern of spots, or reflections, on the 
detector surface. The white silhouette in the center of the image is 
from a beam stop which protects the detector from the intense, 
undiffracted x-rays. [S. Lansky, O. Alalouf, V. Solomon, A. Alhassid, 
L. Govada, N. E. Chayan, H. Belrhali, Y. Shoham, and G. Shoham. 
Acta Cryst. F69:430-434, 2013, Fig. 2.] 


Figure 3.40 Interpretation of an electron-density map. (A) A 
segment of an electron-density map is drawn as a three-dimensional 
contour plot, in which the regions inside the “cage” represent the 
regions of highest electron density. (B) A model of the protein is built 
into this map so as to maximize the placement of atoms within this 
density. [Drawn from 1FCH.pdb.] 


by x-ray film or by a solid-state electronic detector. The 
scattering pattern provides abundant information about 
protein structure. The basic physical principles underly- 
ing the technique are: 


1. Electrons scatter x-rays. The amplitude of the wave 
scattered by an atom is proportional to its number of elec- 
trons. Thus, a carbon atom scatters six times as strongly as 
a hydrogen atom does. 


2. The scattered waves recombine. Each diffracted beam 
comprises waves scattered by each atom in the crystal. 
The scattered waves reinforce one another at the film or 
detector if they are in phase (in step) there, and they cancel 
one another if they are out of phase. 


3. The way in which the scattered waves recombine depends 
only on the atomic arrangement. 


The protein crystal is mounted and positioned in a pre- 
cise orientation with respect to the x-ray beam and the film. 
The crystal is rotated so that the beam can strike the crystal 
from many directions. This rotational motion results in an 
x-ray photograph consisting of a regular array of spots 
called reflections. The x-ray photograph shown in Figure 3.39 
is a two-dimensional section through a three-dimensional 
array of 72,000 reflections. The intensities and positions of 
these reflections are the basic experimental data of an x-ray 
crystallographic analysis. Each reflection is formed from a 
wave with an amplitude proportional to the square root of 
the observed intensity of the spot. Each wave also has a 
phase—that is, the timing of its crests and troughs relative 
to those of other waves. Additional experiments or calcula- 
tions must be performed to determine the phases corre- 
sponding to each reflection. 

The next step is to reconstruct an image of the protein 
from the observed reflections. In light microscopy or 
electron microscopy, the diffracted beams are focused by 
lenses to directly form an image. However, appropriate 
lenses for focusing x-rays do not exist. Instead, the image is 
formed by applying a mathematical relation called a Fourier 
transform to the measured amplitudes and calculated phases 
of every observed reflection. The image obtained is referred 
to as the electron-density map. It is a three-dimensional 
graphic representation of where the electrons are most 
densely localized and is used to determine the positions of 
the atoms in the crystallized molecule (Figure 3.40). Critical 
to the interpretation of the map is its resolution, which is 
determined by the number of scattered intensities used in 
the Fourier transform. The fidelity of the image depends on 
this resolution, as shown by the optical analogy in Figure 3.41. 


(B) 


(9) 


A resolution of 6 A reveals the course of the polypeptide chain but few other 
structural details. The reason is that polypeptide chains pack together so that 
their centers are between 5 A and 10 A apart. Maps at higher resolution are 
needed to delineate groups of atoms, which lie between 2.8 A and 4.0 A apart, 
and individual atoms, which are between 1.0 A and 1.5 A apart (Figure 3.42). 
The ultimate resolution of an x-ray analysis is determined by the degree of 
perfection of the crystal. For proteins, this limiting resolution is often about 
2 A; however, in exceptional cases, resolutions of 1.0 A have been obtained. 


Nuclear magnetic resonance spectroscopy can reveal 
the structures of proteins in solution 


X-ray crystallography is the most powerful method for determining protein 
structures. However, some proteins do not readily crystallize. Furthermore, 
although structures of crystallized proteins very closely represent those of pro- 
teins free of the constraints imposed by the crystalline environment, structures 
in solution can be sources of additional insights. Nuclear magnetic resonance 
(NMR) spectroscopy is unique in being able to reveal the atomic structure of 
macromolecules in solution, provided that highly concentrated solutions 


FIGURE 3.41 Resolution affects the 
quality of an image. The effect of 
resolution on the quality of a reconstructed 
image is shown by an optical analog of 
x-ray diffraction: (A) a photograph of the 
Parthenon; (B) an optical diffraction pattern 
of the Parthenon; (C and D) images 
reconstructed from the pattern in part 

B. More data were used to obtain image 
D than image C, which accounts for the 
higher quality of image D. [Courtesy of 

Dr. Thomas Steitz (part A) and Dr. David 
DeRosier (part B).] 


FIGURE 3.42 Resolution affects the 
quality of the electron density map. The 
electron density maps of a tyrosine residue 
at four different resolution levels (1.0 A, 

2.0 A, 2.7 A, and 3.0 A) are shown. At the 
lower resolution levels (2.7 A and 3.0 A), 
only a group of atoms corresponding to the 
side chain is visible, whereas at the highest 
resolution (1.0 A), individual atoms within 
the side chain are distinguishable. [Data 
from www.rcsb.org/pdb/101/static101. 
do?p=education_discussion/Looking-at- 
Structures/resolution.html] 
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TABLE 3.4 Biologically important 
nuclei giving NMR signals 


Natural abundance 


Nucleus (% by weight of the element) 
1 99.984 
2H 0.016 
BC 1.108 
14) 99.635 
15N 0.365 
Ha) 0.037 
23Na 100.0 
23Me 10.05 
31p 100.0 
35C] 75.4 
39K 93.1 
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FIGURE 3.43 Basis of NMR 
spectroscopy. The energies of the two 
orientations of a nucleus of spin LP (such 
as 31P and 'H) depend on the strength of 
the applied magnetic field. Absorption of 
electromagnetic radiation of appropriate 
frequency induces a transition from the 
lower to the upper level. 


Intensity — 


(~1 mM, or 15 mg ml! fora 15-kDa protein) can be obtained. This technique 
depends on the fact that certain atomic nuclei are intrinsically magnetic. Only 
a limited number of isotopes display this property, called spin, and those most 
important to biochemistry are listed in Table 3.4. The simplest example is the 
hydrogen nucleus (1H), which is a proton. The spinning of a proton generates 
a magnetic moment. This moment can take either of two orientations, or spin 
states (called a and B), when an external magnetic field is applied (Figure 3.43). 
The energy difference between these states is proportional to the strength of 
the imposed magnetic field. The a state has a slightly lower energy because it 
is aligned with this applied field. Hence, in a given population of nuclei, 
slightly more will occupy the a state (by a factor of the order of 1.00001 ina 
typical experiment). A spinning proton in an a state can be raised to an excited 
state (B state) by applying a pulse of electromagnetic radiation (a radio- 
frequency, or RF, pulse), provided that the frequency corresponds to the 
energy difference between the a and the £ states. In these circumstances, the 
spin will change from a to B; in other words, resonance will be obtained. 

These properties can be used to examine the chemical surroundings of the 
hydrogen nucleus. The flow of electrons around a magnetic nucleus generates 
a small local magnetic field that opposes the applied field. The degree of such 
shielding depends on the surrounding electron density. Consequently, nuclei 
in different environments will change states, or resonate, at slightly different 
field strengths or radiation frequencies. A resonance spectrum for a molecule 
is obtained by keeping the magnetic field constant and varying the frequency 
of the electromagnetic radiation. The nuclei of the perturbed sample absorb 
electromagnetic radiation at a frequency that can be measured. The different 
frequencies, termed chemical shifts, are expressed in fractional units 6 (parts 
per million, or ppm) relative to the shifts of a standard compound, such as a 
water-soluble derivative of tetramethylsilane, that is added with the sample. 
For example, a—CH, proton typically exhibits a chemical shift (5) of 1 ppm, 
compared with a chemical shift of 7 ppm for an aromatic proton. The 
chemical shifts of most protons in protein molecules fall between 0 and 
9 ppm (Figure 3.44). Most protons in many proteins can be resolved by using 
this technique of one-dimensional NMR. With this information, we can then 
deduce changes to a particular chemical group under different conditions, 
such as the conformational change of a protein from a disordered structure to 
an a helix in response to a change in pH. 
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FIGURE 3.44 One-dimensional NMR spectra. (A) ‘H-NMR spectrum of ethanol 
(CH,CH,Oh) shows that the chemical shifts for the hydrogen are clearly resolved. (B) ‘H-NMR 
spectrum of a 55 amino acid fragment of a protein having a role in RNA splicing shows a 
greater degree of complexity. A large number of peaks are present and many overlap. 

[(A) Data from C. Branden and J. Tooze, Introduction to Protein Structure (Garland, 1991), p. 280; 
(B) courtesy of Dr. Barbara Amann and Dr. Wesley McDermott.] 
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We can garner even more information by examining how the spins on dif- 
ferent protons affect their neighbors. By inducing a transient magnetization in 
a sample through the application of a radio-frequency pulse, we can alter the 
spin on one nucleus and examine the effect on the spin of a neighboring nucleus. 
Especially revealing is a two-dimensional spectrum obtained by nuclear Overhauser 
enhancement spectroscopy (NOESY), which graphically displays pairs of protons 
that are in close proximity, even if they are not close together in the primary 
structure. The basis for this technique is the nuclear Overhauser effect (NOE), 
an interaction between nuclei that is proportional to the inverse sixth power of 
the distance between them. Magnetization is transferred from an excited 
nucleus to an unexcited one if the two nuclei are less than about 5 A apart 
(Figure 3.45A). In other words, the effect provides a means of detecting the 
location of atoms relative to one another in the three-dimensional structure of 
the protein. The peaks that lie along the diagonal of a NOESY spectrum 
(shown in white in Figure 3.45B) correspond to those present in a one-dimen- 
sional NMR experiment. The peaks apart from the diagonal (shown in red in 
Figure 3.45B), referred to as off-diagonal peaks or cross-peaks, provide crucial 
new information: they identify pairs of protons that are less than 5 A apart. A 
two-dimensional NOESY spectrum for a protein comprising 55 amino acids is 
shown in Figure 3.46. The large number of off-diagonal peaks reveals short 
proton—proton distances. The three-dimensional structure of a protein can be 
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FIGURE 3.45 The nuclear Overhauser 
effect. The nuclear Overhauser effect 
(NOE) identifies pairs of protons that 

are in close proximity. (A) Schematic 
representation of a polypeptide chain 
highlighting five particular protons. Protons 
2 and 5 are in close proximity (~4 A apart), 
whereas other pairs are farther apart. 

(B) A highly simplified NOESY spectrum. 
The diagonal shows five peaks 
corresponding to the five protons in part A. 
The peak above the diagonal and the 
symmetrically related one below reveal that 
proton 2 is close to proton 5. 


FIGURE 3.46 Detecting short proton- 
proton distances. A NOESY spectrum 
for a 55 amino acid domain from a protein 
having a role in RNA splicing. Each 
off-diagonal peak corresponds to a short 
proton-proton separation. This spectrum 
reveals hundreds of such short 
proton-proton distances, which can be 
used to determine the three-dimensional 
structure of this domain. [Courtesy of Dr. 
Barbara Amann and Dr. Wesley McDermott.] 
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(B) 


FIGURE 3.48 A family of structures. A set 
of 25 structures for a 28 amino acid domain 
from a zinc-finger-DNA-binding protein. The 
red line traces the average course of the 
protein backbone. Each of these structures 
is consistent with hundreds of constraints 
derived from NMR experiments. The 
differences between the individual structures 
are due to a combination of imperfections in 
the experimental data and the dynamic 
nature of proteins in solution. [Courtesy of 
Dr. Barbara Amann.] 
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Calculated structure 


FIGURE 3.47 Structures calculated on the basis of NUR 
constraints. (A) NOESY observations show that protons 
(connected by dotted red lines) are close to one another in 
space. (B) A three-dimensional structure calculated with these 
proton pairs constrained to be close together. 


reconstructed with the use of such proximity relations. Structures are calculated 
such that protons that must be separated by less than 5 A on the basis of 
NOESY spectra are close to one another in the three-dimensional structure 
(Figure 3.47). If a sufficient number of distance constraints are applied, the 
three-dimensional structure can nearly be determined uniquely. 

In practice, a family of related structures is generated by NMR spectros- 
copy for three reasons (Figure 3.48). First, not enough constraints may be 
experimentally accessible to fully specify the structure. Second, the distances 
obtained from analysis of the NOESY spectrum are only approximate. 
Finally, the experimental observations are made not on single molecules but 
on a large number of molecules in solution that may have slightly different 
structures at any given moment. Thus, the family of structures generated 
from NMR structure analysis indicates the range of conformations for the 
protein in solution. At present, structural determination by NMR spectros- 
copy is generally limited to proteins less than 50 kDa, but its resolving power 
is certain to increase. The power of NMR has been greatly enhanced by the 
ability of recombinant DNA technology to produce proteins labeled uni- 
formly or at specific sites with '3C, !5N, and 7H (Chapter 5). 

The structures of nearly 97,000 proteins had been elucidated by x-ray 
crystallography and NMR spectroscopy by the end of 2013 and several new 
structures are now determined each day. The coordinates are collected at the 
Protein Data Bank (www.pdb.org), and the structures can be accessed for 
visualization and analysis. Knowledge of the detailed molecular architec- 
ture of proteins has been a source of insight into how proteins recognize 
and bind other molecules, how they function as enzymes, how they fold, 
and how they evolved. This extraordinarily rich harvest is continuing at a 
rapid pace and is greatly influencing the entire field of biochemistry as well 
as other biological and physical sciences. 


SUMMARY 


The rapid progress in gene sequencing has advanced another goal of 
biochemistry—elucidation of the proteome. The proteome is the complete 
set of proteins expressed and includes information about how they are 
modified, how they function, and how they interact with other molecules. 


3.1. The Purification of Proteins Is an Essential First Step in Understanding 
Their Function 
Proteins can be separated from one another and from other molecules on 
the basis of such characteristics as solubility, size, charge, and binding 
affinity. 5DS—polyacrylamide gel electrophoresis separates the polypeptide 
chains of proteins under denaturing conditions largely according to mass. 
Proteins can also be separated electrophoretically on the basis of net 


3.2 


3.3 


3.4 


3.5 


charge by isoelectric focusing in a pH gradient. Ultracentrifugation and 
gel-filtration chromatography resolve proteins according to size, whereas 
ion-exchange chromatography separates them mainly on the basis of net 
charge. The high affinity of many proteins for specific chemical groups is 
exploited in affinity chromatography, in which proteins bind to columns 
containing beads bearing covalently linked substrates, inhibitors, or other 
specifically recognized groups. The mass of a protein can be determined 
by sedimentation-equilibrium measurements. 


Immunology Provides Important Techniques with Which to 

Investigate Proteins 

Proteins can be detected and quantitated by highly specific antibodies; 
monoclonal antibodies are especially useful because they are homoge- 
neous. Enzyme-linked immunosorbent assays and western blots of 
SDS-—polyacrylamide gels are used extensively. Proteins can also be 
localized within cells by immunofluorescence microscopy and immu- 
noelectron microscopy. 


Mass Spectrometry Is a Powerful Technique for the Identification 

of Peptides and Proteins 

Techniques such as matrix-assisted laser desorption/ionization (MALDI) 
and electrospray ionization (ESI) allow the generation of ions of proteins 
and peptides in the gas phase. The mass of such protein ions can be deter- 
mined with great accuracy and precision. Masses determined by these 
techniques act as protein name tags because the mass of a protein or pep- 
tide is precisely determined by its amino acid composition and, hence, by 
its sequence. In addition to chemical methods, such as the Edman degra- 
dation, tandem mass spectrometry enables the rapid and highly accurate 
sequencing of peptides. These sequences are rich in information concern- 
ing the kinship of proteins, their evolutionary relationships, and diseases 
produced by mutations. Knowledge of a sequence provides valuable clues 
to conformation and function. Mass spectrometric techniques are central 
to proteomics because they make it possible to analyze the constituents of 
large macromolecular assemblies or other collections of proteins. 


Peptides Can Be Synthesized by Automated Solid-Phase Methods 
Polypeptide chains can be synthesized by automated solid-phase methods 
in which the carboxyl end of the growing chain is linked to an insoluble 
support. The carboxyl group of the incoming amino acid is activated by 
dicyclohexylcarbodiimide and joined to the amino group of the growing 
chain. Synthetic peptides can serve as drugs and as antigens to stimulate 
the formation of specific antibodies. They can also be sources of insight 
into the relation between amino acid sequence and conformation. 


Three-Dimensional Protein Structure Can Be Determined by X-ray 
Crystallography and NMR Spectroscopy 

X-ray crystallography and nuclear magnetic resonance spectroscopy 
have greatly enriched our understanding of how proteins fold, recognize 
other molecules, and catalyze chemical reactions. X-ray crystallography 
is possible because electrons scatter x-rays. The diffraction pattern pro- 
duced can be analyzed to reveal the arrangement of atoms in a protein. 
The three-dimensional structures of tens of thousands of proteins are 
now known in atomic detail. Nuclear magnetic resonance spectroscopy 
reveals the structure and dynamics of proteins in solution. The chemical 
shift of nuclei depends on their local environment. Furthermore, the 
spins of neighboring nuclei interact with each other in ways that provide 
definitive structural information. This information can be used to 
determine complete three-dimensional structures of proteins. 
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Summary 


KEY TERMS 


proteome (p. 66) 

assay (p. 67) 

specific activity (p. 67) 

homogenate (p. 67) 

salting out (p. 68) 

dialysis (p. 69) 

gel-filtration chromatography (p. 69) 
ion-exchange chromatography (p. 70) 
cation exchange (p. 70) 

anion exchange (p. 70) 

affinity chromatography (p. 70) 


high-performance liquid chromatography 


(HPLC) (p. 71) 
gel electrophoresis (p. 71) 


isoelectric focusing (p. 73) 
two-dimensional electrophoresis (p. 74) 
sedimentation coefficient 

(Svedberg unit, S) (p. 76) 
antibody (p. 79) 
antigen (p. 79) 
antigenic determinant (epitope) (p. 79) 
polyclonal antibody (p. 80) 
monoclonal antibody (p. 80) 


enzyme-linked immunosorbent 


assay (ELISA) (p. 82) 
western blotting (p. 83) 
fluorescence microscopy (p. 84) 
green fluorescent protein (GFP) (p. 84) 
matrix-assisted laser desorption/ 


electrospray ionization (ESI) (p. 85) 

time-of-flight (TOF) mass 
analyzer (p. 85) 

Edman degradation (p. 87) 

phenyl isothiocyanate (p. 87) 

tandem mass spectrometry (p. 87) 

overlap peptide (p. 89) 

peptide mass fingerprinting (p. 91) 

solid-phase method (p. 94) 

x-ray crystallography (p. 95) 

Fourier transform (p. 96) 

electron-density map (p. 96) 


nuclear magnetic resonance 
(NMR) spectroscopy (p. 97) 
chemical shift (p. 98) 


isoelectric point (p. 73) 


ionization (MALDI) (p. 85) 


PROBLEMS 


1. Valuable reagents. The following reagents are often used 
in protein chemistry: 


CNBr Trypsin 

Urea Performic acid 
Mercaptoethanol 6N HCl 
Chymotrypsin Phenyl isothiocyanate 


Which one is the best suited for accomplishing each of the 
following tasks? 


(a) Determination of the amino acid sequence of a small 
peptide. 
(b) Reversible denaturation of a protein devoid of disulfide 


bonds. Which additional reagent would you need if disul- 
fide bonds were present? 


(c) Hydrolysis of peptide bonds on the carboxyl side of 
aromatic residues. 

(d) Cleavage of peptide bonds on the carboxyl side of 
methionines. 

(e) Hydrolysis of peptide bonds on the carboxyl side of 
lysine and arginine residues. 


2. The only constant is change. Explain how two different 
cell types from the same organism will have identical 
genomes but may have vastly divergent proteomes. 


3. Crafting a new breakpoint. Ethyleneimine reacts with cyste- 
ine side chains in proteins to form S-aminoethyl derivatives. 
The peptide bonds on the carboxyl side of these modified cys- 
teine residues are susceptible to hydrolysis by trypsin. Why? 


4. Spectrometry. The absorbance A of a solution is defined as 
A = log, )([)/D) 


in which J) is the incident-light intensity and I is the trans- 
mitted-light intensity. The absorbance is related to the molar 
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absorption coefficient (extinction coefficient) ¢ (in M7! 
cm7!), concentration c (in M), and path length | (in cm) by 


A = elc 
The absorption coefficient of myoglobin at 580 nm is 
15,000 M~! cm7~!. What is the absorbance of a 1 mg ml7! 


solution across a 1-cm path? What percentage of the inci- 
dent light is transmitted by this solution? 


5. It’s in the bag. Suppose that you precipitate a protein with 
1M (NH,),SO, and that you wish to reduce the concentra- 
tion of the (NH,),5O,. You take 1 ml of your sample and 
dialyze it in 1000 ml of buffer. At the end of dialysis, what 
is the concentration of (NH,),5O, in your sample? How 
could you further lower the (NH,),SO, concentration? 


6. Too much or not enough. Why do proteins precipitate at high 
salt concentrations? Although many proteins precipitate at 
high salt concentrations, some proteins require salt to dissolve 
in water. Explain why some proteins require salt to dissolve. 


7. A slow mover. Tropomyosin, a 70-kDa muscle protein, 
sediments more slowly than does hemoglobin (65 kDa). 
Their sedimentation coefficients are 2.65 and 4.315, respec- 
tively. Which structural feature of tropomyosin accounts 
for its slow sedimentation? 


8. Sedimenting spheres. What is the dependence of the sedi- 
mentation coefficient s of a spherical protein on its mass? 
How much more rapidly does an 80-kDa protein sediment 
than does a 40-kDa protein? 


9. Frequently used in shampoos. The detergent sodium 
dodecyl sulfate (SDS) denatures proteins. Suggest how SDS 
destroys protein structure. 


10. Size estimate. The relative electrophoretic mobilities of 
a 30-kDa protein and a 92-kDa protein used as standards 


on an SDS-polyacrylamide gel are 0.80 and 0.41, respec- 
tively. What is the apparent mass of a protein having a 
mobility of 0.62 on this gel? 


11. Unexpected migration. Some proteins migrate anoma- 
lously in SDS-PAGE gels. For instance, the molecular weight 
determined from an SDS-PAGE gel is sometimes very differ- 
ent from the molecular weight determined from the amino 
acid sequence. Suggest an explanation for this discrepancy. 


12. Sorting cells. Fluorescence-activated cell sorting (FACS) 
is a powerful technique for separating cells according to 
their content of particular molecules. For example, a fluo- 
rescence-labeled antibody specific for a cell-surface protein 
can be used to detect cells containing such a molecule. 
Suppose that you want to isolate cells that possess a recep- 
tor enabling them to detect bacterial degradation products. 
However, you do not yet have an antibody directed against 
this receptor. Which fluorescence-labeled molecule would 
you prepare to identify such cells? 


13. Column choice. (a) The octapeptide AVGWRVKS was 
digested with the enzyme trypsin. Which method would be 
most appropriate for separating the products: ion-exchange 
or gel-filtration chromatography? Explain. (b) Suppose that 
the peptide was digested with chymotrypsin. What would 
be the optimal separation technique? Explain. 


14. Power(ful) tools. Monoclonal antibodies can be conju- 
gated to an insoluble support by chemical methods. Explain 
how these antibody-bound beads can be exploited for pro- 
tein purification. 


15. Assay development. You wish to isolate an enzyme from 
its native source and need a method for measuring its activ- 
ity throughout the purification. However, neither the sub- 
strate nor the product of the enzyme-catalyzed reaction can 
be detected by spectroscopy. You discover that the product 
of the reaction is highly antigenic when injected into mice. 
Propose a strategy to develop a suitable assay for this 
enzyme. 


16. Making more enzyme? In the course of purifying an 
enzyme, a researcher performs a purification step that 
results in an increase in the total activity to a value greater 
than that present in the original crude extract. Explain how 
the amount of total activity might increase. 


17. Divide and conquer. The determination of the mass of a 
protein by mass spectrometry often does not allow its 
unique identification among possible proteins within a 
complete proteome, but determination of the masses of all 
fragments produced by digestion with trypsin almost 
always allows unique identification. Explain. 


18. Know your limits. Which two amino acids are indistin- 
guishable in peptide sequencing by the tandem mass spec- 
trometry method described in this chapter and why? 


103 
Problems 


19. Protein purification problem. Complete the following table. 


Total Total Specific 
Purification protein activity activity Purification Yield 
Procedure (mg) (units) (unitsmg™!) — level (%) 
Crude extract 20,000 4,000,000 1 100 
(NH,),SO, 


precipitation 5,000 3,000,000 
DEAE-cellulose 

chromatography 1,500 1,000,000 
Gel-filtration 


chromatography 500 750,000 
Affinity 
chromatography 45 675,000 


20. Part of the mix. Your frustrated colleague hands you a 
mixture of four proteins with the following properties: 


Isoelectric point (pI) Molecular weight (in kDa) 


Protein A 4.1 80 
Protein B 9.0 81 
Protein C 8.8 37 
Protein D 3.9 172 


(a) Propose a method for the isolation of Protein B from the 
other proteins. (b) If Protein B also carried a His tag at its 
N-terminus, how could you revise your method? 


21. The challenge of flexibility. Structures of proteins com- 
prising domains separated by flexible linker regions can be 
quite difficult to solve by x-ray crystallographic methods. 
Why might this be the case? What are possible experimen- 
tal approaches to circumvent this barrier? 


Chapter Integration Problems 


22. Quaternary structure. A protein was purified to homo- 
geneity. Determination of the mass by gel-filtration chro- 
matography yields 60 kDa. Chromatography in the pres- 
ence of 6 M urea yields a 30-kDa species. When the chro- 
matography is repeated in the presence of 6 M urea and 10 
mM £-mercaptoethanol, a single molecular species of 15 
kDa results. Describe the structure of the molecule. 


23. Helix—coil transitions. (a) NMR measurements have 
shown that poly-L-lysine is a random coil at pH 7 but becomes 
a helix as the pH is raised above 10. Account for this pH- 
dependent conformational transition. (b) Predict the pH 
dependence of the helix—coil transition of poly-L-glutamate. 


24. Peptide mass determination. You have isolated a protein 
from the bacterium E. coli and seek to confirm its identity 
by trypsin digestion and mass spectrometry. Determination 
of the masses of several peptide fragments has enabled you 
to deduce the identity of the protein. However, there is a 
discrepancy with one of the peptide fragments, which you 
believe should have the sequence MLNSFK and an 
(M+H)* value of 739.38. In your experiments, you repeat- 
edly obtain an (M+H)t value of 767.38. What is the cause 
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of this discrepancy and what does it tell you about the 
region of the protein from which this peptide is derived? 


25. Peptides on a chip. Large numbers of different peptides 
can be synthesized in a small area on a solid support. This 
high-density array can then be probed with a fluorescence- 
labeled protein to find out which peptides are recognized. 
The binding of an antibody to an array of 1024 different 
peptides occupying a total area the size of a thumbnail is 
shown in the adjoining illustration. How would you syn- 
thesize such a peptide array? (Hint: Use light instead of acid 
to deprotect the terminal amino group in each round of 
synthesis. ) 


Fluorescence scan of an array of 1024 peptides in a 1.6-cm? 
area. Each synthesis site is a 400-j4m square. A fluorescently 
labeled monoclonal antibody was added to the array to identify 
peptides that are recognized. The height and color of each square 
denote the fluorescence intensity. [Information from S. P. A. Fodor 
et al., Science 251(1991):767.] 


26. Exchange rate. The amide hydrogen atoms of peptide 
bonds within proteins can exchange with protons in the sol- 
vent. In general, amide hydrogen atoms in buried regions of 
proteins and protein complexes exchange more slowly than 
those on the solvent-accessible surface do. Determination of 
these rates can be used to explore the protein-folding reac- 
tion, probe the tertiary structure of proteins, and identify the 
regions of protein-protein interfaces. These exchange reac- 
tions can be followed by studying the behavior of the protein 
in solvent that has been labeled with deuterium (7H), a stable 
isotope of hydrogen. What two methods described in this 
chapter could be readily applied to the study of hydrogen— 
deuterium exchange rates in proteins? 


Data Interpretation Problems 


27. Protein sequencing 1. Determine the sequence of hexa- 
peptide on the basis of the following data. Note: When the 
sequence is not known, a comma separates the amino acids 


(Table 3.3). 


Amino acid composition: (2R,A,5, V, Y) 
N-terminal analysis of the hexapeptide: A 
Trypsin digestion: (R,A,V) and (R,5S, Y) 
Carboxypeptidase digestion: No digestion. 
Chymotrypsin digestion: (A,R,V,Y) and (R,5) 


28. Protein sequencing 2. Determine the sequence of a pep- 
tide consisting of 14 amino acids on the basis of the follow- 
ing data. 

Amino acid composition: (45,2L,F,G,I,K,M,T,W,Y) 
N-terminal analysis: 5 

Carboxypeptidase digestion: L 

Trypsin digestion: (38,2L,F,I,M,T,W) (G,K,5,Y) 
Chymotrypsin digestion: (F,I,8) (G,K,L) (L,5) (M,T) 
(S,W) (5, Y) 


N-terminal analysis of (F,1,S) peptide: 5 


Cyanogen bromide treatment: (25,F,G,I,K,L,M*,T,Y) 
(28,L,W) 


M*, methionine detected as homoserine 


29. Applications of two-dimensional electrophoresis. Performic 
acid cleaves the disulfide linkage of cystine and converts the 
sulfhydryl groups into cysteic acid residues, which are then 
no longer capable of disulfide-bond formation. 


Consider the following experiment: You suspect that a pro- 
tein containing three cysteine residues has a single disulfide 
bond. You digest the protein with trypsin and subject the 
mixture to electrophoresis along one end of a sheet of paper. 
After treating the paper with performic acid, you subject 
the sheet to electrophoresis in the perpendicular direction 
and stain it with a reagent that detects proteins. How would 
the paper appear if the protein did not contain any disulfide 
bonds? If the protein contained a single disulfide bond? 
Propose an experiment to identify which cysteine residues 


form the disulfide bond. 
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DNA, RNA, and the Flow of 
Genetic Information 


CHAPTER 


Family resemblance, very evident in this photograph of four sisters, 
results from having genes in common. Genes must be expressed 
to exert an effect, and proteins regulate such expression. One 
such regulatory protein, a zinc-finger protein (zinc ion is blue, 
protein is red), is shown bound to a control region of DNA (black). 
[(Left) © Nicholas Nixon, courtesy Fraenkel Gallery, San Francisco. 


(Right) Drawn from 1AAY.pdb.] 


NA and RNA are long linear polymers, called nucleic acids, that carry 
information in a form that can be passed from one generation to the next. 
These macromolecules consist of a large number of linked nucleotides, each 
composed of a sugar, a phosphate, and a base. Sugars linked by phosphates 
form a common backbone that plays a structural role, whereas the sequence of 
bases along a nucleic acid strand carries genetic information. The DNA mole- 
cule has the form of a double helix, a helical structure consisting of two comple- 
mentary nucleic acid strands. Each strand serves as the template for the other 
in DNA replication. The genes of all cells and many viruses are made of DNA. 
Genes specify the kinds of proteins that are made by cells, but DNA is 
not the direct template for protein synthesis. Rather, a DNA strand is cop- 
ied into a class of RNA molecules called messenger RNA (mRNA), the 
information-carrying intermediates in protein synthesis. This process of 
transcription is followed by translation, the synthesis of proteins according 
to instructions given by mRNA templates. Information processing in all 
cells is quite complex. The scheme that underlies information processing at 
the level of gene expression was first proposed by Francis Crick in 1958. 


Replication 


a 


Transcription Translation 


DNA — RNA —~ Protein 


Crick called this scheme the central dogma. The basic tenets of this dogma 
are true, but, as we will see later, this scheme is not as simple as depicted. 
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CHAPTER 4 DNA, RNA, and the Flow 
of Genetic Information 


Base; 


Phosphate Phosphate 


FIGURE 4.1 Polymeric structure of 
nucleic acids. 


Deoxyribose 


FIGURE 4.2 Ribose and deoxyribose. 
Atoms in sugar units are numbered with 
primes to distinguish them from atoms in 
bases (see Figure 4.4). 


Base;,; 


This flow of information depends on the genetic code, which defines the 
relation between the sequence of bases in DNA (or its mRNA transcript) and 
the sequence of amino acids in a protein. The code is nearly the same in all 
organisms: a sequence of three bases, called a codon, specifies an amino acid. 
There is another step in the expression of most eukaryotic genes, which are 
mosaics of nucleic acid sequences called introns and exons. Both are transcribed, 
but before translation takes place, introns are cut out of newly synthesized RNA 
molecules, leaving mature RNA molecules with continuous exons. The exis- 
tence of introns and exons has crucial implications for the evolution of proteins. 


4.1 A Nucleic Acid Consists of Four Kinds of Bases 
Linked to a Sugar—-Phosphate Backbone 


The nucleic acids DNA and RNA are well suited to function as the carriers of 
genetic information by virtue of their covalent structures. These macromole- 
cules are linear polymers built up from similar units connected end to end 
(Figure 4.1). Each monomer unit within the 
polymer is a nucleotide. A single nucleotide 
unit consists of three components: a sugar, a 
phosphate, and one of four bases. The 
sequence of bases in the polymer uniquely char- 
acterizes a nucleic acid and constitutes a form 
of linear information—information analogous 
to the letters that spell a person’s name. 


Base;,> 


Phosphate Phosphate Phosphate 


RNA and DNA differ in the sugar component and one of the bases 


The sugar in deoxyribonucleic acid (DNA) is deoxyribose. The prefix deoxy 
indicates that the 2’-carbon atom of the sugar lacks the oxygen atom that is 
linked to the 2’-carbon atom of ribose, as shown in Figure 4.2. Note that 
sugar carbons are numbered with primes to differentiate them from atoms 
in the bases. The sugars in both nucleic acids are linked to one another by 
phosphodiester bridges. Specifically, the 3’-hydroxyl (3’-OH) group of the 
sugar moiety of one nucleotide is esterified to a phosphate group, which is, 
in turn, joined to the 5’-hydroxyl group of the adjacent sugar. The chain of 
sugars linked by phosphodiester bridges is referred to as the backbone of the 
nucleic acid (Figure 4.3). Whereas the backbone is constant in a nucleic 
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FIGURE 4.3 Backbones of DNA and RNA. The backbones of these nucleic acids are 
formed by 3’-to-5’ phosphodiester linkages. A sugar unit is highlighted in red and a 
phosphate group in blue. 
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Uracil is present in RNA instead 
Pyrimidine Cytosine Uracil Thymine of thymine. 


acid, the bases vary from one monomer to the next. Two of the bases of 
DNA are derivatives of purine—adenine (A) and guanine (G)—and two of 
pyrimidine—cytosine (C) and thymine (T), as shown in Figure 4.4. 

Ribonucleic acid (RNA), like DNA, is a long unbranched polymer con- 
sisting of nucleotides joined by 3’ -to-5’ phosphodiester linkages (Figure 4.3). 
The covalent structure of RNA differs from that of DNA in two respects. 
First, the sugar units in RNA are riboses rather than deoxyriboses. Ribose 
contains a 2'-hydroxyl group not present in deoxyribose. Second, one of the 
four major bases in RNA is uracil (U) instead of thymine (T). 

Note that each phosphodiester bridge has a negative charge. This negative 
charge repels nucleophilic species such as hydroxide ions, which are capable 
of hydrolytic attack on the phosphate backbone. This resistance is crucial for 
maintaining the integrity of information stored in nucleic acids. The absence 
of the 2’-hydroxyl group in DNA further increases its resistance to hydrolysis. 
The greater stability of DNA probably accounts for its use rather than RNA 
as the hereditary material in all modern cells and in many viruses. 


Nucleotides are the monomeric units of nucleic acids 


The building blocks of nucleic acids and the precursors of these building 
blocks play many other roles throughout the cell—for instance, as energy 
currency and as molecular signals. Consequently, it is important to be 
familiar with the nomenclature of nucleotides and their precursors. A unit 
consisting of a base bonded to a sugar is referred to as a nucleoside. The four 
nucleoside units in RNA are called adenosine, guanosine, cytidine, and uri- 
dine, whereas those in DNA are called deoxyadenosine, deoxyguanosine, 
deoxycytidine, and thymidine. In each case, N-9 of a purine or N-1 of a 
pyrimidine is attached to C-1' of the sugar by an N-glycosidic linkage 
(Figure 4.5). The base lies above the plane of the sugar when the structure 


is written in the standard orientation; that is, the configuration of the NH, 

N-glycosidic linkage is B (Section 11.1). Note that thymidine contains ZN 

deoxyribose; by convention, the prefix deoxy is not added because thymine- palyeoridtc linkage 4 | Nw 

containing nucleosides are only rarely found in RNA. 6 _e \ N _/ 
A nucleotide is a nucleoside joined to one or more phosphoryl groups by OY aa 

an ester linkage. Nucleotide triphosphates, nucleosides joined to three ‘i 

phosphoryl groups, are the monomers—the building blocks—that are linked 

to form RNA and DNA. The four nucleotide units that link to form DNA HO OH 

are nucleotide monophosphates called deoxyadenylate, deoxyguanylate, FIGURE 4.5 B-Glycosidic linkage in 


deoxycytidylate, and thymidylate. Similarly, the most common nucleotides a nucleoside. 
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FIGURE 4.7 Structure of a DNA 
strand. The strand has a 5’ end, which is 
usually attached to a phosphoryl group, 
and a 3’ end, which is usually a free 
hydroxyl group. 


that link to form RNA are nucleotide monophosphates adenylate, guanylate, 
cytidylate and uridylate. 

This nomenclature does not describe the number of phosphoryl groups 
or the site of attachment to carbon of the ribose. A more precise nomencla- 
ture is also commonly used. A compound formed by the attachment of a 
phosphoryl group to C-5’ of a nucleoside sugar (the most common site of 
phosphate esterification) 1s called anucleoside 5'-phosphate ora 5'-nucleotide. 
In this naming system for nucleotides, the number of phosphoryl groups 
and the attachment site are designated. Look, for example, at adenosine 
5'-triphosphate (ATP; Figure 4.6). This nucleotide is tremendously impor- 
tant because, in addition to being a building block for RNA, it is the most 
commonly used energy currency. The energy released from cleavage of the 
triphosphate group is used to power many cellular processes (Chapter 15). 
Another nucleotide is deoxyguanosine 3’-monophosphate (3'-dGMP; 
Figure 4.6). This nucleotide differs from ATP in that it contains guanine 
rather than adenine, deoxyribose rather than ribose (indicated by the pre- 
fix “d’”), and one rather than three phosphoryl groups. In addition, the 
phosphoryl group is esterified to the hydroxyl group in the 3’ rather than 
the 5’ position. 
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FIGURE 4.6 Nucleotides adenosine 5’-triphosphate (5'-ATP) and deoxyguanosine 
3’-monophosphate (3’-dGMP). 


DNA molecules are very long and have directionality 


Scientific communication frequently requires the sequence of a nucleic 
acid—in some cases, a sequence thousands of nucleotides in length—to be 
written like that on page 17. Rather than writing the cumbersome chemical 
structures, scientists have adopted the use of abbreviations. The abbrevi- 
ated notations pApCpG or ACG denote a trinucleotide of DNA consisting 
of the building blocks deoxyadenylate monophosphate, deoxycytidylate 
monophosphate, and deoxyguanylate monophosphate linked by a phos- 
phodiester bridge, where “p” denotes a phosphory1 group (Figure 4.7). The 
5’ end will often have a phosphoryl group attached to the 5'-OH group. 
Note that, like a polypeptide (Section 2.2), a DNA chain has directionality, 
commonly called polarity. One end of the chain has a free 5’-OH group (or 
a 5'-OH group attached to a phosphoryl group) and the other end has a free 
3'-OH group, neither of which is linked to another nucleotide. By conven- 
tion, the base sequence is written in the 5’-to-3' direction. Thus, ACG indi- 
cates that the unlinked 5’-OH group is on deoxyadenylate, whereas the 
unlinked 3’-OH group is on deoxyguanylate. Because of this polarity, ACG 
and GCA correspond to different compounds. 


A striking characteristic of naturally occurring DNA molecules is their 
length. A DNA molecule must comprise many nucleotides to carry the 
genetic information necessary for even the simplest organisms. For exam- 
ple, the DNA ofa virus such as polyoma, which can cause cancer in certain 
organisms, consists of two paired strands of DNA, each 5100 nucleotides in 
length. The E. coli genome is a single DNA molecule consisting of two 
strands of 4.6 million nucleotides each (Figure 4.8). 

The DNA molecules of higher organisms can be much larger. The 
human genome comprises approximately 3 billion nucleotides in each 
strand of DNA, divided among 24 distinct molecules of DNA called chro- 
mosomes (22 autosomal chromosomes plus the X and Y sex chromosomes) 
of different sizes. One of the largest known DNA molecules is found in 
the Indian muntjac, an Asiatic deer; its genome is nearly as large as the 
human genome but is distributed on only 3 chromosomes (Figure 4.9). 
The largest of these chromosomes has two strands of more than 1 billion 
nucleotides each. If such a DNA molecule could be fully extended, it 
would stretch more than 1 foot in length. Some plants contain even larger 
DNA molecules. 


FIGURE 4.9 The Indian muntjac and its chromosomes. Cells from a female Indian muntjac 
(right) contain three pairs of very large chromosomes (stained orange). The cell shown is a 
hybrid containing a pair of human chromosomes (stained green) for comparison. [(Left) Hugh 
Lansdown/Shutterstock. (Right) J.-Y. Lee, M. Koi, E. J. Stanbridge, M. Oshimura, A. T. Kumamoto, 
and A. P. Feinberg. Nat. Genet. 7:30, 1994.] 


4.2 A Pair of Nucleic Acid Strands with Complementary 
Sequences Can Form a Double-Helical Structure 


As discussed in Chapter 1, the covalent structure of nucleic acids accounts 
for their ability to carry information in the form of a sequence of bases along 
a nucleic acid strand. The bases on the two separate nucleic acid strands 
form specific base pairs in such a way that a helical structure is formed. The 
double-helical structure of DNA facilitates the replication of the genetic 
material—that is, the generation of two copies of a nucleic acid from one. 


The double helix is stabilized by hydrogen bonds and van 
der Waals interactions 


The ability of nucleic acids to form specific base pairs was discovered in the 
course of studies directed at determining the three-dimensional structure of 
DNA. Maurice Wilkins and Rosalind Franklin obtained x-ray diffraction pho- 
tographs of fibers of DNA (Figure 4.10). The characteristics of these diffraction 
patterns indicated that DNA is formed of two strands that wind in a regular 
helical structure. From these data and others, James Watson and Francis Crick 
deduced a structural model for DNA that accounted for the diffraction pattern 


FIGURE 4.8 Electron micrograph of part 
of the E. coli genome. [Dr. Gopal Murti/ 
Science Photo Library/Photo Researchers.] 


FIGURE 4.10 X-ray diffraction 
photograph of a hydrated DNA 
fiber. When crystals of a biomolecule are 
irradiated with x-rays, the x-rays are 
diffracted and these diffracted x-rays are 
seen as a series of spots, called reflections, 
on a screen behind the crystal. The structure 
of the molecule can be determined by the 
pattern of the reflections (Section 3.5). In 
regard to DNA crystals, the central cross is 
diagnostic of a helical structure. The strong 
arcs on the meridian arise from the stack of 
nucleotide bases, which are 3.4 A 
apart. [Science Photo Library.] 
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FIGURE 4.11 Watson-Crick model of 
double-helical DNA. (A) Side view. 
Adjacent bases are separated by 3.4 A. 
The structure repeats along the helical axis 
(vertical) at intervals of 34 A, which 
corresponds to approximately 10 
nucleotides on each chain. (B) Axial view, 
looking down the helix axis, reveals a 
rotation of 36° per base and shows that 
the bases are stacked on top of one 
another [Source: J. L. Tymoczko, J. Berg, 
and L. Stryer, Biochemistry: A Short Course, 
2nd ed. (W. H. Freeman and Company, 
2013), Fig. 33.11,]. 
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and was the source of some remarkable insights into the functional 
properties of nucleic acids (Figure 4.11). 

The features of the Watson—Crick model of DNA deduced 
from the diffraction patterns are: 


1. Two helical polynucleotide strands are coiled around a com- 
mon axis with a right-handed screw sense (pp. 40-41). The strands 
are antiparallel, meaning that they have opposite directionality. 


2. The sugar—phosphate backbones are on the outside and the 
purine and pyrimidine bases lie on the inside of the helix. 


3. The bases are nearly perpendicular to the helix axis, and adja- 
cent bases are separated by approximately 3.4 A. The helical 
structure repeats on the order of every 34 A, with about 10.4 
bases per turn of helix. There is a rotation of nearly 36 degrees 
per base (360 degrees per full turn/10.4 bases per turn). 


4. The diameter of the helix is about 20 A. 


How is such a regular structure able to accommodate an 
arbitrary sequence of bases, given the different sizes and shapes 
of the purines and pyrimidines? In attempting to answer this 
question, Watson and Crick discovered that guanine can be 
paired with cytosine and adenine with thymine to form base 
pairs that have essentially the same shape (Figure 4.12). These 
base pairs are held together by specific hydrogen bonds, which, 
although weak (4-21 kJ mol, or 1-5 kcal mol ~!), stabilize the 
helix because of their large numbers in a DNA molecule. These 
base-pairing rules account for the observation, originally made 
by Erwin Chargaff in 1950, that the ratios of adenine to thy- 
mine and of guanine to cytosine are nearly the same in all 
species studied, whereas the adenine-to-guanine ratio varies 
considerably (Table 4.1). 

Inside the helix, the bases are essentially stacked one on top of 
another (Figure 4.11B). The stacking of base pairs contributes to 
the stability of the double helix in two ways. First, the formation 
of the double helix is facilitated by the hydrophobic effect (p. 9). 
The hydrophobic bases cluster in the interior of the helix away 


FIGURE 4.12 Structures of the base pairs O 
proposed by Watson and Crick. 


Adenine 


TABLE 4.1 Base compositions experimentally determined for a variety of organisms 


Organism A:T G:C A:G 
Human being 1.00 1.00 1.56 
Salmon 102 1.02 1.43 
Wheat 1.00 0.97 1.22 
Yeast ROS: 1.02 IO 
Escherichia coli 1.09 0.99 1.05 
Serratia marcescens 0.95 0.86 0.70 


from the surrounding water, whereas the more polar surfaces are exposed to 
water. This arrangement is reminiscent of protein folding, where hydropho- 
bic amino acids are in the protein’s interior and the hydrophilic amino acids 
are on the exterior (Section 2.4). Second, the stacked base pairs attract one 
another through van der Waals forces (p. 8), appropriately referred to as 
stacking forces, further contributing to stabilization of the helix (Figure 
4.13). The energy associated with a single van der Waals interaction is quite 
small, typically from 2 to 4 kJ mol~! (0.5-1.0 kcal mol~'). In the double 
helix, however, a large number of atoms are in van der Waals contact, and 
the net effect, summed over these atom pairs, is substantial. In addition, 
base stacking in DNA is favored by the conformations of the somewhat rigid 
five-membered rings of the backbone sugars. 


DNA can assume a variety of structural forms 


Watson and Crick based their model (known as 
the B-DNA helix) on x-ray diffraction patterns 
of highly hydrated DNA fibers, which provided 
information about properties of the double helix 
that are averaged over its constituent residues. 
Under physiological conditions, most DNA is 
in the B form. X-ray diffraction studies of less- 
hydrated DNA fibers revealed a different form 
called A-DNA. Like B-DNA, A-DNA is a 
right-handed double helix made up of anti-par- 
allel strands held together by Watson—Crick 
base-pairing. The A-form helix is wider 
and shorter than the B-form helix, and its base 
pairs are tilted rather than perpendicular to the 
helix axis (Figure 4.14). 

If the A-form helix were simply a property 
of dehydrated DNA, it would be of little sig- 
nificance. However, double-stranded regions 
of RNA and at least some RNA—DNA hybrids 
adopt a double-helical form very similar to 
that of A-DNA. What is the biochemical basis 
for differences between the two forms of 
DNA? Many of the structural differences 
between B-DNA and A-DNA arise from dif- 
ferent puckerings of their ribose units (Figure 
4.15). In A-DNA, C-3’ lies out of the plane (a 
conformation referred to as C-3’ endo) formed 


Base stacking 
(van der Waal interactions) 


FIGURE 4.13 A side view of DNA. Base 
pairs are stacked nearly one on top of 
another in the double helix. The stacked 
bases interact with van der Waals forces. 
Such stacking forces help stabilize the 
double helix. [Source: J. L. Tymoczko, J. 
Berg, and L. Stryer, Biochemistry: A Short 
Course, 2nd ed. (W. H. Freeman and 
Company, 2013), Fig. 33.13.]. 
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pairs of B-form and A-form DNA depict their right-handed helical structures. Notice 


, , : that the B-form helix is longer and narrower than the A-form helix. The carbon 
called C-2" endo). The C-3'-endo puckering atoms of the backbone are shown in white, [Drawn from 4BNA pdb and 1DNZ.pdb) 


in A-DNA leads to an 11-degree tilting of 
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H_ CH,OH CH,OH the base pairs away from perpendicular to the helix. RNA helices 
id 3 0 H\—o are further induced to take the A-DNA form because of steric 
NG aS hindrance from the 2'-hydroxyl group: the 2’-oxygen atom would 
HO be too close to three atoms of the adjoining phosphoryl group 
ou HO H and to one atom in the next base. In an A-form helix, in contrast, 
C-3"-endo C-2-endo the 2'-oxygen atom projects outward, away from other atoms. 

(A form) (B form) 


FIGURE 4.15 Sugar pucker. In A-form 
DNA, the C-3’ carbon atom lies above the 
approximate plane defined by the four other 
sugar nonhydrogen atoms (called C-3’ 
endo). In B-form DNA, each deoxyribose is 
in a C-2'-endo conformation, in which C-2’ 
lies out of the plane. 


& FIGURE 4.16 Z-DNA. DNA oligomers 
such as CGCGCG adopt an alternative 
conformation under some conditions. This 
conformation is called Z-DNA because the 
phosphoryl groups zigzag along the 
backbone. [Drawn from 131D.pdb.] 


112 


The phosphoryl and other groups in the A-form helix bind fewer 
H,O molecules than do those in B-DNA. Hence, dehydration 
favors the A form. 


Z-DNA is a left-handed double helix in which backbone 
phosphates zigzag 


A third type of double helix is left-handed, in contrast with the right-handed 
screw sense of the A and B helices. Furthermore, the phosphoryl groups in 
the backbone are zigzagged; hence, this form of DNA is called Z-DNA 
(Figure 4.16). 

Although the biological role of Z-DNA is still under investigation, 
Z-DNA-binding proteins have been isolated, one of which is required for 
viral pathogenesis of poxviruses, including variola, the agent of smallpox. 
The existence of Z-DNA shows that DNA is a flexible, dynamic molecule 
whose parameters are not as fixed as depictions suggest. The properties of 
A-, B-, and Z-DNA are compared in Table 4.2. 


Top view Side view 


TABLE 4.2 Comparison of A-, B-, and Z-DNA 


A B Z 
Shape Broadest Intermediate Narrowest 
Rise per base pair 2.3A 3.4A 3.8A 
Helix diameter ~26A ~20A ~18A 
Screw sense Right-handed Right-handed Left-handed 
Glycosidic bond* anti anti Alternating anti and syn 
Base pairs per turn of helix iG 10.4 12 
Pitch per turn of helix 25.3A 35.4A 45.6A 
Tilt of base pairs from 
perpendicular to helix axis 19 degrees 1 degree 9 degrees 


*Syn and anti refer to the orientation of the N-glycosidic bond between the base and deoxyribose. In the 
anti orientation, the base extends away from the deoxyribose. In the syn orientation, the base is above the 
deoxyribose. Pyrimidines can be in anti orientations only, whereas purines can be anti or syn. 


FIGURE 4.17 Electron micrographs 

of circular DNA from mitochondria. 

(A) Relaxed form. (B) Supercoiled form 
[Courtesy of Dr. David Clayton.] 


Some DNA molecules are circular and supercoiled 


The DNA molecules in human chromosomes are linear. However, electron 
microscopic and other studies have shown that intact DNA molecules from 
bacteria and archaea are circular (Figure 4.17A). The term circular refers to 
the continuity of the DNA strands, not to their geometric form. DNA mol- 
ecules inside cells necessarily have a very compact shape. Note that the 
E. coli chromosome, fully extended, would be about 1000 times as long as 
the greatest diameter of the bacterium. 

A closed DNA molecule has a property unique to circular DNA. The 
axis of the double helix can itself be twisted or supercoiled into a superhelix 
(Figure 4.17B). A circular DNA molecule without any superhelical turns is 
known as a relaxed molecule. Supercoiling is biologically important for two 
reasons. First, a supercoiled DNA molecule is more compact than its relaxed 
counterpart. Second, supercoiling may hinder or favor the capacity of the 
double helix to unwind and thereby affect the interactions between DNA and 
other molecules. These topological features of DNA will be considered fur- 
ther in Chapter 28. 


Single-stranded nucleic acids can adopt elaborate structures 


Single-stranded nucleic acids often fold back on themselves to form well- 
defined structures. Such structures are especially prominent in RNA and 
RNA-containing complexes such as the ribosome—a large complex of 
RNAs and proteins on which proteins are synthesized. 

The simplest and most-common structural motif formed is a stem-loop, 
created when two complementary sequences within a single strand come 
together to form double-helical structures (Figure 4.18). In many cases, 
these double helices are made up entirely of Watson—Crick base pairs. In 
other cases, however, the structures include mismatched base pairs or 
unmatched bases that bulge out from the helix. Such mismatches destabilize 
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FIGURE 4.18 Stem-loop structures. 
Stem-loop structures can be formed from 
single-stranded DNA and RNA molecules. 
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the local structure but introduce deviations from the standard double- 
helical structure that can be important for higher-order folding and for 
function (Figure 4.19). 

Single-stranded nucleic acids can adopt structures that are more com- 
plex than simple stem-loops through the interaction of more widely sepa- 
rated bases. Often, three or more bases interact to stabilize these structures. 
In such cases, hydrogen-bond donors and acceptors that do not participate 
in Watson—Crick base pairs participate in hydrogen bonds to form nonstan- 
dard pairings. Metal ions such as magnesium ion (Mg?") often assist in the 
stabilization of these more elaborate structures. These complex structures 
allow RNA to perform a host of functions that the double-stranded DNA 
molecule cannot. Indeed, the complexity of some RNA molecules rivals 
that of proteins, and these RNA molecules perform a number of functions 
that had formerly been thought the exclusive domain of proteins. 


(B) 


FIGURE 4.19 Complex structure of an RNA molecule. A single-stranded RNA molecule 
can fold back on itself to form a complex structure. (A) The nucleotide sequence showing 
Watson-Crick base pairs and other nonstandard base pairings in stem-loop structures. 

(B) The three-dimensional structure and one important long-range interaction between three 
bases. In the three-dimensional structure at the left, cytidine nucleotides are shown in blue, 
adenosine in red, guanosine in black, and uridine in green. In the detailed projection, hydrogen 
bonds within the Watson—Crick base pair are shown as dashed black lines; additional 
hydrogen bonds are shown as dashed green lines. 


4.3 The Double Helix Facilitates the Accurate 
Transmission of Hereditary Information 


The double-helical model of DNA and the presence of specific base pairs 
immediately suggested how the genetic material might replicate. The 
sequence of bases of one strand of the double helix precisely determines the 
sequence of the other strand: a guanine base on one strand is always paired 
with a cytosine base on the other strand, and so on. Thus, separation of a 
double helix into its two component strands would yield two single- 
stranded templates onto which new double helices could be constructed, 
each of which would have the same sequence of bases as the parent double 
helix. Consequently, as DNA is replicated, one of the strands of each 
daughter DNA molecule is newly synthesized, whereas the other is passed 
unchanged from the parent DNA molecule. This distribution of parental 
atoms is achieved by semiconservative replication. 


Differences in DNA density established the validity of the 
semiconservative replication hypothesis 


Matthew Meselson and Franklin Stahl carried out a critical test of this hypoth- 
esis in 1958. They labeled the parent DNA with !°N, a heavy isotope of nitro- 
gen, to make it denser than ordinary DNA. The labeled DNA was generated by 
growing E. coli for many generations in a medium that contained '"NH,Cl as 
the sole nitrogen source. After the incorporation of heavy nitrogen was com- 
plete, the bacteria were abruptly transferred toa medium that contained !N, the 
ordinary isotope of nitrogen. The question asked was: What is the distribution 
of '4N and !°N in the DNA molecules after successive rounds of replication? 

The distribution of !#N and !5N was revealed by the technique of density- 
gradient equilibrium sedimentation. A small amount of DNA was dissolved in 
a concentrated solution of cesium chloride having a density close to that of the 
DNA (1.7 gcm7). This solution was centrifuged until it was nearly at equi- 
librium. At that point, the opposing processes of sedimentation and diffusion 
created a gradient in the concentration of cesium chloride across the centrifuge 
cell. The result was a stable density gradient ranging from 1.66 to 1.76 gcm~?. 
The DNA molecules in this density gradient were driven by centrifugal force 
into the region where the solution’s density was equal to their own. The DNA 
yielded a narrow band that was detected by its absorption of ultraviolet light. 
A mixture of '*N DNA and SN DNA molecules gave clearly separate bands 
because they differ in density by about 1% (Figure 4.20). 

DNA was extracted from the bacteria at various times after they were 
transferred from a !°N to a '4*N medium. Analysis of these samples by the 
density-gradient technique showed that there was a single band of DNA 
after one generation. The density of this band was precisely halfway 
between the densities of the '¥N DNA and !°N DNA bands (Figure 4.21). 
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FIGURE 4.20 Resolution of '4N DNA 
and '®N DNA by density-gradient 
centrifugation. (A) Ultraviolet-absorption 
photograph of a centrifuged cell showing 
the two distinct bands of DNA. (B) 
Densitometric tracing of the absorption 
photograph. [Data from M. Meselson and 
F. W. Stahl. Proc. Natl. Acad. Sci. U. S. A. 
44:671-682, 1958.] 


FIGURE 4.21 Detection of 
semiconservative replication of 

E. coli DNA by density-gradient 
centrifugation. The position of a band of 
DNA depends on its content of '4N and 
15, After 1.0 generation, all of the DNA 
molecules were hybrids containing equal 
amounts of '4N and 19N. [Data from 

M. Meselson and F. W. Stahl. Proc. Natl. 
Acad. Sci. U. S. A. 44:671-682, 1958.] 


Original parent molecule 


S| 


First-generation daughter molecules 


Second-generation daughter molecules 
FIGURE 4.22 Diagram of semiconservative 
replication. Parental DNA is shown in 
blue and newly synthesized DNA in 
red. [Information from M. Meselson and 


F. W. Stahl. Proc. Natl. Acad. Sci. U. S. A. 
44:671-682, 1958.] 
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The absence of !*N DNA indicated that parental DNA was not preserved 
as an intact unit after replication. The absence of !*N DNA indicated that 
all the daughter DNA derived some of their atoms from the parent DNA. 
This proportion had to be half because the density of the hybrid 
DNA band was halfway between the densities of the '*N DNA and '5N 
DNA bands. 

After two generations, there were equal amounts of two bands of DNA. 
One was hybrid DNA, and the other was 14\] DNA. Meselson and Stahl 
concluded from these incisive experiments that replication was semiconser- 
vative, and so each new double helix contains a parent strand and a newly 
synthesized strand. Their results agreed perfectly with the Watson—Crick 
model for DNA replication (Figure 4.22). 


The double helix can be reversibly melted 


During DNA replication and transcription, the two strands of the double 
helix must be separated from each other, at least in a local region. The two 
strands of a DNA helix readily come apart when the hydrogen bonds 
between base pairs are disrupted. In the laboratory, the double helix can be 
disrupted by heating a solution of DNA or by adding acid or alkali to ionize 
its bases. The dissociation of the double helix is called melting because it 
occurs abruptly at a certain temperature. The melting temperature (T.,,) of 
DNA is defined as the temperature at which half the helical structure is lost. 
Inside cells, however, the double helix is not melted by the addition of heat. 
Instead, proteins called helicases use chemical energy (from ATP) to disrupt 
the helix (Chapter 28). 

Stacked bases in nucleic acids absorb less ultraviolet light than do 
unstacked bases, an effect called hypochromism. Thus, the melting of nucleic 
acids is readily monitored by measuring their absorption of light, which is 
maximal at a wavelength of 260 nm (Figure 4.23). 

Separated complementary strands of nucleic acids spontaneously reas- 
sociate to form a double helix when the temperature is lowered below T_. 
This renaturation process is sometimes called annealing. The facility with 
which double helices can be melted and then reassociated is crucial for the 
biological functions of nucleic acids. 
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FIGURE 4.23 Hypochromism. (A) Single-stranded DNA absorbs light more effectively than 
does double-helical DNA. (B) The absorbance of a DNA solution at a wavelength of 260 nm 
increases when the double helix is melted into single strands. 


The ability to melt and reanneal DNA reversibly in the laboratory pro- 
vides a powerful tool for investigating sequence similarity. For instance, 
DNA molecules from two different organisms can be melted and allowed to 
reanneal, or hybridize, in the presence of each other. If the sequences are 
similar, hybrid DNA duplexes, with DNA from each organism contribut- 
ing a strand of the double helix, can form. The degree of hybridization is an 
indication of the relatedness of the genomes and hence the organisms. 
Similar hybridization experiments with RNA and DNA can locate genes in 
a cell’s DNA that correspond to a particular RNA. We will return to this 
important technique in Chapter 5. 


4.4 DNA Is Replicated by Polymerases That Take 
Instructions from Templates 


We now turn to the molecular mechanism of DNA replication. The full 
replication machinery in a cell comprises more than 20 proteins engaged 
in intricate and coordinated interplay. In 1958, Arthur Kornberg and his 
colleagues isolated from E. coli the first known of the enzymes, called 
DNA polymerases, that promote the formation of the bonds joining units 
of the DNA backbone. E. coli has a number of DNA polymerases, desig- 
nated by roman numerals, that participate in DNA replication and repair 
(Chapter 28). 


DNA polymerase catalyzes phosphodiester-bridge formation 


DNA polymerases catalyze the step-by-step addition of deoxyribonucleotide 
units to a DNA strand (Figure 4.24). The reaction catalyzed, in its simplest 
form, is 


(DNA), + dNTP — (DNA)n+i + PP; 


where dNTP stands for any deoxyribonucleotide and PP; is a pyrophos- 
phate ion. 
DNA synthesis has the following characteristics: 


1. The reaction requires all four activated precursors—that is, the deoxy- 
nucleoside 5'-triphosphates dATP, dGTP, dCTP, and TTP—as well as 
Me?" ion. 


2. The new DNA strand is assembled directly on a preexisting DNA template. 
DNA polymerases catalyze the formation of a phosphodiester linkage effi- 
ciently only if the base on the incoming nucleoside triphosphate is comple- 
mentary to the base on the template strand. Thus, DNA polymerase is a 
template-directed enzyme that synthesizes a product with a base sequence 
complementary to that of the template. 
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FIGURE 4.24 Polymerization reaction catalyzed by DNA polymerases. 
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FIGURE 4.25 Strand-elongation reaction. DNA polymerases catalyze the formation of a 
phosphodiester bridge. [Source: J. L. Tymoczko, J. Berg, and L. Stryer, Biochemistry: A Short 
Course, 2nd ed. (W. H. Freeman and Company, 2013), Fig. 34.2.] 


3. DNA polymerases require a primer to begin synthesis. A primer strand hav- 
ing a free 3’-OH group must be already bound to the template strand. The 
chain-elongation reaction catalyzed by DNA polymerases is a nucleophilic 
attack by the 3’-OH terminus of the growing strand on the innermost phos- 
phorus atom of the deoxynucleoside triphosphate (Figure 4.25). A phos- 
phodiester bridge is formed and pyrophosphate is released. The subsequent 
hydrolysis of pyrophosphate to yield two ions of orthophosphate (P.) by 
pyrophosphatase helps drive the polymerization forward. Elongation of the 
DNA chain proceeds in the 5'-to-3' direction. 


4. Many DNA polymerases are able to correct mistakes in DNA by removing 
mismatched nucleotides. These polymerases have a distinct nuclease activity 
that allows them to excise incorrect bases by a separate reaction. This nucle- 
ase activity contributes to the remarkably high fidelity of DNA replication, 
which has an error rate of less than 10~® per base pair. 


The genes of some viruses are made of RNA 


Genes in all cellular organisms are made of DNA. The same is true for some 
viruses but, for others, the genetic material is RNA. Viruses are genetic ele- 
ments enclosed in protein coats that can move from one cell to another but 
are not capable of independent growth. A well-studied example of an RNA 
virus is the tobacco mosaic virus, which infects the leaves of tobacco plants. 
This virus consists of a single strand of RNA (6390 nucleotides) surrounded 
by a protein coat of 2130 identical subunits. An RNA polymerase that takes 
direction from an RNA template, called an RNA-directed RNA polymerase, 
copies the viral RNA. The infected cells die because of virus-instigated 
programmed cell death; in essence, the virus instructs the cell to commit 
suicide. Cell death results in discoloration in the tobacco leaf in a variegated 
pattern, hence the name mosaic virus. 

Another important class of RNA virus comprises the retroviruses, so 
called because the genetic information flows from RNA to DNA rather 
than from DNA to RNA. This class includes human immunodeficiency 
virus 1 (HIV-1), the cause of acquired immunodeficiency syndrome 
(AIDS), as well as a number of RNA viruses that produce tumors in 
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Reverse Reverse Reverse FIGURE 4.26 Flow of information from 
transcriptase transcriptase transcriptase RNA to DNA in retroviruses. The RNA 
: ee . genome of a retrovirus is converted into 
Synthesis of Digestion of RNA Synthesis of second DNA b ; 
DNA complementary strand of DNA y reverse transcriptase, an enzyme 
to RNA brought into the cell by the infecting virus 
particle. Reverse transcriptase possesses 
several activities and catalyzes the 
synthesis of a complementary DNA strand, 
Viral RNA DNA-RNA DNA transcript Double-helical the digestion of the RNA, and the 
hybrid of viral RNA viral DNA subsequent synthesis of the DNA strand. 


susceptible animals. Retrovirus particles contain two copies of a single- 
stranded RNA molecule. On entering the cell, the RNA is copied into 
DNA through the action of a viral enzyme called reverse transcriptase, 
which acts as both a polymerase and an RNase (Figure 4.26). The result- 
ing double-helical DNA version of the viral genome can become incorpo- 
rated into the chromosomal DNA of the host and is replicated along with 
the normal cellular DNA. At a later time, the integrated viral genome is 
expressed to form viral RNA and viral proteins, which assemble into new 
virus particles. 


4.5 Gene Expression Is the Transformation of DNA 
Information into Functional Molecules 


The information stored as DNA becomes useful when it is expressed in the 
production of RNA and proteins. This rich and complex topic is the subject 
of several chapters later in this book, but here we introduce the basics of 
gene expression. DNA can be thought of as archival information, stored and 
manipulated judiciously to minimize damage (mutations). It is expressed in 
two steps. First, an RNA copy is made that encodes directions for protein 
synthesis. This messenger RNA can be thought of as a photocopy of the 
original information: it can be made in multiple copies, used, and then dis- 
posed of. Second, the information in messenger RNA is translated to syn- 
thesize functional proteins. Other types of RNA molecules exist to facilitate 
this translation. 


Several kinds of RNA play key roles in gene expression 


Scientists used to believe that RNA played a passive role in gene expression, 
as a mere conveyor of information. However, recent investigations have 
shown that RNA plays a variety of roles, from catalysis to regulation. Cells 
contain several kinds of RNA (Table 4.3): 


TABLE 4.3 RNA molecules in E. coli 


Relative Sedimentation Number of 

Type amount (%) — coefficient (S) Mass (kDa) nucleotides 
Ribosomal RNA (rRNA) 80 23 1.2 x 103 3700 
16 0.55 X 103 1700 
5 3.6 x10! 120 
Transfer RNA (tRNA) 115) 4 2.5 x10! 75 


Messenger RNA (mRNA) 5 Heterogeneous 


120 


CHAPTER 4 DNA, RNA, and the Flow 
of Genetic Information 


Kilobase (kb) 
A unit of length equal to 1000 base pairs of 
a double-stranded nucleic acid molecule (or 
1000 bases of a single-stranded molecule). 
One kilobase of double-stranded DNA 
has a length of 0.34 jzm at its maximal 
extension (called the contour length) and a 
mass of about 660 kDa. 


1. Messenger RNA (mRNA) is the template for protein synthesis, or trans- 
lation. An mRNA molecule may be produced for each gene or group of 
genes that is to be expressed in E. coli, whereas a distinct mRNA is pro- 
duced for each gene in eukaryotes. Consequently, mRNA is a heteroge- 
neous class of molecules. In prokaryotes, the average length of an mRNA 
molecule is about 1.2 kilobases (kb). In eukaryotes, mRNA has structural 
features, such as stem-loop structures, that regulate the efficiency of trans- 
lation and the lifetime of the mRNA. 


2. Transfer RNA (tRNA) carries amino acids in an activated form to the 
ribosome for peptide-bond formation, in a sequence dictated by the mRNA 
template. There is at least one kind of tRNA for each of the 20 amino acids. 
Transfer RNA consists of about 75 nucleotides (having a mass of about 
25 kDa). 


3. Ribosomal RNA (rRNA) is the major component of ribosomes (Chapter 
30). In prokaryotes, there are three kinds of rRNA, called 23S, 16S, and 5S 
RNA because of their sedimentation behavior (p. 74). One molecule of each 
of these species of rRNA is present in each ribosome. Ribosomal RNA was 
once believed to play only a structural role in ribosomes. We now know that 
rRNA is the actual catalyst for protein synthesis. 


Ribosomal RNA is the most abundant of these three types of RNA. 
Transfer RNA comes next, followed by messenger RNA, which constitutes 
only 5% of the total RNA. Eukaryotic cells contain additional small RNA 
molecules that play a variety of roles including the regulation of gene 
expression, processing of RNA and the synthesis of proteins. We will exam- 
ine these small RNAs in later chapters. In this chapter, we will consider 
rRNA, mRNA, and tRNA. 


All cellular RNA is synthesized by RNA polymerases 


The synthesis of RNA from a DNA template is called transcription and is 
catalyzed by the enzyme RNA polymerase (Figure 4.27). RNA polymerase 


h FIGURE 4.27 RNA Polymerase. This large enzyme comprises many subunits, including 
8 (red) and 8’ (yellow), which form a “claw” that holds the DNA to be transcribed. Notice that 
the active site includes a Mg?* ion (green) at the center of the structure. The curved tubes 
making up the protein in the image represent the backbone of the polypeptide chain. [Drawn 
from 1L9Z, pdb.] 
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catalyzes the initiation and elongation of RNA chains. The reaction cata- 
lyzed by this enzyme is 


(RNA) n residues + ribonucleoside triphosphate == (RNA) 41 residues + PP; 


RNA polymerase requires the following components: 


1. A template. The preferred template is double-stranded DNA. Single- 
stranded DNA also can serve as a template. RNA, whether single or double 
stranded, is not an effective template; nor are RNA—DNA hybrids. 


2. Activated precursors. All four ribonucleoside triphosphates—ATP, GTP, 
UTP, and CT P—are required. 


3. A divalent metal ion. Either Mg?* or Mn?" is effective. 


The synthesis of RNA is like that of DNA in several respects (Figure 4.28). 
First, the direction of synthesis is 5’ — 3’. Second, the mechanism of elon- 
gation is similar: the 3’-OH group at the terminus of the growing chain 
makes a nucleophilic attack on the innermost phosphoryl group of the 
incoming nucleoside triphosphate. Third, the synthesis is driven forward by 
the hydrolysis of pyrophosphate. In contrast with DNA polymerase, how- 
ever, RNA polymerase does not require a primer. In addition, the ability of 
RNA polymerase to correct mistakes is not as extensive as that of DNA 
polymerase. 

All three types of cellular RNA—mRNA, tRNA, and rRNA—are syn- 
thesized in E. coli by the same RNA polymerase according to instructions 
given by a DNA template. In mammalian cells, there is a division of labor 
among several different kinds of RNA polymerases. We shall return to these 
RNA polymerases in Chapter 29. 


RNA polymerases take instructions from DNA templates 


RNA polymerase, like the DNA polymerases described earlier, takes 
instructions from a DNA template. The earliest evidence was the finding 
that the base composition of newly synthesized RNA is the complement of 
that of the DNA template strand, as exemplified by the RNA synthesized 
from a template of single-stranded DNA from the )X174 virus (Table 4.4). 
The strongest evidence for the fidelity of transcription came from 
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FIGURE 4.28 Transcription mechanism 
of the chain-elongation reaction 
catalyzed by RNA polymerase. [Source: 
J. L. Tymoczko, J. Berg, and L. Stryer, 
Biochemistry: A Short Course, 2nd ed. 

(W. H. Freeman and Company, 2018), Fig. 36.3.] 


Table 4.4 Base composition 
(percentage) of RNA 
synthesized from a viral 


DNA template 
DNA template 
(plus, or coding, 
strand of 6X174) RNA product 
A 25 U 25 
a 33 A BD; 
G 24 Cc 23 
G 18 G 20 
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base-sequence studies. For instance, the nucleotide sequence of a segment 
of the gene encoding the enzymes required for tryptophan synthesis was 
determined with the use of DNA-sequencing techniques (Section 5.1). 
Likewise, the sequence of the mRNA for the corresponding gene was deter- 
mined. The results showed that the RNA sequence is the precise comple- 
ment of the DNA template sequence (Figure 4.29). 


5’— GCGGCGACG CG CAGUU AAU CCCACAG CCG CCAGUU CCG CUGGCGGCAU— 3’ mRNA 
3’— CGCCGC TGCGCGTCAA TTAGGGTGTCGGCGGTCAAGGCGACCGCCGTA— 5’ Template strand of DNA 
5’— GCGGCGACGCGCAG TTAATCCCACAG CCG CCAG TTCCGCTGGCGGCAT— 3’ Coding strand of DNA 


Consensus sequence 

Not all base sequences of promoter sites are 
identical. However, they do possess common 
features, which can be represented by an 
idealized consensus sequence. Each base in 
the consensus sequence TATAAT is found in 
most prokaryotic promoters. Nearly all 
promoter sequences differ from this consensus 
sequence at only one or two bases. 


FIGURE 4.29 Complementarity between mRNA and DNA. The base sequence of MRNA 
(red) is the complement of that of the DNA template strand (blue). The sequence shown here 
is from the tryptophan operon, a segment of DNA containing the genes for five enzymes that 
catalyze the synthesis of tryptophan. The other strand of DNA (black) is called the coding 
strand because it has the same sequence as the RNA transcript except for thymine (T) in 
place of uracil (U). 


Transcription begins near promoter sites and ends at terminator sites 


RNA polymerase must detect and transcribe discrete genes from within 
large stretches of DNA. What marks the beginning of the unit to be tran- 
scribed? DNA templates contain regions called promoter sites that 
specifically bind RNA polymerase and determine where transcription 
begins. In bacteria, two sequences on the 5’ (upstream) side of the first 
nucleotide to be transcribed function as promoter sites (Figure 4.30A). 
One of them, called the Pribnow box, has the consensus sequence 
TATAAT and is centered at —10 (10 nucleotides on the 5’ side of the 
first nucleotide transcribed, which is denoted by +1). The other, called 
the —35 region, has the consensus sequence TTGACA. The first nucleo- 
tide transcribed is usually a purine. 

Eukaryotic genes encoding proteins have promoter sites witha TATAAA 
consensus sequence, called a TATA box or a Hogness box, centered at about 
—25 (Figure 4.30B). Many eukaryotic promoters also have a CAAT box 
with a GGNCAATCT consensus sequence centered at about —75. The 
transcription of eukaryotic genes is further stimulated by enhancer sequences, 
which can be quite distant (as many as several kilobases) from the start site, 
on either its 5’ or its 3’ side. 


-10 +1 
DNA template TATAAT 
—35 region Pribnow —_—es 
box Start of 
RNA 
(A) Prokaryotic promoter site 
-75 —25 +1 
DNA template TATAAA 
CAAT box TATA box —e—ee> 
(sometimes present) (Hogness box) Start of 
RNA 


(B) Eukaryotic promoter site 


FIGURE 4.30 Promoter sites for transcription in (A) prokaryotes and (B) eukaryotes. 
Consensus sequences are shown. The first nucleotide to be transcribed is numbered +1. 
The adjacent nucleotide on the 5’ side is numbered —1. The sequences shown are those 
of the coding strand of DNA. 


In E. coli, RNA polymerase proceeds along the DNA template, tran- 
scribing one of its strands until it synthesizes a terminator sequence. This 
sequence encodes a termination signal, which is a base-paired hairpin on the 
newly synthesized RNA molecule (Figure 4.31). This hairpin is formed by 
base-pairing of self-complementary sequences that are rich in G and C. 
Nascent RNA spontaneously dissociates from RNA polymerase when this 
hairpin is followed by a string of U residues. Alternatively, RNA synthesis 
can be terminated by the action of rho, a protein. Less is known about the 
termination of transcription in eukaryotes. A more detailed discussion of 
the initiation and termination of transcription will be given in Chapter 29. 
The important point now is that discrete start and stop signals for transcrip- 
tion are encoded in the DNA template. 

In eukaryotes, the messenger RNA is modified after transcription 
(Figure 4.32). A “cap” structure, a guanosine nucleotide attached to the 
mRNA with an unusual 5’-5’ triphosphate linkage, is attached to the 5’ 
end, and a sequence of adenylates, the poly(A) tail, is added to the 3’ end. 
These modifications will be presented in detail in Chapter 29. 


i Poly(A) tail 
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FIGURE 4.32 Modification of mRNA. Messenger RNA in eukaryotes is modified after 
transcription. A nucleotide “cap” structure is added to the 5’ end, and a poly(A) tail is added 
at the 3’ end. 


Transfer RNAs are the adaptor molecules in protein synthesis 


We have seen that mRNA is the template for protein synthesis. How then 
does it direct amino acids to become joined in the correct sequence to form 
a protein? In 1958, Francis Crick wrote: 


RNA presents mainly a sequence of sites where hydrogen bonding 
could occur. One would expect, therefore, that whatever went onto the 
template in a specific way did so by forming hydrogen bonds. It is there- 
fore a natural hypothesis that the amino acid is carried to the template 
by an adaptor molecule, and that the adaptor is the part that actually fits 
onto the RNA. In its simplest form, one would require twenty adaptors, 
one for each amino acid. 


This highly innovative hypothesis soon became established as fact. The 
adaptors in protein synthesis are transfer RNAs. The structure and reac- 
tions of these remarkable molecules will be considered in detail in 
Chapter 30. For the moment, it suffices to note that tRNAs contain an 
amino acid-attachment site and a template-recognition site. AtRNA mol- 
ecule carries a specific amino acid in an activated form to the ribosome. 
The carboxyl group of this amino acid is esterified to the 3’- or 
2'-hydroxyl group of the ribose unit of an adenylate at the 3’ end of the 
tRNA molecule. The adenylate is always preceded by two cytidylates to 
form the CCA arm of the tRNA (Figure 4.33). The joining of an amino 
acid to a tRNA molecule to form an aminoacyl-tRNA 1s catalyzed by a 
specific enzyme called an aminoacyl-tRNA synthetase. This esterification 
reaction is driven by ATP cleavage. There is at least one specific synthe- 
tase for each of the 20 amino acids. The template-recognition site on 
tRNA is a sequence of three bases called an anticodon (Figure 4.34). The 
anticodon on tRNA recognizes a complementary sequence of three bases, 
called a codon, on mRNA. 
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FIGURE 4.31 Base sequence of the 3’ 
end of an MRNA transcript in E. coli. 
A stable hairpin structure is followed by 
a sequence of uridine (U) residues. 
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FIGURE 4.33 Attachment of an amino 
acid to a tRNA molecule. The amino acid 
(shown in blue) is esterified to the 3’-hydroxyl 
group of the terminal adenylate of tRNA. 
[Source: J. L. Tymoczko, J. Berg, and L. Stryer, 
Biochemistry: A Short Course, 2nd ed. (W. H. 
Freeman and Company, 2013), Fig. 39.3.] 
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FIGURE 4.34 General structure of an aminoacyl-tRNA. The amino acid is attached at the 
8’ end of the RNA. The anticodon is the template-recognition site. Notice that the tRNA has a 
cloverleaf structure with many hydrogen bonds (green dots) between bases. 


4.6 Amino Acids Are Encoded by Groups of Three Bases 
Starting from a Fixed Point 


The genetic code is the relation between the sequence of bases in DNA (or 
its RNA transcripts) and the sequence of amino acids in proteins. 
Experiments by Marshall Nirenberg, Har Gobind Khorana, Francis Crick, 
Sydney Brenner, and others established the following features of the genetic 
code by 1961: 


1. Three nucleotides encode an amino acid. Proteins are built from a basic set 
of 20 amino acids, but there are only four bases. Simple calculations show 
that a minimum of three bases is required to encode at least 20 amino acids. 
Genetic experiments showed that an amino acid is in fact encoded by a group 
of three bases, or codon. 


2. The code is nonoverlapping. Consider a base sequence ABCDEF. In an 
overlapping code, ABC specifies the first amino acid, BCD the next, CDE 
the next, and so on. In a nonoverlapping code, ABC designates the first 
amino acid, DEF the second, and so forth. Genetic experiments again 
established the code to be nonoverlapping. 


3. The code has no punctuation. In principle, one base (denoted as Q) might 
serve as a “comma” between groups of three bases. 


... QABCQDEFQGHIQJKLQ... 


However, it is not the case. Rather, the sequence of bases is read sequen- 
tially from a fixed starting point, without punctuation. 


4. The code has directionality. The code is read from the 5’ end of the mes- 
senger RNA to its 3’ end. 


5. The genetic code is degenerate. Most amino acids are encoded by more 
than one codon. There are 64 possible base triplets and only 20 amino acids, 
and in fact 61 of the 64 possible triplets specify particular amino acids. 
Three triplets (called stop codons) designate the termination of translation. 
Thus, for most amino acids, there is more than one code word. 


Major features of the genetic code 


All 64 codons have been deciphered (Table 4.5). Because the code is highly 
degenerate, only tryptophan and methionine are encoded by just one triplet 
each. Each of the other 18 amino acids is encoded by two or more. Indeed, 
leucine, arginine, and serine are specified by six codons each. 

Codons that specify the same amino acid are called synonyms. For exam- 
ple, CAU and CAC are synonyms for histidine. Note that synonyms are not 
distributed haphazardly throughout the genetic code. In Table 4.5, an 
amino acid specified by two or more synonyms occupies a single box (unless 
it is specified by more than four synonyms). The amino acids in a box are 
specified by codons that have the same first two bases but differ in the third 
base, as exemplified by GUU, GUC, GUA, and GUG. Thus, most syn- 
onyms differ only in the last base of the triplet. Inspection of the code shows 
that XYC and XYU always encode the same amino acid, and XYG and 
XYA usually encode the same amino acid as well. The structural basis for 
these equivalences of codons becomes evident when we consider the nature 
of the anticodons of tRNA molecules (Section 30.3). 

What is the biological significance of the extensive degeneracy of the 
genetic code? If the code were not degenerate, 20 codons would designate 
amino acids and 44 would lead to chain termination. The probability of 
mutating to chain termination would therefore be much higher with a non- 
degenerate code. Chain-termination mutations usually lead to inactive 
proteins, whereas substitutions of one amino acid for another are usually 
rather harmless. Moreover, the code is constructed such that a change in 
any single nucleotide base of a codon results in a synonym or an amino acid 
with similar chemical properties. Thus, degeneracy minimizes the deleterious 
effects of mutations. 


TABLE 4.5 The genetic code 


First Position Second Position Third Position 
(5' end) U C A G (3' end) 
Phe Ser Tyr Cys U 
U Phe Ser Tyr Cys Cc 
Leu Ser Stop Stop A 
Leu Ser Stop Trp G 
Leu Pro His Arg U 
Cc Leu Pro His Arg CG 
Leu Pro Gln Arg A 
Leu Pro Gln Arg G 
lle Thr Asn Ser U 
A lle Thr Asn Ser G 
lle Thr Lys Arg A 
Met Thr Lys Arg G 
Val Ala Asp Gly U 
G Val Ala Asp Gly C 
Val Ala Glu Gly A 
Val Ala Glu Gly G 
Note: This table identifies the amino acid encoded by each triplet. For example, the codon 5’- AUG-3’ on 
mRNA specifies methionine, whereas CAU specifies histidine. UAA, UAG, and UGA are termination 
signals. AUG is part of the initiation signal, in addition to coding for internal methionine residues. 
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TABLE 4.6 Distinctive codons of 
human mitochondria 


Standard Mitochondrial 

Codon code code 
UGA Stop Trp 

UGG Trp Trp 

AUA Ile Met 
AUG Met Met 
AGA Arg Stop 
AGG Arg Stop 


Messenger RNA contains start and stop signals for protein synthesis 


Messenger RNA is translated into proteins on ribosomes—large molecular 
complexes assembled from proteins and ribosomal RNA. How is mRNA 
interpreted by the translation apparatus? The start signal for protein syn- 
thesis is complex in bacteria. Polypeptide chains in bacteria start with a 
modified amino acid—namely, formylmethionine (fMet). A specific tRNA, 
the initiator tRNA, carries f{Met. This fMet-tRNA recognizes the codon 
AUG. However, AUG is also the codon for an internal methionine residue. 
Hence, the signal for the first amino acid in a prokaryotic polypeptide chain 
must be more complex than that for all subsequent ones. AUG is only part 
of the initiation signal (Figure 4.35). In bacteria, the initiating AUG codon 
is preceded several nucleotides away by a purine-rich sequence, called the 
Shine-Dalgarno sequence, that base-pairs with a complementary sequence 
in a ribosomal RNA molecule (Section 30.3). In eukaryotes, the AUG clos- 
est to the 5’ end of an mRNA molecule is usually the start signal for protein 
synthesis. This particular AUG is read by an initiator tRNA conjugated to 
methionine. After the initiator AUG has been located, the reading frame is 
established—groups of three nonoverlapping nucleotides are defined, 
beginning with the initiator AUG codon. 

As already mentioned, UAA, UAG, and UGA designate chain termina- 
tion. These codons are read not by tRNA molecules but rather by specific 
proteins called release factors (Section 30.3). Binding of a release factor to the 
ribosome releases the newly synthesized protein. 


-10 +1 
5’- Purine-rich ~~~ AUG ~~~~~~~~~~~~~~~m RNA 


Base-pairs with fMet. —-~--~~~-----__ > Protein 


ribosomal RNA 
(A) Prokaryotic start signal 


+1 First AUG from 5’ end 
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(B) Eukaryotic start signal 


FIGURE 4.35 Initiation of protein synthesis. Start signals are required for the initiation of 
protein synthesis in (A) prokaryotes and (B) eukaryotes. 


The genetic code is nearly universal 


Most organisms use the same genetic code. This universality accounts for 
the fact that human proteins, such as insulin, can be synthesized in the bac- 
terium E. coli and harvested from it for the treatment of diabetes. However, 
genome-sequencing studies have shown that not all genomes are translated 
by the same code. Ciliated protozoa, for example, differ from most organ- 
isms in that UAA and UAG are read as codons for amino acids rather than 
as stop signals; UGA is their sole termination signal. The first variations in 
the genetic code were found in mitochondria from a number of species, 
including human beings (Table 4.6). The genetic code of mitochondria can 
differ from that of the rest of the cell because mitochondrial DNA encodes 
a distinct set of transfer RNAs, adaptor molecules that recognize the alter- 
native codons. Thus, the genetic code is nearly but not absolutely universal. 

Why has the code remained nearly invariant through billions of years of 
evolution, from bacteria to human beings? A mutation that altered the 
reading of mRNA would change the amino acid sequence of most, if not all, 


proteins synthesized by that particular organism. Many of these changes 
would undoubtedly be deleterious, and so there would be strong selection 
against a mutation with such pervasive consequences. 


4.7 Most Eukaryotic Genes Are Mosaics of 
Introns and Exons 


In bacteria, polypeptide chains are encoded by a continuous array of triplet 
codons in DNA. For many years, genes in higher organisms were assumed 
to be organized in the same manner. This view was unexpectedly shattered 
in 1977, when investigators discovered that most eukaryotic genes are dis- 
continuous. The mosaic nature of eukaryotic genes was revealed by electron 
microscopic studies of hybrids formed between mRNA and a segment of 
DNA containing the corresponding gene (Figure 4.36). For example, the 


(A) DNA 


Duplex DNA JP RNAS 


Displaced strand of DNA 
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of DNA mRNA 


Duplex DNA 


FIGURE 4.36 Detection of introns by electron microscopy. An mRNA molecule (shown 

in red) is hybridized to genomic DNA containing the corresponding gene. (A) A single loop of 
single-stranded DNA (shown in blue) is seen if the gene is continuous. (B) Two loops of single- 
stranded DNA (blue) and a loop of double-stranded DNA (blue and green) are seen if the gene 
contains an intron. Additional loops are evident if more than one intron is present. 


gene for the B chain of hemoglobin is interrupted within its amino acid- 
coding sequence by a long stretch of of 550 non-coding base pairs and a 
short one of 120 base pairs. Thus, the B-globin gene is 
split into three coding sequences (Figure 4.37). Non-coding 
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Introns 
a oo 


Tegions are called introns (for intervening sequences), gusesjeeries 


whereas coding regions are called exons (for expressed 


‘ 240 120 500 550 250 Base pairs 
sequences). The average human gene has 8 introns, and “crishbegaie 3 
some have more than 100. The size ranges from 50 to 
10,000 nucleotides. FIGURE 4.37 Structure of the B-globin gene. 


RNA processing generates mature RNA 


At what stage in gene expression are introns removed? Newly synthesized 
RNA molecules (pre-mRNA or primary transcript) isolated from nuclei are 
much larger than the mRNA molecules derived from them; in regard to 
B-globin RNA, the former consists of approximately 1600 nucleotides and 
the latter approximately 900 nucleotides. In fact, the primary transcript of 
the B-globin gene contains two regions that are not present in the mRNA. 
These regions in primary transcript are excised, and the coding sequences are 
simultaneously linked by a precise splicing complex to form the mature mRNA 


e— es 


B-Globin gene 


Transcription, 
cap formation, and 
poly(A) addition 


Cap ———, — |= (A), 
Primary transcript 


Splicing 


Cap_ |p (A), 
B-Globin mRNA 


FIGURE 4.38 Transcription and 
processing of the B-globin gene. The gene 
is transcribed to yield the primary transcript, 
which is modified by cap and poly(A) 
addition. The introns in the primary RNA 
transcript are removed to form the mRNA. 
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Recombination 


(Figure 4.38). A common feature in the expression of discontinuous, or 
split, genes is that their exons are ordered in the same sequence in mRNA 
asin DNA. Thus, the codons in split genes, like continuous genes, are in the 
same linear order as the amino acids in the polypeptide products. 

Splicing is a complex operation that is carried out by spliceosomes, which 
are assemblies of proteins and small RNA molecules (snRNA). RNA plays 
the catalytic role (Section 29.3). Spliceosomes recognize signals in the nascent 
RNA that specify the splice sites. Introns nearly always begin with GU 
and end with an AG that is preceded by a pyrimidine-rich tract (Figure 4.39). 
This consensus sequence is part of the signal for splicing. 


5’ 3’ 
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FIGURE 4.39 Consensus sequence for the splicing of mRNA precursors. 


Many exons encode protein domains 


Most genes of higher eukaryotes, such as birds and mammals, are split. 
Lower eukaryotes, such as yeast, have a much higher proportion of continu- 
ous genes. In prokaryotes, split genes are extremely rare. Have introns been 
inserted into genes in the evolution of higher organisms? Or have introns been 
removed from genes to form the streamlined genomes of prokaryotes and 
simple eukaryotes? Comparisons of the DNA sequences of genes encoding 
evolutionarily conserved proteins suggest that introns were present in ancestral 
genes and were lost in the evolution of organisms that have become optimized for 
very rapid growth, such as prokaryotes. The positions of introns in some genes 
are at least 1 billion years old. Furthermore, acommon mechanism of splicing 
developed before the divergence of fungi, plants, and vertebrates, as shown by 
the finding that mammalian cell extracts can splice yeast RNA. 

What advantages might split genes confer? Many exons encode discrete 
structural and functional domains of proteins. An attractive hypothesis is that 
new proteins arose in evolution by the rearrangement of exons encoding discrete 
structural elements, binding sites, and catalytic sites, a process called exon shuf- 
fling. Because it preserves functional units but allows them to interact in new 
ways, exon shuffling is a rapid and efficient means of generating novel genes 
(Figure 4.40). Figure 4.41 shows the composition of a gene that was formed in 
part by exon shuffling. DNA can break and recombine in introns with no del- 
eterious effect on encoded proteins. In contrast, the exchange of 
sequences within different exons usually leads to loss of function. 

Another advantage of split genes is the potential for generating 
a series of related proteins by alternative splicing of the primary 
transcript. For example, a precursor of an antibody-producing 
cell forms an antibody that is anchored in the cell’s plasma mem- 
brane (Figure 4.42). The attached antibody recognizes a specific 
foreign antigen, an event that leads to cell differentiation and 
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FIGURE 4.40 Exon shuffling. Exons can be readily shuffled 
by recombination of DNA to expand the genetic repertoire. 
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their primary transcript in an alternative manner to form soluble 
antibody molecules that are secreted rather than retained on the 
cell surface. Alternative splicing is a facile means of forming a set of 
proteins that are variations of a basic motif without requiring a gene 
for each protein. Because of alternative splicing, the proteome is 
more diverse than the genome in eukaryotes. 
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SUMMARY 
4.1 A Nucleic Acid Consists of Four Kinds of Bases Linked to a 


4.2 


Sugar—Phosphate Backbone 

DNA and RNA are linear polymers of a limited number of monomers. 
In DNA, the repeating units are nucleotides, with the sugar being a 
deoxyribose and the bases being adenine (A), thymine (T), guanine 
(G), and cytosine (C). In RNA, the sugar is a ribose and the base uracil 
(U) is used in place of thymine. DNA is the molecule of heredity in all 
prokaryotic and eukaryotic organisms. In viruses, the genetic material 
is either DNA or RNA. 


A Pair of Nucleic Acid Strands with Complementary Sequences Can 
Form a Double-Helical Structure 

All cellular DNA consists of two very long, helical polynucleotide strands 
coiled around a common axis. The sugar—-phosphate backbone of each 
strand is on the outside of the double helix, whereas the purine and pyrim- 
idine bases are on the inside stabilized by stacking forces. The two strands 
are held together by hydrogen bonds between pairs of bases: adenine is 
always paired with thymine, and guanine is always paired with cytosine. 
Hence, one strand of a double helix is the complement of the other. The 
two strands of the double helix run in opposite directions. Genetic infor- 
mation is encoded in the precise sequence of bases along a strand. 


FIGURE 4.41 The tissue plasminogen 
activator (TPA) gene was generated by 
exon shuffling. The gene for TPA encodes 
an enzyme that functions in hemostasis 
(Section 10.4). This gene consists of 4 
exons, one (F) derived from the fibronectin 
gene which encodes an extracellular matrix 
protein, one from the epidermal growth 
factor gene (EGF), and two from the 
plasminogen gene (K, Section 10.4), the 
substrate of the TPA protein. The K domain 
appears to have arrived by exon shuffling 
and then been duplicated to generate the 
TPA gene that exists today. [Information 
from: www.ehu.es/ehusfera/genetica/ 
2012/10/02/demostracion-molecular- 
de-microevolucion/] 


FIGURE 4.42 Alternative splicing. 
Alternative splicing generates mRNAs 
that are templates for different forms of a 
protein: (A) a membrane-bound antibody 
on the surface of a lymphocyte and (B) its 
soluble counterpart, exported from the 
cell. The membrane-bound antibody is 
anchored to the plasma membrane by 

a helical segment (highlighted in yellow) 
that is encoded by its own exon. 
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4.3 


4.4 


4.5 


DNA is a structurally dynamic molecule that can exist in a variety of 
helical forms: A-DNA, B-DNA (the classic Watson—Crick helix), and 
Z-DNA. In A-, B-, and Z-DNA, two antiparallel chains are held together 
by Watson—Crick base pairs and stacking interactions between bases in 
the same strand. A- and B-DNA are right-handed helices. In B-DNA, 
the base pairs are nearly perpendicular to the helix axis. Z-DNA is a left- 
handed helix. Most of the DNA in a cell is in the B-form. 

Double-stranded DNA can also wrap around itself to form a super- 
coiled structure. The supercoiling of DNA has two important conse- 
quences. Supercoiling compacts the DNA and, because supercoiled 
DNA is partly unwound, it is more accessible for interactions with 
other biomolecules. 

Single-stranded nucleic acids, most notably RNA, can form com- 
plicated three-dimensional structures that may contain extensive dou- 
ble-helical regions that arise from the folding of the chain into hairpins. 


The Double Helix Facilitates the Accurate Transmission of 

Hereditary Information 

The structural nature of the double helix readily accounts for the accu- 
rate replication of genetic material because the sequence of bases in one 
strand determines the sequence of bases in the other strand. In replica- 
tion, the strands of the helix separate and a new strand complementary 
to each of the original strands is synthesized. Thus, two new double 
helices are generated, each composed of one strand from the original 
molecule and one newly synthesized strand. This mode of replication is 
called semiconservative replication because each new helix retains one 
of the original strands. 

In order for replication to take place, the strands of the double helix 
must be separated. In vitro, heating a solution of double-helical DNA 
separates the strands, a process called melting. On cooling, the strands 
reanneal and re-form the double helix. In the cell, special proteins tem- 
porarily separate the strands in replication. 


DNA Is Replicated by Polymerases That Take Instructions from Templates 
In the replication of DNA, the two strands of a double helix unwind and 
separate as new strands are synthesized. Each parent strand, with the 
help of a primer, acts as a template for the formation of a new comple- 
mentary strand. The replication of DNA is a complex process carried out 
by many proteins, including several DNA polymerases. The activated 
precursors in the synthesis of DNA are the four deoxyribonucleoside 
5'-triphosphates. The new strand is synthesized in the 5’ — 3’ direction 
by a nucleophilic attack by the 3'-hydroxyl terminus of the primer strand 
on the innermost phosphorus atom of the incoming deoxyribonucleoside 
triphosphate. Most important, DNA polymerases catalyze the forma- 
tion of a phosphodiester linkage only if the base on the incoming nucleo- 
tide is complementary to the base on the template strand. In other words, 
DNA polymerases are template-directed enzymes. The genes of some 
viruses, such as tobacco mosaic virus, are made of single-stranded RNA. 
An RNA-directed RNA polymerase mediates the replication of this 
viral RNA. Retroviruses, exemplified by HIV-1, have a single-stranded 
RNA genome that undergoes reverse transcription into double-stranded 
DNA by reverse transcriptase, an RNA-directed DNA polymerase. 


Gene Expression Is the Transformation of DNA Information into 
Functional Molecules 

The flow of genetic information in normal cells is from DNA to RNA 
to protein. The synthesis of RNA from a DNA template is called 


4.6 


4.7 


transcription, whereas the synthesis of a protein from an RNA tem- 
plate is termed translation. Cells contain several kinds of RNA, among 
which are messenger RNA (mRNA), transfer RNA (tRNA), and ribo- 
somal RNA (rRNA), which vary in size from 75 to more than 5000 
nucleotides. All cellular RNA is synthesized by RNA polymerases 
according to instructions given by DNA templates. The activated 
intermediates are ribonucleoside triphosphates and the direction of 
synthesis, like that of DNA, is 5’ > 3’. RNA polymerase differs from 
DNA polymerase in not requiring a primer. 


Amino Acids Are Encoded by Groups of Three Bases Starting from 

a Fixed Point 

The genetic code is the relation between the sequence of bases in DNA 
(or its RNA transcript) and the sequence of amino acids in proteins. 
Amino acids are encoded by groups of three bases (called codons) start- 
ing from a fixed point. Sixty-one of the 64 codons specify particular 
amino acids, whereas the other 3 codons (UAA, UAG, and UGA) are 
signals for chain termination. Thus, for most amino acids, there is more 
than one code word. In other words, the code is degenerate. The genetic 
code is nearly the same in all organisms. Natural mRNAs contain start 
and stop signals for translation, just as genes do for directing where 
transcription begins and ends. 


Most Eukaryotic Genes Are Mosaics of Introns and Exons 

Most genes in higher eukaryotes are discontinuous. Coding sequences 
in these split genes, called exons, are separated by noncoding sequences, 
called introns, which are removed in the conversion of the primary 
transcript into mRNA and other functional mature RNA molecules. 
Split genes, like continuous genes, are colinear with their polypeptide 
products. A striking feature of many exons is that they encode func- 
tional domains in proteins. New proteins probably arose in the course 
of evolution by the shuffling of exons. Introns may have been present 
in primordial genes but were lost in the evolution of such fast-growing 
organisms as bacteria and yeast. 


KEY TERMS 
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double helix (p. 105) semiconservative replication (p. 114) 
deoxyribonucleic acid (DNA) (p. 106) DNA polymerase (p. 117) 
deoxyribose (p. 106) template (p. 117) 


ribose (p. 106) primer (p. 118) 


purine (p. 107) reverse transcriptase (p. 119) 
pyrimidine (p. 107) messenger RNA (mRNA) (p. 120) 
ribonucleic acid (RNA) (p. 107) translation (p. 120) 

nucleoside (p. 107) transfer RNA (tRNA) (p. 120) 
nucleotide (p. 107) ribosomal RNA (rRNA) (p. 120) 
B-DNA (p. 111) transcription (p. 120) 

A-DNA (p. 111) RNA polymerase (p. 120) 
Z-DNA (p. 112) promoter site (p. 122) 


PROBLEMS 


anticodon (p. 123) 

codon (p. 123) 

genetic code (p. 124) 
ribosome (p. 126) 
Shine-Dalgarno sequence (p. 126) 
intron (p. 127) 

exon (p. 127) 

splicing (p. 128) 
spliceosomes (p. 128) 

exon shuffling (p. 128) 
alternative splicing (p. 128) 


1. A t instead of an s? Differentiate between a nucleoside 
and a nucleotide. 


2. A lovely pair. What is a Watson—Crick base pair? 


3. Chargaff rules! Biochemist Erwin Chargaff was the first 
to note that, in DNA, [A] = [T] and [G] = [C], equalities 


now called Chargaff’s rule. Using this rule, determine the 


percentages of all the bases in DNA that is 20% thymine. 
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4. But not always. A single strand of RNA is 20% U. What 
can you predict about the percentages of the remaining 
bases? 


5. Complements. Write the complementary sequence 
(in the standard 5’ — 3’ notation) for (a) GATCAA, 
(b) TCGAAGC, (c) ACGCGT, and (d) TACCAT. 


6. Compositional constraint. The composition (in mole- 
fraction units) of one of the strands of a double-helical 
DNA molecule is [A] = 0.30 and [G] = 0.24. (a) What can 
you say about [T] and [C] for the same strand? (b) What 
can you say about [A], [G], [T], and [C] of the complemen- 
tary strand? 


7. Size matters. Why are GC and AT the only base pairs 
permissible in the double helix? 


8. Strong, but not strong enough. Why does heat denature, 
or melt, DNA in solution? 


9. Uniqueness. The human genome contains 3 billion nucle- 
otides arranged in a vast array of sequences. What is the 
minimum length of a DNA sequence that will, in all prob- 
ability, appear only once in the human genome? You need 
consider only one strand and may assume that all four 
nucleotides have the same probability of appearance. 


10. Coming and going. What does it mean to say that the 
DNA strands in a double helix have opposite directionality 
or polarity? 


11. All for one. If the forces—hydrogen bonds and stacking 
forces—holding a helix together are weak, why is it difficult 
to disrupt a double helix? 


12. Overcharged. DNA in the form of a double helix must 
be associated with cations, usually Mg?*+. Why is this 
requirement the case? 


13. Not quite from A to Z. Describe the three forms that a 
double helix can assume. 


14. Lost DNA. The DNA of a deletion mutant of \ bacte- 
riophage has a length of 15 ym instead of 17 um. How 
many base pairs are missing from this mutant? 


15. Axial ratio. What is the axial ratio (length:diameter) of 
a DNA molecule 20 pm long? 


16. Guide and starting point. Define template and primer as 
they relate to DNA synthesis. 


17. An unseen pattern. What result would Meselson and 
Stahl have obtained if the replication of DNA were conser- 
vative (1.e., the parental double helix stayed together)? Give 
the expected distribution of DNA molecules after 1.0 and 
2.0 generations for conservative replication. 


18. Which way? Explain, on the basis of nucleotide struc- 
ture, why DNA synthesis proceeds in the 5’-to-3’ direction. 


19. Tagging DNA. 


(a) Suppose that you want to radioactively label DNA but 
not RNA in dividing and growing bacterial cells. Which 
radioactive molecule would you add to the culture medium? 
(b) Suppose that you want to prepare DNA in which the 
backbone phosphorus atoms are uniformly labeled with *”P. 
Which precursors should be added to a solution containing 
DNA polymerase and primed template DNA? Specify the 
position of radioactive atoms in these precursors. 


20. Finding a template. A solution contains DNA poly- 
merase and the Mg?* salts of dATP, dGTP, dCTP, and 
TTP. The following DNA molecules are added to aliquots of 
this solution. Which of them would lead to DNA synthesis? 


(a) A single-stranded closed circle containing 1000 nucleo- 
tide units. 

(b) A double-stranded closed circle containing 1000 nucle- 
otide pairs. 

(c) A single-stranded closed circle of 1000 nucleotides 
base- paired to a linear strand of 500 nucleotides with a free 
3’-OH terminus. 

(d) A double-stranded linear molecule of 1000 nucleotide 
pairs with a free 3’-OH group at each end. 


21. Retrograde. What is a retrovirus and how does informa- 
tion flow for a retrovirus differ from that for the infected cell? 


22. The right start. Suppose that you want to assay reverse 
transcriptase activity. If polyriboadenylate is the template in 
the assay, what should you use as the primer? Which radio- 
active nucleotide should you use to follow chain elongation? 


23. Essential degradation. Reverse transcriptase has ribo- 
nuclease activity as well as polymerase activity. What is the 
role of its ribonuclease activity? 


24. Virus hunting. You have purified a virus that infects 
turnip leaves. Treatment of a sample with phenol removes 
viral proteins. Application of the residual material to scraped 
leaves results in the formation of progeny virus particles. 
You infer that the infectious substance is a nucleic acid. 
Propose a simple and highly sensitive means of determining 
whether the infectious nucleic acid is DNA or RNA. 


25. Mutagenic consequences. Spontaneous deamination of 
cytosine bases in DNA takes place at low but measurable 
frequency. Cytosine is converted into uracil by loss of its 
amino group. After this conversion, which base pair occu- 
pies this position in each of the daughter strands resulting 
from one round of replication? Two rounds of replication? 


26. Information content. 

(a) How many different 8-mer sequences of DNA are there? 
(Hint: There are 16 possible dinucleotides and 64 possible 
trinucleotides.) We can quantify the information-carrying 
capacity of nucleic acids in the following way. Each position 
can be one of four bases, corresponding to two bits of 


information (2? = 4). Thus, a chain of 5100 nucleotides cor- 
responds to 2 X 5100 = 10,200 bits, or 1275 bytes (1 byte = 
8 bits). 

(b) How many bits of information are stored in an 8-mer 
DNA sequence? In the E. coli genome? In the human genome? 
(c) Compare each of these values with the amount of infor- 
mation that can be stored on a computer compact disc, or CD 
(about 700 megabytes). 


27. Key polymerases. Compare DNA polymerase and RNA 
polymerase from E. coli in regard to each of the following 
features: (a) activated precursors, (b) direction of chain 
elongation, (c) conservation of the template, and (d) need 
for a primer. 


28. Different strands. Explain the difference between the 
coding strand and the template strand in DNA. 


29. Family resemblance. Differentiate among mRNA, 
rRNA and tRNA. 


30. A code you can live by. What are the key characteristics 
of the genetic code? 


31. Encoded sequences. 


(a) Write the sequence of the mRNA molecule synthesized 
froma DNA template strand having the following sequence: 


5'-ATCGTACCGTTA-3’ 


(b) What amino acid sequence is encoded by the following 
base sequence of an mRNA molecule? Assume that the 
reading frame starts at the 5’ end. 


5'-UUGCCUAGUGAUUGGAUG-3’ 


(c) What is the sequence of the polypeptide formed on 
addition of poly(UUAC) to a cell-free protein-synthesizing 
system? 


32. A tougher chain. RNA is readily hydrolyzed by alkali, 
whereas DNA is not. Why? 


33. A picture is worth a thousand words. Write a reaction 
sequence showing why RNA is more susceptible to nucleo- 
philic attack than DNA. 


34. Flowing information. What is meant by the phrase gene 
expression? 


35. Wecan all agree on that. What is a consensus sequence? 


36. A potent blocker. Cordycepin (3'-deoxyadenosine) is an 
adenosine analog. When converted into cordycepin 5’-tri- 
phosphate, it inhibits RNA synthesis. How does cordyce- 
pin 5’-triphosphate block the synthesis of RNA? 


37. Silent RNA. The code word GGG cannot be deci- 
phered in the same way as can UUU, CCC, and AAA, 
because poly(G) does not act as a template. Poly(G) forms a 
triple-stranded helical structure. Why is it an ineffective 
template? 
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38. Sometimes it is not so bad. What is meant by the degen- 
eracy of the genetic code? 


39. In fact, it can be good. What is the biological benefit of 
a degenerate genetic code? 


40. To bring together as associates. Match the components 
in the right-hand column with the appropriate process in 
the left-hand column. 


. RNA polymerase 
. DNA polymerase 
. Ribosome 
dNTP 

tRNA 

NTP 

. mRNA 

. primer 

. rRNA 


10. promoter 


(a) Replication 
(b) Transcription 
(c) Translation 
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41. A lively contest. Match the components in the right- 
hand column with the appropriate process in the left-hand 
column. 


(a) {Met 1. continuous message 

(b) Shine-Dalgarno 2. removed during processing 

(c) intron 3. the first of many amino 
acids 

(d) exon 4, joins exons 

(e) pre-mRNA 5. joined to make the final 
message 

(f) mRNA 6. locates the start site 

(g) spliceosome 7. discontinuous message 


42. Two from one. Synthetic RNA molecules of defined 
sequence were instrumental in deciphering the genetic 
code. Their synthesis first required the synthesis of DNA 
molecules to serve as templates. Har Gobind Khorana syn- 
thesized, by organic-chemical methods, two complemen- 
tary deoxyribonucleotides, each with nine residues: 
d(TAC), and d(GTA),. Partly overlapping duplexes that 
formed on mixing these oligonucleotides then served as 
templates for the synthesis by DNA polymerase of long, 
repeating double-helical DNA strands. The next step was 
to obtain long polyribonucleotide chains with a sequence 
complementary to only one of the two DNA strands. How 


did Khorana obtain only poly(UAC)? Only poly(GUA)? 


43. Triple entendre. The RNA transcript of a region of T4 
phage DNA contains the sequence 5’-AAAUGAGGA-3’. 
This sequence encodes three different polypeptides. What 
are they? 


44, A new translation. A transfer RNA with a UGU antico- 
don is enzymatically conjugated to !'*C-labeled cysteine. The 
cysteine unit is then chemically modified to alanine (with 
the use of Raney nickel, which removes the sulfur atom of 
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cysteine). The altered aminoacyl-tRNA is added to a pro- 
tein-synthesizing system containing normal components 
except for this tRNA. The mRNA added to this mixture 
contains the following sequence: 


5'-UUUUGCCAUGUUUGUGCU-3' 


What is the sequence of the corresponding radiolabeled 
peptide? 


45. A tricky exchange. Define exon shuffling and explain 
why its occurrence might be an evolutionary advantage. 


46. From one, many. Explain why alternative splicing 
increases the coding capacity of the genome. 


47. The unity of life. What is the significance of the fact that 
human mRNA can be accurately translated in E. coli? 


Chapter Integration Problems 


48. Back to the bench. A protein chemist told a molecular 
geneticist that he had found a new mutant hemoglobin in 
which aspartate replaced lysine. The molecular geneticist 
expressed surprise and sent his friend scurrying back to the 
laboratory. (a) Why did the molecular geneticist doubt the 
reported amino acid substitution? (b) Which amino acid 
substitutions would have been more palatable to the molec- 
ular geneticist? 


49. Eons apart. The amino acid sequences of a yeast protein 
and a human protein having the same function are found to 
be 60% identical. However, the corresponding DNA 
sequences are only 45% identical. Account for this differing 
degree of identity. 


Data Interpretation Problems 


50. 3 is greater than 2. The adjoining illustration graphs the 
relation between the percentage of GC base pairs in DNA 
and the melting temperature. Suggest a plausible explana- 
tion for these results. 


100 - 


80+ 


Guanine + cytosine 
(mole percent) 


0 1 i i i 
60 70 80 90 100 110 


Tm CC) 


[Data from R. J. Britten and D. E. Kohne, Science 161:529-540, 1968.] 


51. Blast from the past. The illustration below is a graph 
called a Cyt curve (pronounced “cot”’). The y-axis shows the 
percentage of DNA that is double stranded. The x-axis is 
the product of the concentration of DNA and the time 
required for the double-stranded molecules to form. 
Explain why the mixture of poly(A) and poly(U) and the 
three DNAs shown vary in the Cot value required to com- 
pletely anneal. MS2 and T4 are bacterial viruses (bacterio- 
phages) with genome sizes of 3569 and 168,903 bp, respec- 
tively. The E. coli genome is 4.6 X 10° bp. 


(e) 


Poly(U) + 
poly(A) 


0.5 0.5 


Fraction reassociated 


10° 10° 107 10°73 107 0.1 1 
Cot (mole x s liter~!) 


10 100 1,000 10,000 


[Data from J. Marmur and P. Doty, J. Mol. Biol. 5:120, 1962.] 


52. Salt to taste. The graph below shows the effect of salt 
concentration on melting temperature of bacterial DNA. 
How does salt concentration affect the melting temperature 
of DNA? Account for this effect. 

0% 
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Exploring Genes and Genomes 


CHAPTER 


Processes such as the development from a caterpillar 
into a butterfly entail dramatic changes in patterns of 
gene expression. The expression levels of thousands of 
genes can be monitored through the use of DNA arrays. 


At the right, a DNA microarray reveals the expression 
levels of more than 12,000 human genes; the brightness 
and color of each spot indicates a change in the 
expression level of the corresponding gene. [(Left) Cathy 
Keifer/istockphoto.com. (Right) Agilent Technologies. ] 


G ince its emergence in the 1970s, recombinant DNA technology has 
revolutionized biochemistry. The genetic endowment of organisms 
can now be precisely changed in designed ways. Recombinant DNA 
technology is the fruit of several decades of basic research on DNA, RNA, 
and viruses. It depends, first, on having enzymes that can cut, join, and 
replicate DNA and those that can reverse transcribe RNA. Restriction 
enzymes cut very long DNA molecules into specific fragments that can 
be manipulated; DNA ligases join the fragments together. Many kinds of 
restriction enzymes are available. By applying this assortment cleverly, 
researchers can treat DNA sequences as modules that can be moved 
at will from one DNA molecule to another. Thus, recombinant DNA 
technology is based on the use of enzymes that act on nucleic acids as 
substrates. 

A second foundation is the base-pairing language that allows comple- 
mentary sequences to recognize and bind to each other. Hybridization with 
complementary DNA (cDNA) or RNA probes is a sensitive means of 
detecting specific nucleotide sequences. In recombinant DNA technology, 
base-pairing is used to construct new combinations of DNA as well as to 
detect and amplify particular sequences. 

Third, powerful methods have been developed for determining the 
sequence of nucleotides in DNA. These methods have been harnessed to 
sequence complete genomes: first, small genomes from viruses; then, larger 
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genomes from bacteria; and, finally, eukaryotic genomes, including the 
3-billion-base-pair human genome. Scientists are just beginning to exploit 
the enormous information content of these genome sequences. 

Finally, recombinant DNA technology critically depends on our ability 
to deliver foreign DNA into host organisms. For example, DNA fragments 
can be inserted into plasmids, where they can be replicated within a short 
period of time in their bacterial hosts. In addition, viruses efficiently deliver 
their own DNA (or RNA) into hosts, subverting them either to replicate the 
viral genome and produce viral proteins or to incorporate viral DNA into 
the host genome. 

These new methods have wide-ranging benefits across a broad spectrum 
of disciplines, including biotechnology, agriculture, and medicine. Among 
these benefits is the dramatic expansion of our understanding of human 
disease. Throughout this chapter, a specific disorder, amyotrophic lateral 
sclerosis (ALS), will be used to illustrate the effect that recombinant DNA 
technology has had on our knowledge of disease mechanisms. ALS was first 
described clinically in 1869 by the French neurologist Jean- Martin Charcot 
as a fatal neurodegenerative disease of progressive weakening and atrophy 
of voluntary muscles. ALS is commonly referred to as Lou Gehrig’s 
Disease, for the baseball legend whose career and life were prematurely cut 
short as a result of this devastating disease. For many years, little progress 
had been made in the study of the mechanisms underlying ALS. As we shall 
see, significant advances have been made with the use of research tools 
facilitated by recombinant DNA technology. 


5.1 The Exploration of Genes Relies on Key Tools 


The rapid progress in biotechnology—indeed its very existence—is a result 
of a few key techniques. 


1. Restriction-enzyme analysis. Restriction enzymes are precise molecular 
scalpels that allow an investigator to manipulate DNA segments. 


2. Blotting techniques. Southern and northern blots are used to separate and 
identify DNA and RNA sequences, respectively. The western blot, which 
uses antibodies to characterize proteins, was described in Chapter 3. 


3. DNA sequencing. The precise nucleotide sequence of a molecule of 
DNA can be determined. Sequencing has yielded a wealth of information 
concerning gene architecture, the control of gene expression, and protein 
structure. 


4. Solid-phase synthesis of nucleic acids. Precise sequences of nucleic acids 
can be synthesized de novo and used to identify or amplify other nucleic 
acids. 


5. The polymerase chain reaction (PCR). The polymerase chain reaction 
leads to a billionfold amplification of a segment of DNA. One molecule of 
DNA can be amplified to quantities that permit characterization and 
manipulation. This powerful technique can be used to detect pathogens and 
genetic diseases, determine the source of a hair left at the scene of a crime, 
and resurrect genes from the fossils of extinct organisms. 


A final set of techniques relies on the computer, without which, it would 
be impossible to catalog, access, and characterize the abundant information 
generated by the methods outlined above. Such uses of the computer will be 
presented in Chapter 6. 


Restriction enzymes split DNA into specific fragments 


Restriction enzymes, also called restriction endonucleases, recognize specific 
base sequences in double-helical DNA and cleave both strands of that 
duplex at specific places. To biochemists, these exquisitely precise scalpels 
are marvelous gifts of nature. They are indispensable for analyzing chromo- 
some structure, sequencing very long DNA molecules, isolating genes, and 
creating new DNA molecules that can be cloned. 

Restriction enzymes are found in a wide variety of prokaryotes. Their 
biological role is to cleave foreign DNA molecules, providing the host 
organism with a primitive immune system. Many restriction enzymes rec- 
ognize specific sequences of four to eight base pairs and hydrolyze a phos- 
phodiester bond in each strand in this region. A striking characteristic of 
these cleavage sites is that they almost always possess twofold rotational 
symmetry. In other words, the recognized sequence is palindromic, or an 
inverted repeat, and the cleavage sites are symmetrically positioned. For 
example, the sequence recognized by a restriction enzyme from Streptomyces 
achromogenes is 


Cleavage site 


1 
a ae aa re ee a dl 
hb ge EY 
3’ G—G—C+G—C—C 35’ 
ih 


Cleavage site Symmetry axis 

In each strand, the enzyme cleaves the C—G phosphodiester bond on the 
3’ side of the symmetry axis. As we shall see in Chapter 9, this symmetry 
corresponds to that of the structures of the restriction enzymes themselves. 

Several hundred restriction enzymes have been purified and character- 
ized. Their names consist of a three-letter abbreviation for the host organ- 
ism (e.g., Eco for Escherichia coli, Hin for Haemophilus influenzae, Hae for 
Haemophilus aegyptius) followed by a strain designation (if needed) and a 
roman numeral (to distinguish multiple enzymes from the same strain). The 
specificities of several of these enzymes are shown in Figure 5.1. 

Restriction enzymes are used to cleave DNA molecules into specific 
fragments that are more readily analyzed and manipulated than the entire 
parent molecule. For example, the 5.1-kb circular duplex DNA of the 
tumor-producing SV 40 virus is cleaved at one site by EcoRI, at four sites by 
Hpal, and at 11 sites by HindIII. A piece of DNA, called a restriction frag- 
ment, produced by the action of one restriction enzyme can be specifically 
cleaved into smaller fragments by another restriction enzyme. The pattern 
of such fragments can serve as a fingerprint of a DNA molecule, as will be 
considered shortly. Indeed, complex chromosomes containing hundreds of 
millions of base pairs can be mapped by using a series of restriction enzymes. 


Restriction fragments can be separated by gel 
electrophoresis and visualized 


In Chapter 3, we considered the use of gel electrophoresis to separate pro- 
tein molecules (Section 3.1). Because the phosphodiester backbone of DNA 
is highly negatively charged, this technique is also suitable for the separa- 
tion of nucleic acid fragments. Among the many applications of DNA elec- 
trophoresis are the detection of mutations that affect restriction fragment 
size (such as insertions and deletions) and the isolation, purification, and 
quantitation of a specific DNA fragment. 

For most gels, the shorter the DNA fragment, the farther the migration. 
Polyacrylamide gels are used to separate, by size, fragments containing as 
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Palindrome 
A word, sentence, or verse that reads the 


same from right to left as it does from left 
to right. 

Radar 

Senile felines 

Do geese see God? 

Roma tibi subito motibus ibit amor 
Derived from the Greek palindromos, 
“running back again.” 


y 
5’ GGATCC 3! 
3' CCTAGG 5’ 
t 


BamHI 


{ 
5' GAATTC 3’ 


e EcoRI 
3'CTTAAG 5’ 


Haelll 


5’ GCGC 3' 
3° CGCG5' 
t 


1 
5’ CTCGAG 3' 
3' GAGCTC5’ 
i) 


FIGURE 5.1 Specificities of some 
restriction endonucleases. The 
sequences that are recognized by these 
enzymes contain a twofold axis of 
symmetry. The two strands in these regions 
are related by a 180-degree rotation about 
the axis marked by the green symbol. The 
cleavage sites are denoted by red arrows. 
The abbreviated name of each restriction 
enzyme is given at the right of the 
sequence that it recognizes. Note that 

the cuts may be staggered or even. 
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Xhol 
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FIGURE 5.2 Gel-electrophoresis pattern 
of a restriction digest. This gel shows the 
fragments produced by cleaving DNA from 
two viral strains (odd- vs. even-numbered 
lanes) with each of four restriction enzymes. 
These fragments were made fluorescent by 
staining the gel with ethidium bromide. [Data 
from Carr et al., Emerging Infectious Diseases, 
www.cdc.gov/eid, 17(8), August 201 1.] 


FIGURE 5.3 Southern blotting. A DNA 
fragment containing a specific sequence 
can be identified by separating a mixture of 
fragments by electrophoresis, transferring 
them to nitrocellulose, and hybridizing 

with a °2P-labeled probe complementary 
to the sequence. The fragment containing 
the sequence is then visualized by 
autoradiography. 
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many as 1000 base pairs, whereas more-porous agarose gels are used to 
resolve mixtures of larger fragments (as large as 20 kb). An important fea- 
ture of these gels is their high resolving power. In certain kinds of gels, frag- 
ments differing in length by just one nucleotide out of several hundred can 
be distinguished. Bands or spots of radioactive DNA in gels can be visual- 
ized by autoradiography. Alternatively, a gel can be stained with a dye such 
as ethidium bromide, which fluoresces an intense orange under irradiation 
with ultraviolet light when bound to a double-helical DNA molecule 
(Figure 5.2). A band containing only 10 ng of DNA can be readily seen. 

It is often necessary to determine if a particular base sequence is repre- 
sented in a given DNA sample. For example, one may wish to confirm the 
presence of a specific mutation in genomic DNA isolated from patients 
known to be at risk for a particular disease. This specific sequence can be 
identified by hybridizing it with a labeled complementary DNA strand 
(Figure 5.3). A mixture of restriction fragments is separated by electropho- 
resis through an agarose gel, denatured to form single-stranded DNA, and 
transferred to a nitrocellulose sheet. The positions of the DNA fragments in 
the gel are preserved during the transfer. The nitrocellulose is then exposed 
to a 32P-labeled or fluorescently tagged DNA probe, a short stretch of single- 
stranded DNA which contains a known base sequence. The probe hybrid- 
izes with a restriction fragment having a complementary sequence, and 
autoradiography or fluorescence imaging then reveals the position of the 
restriction-fragment—probe duplex. A particular fragment amid a million 
others can be readily identified in this way. This powerful technique is 
named Southern blotting, for its inventor Edwin Southern. 

In a similar manner, RNA molecules of a specific sequence can also be 
readily identified. After separation by gel electrophoresis and transfer to 
nitrocellulose, specific sequences can be detected by DNA probes. This 
analogous technique for the analysis of RNA has been whimsically termed 
northern blotting. A further play on words accounts for the term western blot- 
ting, which refers to a technique for detecting a particular protein by stain- 
ing with specific antibody (Section 3.3). 


DNA can be sequenced by controlled termination of replication 


The analysis of DNA structure and its role in gene expression have been mark- 
edly facilitated by the development of powerful techniques for the sequencing of 
DNA molecules. One of the first and most widely-used techniques for DNA 
sequencing is controlled termination of replication, also referred to as the Sanger 
dideoxy method for its pioneer, Frederick Sanger. The key to this approach is 
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the generation of DNA fragments whose length is determined by the last base 
in the sequence (Figure 5.4). In the current application of this method, a DNA 
polymerase is used to make the complement of a particular sequence within a 
single-stranded DNA molecule. The synthesis is primed by a chemically syn- 
thesized fragment that is complementary to a part of the sequence known from 
other studies. In addition to the four deoxyribonucleoside triphosphates, the 
reaction mixture contains a small amount of the 2',3’-dideoxy analog of each 
nucleotide, each carrying a different fluorescent label attached to the base (e.g., 
a green emitter for termination at A and a red one for termination at T). 
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2', 3'-Dideoxy analog 


The incorporation of this analog blocks further growth of the new chain 
because it lacks the 3’-hydroxyl terminus needed to form the next phospho- 
diester bond. The concentration of the dideoxy analog is low enough that 
chain termination will take place only occasionally. The polymerase will 
insert the correct nucleotide sometimes and the dideoxy analog other times, 
stopping the reaction. For instance, if the dideoxy analog of dATP is pres- 
ent, fragments of various lengths are produced, but all will be terminated by 
the dideoxy analog. Importantly, this dideoxy analog of dATP will be 
inserted only where a T was located in the DNA being sequenced. Thus, the 
fragments of different length will correspond to the positions of T. 

The resulting fragments are separated by a technique known as capillary 
electrophoresis, in which the mixture is passed through a very narrow tube 
containing a gel matrix at high voltage to achieve efficient separation within 
a short time. As the DNA fragments emerge from the capillary, they are 
detected by their fluorescence; the sequence of their colors directly gives the 
base sequence. Sequences of as many as 1000 bases can be determined in 
this way. Indeed, automated Sanger sequencing machines can read more 
than 1 million bases per day. 


DNA probes and genes can be synthesized by automated 
solid-phase methods 


DNA strands, like polypeptides (Section 3.4), can be synthesized by the 
sequential addition of activated monomers to a growing chain that is linked 
to a solid support. The activated monomers are protected deoxyribonucleo- 
side 3' -phosphoramidites. In step 1, the 3’-phosphorus atom of this incoming 
unit becomes joined to the 5’-oxygen atom of the growing chain to form a 
phosphite triester (Figure 5.5). The 5’-OH group of the activated monomer 
is unreactive because it is blocked by a dimethoxytrityl (DMT) protecting 
group, and the 3’-phosphoryl oxygen atom is rendered unreactive by 
attachment of the B-cyanoethyl (GCE) group. Likewise, amino groups on 
the purine and pyrimidine bases are blocked. 

Coupling is carried out under anhydrous conditions because water reacts 
with phosphoramidites. In step 2, the phosphite triester (in which P is 
trivalent) is oxidized by iodine to form a phosphotriester (in which P is pen- 
tavalent). In step 3, the DMT protecting group on the 5'-OH group of the 
growing chain is removed by the addition of dichloroacetic acid, which 
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FIGURE 5.4 Fluorescence detection of 
oligonucleotide fragments produced by 
the dideoxy method. A sequencing 
reaction is performed with four chain- 
terminating dideoxy nucleotides, each 
labeled with a tag that fluoresces at a 
different wavelength. The color of each 
fragment indicates the identity of the last 
base in the chain. The fragments are 
separated by size using capillary 
electrophoresis and the fluorescence at 
each of the four wavelengths indicates 
the sequence of the complement of the 
original DNA template. 
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FIGURE 5.5 Solid-phase synthesis of a DNA chain by the phosphite triester 
method. The activated monomer added to the growing chain is a deoxyribonucleoside 
8'-phosphoramidite containing a dimethoxytrityl (DMT) protecting group on its 5’-oxygen 
atom, a B-cyanoethyl (8CE) protecting group on its 3’-phosphoryl oxygen atom, and a 
protecting group on the base. 


OCH; leaves other protecting groups intact. The DNA chain is now 
Dimethoxytrityl elongated by one unit and ready for another cycle of addi- 
(DMT) group tion. Each cycle takes only about 10 minutes and usually 
elongates more than 99% of the chains. 

Cc ' This solid-phase approach is ideal for the synthesis of 
ee \ noe base (protected)|) DNA, as it is for polypeptides, because the desired product 
a 0 stays on the insoluble support until the final release step. All 
the reactions take place in a single vessel, and excess soluble 
reagents can be added to drive reactions to completion. At the 
end of each step, soluble reagents and by-products are 
washed away from the resin that bears the growing chains. 
cGcawaiek <— a nf At the end of the synthesis, NH, is added to remove all pro- 
(BCE) group mie tg A 3 tecting groups and release the oligonucleotide from the solid 
we | support. Because elongation is never 100% complete, the new 
CHs DNA chains are of diverse lengths—the desired chain is the 
A-deaxyith onuclaoside 3’sphosphoranildite longest one. The sample can be purified by high-performance 
with DMT and BCE attached liquid chromatography or by electrophoresis on polyacryl- 
amide gels. DNA chains of as many as 100 nucleotides can be 

readily synthesized by this automated method. 

The ability to rapidly synthesize DNA chains of any selected sequence 
opens many experimental avenues. For example, a synthesized oligonucle- 
otide labeled at one end with %”P or a fluorescent tag can be used to search 
for a complementary sequence in a very long DNA molecule or even in a 
genome consisting of many chromosomes. The use of labeled oligonucle- 
otides as DNA probes is powerful and general. For example, a DNA probe 
that can base-pair to a known complementary sequence in a chromosome 
can serve as the starting point of an exploration of adjacent uncharted DNA. 
Such a probe can be used as a primer to initiate the replication of neighbor- 
ing DNA by DNA polymerase. An exciting application of the solid-phase 
approach is the synthesis of new tailor-made genes. New proteins with novel 


\ CH; 


oa 
NC—G HSC 


140 


properties can now be produced in abundance by the expression of synthetic 
genes. Finally, the synthetic scheme heretofore described can be slightly 
modified for the solid-phase synthesis of RNA oligonucleotides, which can 
be very powerful reagents for the degradation of specific mRNA molecules 
in living cells by a technique known as RNA interference (Section 5.4). 


Selected DNA sequences can be greatly amplified by the 
polymerase chain reaction 


In 1984, Kary Mullis devised an ingenious method called the polymerase 
chain reaction (PCR) for amplifying specific DNA sequences. Consider a 
DNA duplex consisting of a target sequence surrounded by nontarget DNA. 
Millions of copies of the target sequences can be readily obtained by PCR if 
the sequences flanking the target are known. PCR is carried out by adding 
the following components to a solution containing the target sequence: 
(1) a pair of primers that hybridize with the flanking sequences of the target, 
(2) all four deoxyribonucleoside triphosphates (dNTPs), and (3) a heat- 
stable DNA polymerase. A PCR cycle consists of three steps (Figure 5.6). 


1. Strand separation. The two strands of the parent DNA molecule are 
separated by heating the solution to 95°C for 15 s. 


2. Hybridization of primers. The solution is then abruptly cooled to 54°C to 
allow each primer to hybridize to a DNA strand. One primer hybridizes to 
the 3’ end of the target on one strand, and the other primer hybridizes to the 
3’ end on the complementary target strand. Parent DNA duplexes do not 
form, because the primers are present in large excess. Primers are typically 
from 20 to 30 nucleotides long. 


3. DNA synthesis. The solution is then heated to 72°C, the optimal tem- 
perature for heat-stable polymerases. One such enzyme is Taq DNA 
polymerase, which is derived from Thermus aquaticus, a thermophilic 
bacterium that lives in hot springs. The polymerase elongates both prim- 
ers in the direction of the target sequence because DNA synthesis is in the 
5'-to-3' direction. DNA synthesis takes place on both strands but extends 
beyond the target sequence. 


These three steps—strand separation, hybridization of primers, and DNA 
synthesis—constitute one cycle of the PCR amplification and can be carried 
out repetitively just by changing the temperature of the reaction mixture. The 
thermostability of the polymerase makes it feasible to carry out PCR in a 
closed container; no reagents are added after the first cycle. At the completion 
of the second cycle, four duplexes containing the target sequence have been 
generated (Figure 5.7). Of the eight DNA strands comprising these duplexes, 
two short strands constitute only the target sequence—the sequence includ- 
ing and bounded by the primers. Subsequent cycles will amplify the target 
sequence exponentially. Ideally, after n cycles, the desired sequence is ampli- 
fied 2"-fold. The amplification is a millionfold after 20 cycles and a billionfold 
after 30 cycles, which can be carried out in less than an hour. 

Several features of this remarkable method for amplifying DNA are note- 
worthy. First, the sequence of the target need not be known. All that is 
required is knowledge of the flanking sequences so that complementary 
primers can be synthesized. Second, the target can be much larger than the 
primers. Targets larger than 10 kb have been amplified by PCR. Third, prim- 
ers do not have to be perfectly matched to flanking sequences to amplify tar- 
gets. With the use of primers derived from a gene of known sequence, it is 
possible to search for variations on the theme. In this way, families of genes 
are being discovered by PCR. Fourth, PCR is highly specific because of the 
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FIGURE 5.6 The first cycle in the 
polymerase chain reaction (PCR). A cycle 
consists of three steps: DNA double strand 
separation, the hybridization of primers, 
and the extension of primers by DNA 
synthesis. 
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stringency of hybridization at relatively high temperature. Stringency 
is the required closeness of the match between primer and target, 
which can be controlled by temperature and salt. At high tempera- 
tures, only the DNA between hybridized primers is amplified. 
A gene constituting less than a millionth of the total DNA of a 
higher organism is accessible by PCR. Fifth, PCR is exquisitely 
sensitive. A single DNA molecule can be amplified and detected. 


PCR is a powerful technique in medical diagnostics, 
forensics, and studies of molecular evolution 
e PCR can provide valuable diagnostic information in medi- 
cine. Bacteria and viruses can be readily detected with the 
use of specific primers. For example, PCR can reveal the presence 
of small amounts of DNA from the human immunodeficiency 
virus (HIV) in persons who have not yet mounted an immune 
response to this pathogen. In these patients, assays designed to 
detect antibodies against the virus would yield a false negative test 
result. Finding Mycobacterium tuberculosis bacilli in tissue speci- 
mens is slow and laborious. With PCR, as few as 10 tubercle bacilli 
per million human cells can be readily detected. PCR is a promis- 
ing method for the early detection of certain cancers. This tech- 
nique can identify mutations of certain growth-control genes, such 
as the ras genes (Chapter 14). The capacity to greatly amplify 
selected regions of DNA can also be highly informative in moni- 
toring cancer chemotherapy. Tests using PCR can detect when 
cancerous cells have been eliminated and treatment can be stopped; 
they can also detect a relapse and the need to immediately resume 
treatment. PCR is ideal for detecting leukemias caused by chromo- 
somal rearrangements. 

In addition, PCR has made an impact on forensics and legal 
medicine. An individual DNA profile is highly distinctive 
because many genetic loci are highly variable within a population. 
For example, variations at one specific location determine a per- 
son’s HLA type (human leukocyte antigen type; Section 34.5); 
organ transplants are rejected when the HLA types of the donor 
and recipient are not sufficiently matched. PCR amplification of 
multiple genes is being used to establish biological parentage in 
disputed paternity and immigration cases. Analyses of blood 
stains and semen samples by PCR have implicated guilt or inno- 
cence in numerous assault and rape cases (Figure 5.8). The root of 
a single shed hair found at a crime scene contains enough DNA 
for typing by PCR. 

DNA is a remarkably stable molecule, particularly when 
shielded from air, light, and water. Under such circumstances, 
large fragments of DNA can remain intact for thousands of years 
or longer. PCR provides an ideal method for amplifying such 
ancient DNA molecules so that they can be detected and charac- 
terized (Section 6.5). PCR can also be used to amplify DNA from 
microorganisms that have not yet been isolated and cultured. As 
will be discussed in Chapter 6, sequences from these PCR products 


FIGURE 5.7 Multiple cycles of the polymerase chain reaction. The two short 
strands produced at the end of the third cycle (along with longer stands not 
shown) represent the target sequence. Subsequent cycles will amplify the target 
sequence exponentially and the parent sequence arithmetically. 


can be sources of considerable insight into evolutionary relationships 
between organisms. 


The tools for recombinant DNA technology have been used to 
identify disease-causing mutations 


Let us consider how the techniques just described have been utilized in 
concert to study ALS, introduced at the beginning of this chapter. Five 
percent of all patients suffering from ALS have family members who also 
have been diagnosed with the disease. A heritable disease pattern 1s indic- 
ative of a strong genetic component of disease causation. To identify these 
disease-causing genetic alterations, researchers identify polymorphisms 
(instances of genetic variation) within an affected family that correlate 
with the emergence of disease. These polymorphisms may themselves 
cause disease or be closely linked to the responsible genetic alteration. 
Restriction-fragment-length polymorphisms (RFLPs) are polymorphisms 
within restriction sites that change the sizes of DNA fragments produced 
by the appropriate restriction enzyme. Using restriction digests and 
Southern blots of the DNA from members of ALS-affected families, 
researchers identified RFLPs that were found preferentially in those fam- 
ily members diagnosed with the disease. For some of these families, strong 
evidence was obtained for the disease-causing mutation within a specific 
region of chromosome 21. 

After the probable location of one disease-causing gene had been identi- 
fied, this same research group compared the locations of the ALS-associated 
RFLPs with the known sequence of chromosome 21. They noted that this 
chromosomal locus contains the SOD1 gene, which encodes the Cu/Zn 
superoxide dismutase protein SOD1, an enzyme important for the protec- 
tion of cells against oxidative damage (Section 18.3). PCR amplification of 
regions of the SOD1 gene from the DNA of affected family members, fol- 
lowed by Sanger dideoxy sequencing of the targeted fragment, enabled the 
identification of 11 disease-causing mutations from 13 different families. 
This work was pivotal for focusing further inquiry into the roles that super- 
oxide dismutase and its corresponding mutant forms play in the pathology 
of some forms of ALS. 


5.2 Recombinant DNA Technology Has Revolutionized 
All Aspects of Biology 


The development of recombinant DNA technology has taken biology from 
an exclusively analytical science to a synthetic one. New combinations of 
unrelated genes can be constructed in the laboratory by applying recombi- 
nant DNA techniques. These novel combinations can be cloned—amplified 
many-fold—by introducing them into suitable cells, where they are repli- 
cated by the DNA-synthesizing machinery of the host. The inserted genes 
are often transcribed and translated in their new setting. What is most strik- 
ing is that the genetic endowment of the host can be permanently altered in 
a designed way. 


Restriction enzymes and DNA ligase are key tools in forming 
recombinant DNA molecules 


Let us begin by seeing how novel DNA molecules can be constructed in 
the laboratory. An essential tool for the manipulation of recombinant 
DNA is a vector, a DNA molecule that can replicate autonomously in an 
appropriate host organism. Vectors are designed to enable the rapid, 


FIGURE 5.8 DNA and forensics. DNA 
isolated from sperm obtained during the 
examination of a rape victim was amplified 
by PCR, then compared with DNA from the 
victim and three potential suspects —the 
victim’s husband and two additional 
individuals—using gel electrophoresis and 
autoradiography. Sperm DNA matched the 
pattern of Suspect 1, but not that of 
Suspect 2 or the victim’s husband. Sizing 
marker and K562 lanes refer to control DNA 
samples. [Martin Shields/Science Source.] 
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and bacteriophage lambda (\ phage), a virus, are choice vectors for clon- 


ing in E. coli. The vector can be prepared for accepting a new DNA frag- 
ment by cleaving it at a single specific site with a restriction enzyme. For 
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FIGURE 5.9 Joining of DNA molecules by the cohesive-end 
method. Two DNA molecules, cleaved with a common restriction 


enzyme such as EcoRI, can be ligated to form recombinant 
molecules. 


example, the plasmid pSC101, a 9.9-kb double-helical 
circular DNA molecule, is split at a unique site by the 
EcoRI restriction enzyme. The staggered cuts made by 
this enzyme produce complementary single-stranded ends, 
which have specific affinity for each other and hence are 
known as cohesive or sticky ends. Any DNA fragment can 
be inserted into this plasmid if it has the same cohesive 
ends. Such a fragment can be extracted from a larger piece 
of DNA by using the same restriction enzyme as was used 
to open the plasmid DNA (Figure 5.9). 

The single-stranded ends of the fragment are then 
complementary to those of the cut plasmid. The DNA frag- 
ment and the cut plasmid can be annealed and then joined 
by DNA ligase, which catalyzes the formation of a phospho- 
diester bond at a break in a DNA chain. DNA ligase 
requires a free 3'-hydroxyl group and a 5’-phosphory]l 
group. Furthermore, the chains joined by ligase must be in 


a double helix. An energy source such as ATP or NAD* is required for the 
joining reaction, as will be discussed in Chapter 28. 


5' ®)- -OH 3’ 
3’ HO- -®) 5’ 


DNA fragment or vector 


- al 5' (P)-[EGGAATTCGG -OH 3’ 


3' HO- GGCTTAAGCC -(P) 5’ 


Decameric linker 


5’ (P)-|CGGAATTCGG CGGAATTCGG -OH 3’ 
3’ HO- GGCTTAAGCC GGCTTAAGCC -(P) 5’ 


EcoRI restriction 
enzyme 


5’ (P)-/AATICGG CGG-OH 3! 
3! HO-GCC GGCTTAA -(P) 5’ 


FIGURE 5.10 Formation of cohesive ends. Cohesive ends can 
be formed by the addition and cleavage of a chemically 
synthesized linker. 


What if the target DNA is not naturally flanked by the 
appropriate restriction sites? How is the fragment cut and 
annealed to the vector? The cohesive-end method for join- 
ing DNA molecules can still be used in these cases by 
adding a short, chemically synthesized DNA linker that can 
be cleaved by restriction enzymes. First, the linker is 
covalently joined to the ends of a DNA fragment. For 
example, the 5’ ends of a decameric linker and a DNA 
molecule are phosphorylated by polynucleotide kinase 
and then joined by the ligase from T'4 phage (Figure 5.10). 
This ligase can form a covalent bond between blunt- 
ended double-helical DNA molecules. Cohesive ends are 
produced when these terminal extensions are cut by an 
appropriate restriction enzyme. Thus, cohesive ends cor- 
responding to a particular restriction enzyme can be added 
to virtually any DNA molecule. We see here the fruits of 
combining enzymatic and synthetic chemical approaches 
in crafting new DNA molecules. 


Plasmids and \ phage are choice vectors for DNA cloning in bacteria 


Many plasmids and bacteriophages have been ingeniously modified by 
researchers to enhance the delivery of recombinant DNA molecules into 
bacteria and to facilitate the genetic selection of bacteria harboring these 
vectors. As already mentioned, plasmids are circular double-stranded DNA 
molecules that occur naturally in some bacteria. They range in size from two 
to several hundred kilobases. Plasmids carry genes for the inactivation of 
antibiotics, the production of toxins, and the breakdown of natural prod- 
ucts. These accessory chromosomes can replicate independently of the host 
chromosome. In contrast with the host genome, they are dispensable under 
certain conditions. A bacterial cell may have no plasmids at all or it may 
house as many as 20 copies of a naturally-occurring plasmid. 


Many plasmids have been optimized for a particular 
experimental task. Some engineered plasmids, for exam- 
ple, can achieve nearly a thousand copies per bacterial 
cell. One class of plasmids, known as cloning vectors, is 
particularly suitable for the facile insertion and replica- 
tion of a collection of DNA fragments. These vectors 
often feature a polylinker region that includes many 
unique restriction sites within its sequence. This poly- 
linker can be cleaved with many different restriction 
enzymes or combinations of enzymes, providing great 
versatility in the DNA fragments that can be inserted. In 
addition, these plasmids contain reporter genes, which 
encode rapidly-detectable markers such as antibiotic- 
resistance enzymes or fluorescent proteins. Creative 
placement of these reporter genes within these plasmids 
enables the rapid identification of those vectors that har- 
bor the desired DNA insert. For example, in the cloning 
vector pUC18, insertion of DNA in the polylinker 
region (Figure 5.11) disrupts the lacZa gene, an effect 
called insertional inactivation. This gene encodes an 
essential fragment of the protein B-galactosidase, an 
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FIGURE 5.11 A polylinker in the plasmid pUC18. The plasmid 
pUC18 includes a polylinker within an essential fragment of the 
B-galactosidase gene (often called the facZ gene). Insertion of a DNA 
fragment into one of the many restriction sites within this polylinker 
can be detected by the absence of B-galactosidase activity. 


enzyme which naturally cleaves the milk sugar galactose (Section 11.2). 
B-Galactosidase also cleaves the synthetic substrate X-gal, releasing a 
blue dye. Bacterial cells containing a DNA insert at the polylinker will no 
longer produce the dye in the presence of X-gal, and are readily identified 


by their white color (Figure 5.12). 
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FIGURE 5.12 Insertional inactivation. 
Successful insertion of DNA fragments into 
the polylinker region of pUC18 will result 
in the disruption of the B-galactosidase 
gene. Bacterial colonies that harbor such 
plasmids will no longer convert X-gal into 
a colored product, and will appear white 
on the plate. 
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E. coli Bacterial 
DNA cell 


Another class of plasmids has been optimized for use as expression vectors 
for the production of large amounts of protein. In addition to antibiotic- 
resistance genes, they contain promoter sequences designed to drive the tran- 
scription of large amounts of a protein-coding DNA sequence. In addition to 
polylinkers, these vectors often contain sequences flanking the cloning site 
that simplify the addition of fusion tags to the protein of interest (Section 3.1), 
greatly facilitating the purification of the overexpressed protein. 

Another widely used vector, A phage, enjoys a choice of life styles: this 
bacteriophage can destroy its host or it can become part of its host 
(Figure 5.13). In the lytic pathway, viral functions are fully expressed: viral 
DNA and proteins are quickly produced and packaged into virus particles, 
leading to the lysis (destruction) of the host cell and the sudden appearance 
of about 100 progeny virus particles, or virions. In the lysogenic pathway, 
the phage DNA becomes inserted into the host-cell genome and can be 
replicated together with host-cell DNA for many generations, remaining 
inactive. Certain environmental changes can trigger the expression of this 
dormant viral DNA, which leads to the formation of progeny viruses and 
lysis of the host. Large segments of the 48-kb DNA of X phage are not 
essential for productive infection and can be replaced by foreign DNA, thus 
making \ phage an ideal vector. 
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FIGURE 5.13 Alternative infection modes for \ phage. Lambda phage can multiply within 
a host and lyse it (lytic pathway) or its DNA can become integrated into the host genome 
(lysogenic pathway), where it is dormant until activated. 


Mutant phages designed for cloning have been constructed. An espe- 
cially useful one called Agt-AB contains only two EcoRI cleavage sites 
instead of the five normally present (Figure 5.14). After cleavage, the mid- 
dle segment of this \ DNA molecule can be removed. The two remaining 
pieces of DNA (called arms) have a combined length equal to 72% of a 
normal genome length. This amount of DNA is too little to be packaged 
into a A particle, which can take up only DNA measuring from 78% to 105% 
of anormal genome. However, a suitably long DNA insert (such as 10 kb) 
between the two ends of \ DNA enables such a recombinant DNA mole- 
cule (93% of normal length) to be packaged. Nearly all infectious \ particles 
formed in this way will contain an inserted piece of foreign DNA. Another 
advantage of using these modified viruses as vectors is that they enter bac- 
teria much more easily than do plasmids. Among the variety of \ mutants 
that have been constructed for use as cloning vectors, one of them, called a 
cosmid, is essentially a hybrid of \ phage and a plasmid that can serve as a 
vector for large DNA inserts (as large as 45 kb). 
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Bacterial and yeast artificial chromosomes 


Much larger pieces of DNA can be propagated in bacterial artificial chromo- 
somes (BACs) or yeast artificial chromosomes (YACs). BACs are highly engi- 
neered versions of the E. coli fertility (F) factor that can include inserts as 
large as 300 kb. YACs contain a centromere, an autonomously replicating 
sequence (ARS, where replication begins), a pair of telomeres (normal ends 
of eukaryotic chromosomes), selectable marker genes, and a cloning site 
(Figure 5.15). Inserts as large as 1000 kb can be cloned into YAC vectors. 


Specific genes can be cloned from digests of genomic DNA 


Ingenious cloning and selection methods have made it possible to isolate 
small stretches of DNA in a genome containing more than 3 X 10° kb. The 
approach is to prepare a large collection (library) of DNA fragments and 
then to identify those members of the collection that have the gene of inter- 
est. Hence, to clone a gene that is present just once in an entire genome, two 
critical components must be available: a specific oligonucleotide probe for 
the gene of interest and a DNA library that can be screened rapidly. 

How is a specific probe obtained? In one approach, a probe for a gene can 
be prepared if a part of the amino acid sequence of the protein encoded by the 
gene is known. Peptide sequencing of a purified protein (Chapter 3) or 
knowledge of the sequence of a homologous protein from a related species 
(Chapter 6) are two potential sources of such information. However, a prob- 
lem arises because a single peptide sequence can be encoded by a number of 
different oligonucleotides (Figure 5.16). Thus, for this purpose, peptide 
sequences containing tryptophan and methionine are 
preferred, because these amino acids are specified by 
a single codon, whereas other amino acid residues 
have between two and six codons (Table 4.5). All the 
possible DNA sequences (or their complements) that 
encode the targeted peptide sequence are synthesized 
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FIGURE 5.15 Diagram of a yeast 
artificial chromosome (YAC). These 
vectors include features necessary for 
replication and stability in yeast cells. 
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by the solid-phase method and made radioactive by 
phosphorylating their 5’ ends with *2P. 

To prepare the DNA library, a sample containing 
many copies of total genomic DNA is first mechani- 
cally sheared or partly digested by restriction enzymes 


FIGURE 5.16 Probes generated from a protein sequence. A probe can 
be generated by synthesizing all possible oligonucleotides encoding a 
particular sequence of amino acids. Because of the degeneracy of the 
genetic code, 256 distinct oligonucleotides must be synthesized to ensure 
that the probe matching the sequence of seven amino acids in this example 
is present. 
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FIGURE 5.18 Screening a genomic 
library for a specific gene. Here, a plate 
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into large fragments (Figure 5.17). This process yields a nearly random 
population of overlapping DNA fragments. These fragments are then sepa- 
rated by gel electrophoresis to isolate the set of all fragments that are about 
15 kb long. Synthetic linkers are attached to the ends of these fragments, 
cohesive ends are formed, and the fragments are then inserted into a vector, 
such as \ phage DNA, prepared with the same cohesive ends. E. coli bacteria 
are then infected with these recombinant phages. These phages replicate 
themselves and then lyse their bacterial hosts. The resulting lysate contains 
fragments of human DNA housed in a sufficiently large number of virus 
particles to ensure that nearly the entire genome is represented. These phages 
constitute a genomic library. Phages can be propagated indefinitely such that 
the library can be used repeatedly over long periods. 

This genomic library is then screened to find the very small number of 
phages harboring the gene of interest. For the human genome, a calculation 
shows that a 99% probability of success requires screening about 500,000 
clones; hence, a very rapid and efficient screening process is essential. Rapid 
screening can be accomplished by DNA hybridization. 

A dilute suspension of the recombinant phages is first plated on a lawn 
of bacteria (Figure 5.18). Where each phage particle has landed and infected 
a bacterium, a plaque containing identical phages develops on the plate. 
A replica of this master plate is then made by applying a sheet of nitrocel- 
lulose. Infected bacteria and phage DNA released from lysed cells adhere to 
the sheet in a pattern of spots corresponding to the plaques. Intact bacteria 
on this sheet are lysed with NaOH, which also serves to denature the DNA 
so that it becomes accessible for hybridization with a “P-labeled probe. The 
presence of a specific DNA sequence in a single spot on the replica can be 
detected by using a radioactive complementary DNA or RNA molecule as 
a probe. Autoradiography then reveals the positions of spots harboring 
recombinant DNA. The corresponding plaques are picked out of the intact 
master plate and grown. A single investigator can readily screen a million 
clones in a day. This method makes it possible to isolate virtually any gene, 
provided that a probe is available. 


Complementary DNA prepared from mRNA can be 
expressed in host cells 


The preparation of eukaryotic DNA libraries presents unique challenges, 
especially if the researcher is interested primarily in the protein-coding 
region of a particular gene. Recall that most mammalian genes are mosaics 
of introns and exons. These interrupted genes cannot be expressed by bac- 
teria, which lack the machinery to splice introns out of the primary tran- 
script. However, this difficulty can be circumvented by causing bacteria to 
take up recombinant DNA that is complementary to mRNA, where the 
intronic sequences have been removed. 

The key to forming complementary DNA is the enzyme reverse transcrip- 
tase. As discussed in Section 4.3, a retrovirus uses this enzyme to form a 
DNA-RNA hybrid in replicating its genomic RNA. Reverse transcriptase 
synthesizes a DNA strand complementary to an RNA template if the tran- 
scriptase is provided with a DNA primer that is base-paired to the RNA and 
contains a free 3’-OH group. We can use a simple sequence of linked thymi- 
dine [oligo(T)]| residues as the primer. This oligo(T) sequence pairs with the 
poly(A) sequence at the 3’ end of most eukaryotic mRNA molecules 
(Section 4.4), as shown in Figure 5.19. The reverse transcriptase then synthe- 
sizes the rest of the cDNA strand in the presence of the four deoxyribonucle- 
oside triphosphates (step 1). The RNA strand of this RNA—DNA hybrid 
is subsequently hydrolyzed by raising the pH (step 2). Unlike RNA, DNA is 
resistant to alkaline hydrolysis. The single-stranded DNA is converted into 
double-stranded DNA by creating another primer site. The enzyme terminal 
transferase adds nucleotides—for instance, several residues of dG—to the 3’ 
end of DNA (step 3). Oligo(dC) can bind to dG residues and prime the syn- 
thesis of the second DNA strand (step 4). Synthetic linkers can be added to 
this double-helical DNA for ligation to a suitable vector. Complementary 
DNA for all mRNA that a cell contains can be made, inserted into vectors, 
and then inserted into bacteria. Such a collection is called a cDNA library. 
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FIGURE 5.19 Formation of a cDNA duplex. A complementary DNA (cDNA) duplex is 
created from mRNA by (1) use of reverse transcriptase to synthesize a cDNA strand, (2) 
digestion of the original RNA strand, (3) addition of several G bases to the DNA by terminal 
transferase, and (4) synthesis of a complementary DNA strand using the newly synthesized 
cDNA strand as a template. 


Complementary DNA molecules can be inserted into expression vec- 
tors to enable the production of the corresponding protein of interest. 
Clones of cDNA can be screened on the basis of their capacity to direct the 
synthesis of a foreign protein in bacteria, a technique referred to as expres- 
sion cloning. A labeled antibody specific for the protein of interest can be 
used to identify colonies of bacteria that express the corresponding protein 
product (Figure 5.20). As described earlier, spots of bacteria on a replica 
plate are lysed to release proteins, which bind to an applied nitrocellulose 
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FIGURE 5.20 Screening of cDNA 
clones. A method of screening for cDNA 
clones is to identify expressed products 
by staining with specific antibody. 


FIGURE 5.21 Synthesis of proinsulin by 
bacteria. Proinsulin, a precursor of insulin, 
can be synthesized by transformed 
(genetically altered) clones of F. coli. 

The clones contain the mammalian 
proinsulin gene. 


filter. With the addition of labeled antibody specific for the protein of inter- 
est, the location of the desired colonies on the master plate can be readily 
identified. This immunochemical screening approach can be used whenever 
a protein is expressed and corresponding antibody is available. 

Complementary DNA has many applications beyond the generation of 
genetic libraries. The overproduction and purification of most eukaryotic 
proteins in prokaryotic cells necessitates the insertion of cDNA into plas- 
mid vectors. For example, proinsulin, a precursor of insulin, is synthesized 
by bacteria-harboring plasmids that contain DNA complementary to 
mRNA for proinsulin (Figure 5.21). Indeed, bacteria produce much of the 
insulin used today by millions of diabetics. 


Proteins with new functions can be created through 
directed changes in DNA 


Much has been learned about genes and proteins by analyzing the effects that 
mutations have on their structure and function. In the classic genetic approach, 
mutations are generated randomly throughout the genome of a host organism, 
and those individuals exhibiting a phenotype of interest are selected. Analysis 
of these mutants then reveals which genes are altered, and DNA sequencing 
identifies the precise nature of the changes. Recombinant DNA technology 
makes the creation of specific mutations feasible in vitro. We can construct new 
genes with designed properties by making three kinds of directed changes: 
deletions, insertions, and substitutions. A variety of methods can be used to 
introduce these types of mutations, including the following examples. 


Site-directed mutagenesis. Mutant proteins with single amino acid substi- 
tutions can be readily produced by site-directed mutagenesis (Figure 5.22). 
Suppose that we want to replace a particular serine residue with cysteine. 
This mutation can be made if (1) we have a plasmid containing the gene or 
cDNA for the protein and (2) we know the base sequence around the site to 
be altered. If the serine of interest is encoded by TCT, mutation of the central 
base from C to G yields the TGT codon, which encodes cysteine. This type 
of mutation is called a point mutation because only one base is altered. To 
introduce this mutation into our plasmid, we prepare an oligonucleotide 
primer that is complementary to this region of the gene except that it contains 
TGT instead of TCT. The two strands of the plasmid are separated, and the 
primer is then annealed to the complementary strand. The mismatch of 1 of 
15 base pairs is tolerable if the annealing is carried out at an appropriate 
temperature. After annealing to the complementary strand, the primer is 
elongated by DNA polymerase, and the double-stranded circle is closed by 
adding DNA ligase. Subsequent replication of this duplex yields two kinds of 
progeny plasmid, half with the original TCT sequence and half with the 
mutant TGT sequence. Expression of the plasmid containing the new TGT 
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FIGURE 5.22 Oligonucleotide-directed mutagenesis. A primer containing a mismatched 
nucleotide is used to produce a desired change in the DNA sequence. 
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sequence will produce a protein with the desired substitution of cysteine for 
serine at a unique site. We will encounter many examples of the use of site- 
directed mutagenesis to precisely alter regulatory regions of genes and to 
produce proteins with tailor-made features. 


Cassette mutagenesis. In cassette mutagenesis, a variety of mutations, 
including insertions, deletions, and multiple point mutations, can be intro- 
duced into the gene of interest. A plasmid harboring the original gene is cut 
with a pair of restriction enzymes to remove a short segment (Figure 5.23). 
A synthetic double-stranded oligonucleotide—the cassette—carrying the 
genetic alterations of interest is prepared with cohesive ends that are com- 
plementary to the ends of the cut plasmid. Ligation of the cassette into the 
plasmid yields the desired mutated gene product. 


Mutagenesis by PCR. In Section 5.1, we learned how PCR can be used to 
amplify a specific region of DNA using primers that flank the region of 
interest. In fact, the creative design of PCR primers enables the introduc- 
tion of specific insertions, deletions, and substitutions into the amplified 
sequence. A number of methods have been developed for this purpose. 
Here, we shall consider one: inverse PCR to introduce deletions into plas- 
mid DNA (Figure 5.24). In this approach, primers are designed to flank the 
sequence to be deleted. However, these primers are oriented in the opposite 
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FIGURE 5.24 Deletion mutagenesis 
by inverse PCR. A deletion can be 
introduced into a plasmid with primers 
hat flank this region but are oriented 
away from the segment to be removed. 
PCR amplification yields a linear product 
hat contains the entire plasmid minus 
he unwanted sequence. If the primers 
contained a 5' phosphate group, this 
product can be recircularized using DNA 
igase, generating a plasmid with the 
desired mutation. 
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FIGURE 5.23 Cassette mutagenesis. 
DNA is cleaved at a pair of unique 
restriction sites by two different restriction 
endonucleases. A synthetic oligonucleotide 
with ends that are complementary to these 
sites (the cassette) is then ligated to the 
cleaved DNA. The method is highly 
versatile because the inserted DNA can 
have any desired sequence. 
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direction, such that they direct the amplification of the entire plasmid, 
minus the region to be deleted. If each of the primers contains a 5’ phosphate 
group, the amplified product can be recircularized with DNA ligase, yield- 
ing the desired deletion mutation. 


Designer genes. Novel proteins can also be created by splicing together gene 
segments that encode domains that are not associated in nature. For example, 
a gene for an antibody can be joined to a gene encoding a toxic protein, yield- 
ing a chimeric protein that kills those cells recognized by the antibody. These 
immunotoxins are being evaluated as anticancer agents. Furthermore, nonin- 
fectious coat proteins of viruses can be produced in large amounts by recom- 
binant DNA methods. They can serve as synthetic vaccines that are safer than 
conventional vaccines prepared by inactivating pathogenic viruses. A sub- 
unit of the hepatitis B virus produced in yeast is proving to be an effective 
vaccine against this debilitating viral disease. Finally, entirely new genes can 
be synthesized de novo by the solid-phase method described above. These 
genes can encode proteins with no known counterparts in nature. 


Recombinant methods enable the exploration of the functional 
effects of disease-causing mutations 


The application of recombinant DNA technology to the production of 
mutated proteins has had a significant effect in the study of ALS. Recall that 
genetic studies had identified a number of ALS-inducing mutations within 
the gene encoding Cu/Zn superoxide dismutase. As we shall learn in Section 
18.3, SOD1 catalyzes the conversion of the superoxide radical anion into 
hydrogen peroxide, which, in turn, is converted into molecular oxygen and 
water by catalase. To study the potential effect of ALS-causing mutations on 
SOD1 structure and function, the SOD1 gene was isolated from a human 
cDNA library by PCR amplification. The amplified fragments containing 
the gene were then digested by an appropriate restriction enzyme and inserted 
into a similarly digested plasmid vector. Mutations corresponding to those 
observed in ALS patients were introduced into these plasmids by oligonucle- 
otide-directed mutagenesis and the protein products were expressed and 
assayed for their catalytic activity. Surprisingly, these mutations did not sig- 
nificantly alter the enzymatic activity of the corresponding recombinant 
proteins. These observations have led to the prevailing notion that these 
mutations impart toxic properties to SOD1. Although the nature of this tox- 
icity is not yet completely understood, one hypothesis is that mutant SOD1 
is prone to form toxic aggregates in the cytoplasm of neuronal cells. 


5.3 Complete Genomes Have Been Sequenced 
and Analyzed 


The methods just described are extremely effective for the isolation and char- 
acterization of fragments of DNA. However, the genomes of organisms rang- 
ing from viruses to human beings contain considerably longer sequences, 
arranged in very specific ways crucial for their integrated functions. Is it pos- 
sible to sequence complete genomes and analyze them? For small genomes, 
this sequencing was accomplished soon after DNA-sequencing methods were 
developed. Sanger and his coworkers determined the complete sequence of 
the 5386 bases in the genome of the 6X174 DNA virus in 1977, just a quarter 
century after Sanger’s pioneering elucidation of the amino acid sequence 
of a protein. This tour de force was followed several years later by the deter- 
mination of the sequence of human mitochondrial DNA, a double-stranded 


circular DNA molecule containing 16,569 base pairs. It encodes 2 ribosomal 
RNAs, 22 transfer RNAs, and 13 proteins. Many other viral genomes were 
sequenced in subsequent years. However, the genomes of free-living organ- 
isms presented a great challenge because even the simplest comprises more 
than 1 million base pairs. Thus, sequencing projects require both rapid 
sequencing techniques and efficient methods for assembling many short 
stretches of 300 to 500 base pairs into a complete sequence. 


The genomes of organisms ranging from bacteria to multicellular 
eukaryotes have been sequenced 


With the development of automatic DNA sequencers based on fluorescent 
dideoxynucleotide chain terminators, high-volume, rapid DNA sequencing 
became a reality. The genome sequence of the bacterium Haemophilus 
influenzae was determined in 1995 by using a “shotgun” approach. The 
genomic DNA was sheared randomly into fragments that were then sequenced. 
Computer programs assembled the complete sequence by matching up over- 
lapping regions between fragments. The H. influenzae genome comprises 
1,830,137 base pairs and encodes approximately 1740 proteins (Figure 5.25). 
Using similar approaches, as well as more advanced methods described below, 
investigators have determined the sequences of more than 10,000 bacterial and 
archaeal species, including key model organisms such as E. coli, Salmonella 
typhimurium, and Archaeoglobus fulgidus, as well as pathogenic organisms 
such as Yersina pestis (bubonic plague) and Bacillus anthracis (anthrax). 

The first eukaryotic genome to be completely sequenced was that of 
baker’s yeast, Saccharomyces cerevisiae, in 1996. The yeast genome com- 
prises approximately 12 million base pairs, distributed on 16 chromosomes, 
and encodes more than 6000 proteins. This achievement was followed in 
1998 by the first complete sequencing of the genome of a multicellular 
organism, the nematode Caenorhabditis elegans, which contains 97 million 
base pairs. This genome includes more than 19,000 genes. The genomes of 
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FIGURE 5.25 A complete genome. The diagram depicts the genome of Haemophilus 
influenzae, the first complete genome of a free-living organism to be sequenced. The genome 
encodes more than 1700 proteins and 70 RNA molecules. The likely function of approximately 
one-half of the proteins was determined by comparisons with sequences of proteins already 
characterized in other species. [Data from R. D. Fleischmann et al., Science 269:496-512, 1995; 
scan courtesy of The Institute for Genomic Research. ] 
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FIGURE 5.26 The human genome. The human genome is arrayed on 46 
chromosomes— 22 pairs of autosomes and the X and Y sex chromosomes. 
The locations of several genes associated with important pathways in 


biochemistry are highlighted. 


many additional organisms widely used in biological and biomedical 
research have now been sequenced, including those of the fruit fly Drosophila 
melanogaster, the model plant Arabidopsis thaliana, the mouse, the rat, and 
the dog. Note that even after a genome sequence has been considered com- 
plete, some sections, such as the repetitive sequences that make up hetero- 
chromatin, may be missing because these DNA sequences are very difficult 
to manipulate with the use of standard techniques. 


The sequence of the human genome has been completed 


The ultimate goal of much of genomics research has been the sequencing and 
analysis of the human genome. Given that the human genome comprises 
approximately 3 billion base pairs of DNA distributed among 24 chromo- 
somes, the challenge of producing a complete sequence was daunting. However, 
through an organized international effort of academic laboratories and private 
companies, the human genome has progressed from a draft sequence first 
reported in 2001 to a finished sequence reported in late 2004 (Figure 5.26). 
The human genome is a rich source of informa- 
Glyceraldehyde tion about many aspects of humanity, including 
Peta biochemistry and evolution. Analysis of the genome 
(Chapter 16) will continue for many years to come. Developing an 
inventory of protein-encoding genes is one of the first 
Q tasks. At the beginning of the genome-sequencing 
project, the number of such genes was estimated to be 
approximately 100,000. With the availability of the 
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produce proteins. For example, more than half of the 
genomic regions that correspond to olfactory recep- 
tors—key molecules responsible for our sense of 
smell—are pseudogenes (Section 33.1). The corre- 
sponding regions in the genomes of other primates 
and rodents encode functional olfactory receptors. 
Nonetheless, the surprisingly small number of genes belies the complexity of 
the human proteome. Many genes encode more than one protein through mecha- 
nisms such as alternative splicing of mRNA and posttranslational modifications of 
proteins. The different proteins encoded by a single gene often display impor- 
tant variations in functional properties. 

The human genome contains a large amount of DNA that does not encode 
proteins. A great challenge in modern biochemistry and genetics is to elucidate 
the roles of this noncoding DNA. Much of this DNA is present because of the 
existence of mobile genetic elements. These elements, related to retroviruses 
(Section 4.3), have inserted themselves throughout the genome over time. Most 
of these elements have accumulated mutations and are no longer functional. For 
example, more than 1 million Alu sequences, each approximately 300 bases in 
length, are present in the human genome. Alu sequences are examples of 
SINES, short interspersed elements. The human genome also includes nearly 
1 million LINES, long interspersed elements, DNA sequences that can be as long 
as 10 kilobases (kb). The roles of these elements as neutral genetic parasites or 
instruments of genome evolution are under current investigation. 


Next-generation sequencing methods enable the rapid determination 
of a complete genome sequence 


Since the introduction of Sanger dideoxy method in the mid-1970s, sig- 
nificant advances have been made in DNA-sequencing technologies, 
enabling the readout of progressively longer sequences with higher 
fidelity and shorter run times. The development of next-generation 
sequencing (NGS) platforms has extended this capability to formerly 
unforeseen levels. By combining technological breakthroughs in the 
handling of very small amounts of liquid, high-resolution optics, and 
computing power, these methods have already made a significant impact 
on the ability to obtain whole genome sequences rapidly and cheaply 
(Chapter 1). 

Next-generation sequencing refers to a family of technologies, each of 
which utilizes a unique approach for the determination of a DNA sequence. 
All of these methods are highly parallel: from 1 million to 1 billion DNA 
fragment sequences are acquired in a single experiment. How are NGS 
methods capable of attaining such a high number of parallel runs? 
Individual DNA fragments are amplified by PCR on a solid support—a 
single bead or a small region of a glass slide—such that clusters of identical 
DNA fragments are distinguishable by high-resolution imaging. These 
fragments then serve as templates for DNA polymerase, where the addi- 
tion of nucleotide triphosphates is converted to a signal that can be detected 
in a highly sensitive manner. The technique used to detect individual base 
incorporation varies among the variety of NGS methods. However, most 
of these can be understood simply by considering the overall reaction of 
chain elongation catalyzed by DNA polymerase (Figure 5.27). In the 
reversible terminator method, the four nucleotides are added to the template 
DNA, with each base tagged with a unique fluorescent label and a 
reversibly-blocked 3° end. The blocked end assures that only one phospho- 
diester linkage will form. Once the nucleotide is incorporated into the 
growing strand, it is identified by its fluorescent tag, the blocking agent is 
removed, and the process is repeated. 

In pyrosequencing, nucleotides are added to the template DNA, one at a 
time in a defined order. One of the nucleotides will be incorporated into the 
growing strand, releasing a pyrophosphate which is detected by coupling 
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FIGURE 5.27 Detection methods in next-generation sequencing. Measurement of 
base incorporation in next-generation sequencing methods relies on the detection of the 
various products of the DNA polymerase reaction. Reversible terminator sequencing 
measures the nucleotide incorporation in a manner similar to Sanger sequencing, while 
pyrosequencing and ion semiconductor sequencing detect the release of pyrophosphate 
and protons, respectively. 
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the formation of pyrophosphate with the production of light by the sequen- 
tial action of the enzymes ATP sulfurylase and luciferase: 


ATP sulfurylase 


ATP + sulfate 


PP; + adenylyl sulfate 


ATP + luciferin === oxyluciferin + light 
The protocol for ion semiconductor sequencing is similar to pyrosequencing 
except that nucleotide incorporation is detected by sensitively measuring 
the very small changes in pH of the reaction mixture due to the release of 
proton upon nucleotide incorporation. 

Regardless of the sequencing method, the technology exists to quantify 
the signal produced by millions of DNA fragment templates simultane- 
ously. However, for many approaches, as few as 50 bases are read per frag- 
ment. Hence, significant computing power is required to both store the 
massive amounts of sequence data and perform the necessary alignments 
required to assemble a completed sequence. NGS methods are being used 
to answer an ever-growing number of questions in genomics, transcrip- 
tomics, and evolutionary biology, to name a few. Additionally, individual 
genome sequences will provide information about genetic variation within 
populations and may usher in an era of personalized medicine, when these 
data can be used to guide treatment decisions. 


Comparative genomics has become a powerful research tool 


Comparisons with genomes from other organisms are sources of insight into 
the human genome. The sequencing of the genome of the chimpanzee, our 
closest living relative, as well as that of other mammals that are widely used 
in biological research, such as the mouse and the rat, have been completed. 
Comparisons reveal that an astonishing 99% of human genes have counter- 
parts in these rodent genomes. However, these genes have been substan- 
tially reassorted among chromosomes in the estimated 75 million years of 
evolution since humans and rodents had a common ancestor (Figure 5.28). 
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FIGURE 5.28 Genome comparison. A schematic comparison of the human genome and the 
mouse genome shows reassortment of large chromosomal fragments. The small numbers to 
the right of the mouse chromosomes indicate the human chromosome to which each region 
is most closely related. 


The genomes of other organisms also have been determined specifically 
for use in comparative genomics. For example, the genomes of two species 
of puffer fish, Takifugu rubripes and Tetraodon nigroviridis, have been deter- 
mined. These genomes were selected because they are very small and lack 
much of the intergenic DNA present in such abundance in the human 
genome. The puffer fish genomes include fewer than 400 megabase pairs 
(Mbp), one-eighth of the number in the human genome, yet the puffer fish 
and human genomes contain essentially the same number of genes. 
Comparison of the genomes of these species with that of humans revealed 
more than 1000 formerly unrecognized human genes. Furthermore, com- 
parison of the two species of puffer fish, which had a common ancestor 
approximately 25 million years ago, is a source of insight into more-recent 
events in evolution. Comparative genomics is a powerful tool, both for 
interpreting the human genome and for understanding major events in the 
origin of genera and species. 


5.4 Eukaryotic Genes Can Be Quantitated and 
Manipulated with Considerable Precision 


After a gene of interest has been identified, cloned, and sequenced, it is 
often desirable to understand how that gene and its corresponding protein 
product function in the context of a whole cell or organism. It is now pos- 
sible to determine how the expression of a particular gene is regulated, how 
mutations in the gene affect the function of the corresponding protein prod- 
uct, and how the behavior of an entire cell or model organism is altered by 
the introduction of mutations within specific genes. Levels of transcription 
of large families of genes within cells and tissues can be readily quantitated 
and compared across a range of environmental conditions. Eukaryotic genes 
can be introduced into bacteria, and the bacteria can be used as factories to 
produce a desired protein product. DNA can also be introduced into the 
cells of higher organisms. Genes introduced into animals are valuable tools 
for examining gene action, and they are the basis of gene therapy. Genes 
introduced into plants can make the plants resistant to pests, able to grow in 
harsh conditions, or carry greater quantities of essential nutrients. The 
manipulation of eukaryotic genes holds much promise as a source of medi- 
cal and agricultural benefits. 


Gene-expression levels can be comprehensively examined 


Most genes are present in the same quantity in every cell—namely, one copy 
per haploid cell or two copies per diploid cell. However, the level at which a 
gene is expressed, as indicated by mRNA quantities, can vary widely, rang- 
ing from no expression to hundreds of mRNA copies per cell. Gene- 
expression patterns vary from cell type to cell type, distinguishing, for 
example, a muscle cell from a nerve cell. Even within the same cell, gene- 
expression levels may vary as the cell responds to changes in physiological 
circumstances. Note that mRNA levels sometimes correlate with the levels 
of proteins expressed, but this correlation does not always hold. Thus, care 
must be exercised when interpreting the results of mRNA levels alone. 
The quantity of individual mRNA transcripts can be determined by 
quantitative PCR (qPCR), or real-time PCR. RNA is first isolated from the 
cell or tissue of interest. With the use of reverse transcriptase, cDNA is 
prepared from this RNA sample. In one qPCR approach, the transcript of 
interest is PCR amplified with the appropriate primers in the presence 
of the dye SYBR Green I, which fluoresces brightly when bound to 


A puffer fish. [Beth Swanson/Shutterstock] 


157 


(A) 


10° L 
Threshold 


ilar! [| | | J 


2 6 10 14 18 22 26 30 34 38 42 46 50 
Cycle 


Fluorescence 
PS) 
& 


(B) 


Ci 


| | | | | | 
10° 10' 102 107 104 10° 10° 
Starting quantity 


FIGURE 5.29 Quantitative PCR. (A) In 
qPCR, fluorescence is monitored in the 
course of PCR amplification to determine 
Cy, the cycle at which this signal exceeds a 
defined threshold. Each color represents a 
different starting quantity of DNA. (B) C+ 
values are inversely proportional to the 
number of copies of the original CDNA 
template. [Data from N. J. Walker, Science 
296:557-559, 2002.] 


FIGURE 5.30 Using DNA microarrays to 
measure gene expression changes ina 
tumor. mRNA is isolated from two samples, 
tumor cells and a control sample. From 
these transcripts, cDNA is prepared in the 
presence of a fluorescent nucleotide, with a 
red label for the tumor sample and a green 
label for the control sample. The cDNA 
strands are separated, hybridized to the 
microarray, and the unbound DNA is 
washed away. Spots that are red indicate 
genes which are expressed more highly in 
the tumor, while the green spots indicate 
reduced expression relative to control. 
Spots that are black or yellow indicate 
comparable expression at either low or 
high levels, respectively. [Information from 

D. L. Nelson and M. M. Cox, Lehninger 
Principles of Biochemistry 6th ed. 

(W. H. Freeman and Company, 2013)] 
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double-stranded DNA. In the initial PCR cycles, not enough duplex is 
present to allow a detectable fluorescence signal. However, after repeated 
PCR cycles, the fluorescence intensity exceeds the detection threshold and 
continues to rise as the number of duplexes corresponding to the transcript 
of interest increases (Figure 5.29). Importantly, the cycle number at which 
the fluorescence becomes detectable over a defined threshold (or Cy) is 
indirectly proportional to the number of copies of the original template. 
After the relation between the original copy number and the C- has been 
established with the use of a known standard, subsequent qPCR experiments 
can be used to determine the number of copies of any desired transcript in 
the original sample, provided the appropriate primers are available. 
Although qPCR is a powerful technique for quantitation of a small 
number of transcripts in any given experiment, we can now use our knowIl- 
edge of complete genome sequences to investigate an entire transcriptome, 
the pattern and level of expression of all genes in a particular cell or tissue. 
One of the most powerful methods for this purpose is based on hybridiza- 
tion. Single-stranded oligonucleotides whose sequences correspond to cod- 
ing regions of the genome are affixed to a solid support such as a microscope 
slide, creating a DNA microarray. Importantly, the position of each sequence 
within the array is known. mRNA is isolated from the cells of interest 
(a tumor, for example) as well as a control sample (Figure 5.30). From this 
mRNA, cDNA is prepared (Section 5.2) in the presence of fluorescent 
nucleotides using different labels, usually green and red, for the two sam- 
ples. The samples are combined, separated into single strands, and hybrid- 
ized to the slide. The relative levels of green and red fluorescence at each 
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spot indicate the differences in expression for each gene. DNA chips have 
been prepared such that thousands of transcript levels can be assessed in a 
single experiment. Hence, over several arrays, the differences in expression 
of many genes across a number of different cell types or conditions can be 
measured (Figure 5.31). 

Microarray analyses can be quite informative in the study of gene-expres- 
sion changes in diseased mammals compared with their healthy counter- 
parts. As noted earlier, although ALS-causing mutations within the SOD1 
gene had been identified, the mechanism by which the mutant SOD 1 protein 
ultimately leads to motor-neuron loss remains a mystery. Many research 
groups have used microarray analysis of neuronal cells isolated from humans 
and mice carrying SOD1 gene mutations to search for clues into the path- 
ways of disease progression and to suggest potential avenues for treatment. 
These studies have implicated a variety of biochemical pathways, including 
immunological activation, handling of oxidative stress, and protein degrada- 
tion, in the cellular response to the mutant, toxic forms of SOD 1. 


New genes inserted into eukaryotic cells can be efficiently expressed 


Bacteria are ideal hosts for the amplification of DNA molecules. They can 
also serve as factories for the production of a wide range of prokaryotic and 
eukaryotic proteins. However, bacteria lack the necessary enzymes to carry 
out posttranslational modifications such as the specific cleavage of polypep- 
tides and the attachment of carbohydrate units. Thus, many eukaryotic 
genes can be expressed correctly only in eukaryotic host cells. The introduc- 
tion of recombinant DNA molecules into cells of higher organisms can also 
be a source of insight into how their genes are organized and expressed. 
How are genes turned on and off in embryological development? How does 
a fertilized egg give rise to an organism with highly differentiated cells that 
are organized in space and time? These central questions of biology can now 
be fruitfully approached by expressing foreign genes in mammalian cells. 
Recombinant DNA molecules can be introduced into animal cells in 
several ways. In one method, foreign DNA molecules precipitated by cal- 
cium phosphate are taken up by animal cells. A small fraction of the 
imported DNA becomes stably integrated into the chromosomal DNA. 
The efficiency of incorporation is low, but the method is useful because it 
is easy to apply. In another method, DNA is microinjected into cells. A fine- 
tipped glass micropipette containing a solution of foreign DNA is inserted 
into a nucleus (Figure 5.32). A skilled investigator can inject hundreds of 
cells per hour. About 2% of injected mouse cells are viable and 
contain the new gene. In a third method, viruses are used to 
introduce new genes into animal cells. The most effective vec- 
tors are retroviruses, whose genomes are encoded by RNA and 
replicate through DNA intermediates. A striking feature of 
the life cycle of a retrovirus is that the double-helical DNA 
form of its genome, produced by the action of reverse tran- 
scriptase, becomes randomly incorporated into host chromo- 
somal DNA. This DNA version of the viral genome, called 
proviral DNA, can be efficiently expressed by the host cell 
and replicated along with normal cellular DNA. Retroviruses 
do not usually kill their hosts. Foreign genes have been effi- 
ciently introduced into mammalian cells by infecting them 
with vectors derived from the Moloney murine leukemia virus, 
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FIGURE 5.31 Gene-expression analysis 
with the use of microarrays. The 
expression levels of thousands of genes 
can be simultaneously analyzed with DNA 
microarrays. Here, an analysis of 1733 
genes in 84 breast tumor samples reveals 
that the tumors can be divided into distinct 
classes based on their gene-expression 
patterns. In this “heat map” representation, 
each row represents a different gene and 
each column represents a different breast 
tumor sample (i.e., a separate microarray 
experiment). Red corresponds to gene 
induction and green corresponds to gene 
repression. [Data from C. M. Perou et al., 
Nature 406:747-752, 2000.] 
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a retrovirus which can accept inserts as long as 6 kb. Some 
genes introduced by this vector into the genome of a trans- 
formed host cell are efficiently expressed. 


FIGURE 5.32 Microinjection of DNA. Cloned plasmid DNA is 
being microinjected into the male pronucleus of a fertilized 
mouse egg. 
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FIGURE 5.33 Gene disruption by 
homologous recombination. (A) A 
mutated version of the gene to be disrupted 
is constructed, maintaining some regions of 
homology with the normal gene (red). When 
the foreign mutated gene is introduced into 
an embryonic stem cell, (B) recombination 
takes place at regions of homology and 

(C) the normal (targeted) gene is replaced, 
or “knocked out,” by the foreign gene. 

The cell is inserted into embryos, and 

mice lacking the gene (knockout mice) 

are produced. 
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Two other viral vectors are extensively used. Vaccinia virus, a large 
DNA-containing virus, replicates in the cytoplasm of mammalian cells, 
where it shuts down host-cell protein synthesis. Baculovirus infects insect 
cells, which can be conveniently cultured. Insect larvae infected with this 
virus can serve as efficient protein factories. Vectors based on these large- 
genome viruses have been engineered to express DNA inserts efficiently. 


Transgenic animals harbor and express genes introduced 
into their germ lines 


As shown in Figure 5.32, plasmids harboring foreign genes can be microin- 
jected into the male pronucleus of fertilized mouse eggs, which are then inserted 
into the uterus of a foster-mother mouse. A subset of the resulting embryos in 
this host will then harbor the foreign gene; these embryos may develop into 
mature animals. Southern blotting or PCR analysis of DNA isolated from the 
progeny can be used to determine which offspring carry the introduced gene. 
These transgenic mice are a powerful means of exploring the role of a specific 
gene in the development, growth, and behavior of an entire organism. 
Transgenic animals often serve as useful models for a particular disease process, 
enabling researchers to test the efficacy and safety of a newly developed therapy. 

Let us return to our example of ALS. Research groups have generated trans- 
genic mouse lines that express forms of human superoxide dismutase that har- 
bor mutations matching those identified in earlier genetic analyses. Many of 
these strains exhibit a clinical picture similar to that observed in ALS patients: 
progressive weakness of voluntary muscles and eventual paralysis, motor-neu- 
ron loss, and rapid progression to death. Since their first characterization in 
1994, these strains continue to serve as valuable sources of information for the 
exploration of the mechanism, and potential treatment, of ALS. 


Gene disruption and genome editing provide clues to gene 
function and opportunities for new therapies 


The function of a gene can also be probed by inactivating it and looking for 
resulting abnormalities. Powerful methods have been developed for accom- 
plishing gene disruption (also called gene knockout) in organisms such as yeast 
and mice. These methods rely on the process of homologous recombination 
(Section 28.5), in which two DNA molecules with strong sequence similarity 
exchange segments. If a region of foreign DNA is flanked by sequences that 
have high homology to a particular region of genomic DNA, two recombina- 
tion events will yield the transfer of the foreign DNA into the 
genome (Figure 5.33). In this manner, specific genes can be 
targeted if their flanking nucleotide sequences are known. 
For example, the gene-knockout approach has been 
applied to the genes encoding gene-regulatory proteins 
(also called transcription factors) that control the differentia- 
tion of muscle cells. When both copies of the gene for the 
regulatory protein myogenin are disrupted, an animal dies at 
birth because it lacks functional skeletal muscle. Microscopic 
inspection reveals that the tissues from which muscle nor- 
mally forms contain precursor cells that have failed to dif- 
ferentiate fully (Figure 5.34A and B). Heterozygous mice 
containing one normal myogenin gene and one disrupted 
gene appear normal, suggesting that a reduced level of gene 
expression is still sufficient for normal muscle develop- 
ment. The generation and characterization of this knockout 
strain provided strong evidence that functional myogenin is 
essential for proper development of skeletal muscle tissue 
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FIGURE 5.34 Consequences of gene disruption. Sections of muscle from normal (A) and 
myogenin-knockout (B) mice, as viewed under the light microscope. The unlabeled arrows in 
both panels identify comparable sections of the pelvic bone, indicating that similar anatomical 
regions are depicted. Muscles do not develop properly in mice having both myogenin genes 
disrupted. A poorly formed muscle fiber in the knockout strain is indicated by the M arrow. 
(C) The development of mature skeletal muscle from progenitor cells is a highly regulated 
process involving a number of intermediate cell types and multiple transcription factors. 
Through the gene-disruption studies in (A) and (B), myogenin was identified as an essential 
component of this pathway. [(A) and (B) From P. Hasty, et al., Nature 364:501-506, 1993; 

(C) Information from S. Hettmer and A. J. Wagers, Nat. Med. 16:171-173, 2010, Fig. 1] 


(Figure 5.34C). Analogous studies have probed the function of many other 
genes to generate animal models for known human genetic diseases. 

Manipulation of genomic DNA using homologous recombination, while 
a powerful tool, has limitations. Introduction of point mutations into genes, 
rather than knocking out the entire gene, can be difficult and time-consum- 
ing. In addition, these methods are generally limited to specific model organ- 
isms, such as yeast, mice, and fruit flies. Over the past 10 years, new methods 
for the highly specific modification of genomic DNA, or genome editing, have 
emerged. These approaches rely on engineered site-specific nucleases that 
introduce double-strand breaks at precisely-determined sequences within 
genomic DNA. In one approach, the nonspecific nuclease domain of the 
restriction enzyme FokI is fused to a DNA-binding domain designed to bind 
to a particular DNA sequence. In zinc-finger nucleases (ZFNs), 
the DNA-binding domain contains a series of zinc finger 
domains (Section 32.2), small zinc-binding motifs that each 
recognize a sequence of three base pairs. The preferred DNA 
binding sequence can be altered by changing the identity of only 
four contact residues within each finger. In transcription activa- 
tor-like effector nucleases (TALENSs), the DNA-binding domain 
is comprised of an array of TALE repeats. Each repeat contains 
34 amino acids and two a-helices, yet only two of these residues 
(at positions 12 and 13) are responsible for the unique recogni- 
tion of a single nucleotide within the double helix (Figure 5.35). 
Mutation of these residues within an array of repeats enables the 
recognition of a vast number of possible DNA target sequences 
with a high degree of specificity. 

How do these engineered nucleases effect a change in the 
genomic DNA sequence? Upon binding of the ZFN or 
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FIGURE 5.35 TALE repeats recognize 
individual bases in DNA. Each TALE 
repeat contains 34 amino acids, two of 
which specify its nucleotide binding partner. 
In this figure, the identity of these residues 
is indicated by the color of the repeat. 
TALE proteins can be designed to uniquely 
recognize extended oligonucleotide 
sequences. In this example, a 22 base-pair 
sequence is bound by a single TALE 
protein, the bacterial effector 

PthXo1. [Drawn from 3UGM.pdb] 


Array of TALE repeats 


5’ -TGCATCTCCCCCTACTGTACACCAC-3' 
3’ -ACGTAGAGGGGGATGACATGTGGTG- 5’ 
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FIGURE 5.37 RNA interference 
mechanism. A double-stranded RNA 
molecule is cleaved into 21-bp fragments 
by the enzyme Dicer to produce siRNAs. 
These siRNAs are incorporated into the 
RNA-induced silencing complex (RISC), 
where the single-stranded RNAs guide the 
cleavage of mRNAs that contain 
complementary sequences. 


Fokl 
nuclease 
ZFN/TALEN #1 domain 
Genomic DNA (OT 
et 
ZFN/TALEN #2 


| seer by 


ZFNs/TALENs 


| | 
Donor template DNA 


| ou repair 


FIGURE 5.36 Genome editing by site-specific nucleases. A pair of ZFNs or TALENs cleave 
opposite strands of a targeted gene (blue) within the genome. The DNA repair machinery of 
the cell will use a homologous donor template DNA fragment to fix the double-strand break, 
incorporating the desired modifications (green) into the targeted gene. 


TALEN to DNA, the nuclease domain cleaves the phosphate backbone of 
one of the DNA strands. A second nuclease, designed to recognize the 
opposite strand, introduces a second cleavage site, yielding a complete 
double-stranded break. The resulting cleavage site is repaired by the DNA 
repair machinery of the host cell (Section 28.5). Ifa DNA fragment contain- 
ing the desired sequence change is simultaneously introduced with the 
nucleases, the repair machinery will use this donor template to introduce 
these changes directly into the genomic sequence, in a manner similar to the 
homologous recombination process described above (Figure 5.36). 

Site-specific nuclease-based genome editing methods have now been 
applied to a variety of species, including model organisms used in the labo- 
ratory (rat, zebrafish, and fruit fly), various forms of livestock (pig, cow), 
and a number of plants. In addition, their use as therapeutic tools in humans 
is currently under investigation. For example, a ZFN which inactivates the 
human CCRS5 gene, a coreceptor for cellular invasion of human immunode- 
ficiency virus (HIV), is currently in clinical trials for the treatment of 
patients infected with HIV. 


RNA interference provides an additional tool for 
disrupting gene expression 


An extremely powerful tool for disrupting gene expression was serendipitously 
discovered in the course of studies that required the introduction of RNA into 
acell. The introduction of a specific double-stranded RNA molecule into a cell 
was found to suppress the transcription of genes that contained sequences 
present in the double-stranded RNA molecule. Thus, the introduction of a 
specific RNA molecule can interfere with the expression of a specific gene. 
The mechanism of RNA interference has been largely established 
(Figure 5.37). When a double-stranded RNA molecule is introduced into an 
appropriate cell, the RNA is cleaved by the enzyme Dicer into fragments 
approximately 21 nucleotides in length. Each fragment, termed a small interfer- 
ing RNA (siRNA), consists of 19 bp of double-stranded RNA and 2 bases of 
unpaired RNA on each 5’ end. The siRNA is loaded into an assembly of several 
proteins referred to as the RNA-induced silencing complex (RISC), which 
unwinds the RNA duplex and cleaves one of the strands, the so-called passenger 
strand. The uncleaved single-stranded RNA segment, the guide strand, remains 


incorporated into the enzyme. The fully assembled RISC cleaves mRNA mol- 
ecules that contain exact complements of the guide-strand sequence. Thus, 
levels of such mRNA molecules are dramatically reduced. The technique of 
RNA interference is called gene knockdown, because the expression of the gene 
is reduced but not eliminated, as is the case with gene knockouts. 

The machinery necessary for RNA interference is found in many cells. 
In some organisms such as C. elegans, RNA interference is quite efficient. 
Indeed, RNA interference can be induced simply by feeding C. elegans 
strains of E. coli that have been engineered to produce appropriate double- 
stranded RNA molecules. Although not as efficient in mammalian cells, 
RNA interference has emerged as a powerful research tool for reducing the 
expression of specific genes. Moreover, initial clinical trials of therapies 
based on RNA interference are underway. 


Tumor-inducing plasmids can be used to introduce new 
genes into plant cells 


The common soil bacterium Agrobacterium tumefaciens infects plants and 
introduces foreign genes into plant cells (Figure 5.38). A lump of tumor tis- 
sue called a crown gall grows at the site of infection. Crown galls synthesize 
opines, a group of amino acid derivatives that are metabolized by the infect- 
ing bacteria. In essence, the metabolism of the plant cell is diverted to satisfy 
the highly distinctive appetite of the intruder. Tumor-inducing plasmids (Ti 
plasmids) that are carried by A. tumefaciens carry instructions for the switch 
to the tumor state and the synthesis of opines. A small part of the Ti plasmid 
becomes integrated into the genome of infected plant cells; this 20-kb seg- 
ment is called T-DNA (transferred DNA; Figure 5.39). 

Ti-plasmid derivatives can be used as vectors to deliver foreign genes 
into plant cells. First, a segment of foreign DNA is inserted into the T-DNA 
region of a small plasmid through the use of restriction enzymes and ligases. 
This synthetic plasmid is added to A. tumefaciens colonies harboring natu- 
rally occurring Ti plasmids. By recombination, Ti plasmids containing the 
foreign gene are formed. These Ti vectors hold great promise as tools for 
exploring the genomes of plant cells and modifying plants to improve their 
agricultural value and crop yield. However, they are not suitable for trans- 
forming all types of plants. Ti-plasmid transfer is effective with dicots 
(broad-leaved plants such as grapes) and a few kinds of monocots but not as 
effective with economically important cereal monocots. 

Foreign DNA can be introduced into cereal monocots as well as dicots 
by applying intense electric fields, a technique called electroporation 
(Figure 5.40). First, the cellulose wall surrounding plant cells is removed 
by adding cellulase; this treatment produces protoplasts, plant cells with 
exposed plasma membranes. Electric pulses are then applied to a suspen- 
sion of protoplasts and plasmid DNA. Because high electric fields make 
membranes transiently permeable to large molecules, plasmid DNA mole- 
cules enter the cells. The cell wall is then allowed to reform, and the plant 
cells are again viable. Maize cells and carrot cells have been stably trans- 
formed in this way with the use of plasmid DNA that includes genes for 
resistance to herbicides. Moreover, the transformed cells efficiently express 
the plasmid DNA. Electroporation is also an effective means of delivering 
foreign DNA into animal cells and bacterial cells. 

The most effective means of transforming plant cells is through the use of 
“gene guns,” or bombardment-mediated transformation. DNA is coated onto 
1-m-diameter tungsten pellets, and these microprojectiles are fired at the 
target cells with a velocity greater than 400 ms~!. Despite its apparent crude- 
ness, this technique is proving to be the most effective way of transforming 
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FIGURE 5.38 Tumors in plants. Crown 
gall, a plant tumor, is caused by a bacterium 
(Agrobacterium tumefaciens) that carries a 
tumor-inducing plasmid (Ti plasmid). [From 
M. Escobar et al., Proc. Natl. Acad. Sci. U. S. A. 
98:13437-13442, 2001. Copyright © 2001 
National Academy of Sciences, U. S. A.] 
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FIGURE 5.39 Ti plasmids. Agrobacteria 
containing Ti plasmids can deliver foreign 
genes into some plant cells. [Information 
from M. Chilton. A vector for introducing 
new genes into plants. Copyright © 1983 by 
Scientific American, Inc. All rights reserved.] 
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FIGURE 5.40 Electroporation. Foreign 
DNA can be introduced into plant cells by 
electroporation, the application of intense 
electric fields to make their plasma 
membranes transiently permeable. 


plants, especially important crop species such as soybean, corn, wheat, and 
rice. The gene-gun technique affords an opportunity to develop genetically 
modified organisms (GMOs) with beneficial characteristics, such as the 
ability to grow in poor soils, resistance to natural climatic variation, resis- 
tance to pests, and the ability to fortify nutritional content. These crops 
might be most useful in developing countries. However, the use of GMOs is 
highly controversial, as some fear that their safety risks have not been ade- 
quately addressed. 

The first GMO to come to market was a tomato characterized by 
delayed ripening, rendering it ideal for shipment. Pectin is a polysaccharide 
that gives tomatoes their firmness and is naturally destroyed by the enzyme 
polygalacturonase. As pectin is destroyed, the tomatoes soften, making ship- 
ment difficult. DNA was introduced that disrupts the polygalacturonase 
gene. Less of the enzyme was produced, and the tomatoes stayed fresh 
longer. However, the tomato’s poor taste hindered its commercial success. 
An especially successful result of the use of Ti plasmid to modify crops is 
golden rice. Golden rice is a variety of genetically modified rice that con- 
tains the genes for B-carotene synthesis, a required precursor for vitamin A 
synthesis in humans. Consumption of this rice will benefit children and 
pregnant woman in parts of the world where rice is a dietary staple and vita- 
min A deficiency is common. 


Human gene therapy holds great promise for medicine 


e The field of gene therapy attempts to express specific genes within 

the human body in such a way that beneficial results are obtained. 
The gene targeted for expression may be already present or specially intro- 
duced. Alternatively, gene therapy may attempt to modify genes contain- 
ing sequence variations that have harmful consequences. A tremendous 
amount of research remains to be done before gene therapy becomes practi- 
cal. Nonetheless, considerable progress has been made. For example, some 
people lack functional genes for adenosine deaminase and succumb to infec- 
tions if exposed to a normal environment, a condition called severe combined 
immunodeficiency (SCID). Functional genes for this enzyme have been 
introduced by using gene-therapy vectors based on retroviruses. Although 
these vectors have produced functional enzymes and reduced the clinical 
symptoms, challenges remain. These challenges include increasing the 
longevity of the effects and eliminating unwanted side effects. Future 
research promises to transform gene therapy into an important tool for 
clinical medicine. 


SUMMARY 


5.1 The Exploration of Genes Relies on Key Tools 
The recombinant DNA revolution in biology is rooted in the repertoire 
of enzymes that act on nucleic acids. Restriction enzymes are a key 
group among them. These endonucleases recognize specific base 
sequences in double-helical DNA and cleave both strands of the duplex, 
forming specific fragments of DNA. These restriction fragments can be 
separated and displayed by gel electrophoresis. The pattern of these 
fragments on the gel is a fingerprint of a DNA molecule. A DNA frag- 
ment containing a particular sequence can be identified by hybridizing 
it with a labeled single-stranded DNA probe (Southern blotting). 
Rapid sequencing techniques have been developed to further the 
analysis of DNA molecules. DNA can be sequenced by controlled 


§.2 


5.3 


5.4 


interruption of replication. The fragments produced are separated by 
gel electrophoresis and visualized by autoradiography of a 3?P label at 
the 5’ end or by fluorescent tags. 

DNA probes for hybridization reactions, as well as new genes, can 
be synthesized by the automated solid-phase method. DNA chains as 
long as 100 nucleotides can be readily synthesized. The polymerase 
chain reaction makes it possible to greatly amplify specific segments of 
DNA in vitro. The region amplified is determined by the placement of 
a pair of primers that are added to the target DNA along with a ther- 
mostable DNA polymerase and deoxyribonucleoside triphosphates. 
The exquisite sensitivity of PCR makes it a choice technique in detect- 
ing pathogens and cancer markers, in genotyping, and in amplifying 
DNA from fossils that are many thousands of years old. 


Recombinant DNA Technology Has Revolutionized All Aspects of Biology 
New genes can be constructed in the laboratory, introduced into host 
cells, and expressed. Novel DNA molecules are made by joining frag- 
ments that have complementary cohesive ends produced by the action 
of a restriction enzyme. DNA ligase seals breaks in DNA chains. 
Vectors for propagating the DNA include plasmids, \ phage, and bac- 
terial and yeast artificial chromosomes. Specific genes can be cloned 
from a genomic library with the use of a DNA or RNA probe. Foreign 
DNA can be expressed after insertion into prokaryotic and eukaryotic 
cells by the appropriate vector. Specific mutations can be generated in 
vitro to engineer novel proteins. A mutant protein with a single amino 
acid substitution can be produced by priming DNA replication with an 
oligonucleotide encoding the new amino acid. Plasmids can be engi- 
neered to permit the facile insertion of a DNA cassette containing any 
desired mutation. The techniques of protein and nucleic acid chemistry 
are highly synergistic. Investigators now move back and forth between 
gene and protein with great facility. 


Complete Genomes Have Been Sequenced and Analyzed 

The sequences of many important genomes are known in their 
entirety. More than 10,000 bacterial and archaeal genomes have been 
sequenced, including those from key model organisms and impor- 
tant pathogens. The sequence of the human genome has now been 
completed with nearly full coverage and high precision. Only from 
20,000 to 25,000 protein-encoding genes appear to be present in the 
human genome, a substantially smaller number than earlier esti- 
mates. Comparative genomics has become a powerful tool for ana- 
lyzing individual genomes and for exploring evolution. Genomewide 
gene-expression patterns can be examined through the use of DNA 
microarrays. 


Eukaryotic Genes Can Be Quantitated and Manipulated with 
Considerable Precision 

Changes in gene expression can be readily determined by such tech- 
niques as quantitative PCR and hybridization to microarrays. The 
production of transgenic mice carrying mutations known to cause ALS 
in humans has been a source of considerable insight into the disease 
mechanism and its possible treatment. The functions of particular 
genes can be investigated by disruption. One method of disrupting the 
expression of a particular gene is through RNA interference, which 
depends on the introduction of specific double-stranded RNA mole- 
cules into eukaryotic cells. New DNA can be brought into plant cells 
by the soil bacterium Agvobacterium tumefaciens, which harbors Ti 


165 
Summary 


166 


CHAPTER 5 Exploring Genes 
and Genomes 


KEY TERMS 


plasmids. DNA can also be introduced into plant cells by applying 
intense electric fields, which render them transiently permeable to very 
large molecules, or by bombarding them with DNA-coated micropar- 
ticles. Gene therapy holds great promise for clinical medicine, but 


many challenges remain. 
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PROBLEMS 


1. It’s not the heat... Why is Tag polymerase especially 
useful for PCR? 


2. The right template. Ovalbumin is the major protein of 
egg white. The chicken ovalbumin gene contains eight 
exons separated by seven introns. Should ovalbumin cDNA 
or ovalbumin genomic DNA be used to form the protein in 


E. coli? Why? 


3. Handle with care. Ethidium bromide is a commonly 
used stain for DNA molecules after separation by gel elec- 
trophoresis. The chemical structure of ethidium bromide is 
shown here. Based on this structure, suggest how this stain 


binds to DNA. 
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4. Cleavage frequency. The restriction enzyme Alul cleaves 
at the sequence 5’-AGCT-3’, and NotI cleaves at 
5’-GCGGCCGC-3’. What would be the average distance 
between cleavage sites for each enzyme on digestion of 
double-stranded DNA? Assume that the DNA contains 
equal proportions of A, G, C, and T. 


5. The right cuts. Suppose that a human genomic library is 
prepared by exhaustive digestion of human DNA with the 
EcoRI restriction enzyme. Fragments averaging about 4 kb 
in length would be generated. Is this procedure suitable for 
cloning large genes? Why or why not? 


6. A revealing cleavage. Sickle-cell anemia arises from a 
mutation in the gene for the B chain of human hemoglobin. 
The change from GAG to GTG in the mutant eliminates a 
cleavage site for the restriction enzyme MstlII, which recog- 
nizes the target sequence CCTGAGG. These findings 
form the basis of a diagnostic test for the sickle-cell gene. 
Propose a rapid procedure for distinguishing between the 
normal and the mutant gene. Would a positive result prove 
that the mutant contains GTG in place of GAG? 


7. Sticky ends? The restriction enzymes KpnI and 
Acc65I recognize and cleave the same 6-bp sequence. 
However, the sticky end formed from KpnI cleavage cannot 


be ligated directly to the sticky end formed from Acc65I 
cleavage. Explain why. 
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8. Many melodies from one cassette. Suppose that you have 
isolated an enzyme that digests paper pulp and have 
obtained its CDNA. The goal is to produce a mutant that is 
effective at high temperature. You have engineered a pair of 
unique restriction sites in the cDNA that flank a 30-bp cod- 
ing region. Propose a rapid technique for generating many 
different mutations in this region. 


9. A blessing and a curse. The power of PCR can also create 
problems. Suppose someone claims to have isolated dino- 
saur DNA by using PCR. What questions might you ask to 
determine if it is indeed dinosaur DNA? 


10. Rich or poor? DNA sequences that are highly enriched 
in G-C base pairs typically have high melting tempera- 
tures. Moreover, once separated, single strands containing 
these regions can form rigid secondary structures. How 
might the presence of G—C-rich regions ina DNA template 
affect PCR amplification? 


11. Questions of accuracy. The stringency of PCR amplifi- 
cation can be controlled by altering the temperature at 
which the primers and the target DNA undergo hybridiza- 
tion. How would altering the temperature of hybridization 
affect the amplification? Suppose that you have a particular 
yeast gene A and that you wish to see if it has a counterpart 
in humans. How would controlling the stringency of the 
hybridization help you? 


12. Terra incognita. PCR is typically used to amplify 
DNA that lies between two known sequences. Suppose 
that you want to explore DNA on both sides of a single 
known sequence. Devise a variation of the usual PCR 
protocol that would enable you to amplify entirely new 
genomic terrain. 


13. A puzzling ladder. A gel pattern displaying PCR prod- 
ucts shows four strong bands. The four pieces of DNA have 
lengths that are approximately in the ratio of 1:2:3:4. The 
largest band is cut out of the gel, and PCR is repeated with 
the same primers. Again, a ladder of four bands is evident 
in the gel. What does this result reveal about the structure 
of the encoded protein? 


14. Chromosome walking. Propose a method for isolating a 
DNA fragment that is adjacent in the genome to a previ- 
ously isolated DNA fragment. Assume that you have access 
to a complete library of DNA fragments in a BAC vector 
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but that the sequence of the genome under study has not yet 
been determined. 


15. Probe design. Which of the following amino acid sequences 
would yield the most optimal oligonucleotide probe? 


Ala-Met-Ser-Leu-Pro-Trp 
Gly-Trp-Asp-Met-His-Lys 
Cys-Val-Trp-Asn-Lys-Ile 
Arg-Ser-Met-Leu-Gln-Asn 


16. Man’s best friend. Why might the genomic analysis of 
dogs be particularly useful for investigating the genes 
responsible for body size and other physical characteristics? 


17. Of mice and men. You have identified a gene that is 
located on human chromosome 20 and wish to identify its 
location within the mouse genome. On which chromosome 
would you be most likely to find the mouse counterpart of 
this gene? 


Chapter Integration Problems 


18. Designing primers I. A successful PCR experiment 
often depends on designing the correct primers. In particu- 
lar, the T,,, for each primer should be approximately the 
same. What is the basis of this requirement? 


19. Designing primers IT. You wish to amplify a segment of 
DNA from a plasmid template by PCR with the use of the 
following primers: 5’-GGATCGATGCTCGCGA-3' 
and 5'-AGGATCGGGTCGCGAG-3’. Despite repeated 
attempts, you fail to observe a PCR product of the expected 
length after electrophoresis on an agarose gel. Instead, you 
observe a bright smear on the gel with an approximate 
length of 25 to 30 base pairs. Explain these results. 


Chapter Integration and Data Interpretation Problem 


20. Any direction but east. A series of people are found to 
have difficulty eliminating certain types of drugs from 
their bloodstreams. The problem has been linked to a 
gene X, which encodes an enzyme Y. Six people were 
tested with the use of various techniques of molecular 
biology. Person A is a normal control, person B is asymp- 
tomatic but some of his children have the metabolic prob- 
lem, and persons C through F display the trait. Tissue 
samples from each person were obtained. Southern analy- 
sis was performed on the DNA after digestion with the 
restriction enzyme HindIII. Northern analysis of mRNA 
also was done. In both types of analysis, the gels were 
probed with labeled X cDNA. Finally, a western blot 
with an enzyme-linked monoclonal antibody was used to 
test for the presence of protein Y. The results are shown 
here. Why is person B without symptoms? Suggest pos- 
sible defects in the other people. 
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Data Interpretation Problems 


21. DNA diagnostics. Representations of sequencing chro- 
matograms for variants of the a chain of human hemoglobin 
are shown here. What is the nature of the amino acid change 
in each of the variants? The first triplet encodes valine. 


22. Two peaks. In the course of studying a gene and its pos- 
sible mutation in humans, you obtain genomic DNA sam- 
ples from a collection of persons and PCR amplify a region 
of interest within this gene. For one of the samples, you 
obtain the sequencing chromatogram shown here. Provide 
an explanation for the appearance of these data at position 
49 (indicated by the arrow): 


50 
ATTAGGNGGTATGTA 


Animated Techniques 

k Visit www.whfreeman.com/Berg7e to see animations 
of Dideoxy Sequencing of DNA, Polymerase Chain 
Reaction, Synthesizing an Oligonucleotide Array, 
Screening an Oligonucleotide Array for Patterns of Gene 
Expression, Plasmid Cloning, In Vitro Mutagenesis of 
Cloned Genes, Creating a Transgenic Mouse. [Courtesy of 
H. Lodish et al., Molecular Cell Biology, 5th ed. (W. H. 
Freeman and Company, 2004). | 
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Exploring Evolution and Bioinformatics 


Evolutionary relationships are manifest in protein 
sequences. The close kinship between human beings 
and chimpanzees, hinted at by the mutual interest shown 
by Jane Goodall and a chimpanzee in the photograph, 

is revealed in the amino acid sequences of myoglobin. 
The human sequence (red) differs from the chimpanzee 
sequence (blue) in only one amino acid in a protein 

chain of 153 residues. [(Left) Kennan Ward/Corbis.] 
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ike members of a human family, members of molecular families often 

have features in common. Such family resemblance is most easily 
detected by comparing three-dimensional structure, the aspect of a mole- 
cule most closely linked to function. Consider as an example ribonuclease 
from cows, which was introduced in our consideration of protein folding 
(Section 2.6). Comparing structures reveals that the three-dimensional 
structure of this protein and that of a human ribonuclease are quite similar 
(Figure 6.1). Although the degree of overlap between these two structures is 
not unexpected, given their nearly identical biological functions, similarities 
revealed by other such comparisons are sometimes surprising. For example, 
angiogenin, a protein that stimulates the growth of new blood vessels, is also 
structurally similar to ribonuclease—so similar that both angiogenin and 
ribonuclease are clearly members of the same protein family (Figure 6.2). 
Angiogenin and ribonuclease must have had a common ancestor at some 
earlier stage of evolution. 

Three-dimensional structures have been determined for only a small pro- 
portion of the total number of proteins. In contrast, gene sequences and the 
corresponding amino acid sequences are available for a great number of pro- 
teins, largely owing to the tremendous power of DNA cloning and sequencing 
techniques, including applications to complete-genome sequencing (Chapter 5). 
Evolutionary relationships also are manifest in amino acid sequences. For 
example, 35% of the amino acids in corresponding positions are identical in 
the sequences of bovine ribonuclease and angiogenin. Is this level sufficiently 
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hk Figure 6.1 Structures of 
ribonucleases from cows and human 
beings. Structural similarity often follows 
functional similarity. [Drawn from 8RAT.pdb. 
and 2RNF.pdb.] 


Angiogenin 


hk FIGURE 6.2 Structure of 
angiogenin. The protein angiogenin, 
identified on the basis of its ability to 
stimulate blood-vessel growth, is highly 
similar in three-dimensional structure to 
ribonuclease. [Drawn from 2ANG.pdb.] 


Human ribonuclease 


Bovine ribonuclease 


high to ensure an evolutionary relationship? If not, what level is required? In 
this chapter, we shall examine the methods that are used to compare amino 
acid sequences and to deduce such evolutionary relationships. 

Sequence-comparison methods are powerful tools in modern biochemistry. 
Sequence databases can be probed for matches to a newly elucidated sequence 
to identify related molecules. This information can often be a source of con- 
siderable insight into the function and mechanism of the newly sequenced 
molecule. When three-dimensional structures are available, they can be com- 
pared to confirm relationships suggested by sequence comparisons and to 
reveal others that are not readily detected at the level of sequence alone. 

By examining the footprints present in modern protein sequences, the 
biochemist can learn about events in the evolutionary past. Sequence com- 
parisons can often reveal pathways of evolutionary descent and estimated 
dates of specific evolutionary landmarks. This information can be used to 
construct evolutionary trees that trace the evolution of a particular protein or 
nucleic acid, in many cases from Archaea and Bacteria through Eukarya, 
including human beings. Molecular evolution can also be studied experimen- 
tally. In some cases, DNA from fossils can be amplified by PCR methods and 
sequenced, giving a direct view into the past. In addition, investigators can 
observe molecular evolution taking place in the laboratory, through experi- 
ments based on nucleic acid replication. The results of such studies are reveal- 
ing more about how evolution proceeds. 


6.1 Homologs Are Descended from a Common Ancestor 


The exploration of biochemical evolution consists largely of an attempt to 
determine how proteins, other molecules, and biochemical pathways have 
been transformed through time. The most fundamental relationship 
between two entities is homology; two molecules are said to be homologous if 
they have been derived from a common ancestor. Homologous molecules, 
or homologs, can be divided into two classes (Figure 6.3). Paralogs are homo- 
logs that are present within one species. Paralogs often differ in their 
detailed biochemical functions. Orthologs are homologs that are present 
within different species and have very similar or identical functions. 
Understanding the homology between molecules can reveal the evolution- 
ary history of the molecules as well as information about their function; if a 
newly sequenced protein is homologous to an already characterized protein, 
we have a strong indication of the new protein’s biochemical function. 

How can we tell whether two human proteins are paralogs or whether a 
yeast protein is the ortholog of ahuman protein? As will be discussed in Section 
6.2, homology is often detectable by significant similarity in nucleotide or amino 
acid sequence and almost always manifested in three-dimensional structure. 


FIGURE 6.3 Two classes of homologs. 
Homologs that perform identical or very 
similar functions in different species are 
called orthologs, whereas homologs that 
perform different functions within one 
species are called paralogs. 
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(stimulates blood-vessel growth) 


6.2 Statistical Analysis of Sequence Alignments 
Can Detect Homology 


A significant sequence similarity between two molecules implies that they 
are likely to have the same evolutionary origin and, therefore, similar three- 
dimensional structures, functions, and mechanisms. Both nucleic acid and 
protein sequences can be compared to detect homology. However, the pos- 
sibility exists that the observed agreement between any two sequences is 
solely a product of chance. Because nucleic acids are composed of fewer 
building blocks than proteins (4 bases versus 20 amino acids), the likelihood 
of random agreement between two DNA or RNA sequences is significantly 
greater than that for protein sequences. For this reason, detection of homol- 
ogy between protein sequences is typically far more effective. 

To illustrate sequence-comparison methods, let us consider a class of 
proteins called the globins. Myoglobin is a protein that binds oxygen in 
muscle, whereas hemoglobin is the oxygen-carrying protein in blood 
(Chapter 7). Both proteins cradle a heme group, an iron-containing organic 
molecule that binds the oxygen. Each human hemoglobin molecule is com- 
posed of four heme-containing polypeptide chains, two identical a chains 
and two identical B chains. Here, we shall consider only the a chain. To 
examine the similarity between the amino acid sequence of the human a 
chain and that of human myoglobin (Figure 6.4), we apply a method, referred 
to as a sequence alignment, in which the two sequences are systematically 
aligned with respect to each other to identify regions of significant overlap. 
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6.2 Analysis of Sequence Fragments 


FIGURE 6.4 Amino acid sequences of 
human hemoglobin (@ chain) and human 
myoglobin. a-Hemoglobin is composed of 
141 amino acids; myoglobin consists of 
153 amino acids. (One-letter abbreviations 
designating amino acids are used; see 
Table 2.2.) 
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FIGURE 6.5 Comparing the amino acid 
sequences of a-hemoglobin and 
myoglobin. (A) A comparison is made by 
sliding the sequences of the two proteins 
past each other, one amino acid at a time, 
and counting the number of amino acid 
identities between the proteins. (B) The two 
alignments with the largest number of 
matches are shown above the graph, which 
plots the matches as a function of alignment. 
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How can we tell where to align the two sequences? In the course of evo- 
lution, the sequences of two proteins that have an ancestor in common will 
have diverged in a variety of ways. Insertions and deletions may have 
occurred at the ends of the proteins or within the functional domains them- 
selves. Individual amino acids may have been mutated to other residues of 
varying degrees of similarity. To understand how the methods of sequence 
alignment take these potential sequence variations into account, let us first 
consider the simplest approach, where we slide one sequence past the other, 
one amino acid at a time, and count the number of matched residues, or 
sequence identities (Figure 6.5). For a-hemoglobin and myoglobin, the best 
alignment reveals 23 sequence identities, spread throughout the central 
parts of the sequences. 

However, careful examination of all the possible alignments and their 
scores suggests that important information regarding the relationship 
between myoglobin and hemoglobin a has been lost with this method. In 
particular, we see that another alignment, featuring 22 identities, is 
nearly as good. This alignment is shifted by six residues relative to the 
preceding alignment and yields identities that are concentrated toward 
the amino-terminal end of the sequences. By introducing a gap into one 
of the sequences, the identities found in both alignments will be repre- 
sented (Figure 6.6). Insertion of gaps allows the alignment method to 
compensate for the insertions or deletions of nucleotides that may have 
taken place in the gene for one molecule but not the other in the course 
of evolution. 
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38 identities: 38 x (+10) = 380 
1 gap: 1X (—25) =—25 
355 


The use of gaps substantially increases the complexity of sequence 
alignment because a vast number of possible gaps, varying in both position 
and length, must be considered throughout each sequence. Moreover, the 
introduction of an excessive number of gaps can yield an artificially high 
number of identities. Nevertheless, methods have been developed for the 
insertion of gaps in the automatic alignment of sequences. These methods 
use scoring systems to compare different alignments, including penalties for 
gaps to prevent the insertion of an unreasonable number of them. For 
example, in one scoring system, each identity between aligned sequences is 
counted as +10 points, whereas each gap introduced, regardless of size, 
counts for —25 points. For the alignment shown in Figure 6.6, there are 38 
identities (38 X 10 = 380) and 1 gap (1 X —25 = —25), producing a score 
of (380 + —25 = 355). Overall, there are 38 matched amino acids in an 
average length of 147 residues; thus, the sequences are 25.9% identical. 
Next, we must determine the significance of this score and level of identity. 


The statistical significance of alignments can be 
estimated by shuffling 


The similarities in sequence in Figure 6.5 appear striking, 
yet there remains the possibility that a grouping of 


i‘ 30: = 
sequence identities has occurred by chance alone. Because 


proteins are composed of the same set of 20 amino acid 
monomers, the alignment of any two unrelated proteins 
will yield some identities, particularly if we allow the 
introduction of gaps. Even if two proteins have identical 
amino acid composition, they may not be linked by evolu- 
tion. It is the order of the residues within their sequences 
that implies a relationship between them. Hence, we can 
assess the significance of our alignment by “shuffling,” or 
randomly rearranging, one of the sequences (Figure 6.7), 
repeating the sequence alignment, and determining a new 
alignment score. This process is repeated many times to 
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FIGURE 6.6 Alignment with gap insertion. 
The alignment of a-hemoglobin and 
myoglobin after a gap has been inserted 
into the hemoglobin a sequence. 
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FIGURE 6.7 The generation of a 
shuffled sequence. 


yield a histogram showing, for each possible score, the 200 


number of shuffled sequences that received that score 
(Figure 6.8). If the original score is not appreciably differ- 
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FIGURE 6.8 Statistical comparison of alignment 


ent from the scores from. the shuffled alignments, then WE “scores. Alignment scores are calculated for many shuffled 

cannot exclude the possibility that the original alignment sequences, and the number of sequences generating a particular 

is merely a consequence of chance. score is plotted against the score. The resulting plot is a distribution 
When this procedure is applied to the sequences of of alignment scores occurring by chance. The alignment score for 


myoglobin and a-hemoglobin, the authentic alignment 
(indicated by the red bar in Figure 6.8) clearly stands out. 
Its score is far above the mean for the alignment scores 


unshuffled a-hemoglobin and myoglobin (shown in red) is 
substantially greater than any of these scores, strongly suggesting 
that the sequence similarity is significant. 
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based on shuffled sequences. The probability that such a deviation occurred 
by chance alone is approximately 1 in 102°. Thus, we can comfortably con- 
clude that the two sequences are genuinely similar; the simplest explanation 
for this similarity is that these sequences are homologous—that is, the two 
molecules have descended from a common ancestor. 


Distant evolutionary relationships can be detected through 
the use of substitution matrices 


The scoring scheme heretofore described assigns points only to positions 
occupied by identical amino acids in the two sequences being compared. No 
credit is given for any pairing that is not an identity. However, as already 
discussed, two proteins related by evolution undergo amino acid substitu- 
tions as they diverge. A scoring system based solely on amino acid identity 
cannot account for these changes. To add greater sensitivity to the detection 
of evolutionary relationships, methods have been developed to compare two 
amino acids and assess their degree of similarity. 

Not all substitutions are equivalent. For example, amino acid changes 
can be classified as structurally conservative or nonconservative. A conser- 
vative substitution replaces one amino acid with another that is similar in size 
and chemical properties. Conservative substitutions may have only minor 
effects on protein structure and often can be tolerated without compromis- 
ing protein function. In contrast, in a nonconservative substitution, an amino 
acid is replaced by one that is structurally dissimilar. Amino acid changes 
can also be classified by the fewest number of nucleotide changes necessary 
to achieve the corresponding amino acid change. Some substitutions arise 
from the replacement of only a single nucleotide in the gene sequence; 
whereas others require two or three replacements. Conservative and single- 
nucleotide substitutions are likely to be more common than are substitu- 
tions with more radical effects. 

How can we account for the type of substitution when comparing 
sequences? We can approach this problem by first examining the substitu- 
tions that have been observed in proteins known to be evolutionarily related. 
From an examination of appropriately aligned sequences, substitution 
matrices have been deduced. A substitution matrix describes a scoring system 
for the replacement of any amino acid with each of the other 19 amino acids. 
In these matrices, a large positive score corresponds to a substitution that 
occurs relatively frequently, whereas a large negative score corresponds to a 
substitution that occurs only rarely. A commonly used substitution matrix, 
the Blosum-62 (for Blocks of amino acid substitution matrix), is illustrated in 
Figure 6.9. In this depiction, each column in this matrix represents one of 
the 20 amino acids, whereas the position of the single-letter codes within 
each column specifies the score for the corresponding substitution. Notice 
that scores corresponding to identity (the boxed codes at the top of each 
column) are not the same for each residue, owing to the fact that less fre- 
quently occurring amino acids such as cysteine (C) and tryptophan (W) will 
align by chance less often than the more common residues. Furthermore, 
structurally conservative substitutions such as lysine (K) for arginine (R) 
and isoleucine (I) for valine (V) have relatively high scores, whereas noncon- 
servative substitutions such as lysine for tryptophan result in negative scores 
(Figure 6.10). When two sequences are compared, each pair of aligned resi- 
dues is assigned a score based on the matrix. In addition, gap penalties are 
often assessed. For example, the introduction of a single-residue gap lowers 
the alignment score by 12 points and the extension of an existing gap costs 2 
points per residue. With the use of this scoring system, the alignment shown 
in Figure 6.6 receives a score of 115. In many regions, most substitutions are 
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FIGURE 6.9 A graphic view of the Blosum-62. This substitution matrix was derived by 
examining substitutions within aligned sequence blocks in related proteins. Amino acids 


are classified into four groups (charged, red; polar, green; large and hydrophobic, blue; 
other, black). Substitutions that require the change of only a single nucleotide are shaded. 
Identities are boxed. To find the score for a substitution of, for instance, a Y for an H, you 
find the Y in the column having H at the top and check the number at the left. In this case, 
the resulting score is 2. 
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FIGURE 6.10 Scoring of conservative 
and nonconservative substitutions. The 
Blosum-62 indicates that a conservative 
substitution (lysine for arginine) receives a 
positive score, whereas a nonconservative 
substitution (lysine for tryptophan) is scored 
negatively. The matrix is depicted as an 
abbreviated form of Figure 6.9. 
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alignment of a-hemoglobin and myoglobin 
with conservative substitutions indicated by 
yellow shading and identities by orange. 
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conservative (defined as those substitutions with scores greater than 0) and 
relatively few are strongly disfavored (Figure 6.11). 

This scoring system detects homology between less obviously related 
sequences with greater sensitivity than would a comparison of identities 
only. Consider, for example, the protein leghemoglobin, an oxygen-binding 
protein found in the roots of some plants. The amino acid sequence of leghe- 
moglobin from the herb lupine can be aligned with that of human myoglo- 
bin and scored by using either the simple scoring scheme based on identities 
only or the Blosum-62 (Figure 6.9). Repeated shuffling and scoring pro- 
vides a distribution of alignment scores (Figure 6.12). Scoring based solely 
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population of shuffled alignments. 


on identities indicates that the probability of the alignment between myo- 
globin and leghemoglobin occurring by chance alone is 1 in 20. Thus, 
although the level of similarity suggests a relationship, there is a 5% chance 
that the similarity is accidental on the basis of this analysis. In contrast, users 
of the substitution matrix are able to incorporate the effects of conservative 
substitutions. From such an analysis, the odds of the alignment occurring by 
chance are calculated to be approximately 1 in 300. Thus, an analysis per- 
formed with the substitution matrix reaches a much firmer conclusion about 
the evolutionary relationship between these proteins (Figure 6.13). 
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FIGURE 6.13 Alignment of human myoglobin and lupine leghemoglobin. The use of 
Blosum-62 yields the alignment shown between human myoglobin and lupine leghemoglobin, 
illustrating identities (orange boxes) and conservative substitutions (yellow). These sequences 
are 23% identical. 
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Experience with sequence analysis has led to the development of sim- 
pler rules of thumb. For sequences longer than 100 amino acids, sequence 
identities greater than 25% are almost certainly not the result of chance 
alone; such sequences are probably homologous. In contrast, if two 
sequences are less than 15% identical, their alignment alone is unlikely to 
indicate statistically significant similarity. For sequences that are between 
15 and 25% identical, further analysis is necessary to determine the statisti- 
cal significance of the alignment. It must be emphasized that the lack of a 
statistically significant degree of sequence similarity does not rule out homology. 
The sequences of many proteins that have descended from common ances- 
tors have diverged to such an extent that the relationship between the pro- 
teins can no longer be detected from their sequences alone. As we will see, 
such homologous proteins can often be detected by examining three- 
dimensional structures. 


Databases can be searched to identify homologous sequences 


When the sequence of a protein is first determined, comparing it with all 
previously characterized sequences can be a source of tremendous insight 
into its evolutionary relatives and, hence, its structure and function. Indeed, 
an extensive sequence comparison is almost always the first analysis per- 
formed on a newly elucidated sequence. The sequence-alignment methods 
just described are used to compare an individual sequence with all members 
of a database of known sequences. 

Database searches for homologous sequences are most often accom- 
plished by using resources available on the Internet at the National Center 
for Biotechnology Information (www.ncbi.nih.gov). The procedure used is 
referred to as a BLAST (Basic Local Alignment Search Tool) search. An 
amino acid sequence is typed or pasted into the Web browser, and a search 
is performed, most often against a nonredundant database of all known 
sequences. At the end of 2013, this database included more than 35 million 
sequences. A BLAST search yields a list of sequence alignments, each 
accompanied by an estimate giving the likelihood that the alignment 
occurred by chance (Figure 6.14). 

In 1995, investigators reported the first complete sequence of the 
genome of a free-living organism, the bacterium Haemophilus influenzae. 
With the sequences available, they performed a BLAST search with each 
deduced protein sequence. Of 1743 identified protein-coding regions, also 
called open reading frames (ORFs), 1007 (58%) could be linked to some pro- 
tein of known function that had been previously characterized in another 
organism. An additional 347 ORFs could be linked to sequences in the 
database for which no function had yet been assigned (“hypothetical pro- 
teins”). The remaining 389 sequences did not match any sequence present 
in the database at that time. Thus, investigators were able to identify likely 
functions for more than half the proteins within this organism solely by 
sequence comparisons. 


6.3 Examination of Three-Dimensional Structure 
Enhances Our Understanding of 
Evolutionary Relationships 


Sequence comparison is a powerful tool for extending our knowledge of 
protein function and kinship. However, biomolecules generally function as 
intricate three-dimensional structures rather than as linear polymers. 
Mutations occur at the level of sequence, but the effects of the mutations are 
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FIGURE 6.14 BLAST search results. Part of the results from a BLAST search of the 
nonredundant (nr) protein sequence database using the sequence of ribose 5-phosphate 
isomerase (also called phosphopentose isomerase, Chapter 20) from E. coli as a query. 
Among the thousands of sequences found is the orthologous sequence from humans, and 
the alignment between these sequences is shown (highlighted in yellow). The number of 
sequences with this level of similarity expected to be in the database by chance is 2 x 10727 
as shown by the E value (highlighted in red). Because this value is much less than 1, the 
observed sequence alignment is highly significant. 


at the level of function, and function is directly related to tertiary structure. 
Consequently, to gain a deeper understanding of evolutionary relationships 
between proteins, we must examine three-dimensional structures, especially 
in conjunction with sequence information. The techniques of structural 
determination were presented in Section 3.5. 


Tertiary structure is more conserved than primary structure 


Because three-dimensional structure is much more closely associated 
with function than is sequence, tertiary structure is more evolutionarily 
conserved than is primary structure. This conservation is apparent in the 
tertiary structures of the globins (Figure 6.15), which are extremely 
similar even though the similarity between human myoglobin and lupine 
leghemoglobin is just barely detectable at the sequence level (Figure 
6.12) and that between human a-hemoglobin and lupine leghemoglobin 
is not statistically significant (15% identity). This structural similarity 
firmly establishes that the framework that binds the heme group and 


facilitates the reversible binding of oxygen has been conserved over a 
long evolutionary period. 

Anyone aware of the similar biochemical functions of hemoglobin, myo- 
globin, and leghemoglobin could expect the structural similarities. In a 
growing number of other cases, however, a comparison of three-dimensional 
structures has revealed striking similarities between proteins that were not 
expected to be related, on the basis of their diverse functions. A case in point 
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Hemoglobin (a chain) Myoglobin Leghemoglobin 


is the protein actin, a major component of the cytoskeleton (Section 35.2), 
and heat shock protein 70 (Hsp70), which assists protein folding inside 
cells. These two proteins were found to be noticeably similar in structure 
despite only 16% sequence identity (Figure 6.16). On the basis of their 
three-dimensional structures, actin and Hsp70 are paralogs. The level of 
structural similarity strongly suggests that, despite their different biological 
roles in modern organisms, these proteins descended from a common ances- 
tor. As the three-dimensional structures of more proteins are determined, 
such unexpected kinships are being discovered with increasing frequency. 
The search for such kinships relies ever more frequently on computer-based 
searches that are able to compare the three-dimensional structure of any 
protein with all other known structures. 


Knowledge of three-dimensional structures can aid in the 
evaluation of sequence alignments 


The sequence-comparison methods described thus far treat all positions 
within a sequence equally. However, we know from examining families 


FIGURE 6.15 Conservation of 
three-dimensional structure. The tertiary 
structures of human hemoglobin (a chain), 
human myoglobin, and lupine leghemoglobin 
are conserved. Each heme group contains 
an iron atom to which oxygen binds. [Drawn 
from 1HBB.pdb, 1MBD.pdb, and 1GDJ.pdb.] 


FIGURE 6.16 Structures of actin 

and a large fragment of heat shock 
protein 70 (Hsp70). A comparison of the 
identically colored elements of secondary 
structure reveals the overall similarity in 
structure despite the difference in 
biochemical activities. [Drawn from 
1ATN.pdb and 1ATR.pdb.] 
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FIGURE 6.17 Sequence alignment 

of internal repeats. (A) The primary 
structure of the TATA-box-binding protein. 
(B) An alignment of the sequences of the 
two repeats of the TATA-box-binding 
protein. The amino-terminal repeat is shown 
in red and the carboxyl-terminal repeat in 
blue. (C) Structure of the TATA-box-binding 
protein. The amino-terminal domain is 
shown in red and the carboxyl-terminal 
domain in blue. [Drawn from 1VOK.pdb.] 


of homologous proteins for which at least one three-dimensional struc- 
ture is known that regions and residues critical to protein function are 
more strongly conserved than are other residues. For example, each type 
of globin contains a bound heme group with an iron atom at its center. 
A histidine residue that interacts directly with this iron atom (residue 64 
in human myoglobin) is conserved in all globins. After we have identi- 
fied key residues or highly conserved sequences within a family of pro- 
teins, we can sometimes identify other family members even when the 
overall level of sequence similarity is below statistical significance. Thus 
it may be useful to generate a sequence template—a map of conserved 
residues that are structurally and functionally important and are charac- 
teristic of particular families of proteins, which makes it possible to 
recognize new family members that might be undetectable by other 
means. A variety of other methods for sequence classification that take 
advantage of known three-dimensional structures also are being devel- 
oped. Still other methods are able to identify conserved residues within 
a family of homologous proteins, even without a known three-dimen- 
sional structure. These methods often use substitution matrices that 
differ at each position within a family of aligned sequences. Such meth- 
ods can often detect quite distant evolutionary relationships. 


Repeated motifs can be detected by aligning sequences with themselves 


More than 10% of all proteins contain sets of two or more domains that 
are similar to one another. Sequence search methods can often detect 
internally repeated sequences that have been characterized in other pro- 
teins. Often, however, repeated units do not correspond to previously 
identified domains. In these cases, their presence can be detected by 
attempting to align a given sequence with itself. The statistical signifi- 
cance of such repeats can be tested by aligning the regions in question as 
if these regions were sequences from separate proteins. For the TATA- 
box-binding protein (Figure 6.17A), a key protein in controlling gene 
transcription (Section 29.2), such an alignment is highly significant: 
30% of the amino acids are identical over 90 residues (Figure 6.17B). The 


(A) 1 MTDQGLEGSN PVDLSKHPSG IVPTLONIVS TVNLDCKLDL KAIALQARNA 
51 EYNPKRFAAV IMRIREPKTT ALIFASGKMV CTGAKSEDFS KMAARKYARI 
101 VQKLGFPAKF KDFKIQNIVG SCDVKFPIRL EGLAYSHAAF SSYEPELFPG 
151 LIYRMKVPKI VLLIFVSGKI VITGAKMRDE TYKAFENIYP VLSEFRKIQQ 
(B) 1 MTDQGLEGSNPVDLSKHPS 
20 GIVPTLONIVSTVNLDCKLDL AVIIMR|I R 
110. FKDFK: 6SCDVKFPIRILE cuzlyRM K 
66 EPKITTALI . MIVICITGAKISEIDIF SK ARKYARTV LGFPAK 
ke ‘| p | at Ps 4 
157 KITV TFV I|VIITGAKMRIDETY Kl FENIYPVLSEFRKIOQ 
(©) 


estimated probability of such an alignment occurring by chance is 1 in 
10!3. The determination of the three-dimensional structure of the 
TATA-box-binding protein confirmed the presence of repeated struc- 
tures; the protein is formed of two nearly identical domains (Figure 6.17C). 
The evidence is convincing that the gene encoding this protein evolved 
by duplication of a gene encoding a single domain. 


Convergent evolution illustrates common solutions to 
biochemical challenges 


Thus far, we have been exploring proteins derived from common ancestors— 
that is, through divergent evolution. Other cases have been found of pro- 
teins that are structurally similar in important ways but are not descended 
from a common ancestor. How might two unrelated proteins come to 
resemble each other structurally? Two proteins evolving independently 
may have converged on similar structural features to perform a similar 
biochemical activity. Perhaps that structure was an especially effective 
solution to a biochemical problem that organisms face. The process by which 
very different evolutionary pathways lead to the same solution is called con- 
vergent evolution. 

An example of convergent evolution is found among the serine prote- 
ases. These enzymes, to be considered in more detail in Chapter 9, cleave 
peptide bonds by hydrolysis. Figure 6.18 shows the structure of the active 
sites—that is, the sites on the proteins at which the hydrolysis reaction takes 
place—for two such enzymes, chymotrypsin and subtilisin. These active- 
site structures are remarkably similar. In each case, a serine residue, a histi- 
dine residue, and an aspartic acid residue are positioned in space in nearly 
identical arrangements. As we will see, this conserved spatial arrangement 
is critical for the activity of these enzymes and affords the same mechanistic 
solution to the problem of peptide hydrolysis. At first glance, this similarity 
might suggest that these proteins are homologous. However, striking differ- 
ences in the overall structures of these proteins make an evolutionary 
relationship extremely unlikely (Figure 6.19). Whereas chymotrypsin con- 
sists almost entirely of B sheets, subtilisin contains extensive a-helical 
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FIGURE 6.18 Convergent evolution of 
protease active sites. The relative positions 
of the three key residues shown are nearly 
identical in the active sites of the serine 
proteases chymotrypsin and subtilisin. 
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FIGURE 6.19 Structures of 
mammalian chymotrypsin and bacterial 
subtilisin. The overall structures are quite 
dissimilar, in stark contrast with the active 
sites, shown at the top of each structure. 
The B strands are shown in yellow and the 
a helices in blue. [Drawn from 1GCT.pdb. 
and 1SUP.pdb.] Chymotrypsin Subtilisin 


structure. Moreover, the key serine, histidine, and aspartic acid residues do 
not occupy similar positions or even appear in the same order within the two 
sequences. It is extremely unlikely that two proteins evolving from a com- 
mon ancestor could have retained similar active-site structures while other 
aspects of the structure changed so dramatically. 


Comparison of RNA sequences can be a source of insight into RNA 
secondary structures 


Homologous RNA sequences can be compared in a manner similar to 
that already described for protein sequences. Such comparisons can be a 
source of important insights into evolutionary relationships; in addition, 
they provide clues to the three-dimensional structure of the RNA itself. 
As noted in Chapter 4, single-stranded nucleic acid molecules fold back 
on themselves to form elaborate structures held together by Watson— 
Crick base-pairing and other interactions. In a family of sequences that 
form similar base-paired structures, base sequences may vary, but base- 
pairing ability is conserved. Consider, for example, a region from a large 
RNA molecule present in the ribosomes of all organisms (Figure 6.20). 
In the region shown, the E. coli sequence has a guanine (G) residue in 
position 9 and a cytosine (C) residue in position 22, whereas the human 
sequence has uracil (U) in position 9 and adenine (A) in position 22. 
Examination of the six sequences shown in Figure 6.20 reveals that the 


FIGURE 6.20 Comparison of RNA sequences. (A) A comparison of sequences in a part of U—A 
ribosomal RNA taken from a variety of species. (B) The implied secondary structure. Green hy, 
lines indicate positions at which Watson—Crick base-pairing is completely conserved in the (C -) 4 
sequences shown, whereas dots indicate positions at which Watson—Crick base-pairing is (C, G) 
conserved in most cases. 
NeN 
NeN 
A B 
(A) \9 |x (B) NeN 
Bacteria Escherichia coli CACACGGCGGCUGCUAACGUCCGUCGUGAA 9 NN 22 
Pseudomonas aeruginosa ACCACGGCGGGUGCUAACGUCCGUCGUGAA NeN 


ARCHAEA Halobacterium halobium = cCGGUGUGCGGGG-UAAGCCUGUGCACCGU 
Methanococcus vannielil GAGGGCAUACGGG-UAAGCUGUAUGUCCGA N—N 


NeN 
Homo sapiens GGGCCACUUUUGG-UAAGCAGAACUGGCEC 
EUKARYA = N—N 
Saccharomyces cerevisiae GGGCCAUUUUUGG-UAAGCAGAACUGGCGA 
N—N 
we Ss 
UN NO 


bases in positions 9 and 22, as well as several of the neighboring posi- 
tions, retain the ability to form Watson—Crick base pairs even though 
the identities of the bases in these positions vary. We can deduce that 
two segments with paired mutations that maintain base-pairing ability 
are likely to form a double helix. Where sequences are known for several 
homologous RNA molecules, this type of sequence analysis can often 
suggest complete secondary structures as well as some additional inter- 
actions. For this particular ribosomal RNA, the subsequent determina- 
tion of its three-dimensional structure (Section 30.3) confirmed the 
predicted secondary structure. 


6.4 Evolutionary Trees Can Be Constructed on the Basis 
of Sequence Information 


The observation that homology is often manifested as sequence similarity 
suggests that the evolutionary pathway relating the members of a family of 
proteins may be deduced by examination of sequence similarity. This 
approach is based on the notion that sequences that are more similar to one 
another have had less evolutionary time to diverge than have sequences 
that are less similar. This method can be illustrated by using the three glo- 
bin sequences in Figures 6.11 and 6.13, as well as the sequence for the 
human hemoglobin B chain. These sequences can be aligned with the addi- 
tional constraint that gaps, if present, should be at the same positions in all 
of the proteins. These aligned sequences can be used to construct an evolu- 
tionary tree in which the length of the branch connecting each pair of pro- 
teins is proportional to the number of amino acid differences between the 
sequences (Figure 6.21). 


Hemoglobin Hemoglobin 
Leghemoglobin Myoglobin a B 
Oo-c 
GF 200 - 
o 
ovo 
> 
S 400 
Ww 
c 
2 
= 600 FIGURE 6.21 An evolutionary tree 
= for globins. The branching structure 
E was deduced by sequence comparison, 
F 800 |— whereas the results of fossil studies 
provided the overall time scale showing 
when divergence occurred. 


Such comparisons reveal only the relative divergence times—for 
example, that myoglobin diverged from hemoglobin twice as long ago as 
the a chain diverged from the B chain. How can we estimate the approx- 
imate dates of gene duplications and other evolutionary events? 
Evolutionary trees can be calibrated by comparing the deduced branch 
points with divergence times determined from the fossil record. For 
example, the duplication leading to the two chains of hemoglobin 
appears to have occurred 350 million years ago. This estimate is sup- 
ported by the observation that jawless fish such as the lamprey, which 
diverged from bony fish approximately 400 million years ago, contain 
hemoglobin built from a single type of subunit (Figure 6.22). These 
methods can be applied to both relatively modern and very ancient 
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FIGURE 6.22 The lamprey. A jawless fish 
whose ancestors diverged from bony fish 
approximately 400 million years ago, the 
lamprey has hemoglobin molecules that 
contain only a single type of polypeptide 
chain. [Brent P. Kent.] 
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(A) (B) 
-— Eukaryotes 


| Archaea 


Bacillus halodurans 


© _>—— Escherichia coli 
Proteus pennen 
-—— Cyanidiales | |. Pseudomonas syringae 


(includes G. sulphuraria) '——. Pseudomonas putida 
Deinococcus geothermalis 3 
Galdieria sulphuraria ORF1_ > Bacteria 
Lie Galdieria sulphuraria ORF2 
Leptospirillum ferriphilum 
Thiomonas sp. 
——__ ; -— Acidithiobacillus ferrooxidans 
== Bacteria Roscomonas cervicalis 


= _>———__ Listeria welshimeri ; 
mee Fuka ryotes |________. Staphylococcus saprophyticus 
=— 


Archaea 


Methylobacterium extorquens 
Methylobacterium radiotolerans 
Psychrobacter arcticus 


FIGURE 6.23 Evidence of horizontal gene transfer. (A) The unicellular red alga Galaieria 
sulphuraria belongs to the order Cyanidiales, clearly within the eukaryotic branch of the 
evolutionary tree. (B) Within the completely sequenced G. sulphuraria genome, two ORFs 
encode proteins involved in transport of arsenate ions across membranes. Alignment of these 
ORFs against orthologs from a variety of species reveals that these pumps are most closely 
related to their bacterial counterparts, suggesting that a horizontal gene transfer event 
occurred during the evolution of this species. [(A) Information from Dr. Gerald Schénknecht; 

(B) Information from G. Schdénknecht et al. Science 339:1207-1210, 2018, Fig. 3.] 


molecules, such as the ribosomal RNAs that are found in all organisms. 
Indeed, such an RNA sequence analysis led to the realization that 
Archaea are a distinct group of organisms that diverged from Bacteria 
very early in evolutionary history. 


Horizontal gene transfer events may explain unexpected branches of 
the evolutionary tree 


Evolutionary trees that encompass orthologs of a particular protein across 
a range of species can lead to unexpected findings. For example, let us 
consider the unicellular red alga Galdieria sulphuraria, a remarkable 
eukaryote that can thrive in extreme environments, including at tempera- 
tures up to 56°C, at pH values between 0 and 4, and in the presence of 
high concentrations of toxic metals. G. sulphuraria belongs to the order 
Cyanidiales, clearly within the eukaryotic branch of the evolutionary tree 
(Figure 6.23A). However, the complete genome sequence of this organ- 
ism revealed that nearly 5% of the G. sulphuraria ORFs encode proteins 
that are more closely related to bacterial or archaeal, not eukaryotic, 
orthologs. Furthermore, the proteins that exhibited these unexpected 
evolutionary relationships possess functions that are likely to confer a 
survival advantage in extreme environments, such as the removal of metal 
ions from inside the cell (Figure 6.23B). One likely explanation for these 
observations is horizontal gene transfer, or the exchange of DNA between 
species that provides a selective advantage to the recipient. Amongst pro- 
karyotes, horizontal gene transfer is a well-characterized and important 
evolutionary mechanism. For example, as we shall discuss in Chapter 9, 
exchange of plasmid DNA between bacterial species likely facilitated the 
acquisition of restriction endonuclease activities. However, recent studies 
such as those on G. sulphuraria, made possible by the expansive growth of 
complete genome sequence information, suggest that horizontal gene 
transfer from prokaryotes to eukaryotes, between different domains of 
life, may also represent evolutionarily significant events. 


6.5 Modern Techniques Make the Experimental 
Exploration of Evolution Possible 


‘Two techniques of biochemistry have made it possible to examine the course 
of evolution more directly and not simply by inference. The polymerase 
chain reaction (Section 5.1) allows the direct examination of ancient DNA 
sequences, releasing us, at least in some cases, from the constraints of being 
able to examine existing genomes from living organisms only. Molecular 
evolution may be investigated through the use of combinatorial chemistry, the 
process of producing large populations of molecules en masse and selecting 
for a biochemical property. This exciting process provides a glimpse into the 
types of molecules that may have existed very early in evolution. 


Ancient DNA can sometimes be amplified and sequenced 


The tremendous chemical stability of DNA makes the molecule well suited 
to its role as the storage site of genetic information. So stable is the molecule 
that samples of DNA have survived for many thousands of years under 
appropriate conditions. With the development of PCR and advanced 
DNA-sequencing methods, such ancient DNA can be amplified and 
sequenced. This approach was first applied to mitochondrial DNA isolated 
from a Neanderthal fossil estimated at 38,000 years of age. Comparison of 
the complete Neanderthal mitochondrial sequence with those from Homo 
sapiens individuals revealed between 201 and 234 substitutions, consider- 
ably fewer than the approximately 1500 differences between human beings 
and chimpanzees over the same region. 

Remarkably, the complete genome sequences of a Neanderthal and a 
closely related hominin known as a Denisovan have been obtained using 
DNA isolated from nearly 50,000-year-old fossils. Comparison of these 
sequences suggests that the common ancestor of modern human beings and 
Neanderthals lived approximately 570,000 years ago, while the common 
ancestor between Neanderthals and Denisovans lived nearly 380,000 years 
ago. An evolutionary tree constructed from these data revealed that the 
Neanderthal was not an intermediate between chimpanzees and human 
beings but, instead, was an evolutionary “dead end” that became extinct 
(Figure 6.24). Further analysis of these sequences has enabled researchers to 
determine the extent of interbreeding between these groups, elucidate the 
geographic history of these populations, and make assertions about addi- 
tional ancestors whose DNA has not yet been sequenced. 

A few earlier studies claimed to determine the sequences of far more 
ancient DNA such as that found in insects trapped in amber, but these 
studies appear to have been flawed. The source of these sequences turned 
out to be contaminating modern DNA. Successful sequencing of ancient 
DNA requires sufficient DNA for reliable amplification and the rigorous 
exclusion of all sources of contamination. 


Molecular evolution can be examined experimentally 


Evolution requires three processes: (1) the generation of a diverse popula- 
tion, (2) the selection of members based on some criterion of fitness, and (3) 
reproduction to enrich the population in these more-fit members. Nucleic 
acid molecules are capable of undergoing all three processes in vitro under 
appropriate conditions. The results of such studies enable us to glimpse how 
evolutionary processes might have generated catalytic activities and specific 
binding abilities—important biochemical functions in all living systems. 

A diverse population of nucleic acid molecules can be synthesized in 
the laboratory by the process of combinatorial chemistry, which rapidly 
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FIGURE 6.24 Placing Neanderthals and 
Denisovans on an evolutionary tree. 
Comparison of DNA sequences revealed 
that neither Neanderthals nor the Denisovans 
are on the line of direct descent leading to 
Homo sapiens but, instead, branched off 
earlier and then became extinct. 
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FIGURE 6.25 Evolution in the 
laboratory. A collection of RNA molecules 
of random sequences is synthesized by 
combinatorial chemistry. This collection is 
selected for the ability to bind ATP by 
passing the RNA through an ATP affinity 
column (Section 3.1). The ATP-binding 
RNA molecules are released from the 
column by washing with excess ATP and 
then replicated. The process of selection 
and replication is then repeated several 
times. The final RNA products with 
significant ATP-binding ability are isolated 
and characterized. 


produces large populations of a particular type of molecule such as a nucleic 
acid. A population of molecules of a given size can be generated randomly 
so that many or all possible sequences are present in the mixture. When an 
initial population has been generated, it is subjected to a selection process 
that isolates specific molecules with desired binding or reactivity properties. 
Finally, molecules that have survived the selection process are replicated 
through the use of PCR; primers are directed toward specific sequences 
included at the ends of each member of the population. Errors that occur 
naturally in the course of the replication process introduce additional varia- 
tion into the population in each “generation.” Let us consider an application 
of this approach. Early in evolution, before the emergence of proteins, RNA 
molecules may have played all major roles in biological catalysis. To under- 
stand the properties of potential RNA catalysts, researchers have used the 
methods heretofore described to create an RNA molecule capable of bind- 
ing adenosine triphosphate and related nucleotides. An initial population of 
RNA molecules 169 nucleotides long was created; 120 of the positions dif- 
fered randomly, with equimolar mixtures of adenine, cytosine, guanine, and 
uracil. The initial synthetic pool that was used contained approximately 
10!4 RNA molecules. Note that this number is a very small fraction of the 
total possible pool of random 120-base sequences. From this pool, those 
molecules that bound to ATP, which had been immobilized on a column, 
were selected (Figure 6.25). 

The collection of molecules that were bound well by the ATP affinity 
column was replicated by reverse transcription into DNA, amplification by 
PCR, and transcription back into RNA. The somewhat error-prone reverse 
transcriptase may have introduced additional mutations into the population 
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in each cycle. The new population was subjected to additional rounds of 
selection for ATP-binding activity. After eight generations, members of the 
selected population were characterized by sequencing. Seventeen different 
sequences were obtained, 16 of which could form the structure shown in 
Figure 6.26. Each of these molecules bound ATP with dissociation con- 
stants less than 50 pM. 

The folded structure of the ATP-binding region from one of these RNAs 
was determined by nuclear magnetic resonance (NMR) methods (Section 
3.6). As expected, this 40-nucleotide molecule is composed of two Watson— 
Crick base-paired helical regions separated by an 11-nucleotide loop (Figure 
6.27A). This loop folds back on itself in an intricate way (Figure 6.27B) to 
form a deep pocket into which the adenine ring can fit (Figure 6.27C). Thus, 
a structure had evolved in vitro that was capable of a specific interaction. 


Synthetic oligonucleotides that can specifically bind ligands, such as 

the ATP-binding RNA molecules described above, are referred to 
as aptamers. In addition to their role in understanding molecular evolution, 
aptamers have shown promise as versatile tools for biotechnology and 
medicine. They have been developed for diagnostic applications, serving as 
sensors for ligands ranging from small organic molecules, such as cocaine, 
to larger proteins, such as thrombin. Several aptamers are also in clinical 
trials as therapies for diseases ranging from leukemia to diabetes. Macugen 
(pegaptanib sodium), an aptamer which binds to and inhibits the protein 
vascular endothelial growth factor (VEGF), has been approved for the 
treatment of age-related macular degeneration. 


(A) (B) 


ATP 
Loop 


| SUMMARY 


6.1 Homologs Are Descended from a Common Ancestor 
Exploring evolution biochemically often means searching for homol- 
ogy between molecules, because homologous molecules, or homologs, 
evolved from a common ancestor. Paralogs are homologous molecules 
that are found in one species and have acquired different functions 
through evolutionary time. Orthologs are homologous molecules that 
are found in different species and have similar or identical functions. 


6.2 Statistical Analysis of Sequence Alignments Can Detect Homology 
Protein and nucleic acid sequences are two of the primary languages of 
biochemistry. Sequence-alignment methods are the most powerful tools 
of the evolutionary detective. Sequences can be aligned to maximize their 
similarity, and the significance of these alignments can be judged by 
statistical tests. The detection of a statistically significant alignment 
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FIGURE 6.26 A conserved secondary 
structure. The secondary structure shown 
is common to RNA molecules selected for 
ATP binding. 
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FIGURE 6.27 An evolved ATP-binding 
RNA molecule. (A) The Watson-Crick 
base-pairing pattern of an RNA molecule 
selected to bind adenosine nucleotides. 
(B) The NMR structure of this RNA 
molecule reveals the deep pocket into 
which the ATP molecule is bound. 

(C) In this surface representation, the 
ATP molecule has been removed to 
enable visualization of the pocket. 
[Drawn from 1RAW.pdb.] 
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6.3 


6.4 


6.5 


between two sequences strongly suggests that two sequences are related 
by divergent evolution from a common ancestor. The use of substitution 
matrices makes the detection of more-distant evolutionary relationships 
possible. Any sequence can be used to probe sequence databases to iden- 
tify related sequences present in the same organism or in other organisms. 


Examination of Three-Dimensional Structure Enhances Our 
Understanding of Evolutionary Relationships 

The evolutionary kinship between proteins may be even more strikingly 
evident in the conserved three-dimensional structures. The analysis of 
three-dimensional structure in combination with the analysis of especially 
conserved sequences has made it possible to determine evolutionary rela- 
tionships that cannot be detected by other means. Sequence-comparison 
methods can also be used to detect imperfectly repeated sequences within 
a protein, indicative of linked similar domains. 


Evolutionary Trees Can Be Constructed on the Basis of 

Sequence Information 

Evolutionary trees can be constructed with the assumption that the num- 
ber of sequence differences corresponds to the time since the two sequences 
diverged. Construction of an evolutionary tree based on sequence com- 
parisons revealed approximate times for the gene-duplication events 
separating myoglobin and hemoglobin as well as the a and B subunits of 
hemoglobin. Evolutionary trees based on sequences can be compared with 
those based on fossil records. Horizontal gene transfer events can manifest 
as unexpected branches on the evolutionary tree. 


Modern Techniques Make the Experimental Exploration of 

Evolution Possible 

The exploration of evolution can also be a laboratory science. In favor- 
able cases, PCR amplification of well-preserved samples allows the 
determination of nucleotide sequences from extinct organisms. 
Sequences so determined can help authenticate parts of an evolutionary 
tree constructed by other means. Molecular evolutionary experiments 
performed in the test tube can examine how molecules such as ligand- 
binding RNA molecules might have been generated. 


homolog (p. 170) 
paralog (p. 170) 
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| PROBLEMS 


nonconservative substitution (p. 174) convergent evolution (p. 181) 
substitution matrix (p. 174) evolutionary tree (p. 183) 
BLAST search (p. 177) horizontal gene transfer (p. 184) 
sequence template (p. 180) combinatorial chemistry (p. 185) 
divergent evolution (p. 181) aptamers (p. 187) 


1. What’s the score? Using the identity-based scoring sys- 2. Sequence and structure. A comparison of the aligned 
tem (Section 6.2), calculate the score for the following align- amino acid sequences of two proteins each consisting of 150 
ment. Do you think the score is statistically significant? amino acids reveals them to be only 8% identical. However, 


(1) WYLGKITRMDAEVLLKKPTV. 
(2) WYFGKITRRESERLLLNPENP 


SLSVREGDSVO===+== HFKVLRI 
CLSVSDFDNAKGLNVKHYKIRK! 


SLNELVAYHRTASVSRTHTILLSDMNV 
DGLCHRLTNV 


SSLOOLVAYYSKHAI 


RDGHFLVTOCESSPGEF- their three-dimensional structures are very similar. Are 
RGTFLVRESETTKGAY- these two proteins related evolutionarily? Explain. 


DONGKYYLWAVK-FN- 3. It depends on how you count. Consider the following two 
LDSGGFYITSRTQFS- sequence alignments: 


(1) A-SNLFDIRLIG (2) ASNLFDIRLI-G 
GSNDFYEVKIMD GSNDFYEVKIMD 


Which alignment has a higher score if the identity-based 
scoring system (Section 6.2) is used? Which alignment has 
a higher score if the Blosum-62 substitution matrix 
(Figure 6.9) is used? 


4. Discovering a new base pair. Examine the ribosomal 
RNA sequences in Figure 6.20. In sequences that do not 
contain Watson—Crick base pairs, what base tends to be 
paired with G? Propose a structure for your new base pair. 


5. Overwhelmed by numbers. Suppose that you wish to syn- 
thesize a pool of RNA molecules that contain all four bases 
at each of 40 positions. How much RNA must you have in 
grams if the pool is to have at least a single molecule of each 
sequence? The average molecular weight of a nucleotide is 
330 g mol” !. 


6. Form follows function. The three-dimensional structure 
of biomolecules is more conserved evolutionarily than is 
sequence. Why? 


7. Shuffling. Using the identity-based scoring system 
(Section 6.2), calculate the alignment score for the align- 
ment of the following two short sequences: 


(1) ASNFLDKAGK 
(2) ATDYLEKAGK 


Generate a shuffled version of sequence 2 by randomly 
reordering these 10 amino acids. Align your shuffled 
sequence with sequence 1 without allowing gaps, and calcu- 
late the alignment score between sequence 1 and your 
shuffled sequence. 


8. Interpreting the score. Suppose that the sequences of two 
proteins each consisting of 200 amino acids are aligned and 
that the percentage of identical residues has been calcu- 
lated. How would you interpret each of the following results 
in regard to the possible divergence of the two proteins from 
a common ancestor? 


(a) 80% 
(b) 50% 
(c) 20% 
(d) 10%. 
9. Particularly unique. Consider the Blosum-62 matrix in 
Figure 6.9. Replacement of which three amino acids never 


yields a positive score? What features of these residues 
might contribute to this observation? 
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10. A set of three. The sequences of three proteins (A, B, 
and C) are compared with one another, yielding the follow- 
ing levels of identity: 


A B C 
A 100% 65% 15% 
65% 100% 55% 


C 15% 55% 100% 


Assume that the sequence matches are distributed uni- 
formly along each aligned sequence pair. Would you expect 
protein A and protein C to have similar three-dimensional 
structures? Explain. 


11. RNA alignment. Sequences of an RNA fragment from 
five species have been determined and aligned. Propose a 
likely secondary structure for these fragments. 


1) UUGGAGAUUCGGUAGAAUCUCCC 
2) GCCGGGAAUCGACAGAUUCCCCG 
3) CCCAAGUCCCGGCAGGGACUUAC 
4) CUCACCUGCCGAUAGGCAGGUCA 
(5) AAUACCACCCGGUAGGGUGGUUC 


wf. sm faa. pat 


12. The move the merrier. When RNA alignments are used to 
determine secondary structure, it is advantageous to have 
many sequences representing a wide variety of species. Why? 


13. To err is human. You have discovered a mutant form of a 
thermostable DNA polymerase with significantly reduced 
fidelity in adding the appropriate nucleotide to the growing 
DNA strand, compared with wild-type DNA polymerase. 
How might this mutant be useful in the molecular-evolution 
experiments described in Section 6.5? 


14. Generation to generation. When performing a 
molecular-evolution experiment, such as that described in 
Section 6.5, why is it important to repeat the selection and 
replication steps for several generations? 


15. BLAST away. Using the National Center for Bio- 
technology Information Web site (www.ncbi.nlm.nih.gov), 
find the sequence of the enzyme triose phosphate isomerase 
from E. coli strain K-12. Use this sequence as the query for 
a protein-protein BLAST search. In the output, find the 
alignment with the sequence of triose phosphate isomerase 
from human beings (Homo sapiens). How many identities 
are observed in the alignment? 
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he transition from anaerobic to aerobic life was a major step in evolution 

because it uncovered a rich reservoir of energy. Fifteen times as much 
energy is extracted from glucose in the presence of oxygen than in its absence. 
For single-celled and other small organisms, oxygen can be absorbed into 
actively metabolizing cells directly from the air or surrounding water. Vertebrates 
evolved two principal mechanisms for supplying their cells with an adequate 
supply of oxygen. The first is a circulatory system that actively delivers oxygen 
to cells throughout the body. The second is the use of the oxygen-transport and 
oxygen-storage proteins, hemoglobin and myoglobin. Hemoglobin, which is 
contained in red blood cells, is a fascinating protein, efficiently carrying oxygen 
from the lungs to the tissues while also contributing to the transport of carbon 
dioxide and hydrogen ions back to the lungs. Myoglobin, located in muscle, 
facilitates the diffusion of oxygen through the cell for the generation of cellular 
energy and provides a reserve supply of oxygen available in time of need. 

A comparison of myoglobin and hemoglobin illuminates some key aspects 
of protein structure and function. These two evolutionarily related proteins 
employ nearly identical structures for oxygen binding (Chapter 6). However, 
hemoglobin is a remarkably efficient oxygen carrier, able to use as much as 90% 
of its potential oxygen-carrying capacity effectively. Under similar conditions, 
myoglobin would be able to use only 7% of its potential capacity. What accounts 


Beta chain of hemoglobin 
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In the bloodstream, red cells carry oxygen from the lungs to the tissues, 
where demand is high. Hemoglobin, the protein that gives blood its red color, 
is responsible for the transport of oxygen via its four heme-bound subunits. 
Hemoglobin was one of the first proteins to have its structure determined; 
the folding of a single subunit is shown in this hand-drawn view. [Left, Andrew 


OUTLINE 


7.1. Myoglobin and Hemoglobin Bind 
Oxygen at lron Atoms in Heme 


7.2 Hemoglobin Binds Oxygen 
Cooperatively 


7.3 Hydrogen lons and Carbon Dioxide 
Promote the Release of Oxygen: 
The Bohr Effect 


7.4 Mutations in Genes Encoding 
Hemoglobin Subunits Can Result 
in Disease 


191 


192 
CHAPTER 7 ‘Hemoglobin: Portrait of a 
Protein in Action 


Myoglobin 


h FIGURE 7.1 Structure of 
myoglobin. Notice that myoglobin 
consists of a single polypeptide chain, 
formed of a helices connected by 
turns, with one oxygen-binding 
site. [Drawn from 1MBD.pdb.] 


for this dramatic difference? Myoglobin exists as a single polypeptide, whereas 
hemoglobin comprises four polypeptide chains. The four chains in hemoglobin 
bind oxygen cooperatively, meaning that the binding of oxygen to a site in one 
chain increases the likelihood that the remaining chains will bind oxygen. 
Furthermore, the oxygen-binding properties of hemoglobin are modulated by 
the binding of hydrogen ions and carbon dioxide in a manner that enhances 
oxygen-carrying capacity. Both cooperativity and the response to modulators 
are made possible by variations in the quaternary structure of hemoglobin 
when different combinations of molecules are bound. 

Hemoglobin and myoglobin have played important roles in the history 
of biochemistry. They were the first proteins for which three-dimensional 
structures were determined by x-ray crystallography. Furthermore, the pos- 
sibility that variations in protein sequence could lead to disease was first 
proposed and demonstrated for sickle-cell anemia, a blood disease caused 
by mutation of a single amino acid in one hemoglobin chain. Hemoglobin 
has been and continues to be a valuable source of knowledge and insight, 
both in itself and as a prototype for many other proteins that we will 
encounter throughout our study of biochemistry. 


7.1 Myoglobin and Hemoglobin Bind Oxygen at Iron 
Atoms in Heme 


Sperm whale myoglobin was the first protein for which the three-dimensional 
structure was determined. X-ray crystallographic studies pioneered by John 
Kendrew revealed the structure of this protein in the 1950s (Figure 7.1). 
Myoglobin consists largely of a helices that are linked to one another by 
turns to form a globular structure. 

Myoglobin can exist in an oxygen-free form called deoxymyoglobin or in a 
form with an oxygen molecule bound called oxymyoglobin. The ability of myo- 
globin and hemoglobin to bind oxygen depends on the presence of a heme 
molecule. As we shall discuss in Chapter 9, heme is one example of a prosthetic 
group, a molecule that binds tightly to a protein and is essential for its function. 


0 Oy, 20 


Propionate 


Vinyl gro’ 
mee ly! group 
Heme 
(Fe-protoporphyrin IX) 


The heme group gives muscle and blood their distinctive red color. It consists 
of an organic component and a central iron atom. The organic component, 
called protoporphyrin, is made up of four pyrrole rings linked by methine 
bridges to form a tetrapyrrole ring. Four methyl groups, two vinyl groups, 
and two propionate side chains are attached to the central tetrapyrrole. 


In deoxymyoglobin In oxymyoglobin 


The iron atom lies in the center of the protoporphyrin, bonded to the 
four pyrrole nitrogen atoms. Although the heme-bound iron can be in 
either the ferrous (Fe?*) or ferric (Fe**) oxidation state, only the Fe?* state 
is capable of binding oxygen. The iron ion can form two additional bonds, 
one on each side of the heme plane. These binding sites are called the fifth 
and sixth coordination sites. In myoglobin, the fifth coordination site is 
occupied by the imidazole ring of a histidine residue from the protein. This 
histidine is referred to as the proximal histidine. 

Oxygen binding occurs at the sixth coordination site. In deoxymyo- 
globin, this site remains unoccupied. The iron ion is slightly too large to 
fit into the well-defined hole within the porphyrin ring; it lies approx1- 
mately 0.4 A outside the porphyrin plane (Figure 7.2, left). Binding of the 
oxygen molecule at the sixth coordination site substantially rearranges the 
electrons within the iron so that the ion becomes effectively smaller, 
allowing it to move within the plane of the porphyrin (Figure 7.2, right). 
Remarkably, the structural changes that take place on oxygen binding 
were predicted by Linus Pauling, on the basis of magnetic measurements 
in 1936, nearly 25 years before the three-dimensional structures of myo- 
globin and hemoglobin were elucidated. 


Changes in heme electronic structure upon oxygen binding are the 
basis for functional imaging studies 


es The change in electronic structure that occurs when the iron ion 
moves into the plane of the porphyrin is paralleled by alterations in 
the magnetic properties of hemoglobin; these changes are the basis for func- 
tional magnetic resonance imaging ((MRI), one of the most powerful meth- 
ods for examining brain function. Nuclear magnetic resonance techniques 
detect signals that originate primarily from the protons in water molecules 
and are altered by the magnetic properties of hemoglobin. With the use of 
appropriate techniques, images can be generated that reveal differences in 
the relative amounts of deoxy- and oxyhemoglobin and thus the relative 
activity of various parts of the brain. When a specific part of the brain is 
active, blood vessels relax to allow more blood flow to that region. Thus, a 
more-active region of the brain will be richer in oxyhemoglobin. 

These noninvasive methods identify areas of the brain that process sen- 
sory information. For example, subjects have been imaged while breathing air 
that either does or does not contain odorants. When odorants are present, 
fMRI detects an increase in the level of hemoglobin oxygenation (and, hence, 


FIGURE 7.2 Oxygen binding changes 


the position of the iron ion. The iron ion 


lies slightly outside the plane of the 


porphyrin in deoxymyoglobin heme (left), 
but moves into the plane of the heme on 


oxygenation (right). 
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FIGURE 7.4 lron—-oxygen bonding. The 
interaction between iron and oxygen in 
myoglobin can be described as a 
combination of resonance structures, one 
with Fe2* and dioxygen and another with 
Fe’*+ and superoxide ion. 


FIGURE 7.5 Stabilizing bound oxygen. A 
hydrogen bond (dotted green line) donated 
by the distal histidine residue to the bound 
oxygen molecule helps stabilize 
oxymyoglobin. 


FIGURE 7.3 Functional magnetic 
resonance imaging of the brain. A functional 
magnetic resonance image reveals brain 
response to odorants. The light spots indicate 
regions of the brain activated by odorants. 
[From Osterbauer, R. et al., “Color of Scents: 
Chromatic Stimuli Modulate Odor Responses in 
the Human Brain.” Journal of Neurophysiology 
93 (2005) 3434-3441 |] 


of activity) in several regions of the brain (Figure 7.3). These regions are in the 
primary olfactory cortex, as well as in areas in which secondary processing of 
olfactory signals presumably takes place. Further analysis reveals the time 
course of activation of particular regions. Functional MRI shows tremendous 
potential for mapping regions and pathways engaged in processing sensory 
information obtained from all the senses. A seemingly incidental aspect of the 
biochemistry of hemoglobin has enabled observation of the brain in action. 


The structure of myoglobin prevents the release of 
reactive oxygen species 


Oxygen binding to iron in heme is accompanied by the partial transfer of an 
electron from the ferrous ion to oxygen. In many ways, the structure is best 
described as a complex between ferric ion (Fe**) and superoxide anion (O7), 
as illustrated in Figure 7.4. It is crucial that oxygen, when it is released, 
leaves as dioxygen rather than superoxide, for two important reasons. First, 
superoxide and other species generated from it are reactive oxygen species that 
can be damaging to many biological materials. Second, release of superoxide 
would leave the iron ion in the ferric state. This species, termed metmyoglo- 
bin, does not bind oxygen. Thus, potential oxygen-storage capacity is lost. 
Features of myoglobin stabilize the oxygen complex such that superoxide is 
less likely to be released. In particular, the binding pocket of myoglobin 
includes an additional histidine residue (termed the distal histidine) that 
donates a hydrogen bond to the bound oxygen molecule (Figure 7.5). The 
superoxide character of the bound oxygen species strengthens this interac- 
tion. Thus, the protein component of myoglobin controls the intrinsic reactivity 


Distal histidine 


of heme, making it more suitable for reversible oxygen binding. The distal 
histidine may also impair access of carbon monoxide to the heme, which 
binds tightly to the heme iron with dire consequences (p. 201). 


Human hemoglobin is an assembly of four myoglobin-like subunits 


The three-dimensional structure of hemoglobin from horse heart was solved 
by Max Perutz shortly after the determination of the myoglobin structure. 
Since then, the structures of hemoglobins from other species including 
humans have been determined. Hemoglobin consists of four polypeptide 
chains, two identical a chains and two identical B chains (Figure 7.6). Each 
of the subunits consists of a set of a helices in the same arrangement as the a 
helices in myoglobin (see Figure 6.15 for a comparison of the structures). 
The recurring structure is called a globin fold. Consistent with this structural 
similarity, alignment of the amino acid sequences of the a and £ chains of 
human hemoglobin with those of sperm whale myoglobin yields 25% and 
24% identity, respectively, and good conservation of key residues such as the 
proximal and distal histidines. Thus, the a and £ chains are related to each 
other and to myoglobin by divergent evolution (Section 6.2). 


hk FIGURE 7.6 Quaternary structure of deoxyhemoglobin. Hemoglobin, which is 
composed of two a chains and two 8 chains, functions as a pair of aB dimers. (A) A ribbon 
diagram. (B) A space-filling model. [Drawn from 1A3N.pdb.] 


The hemoglobin tetramer, referred to as hemoglobin A (HbA), is best 
described as a pair of identical aB dimers (a8, and a,8,) that associate to 
form the tetramer. In deoxyhemoglobin, these aB dimers are linked by an 
extensive interface, which includes the carboxyl terminus of each chain. The 
heme groups are well separated in the tetramer by iron—iron distances rang- 
ing from 24 to 40 A. 


7.2 Hemoglobin Binds Oxygen Cooperatively 


We can determine the oxygen-binding properties of each of these proteins 
by observing its oxygen-binding curve, a plot of the fractional saturation ver- 
sus the concentration of oxygen. The fractional saturation, Y, is defined as 
the fraction of possible binding sites that contain bound oxygen. The value 
of Y can range from 0 (all sites empty) to 1 (all sites filled). The concentra- 
tion of oxygen is most conveniently measured by its partial pressure, pO). 
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Torr 

A unit of pressure equal to that exerted by a 
column of mercury 1 mm high at O°C and 
standard gravity (1 mm Hg). Named after 
Evangelista Torricelli (1608-1647), inventor of 
the mercury barometer. 
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FIGURE 7.8 Oxygen binding by 
hemoglobin. This curve, obtained for 
hemoglobin in red blood cells, is shaped 
somewhat like an “S,” indicating that 
distinct, but interacting, oxygen-binding sites 
are present in each hemoglobin molecule. 
Half-saturation for hemoglobin is 26 torr. For 
comparison, the binding curve for myoglobin 
is shown as a dashed black curve. 


i 

2 

g 

=) 

5 

x 

Cc 

2 

G 

2c 
FIGURE 7.7 Oxygen binding by > Ps9 = 2 torr 
myoglobin. Half the myoglobin molecules 0.0 6 a "7 A | - 
have bound oxygen when the oxygen partial 

pO, (torr) 


pressure is 2 torr. 


For myoglobin, a binding curve indicating a simple chemical equilibrium is 
observed (Figure 7.7). Notice that the curve rises sharply as pO, increases 
and then levels off. Half-saturation of the binding sites, referred to as P., 
(for 50% saturated), is at the relatively low value of 2 torr (mm Hg), indicat- 
ing that oxygen binds with high affinity to myoglobin. 

In contrast, the oxygen-binding curve for hemoglobin in red blood cells 
shows some remarkable features (Figure 7.8). It does not look like a simple 
binding curve such as that for myoglobin; instead, it resembles an “S.”’ Such 
curves are referred to as sigmoid because of their S-like shape. In addition, 
oxygen binding for hemoglobin (P., = 26 torr) 1s significantly weaker than 
that for myoglobin. Note that this binding curve 1s derived from hemoglobin 
in red blood cells. 

A sigmoid binding curve indicates that a protein exhibits a special 
binding behavior. For hemoglobin, this shape suggests that the binding of 
oxygen at one site within the hemoglobin tetramer increases the likeli- 
hood that oxygen binds at the remaining unoccupied sites. Conversely, 
the unloading of oxygen at one heme facilitates the unloading of oxygen 
at the others. This sort of binding behavior 1s referred to as cooperative, 
because the binding reactions at individual sites in each hemoglobin mol- 
ecule are not independent of one another. We will return to the mecha- 
nism of this cooperativity shortly. 

What is the physiological significance of the cooperative binding of 
oxygen by hemoglobin? Oxygen must be transported in the blood from 
the lungs, where the partial pressure of oxygen is relatively high (approx- 
imately 100 torr), to the actively metabolizing tissues, where the partial 
pressure of oxygen is much lower (typically, 20 torr). Let us consider how 
the cooperative behavior indicated by the sigmoid curve leads to efficient 
oxygen transport (Figure 7.9). In the lungs, hemoglobin becomes nearly 
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saturated with oxygen such that 98% of the oxygen-binding sites are occu- 
pied. When hemoglobin moves to the tissues and releases O,,, the satura- 
tion level drops to 32%. Thus, a total of 98 — 32 = 66% of the potential 
oxygen-binding sites contribute to oxygen transport. The cooperative 
release of oxygen favors a more-complete unloading of oxygen in the tis- 
sues. If myoglobin were employed for oxygen transport, it would be 98% 
saturated in the lungs, but would remain 91% saturated in the tissues, and 
so only 98 — 91 = 7% of the sites would contribute to oxygen transport; 
myoglobin binds oxygen too tightly to be useful in oxygen transport. 
Nature might have solved this problem by weakening the affinity of myo- 
globin for oxygen to maximize the difference in saturation between 20 and 
100 torr. However, for such a protein, the most oxygen that could be 
transported from a region in which pO, 1s 100 torr to one in which it is 
20 torr is 63 — 25 = 38%, as indicated by the blue curve in Figure 7.9. 
Thus, the cooperative binding and release of oxygen by hemoglobin 
enables it to deliver nearly 10 times as much oxygen as could be delivered 
by myoglobin and more than 1.7 times as much as could be delivered by 
any noncooperative protein. 

Closer examination of oxygen concentrations in tissues at rest and dur- 
ing exercise underscores the effectiveness of hemoglobin as an oxygen car- 
rier (Figure 7.10). Under resting conditions, the oxygen concentration in 
muscle is approximately 40 torr, but during exercise the concentration is 
reduced to 20 torr. In the decrease from 100 torr in the lungs to 40 torr in 
resting muscle, the oxygen saturation of hemoglobin is reduced from 98% to 
77%, and so 98 — 77 = 21% of the oxygen is released over a drop of 60 torr. 
In a decrease from 40 torr to 20 torr, the oxygen saturation is reduced from 
77% to 32%, corresponding to an oxygen release of 45% over a drop of 20 
torr. Thus, because the change in oxygen concentration from rest to exercise 
corresponds to the steepest part of the oxygen-binding curve, oxygen is 
effectively delivered to tissues where it is most needed. In Section 7.3, we 
shall examine other properties of hemoglobin that enhance its physiological 
responsiveness. 
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Oxygen binding markedly changes the quaternary 
structure of hemoglobin 


The cooperative binding of oxygen by hemoglobin requires that the binding 
of oxygen at one site in the hemoglobin tetramer influence the oxygen- 
binding properties at the other sites. Given the large separation between the 
iron sites, direct interactions are not possible. Thus, indirect mechanisms 
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for coupling the sites must be at work. These mechanisms are intimately 
related to the quaternary structure of hemoglobin. 

Hemoglobin undergoes substantial changes in quaternary structure on 
oxygen binding: the a, 8, and a8, dimers rotate approximately 15 degrees 
with respect to one another (Figure 7.11). The dimers themselves are rela- 
tively unchanged, although there are localized conformational shifts. Thus, 
the interface between the a,B, and a8, dimers is most affected by this 
structural transition. In particular, the a,8, and a,f, dimers are freer to 
move with respect to one another in the oxygenated state than they are in the 
deoxygenated state. 
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hk FIGURE 7.11 Quaternary structural changes on oxygen binding by hemoglobin. 
Notice that, on oxygenation, one af dimer shifts with respect to the other by a rotation of 
15 degrees. [Drawn from 1A3N.pdb and 1LFQ.pdb.] 


The quaternary structure observed in the deoxy form of hemoglobin, 
deoxyhemoglobin, is often referred to as the T (for tense) state because it is 
quite constrained by subunit—subunit interactions. The quaternary struc- 
ture of the fully oxygenated form, oxyhemoglobin, is referred to as the R (for 
relaxed) state. In light of the observation that the R form of hemoglobin is 
less constrained, the tense and relaxed designations seem particularly apt. 
Importantly, in the R state, the oxygen-binding sites are free of strain and 
are capable of binding oxygen with higher affinity than are the sites in the 
T state. By triggering the shift of the hemoglobin tetramer from the T state to 
the R state, the binding of oxygen to one site increases the binding affinity of 
other sites. 


Hemoglobin cooperativity can be potentially explained 
by several models 


Two limiting models have been developed to explain the cooperative bind- 
ing of ligands to a multisubunit assembly such as hemoglobin. In the con- 
certed model, also known as the MWC model after Jacques Monod, Jeffries 
Wyman, and Jean-Pierre Changeux, who first proposed it, the overall assem- 
bly can exist only in two forms: the T state and the R state. The binding of 
ligands simply shifts the equilibrium between these two states (Figure 7.12). 
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Thus, as a hemoglobin tetramer binds each oxygen molecule, the probability 
that the tetramer is in the R state increases. Deoxyhemoglobin tetramers are 
almost exclusively in the T state. However, the binding of oxygen to one site 
in the molecule shifts the equilibrium toward the R state. If a molecule 
assumes the R quaternary structure, the oxygen affinity of its sites increases. 
Additional oxygen molecules are now more likely to bind to the three unoc- 
cupied sites. Thus, the binding curve for hemoglobin can be seen as a com- 
bination of the binding curves that would be observed if all molecules 
remained in the T state or if all of the molecules were in the R state. As 
oxygen molecules bind, the hemoglobin tetramers convert from the T state 
into the R state, yielding the sigmoid binding curve so important for effi- 
cient oxygen transport (Figure 7.13). 

In the concerted model, each tetramer can exist in only two states, the T 
state and the R state. In an alternative model, the sequential model, the bind- 
ing of a ligand to one site in an assembly increases the binding affinity of 
neighboring sites without inducing a full conversion from the T into the R 
state (Figure 7.14). 


FIGURE 7.14 Sequential model. The binding of a ligand changes the conformation of the 
subunit to which it binds. This conformational change induces changes in neighboring 
subunits that increase their affinity for the ligand. 


Is the cooperative binding of oxygen by hemoglobin better described 
by the concerted or the sequential model? Neither model in its pure form 
fully accounts for the behavior of hemoglobin. Instead, a combined 
model is required. Hemoglobin behavior is concerted in that the tetramer 
with three sites occupied by oxygen is almost always in the quaternary 
structure associated with the R state. The remaining open binding site 
has an affinity for oxygen more than 20-fold greater than that of fully 
deoxygenated hemoglobin binding its first oxygen. However, the behav- 
ior is not fully concerted, because hemoglobin with oxygen bound to only 
one of four sites remains primarily in the T-state quaternary structure. 
Yet, this molecule binds oxygen three times as strongly as does fully 
deoxygenated hemoglobin, an observation consistent only with a sequen- 
tial model. These results highlight the fact that the concerted and sequen- 
tial models represent idealized limiting cases, which real systems may 
approach but rarely attain. 
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FIGURE 7.12 Concerted model. All 
molecules exist either in the T state or in 
the R state. At each level of oxygen 
loading, an equilibrium exists between the 
T and R states. The equilibrium shifts from 
strongly favoring the T state with no oxygen 
bound to strongly favoring the R state 
when the molecule is fully loaded with 
oxygen. The R state has a greater affinity 
for oxygen than does the T state. 
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FIGURE 7.13 T-to-R transition. The 
binding curve for hemoglobin can be seen 
as a combination of the binding curves that 
would be observed if all molecules remained 
in the T state or if all of the molecules were 
in the R state. The sigmoidal curve is 
observed because molecules convert from 
the T state into the R state as oxygen 
molecules bind. 
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FIGURE 7.15 Conformational changes in 
hemoglobin. The movement of the iron ion 
on oxygenation brings the iron-associated 
histidine residue toward the porphyrin ring. 
The associated movement of the histidine- 
containing « helix alters the interface 
between the a8 dimers, instigating other 
structural changes. For comparison, the 
deoxyhemoglobin structure is shown in gray 
behind the oxyhemoglobin structure in red. 
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FIGURE 7.16 Oxygen binding by pure 
hemoglobin compared with hemoglobin 
in red blood cells. Pure hemoglobin binds 
oxygen more tightly than does hemoglobin 
in red blood cells. This difference is due to 
the presence of 2,3-bisphosphoglycerate 
(2,3-BPG) in red blood cells. 


Structural changes at the heme groups are transmitted to 
the a,8,—a,B, interface 


We now examine how oxygen binding at one site is able to shift the equi- 
librium between the T and R states of the entire hemoglobin tetramer. 
As in myoglobin, oxygen binding causes each iron atom in hemoglobin 
to move from outside the plane of the porphyrin into the plane. When 
the iron atom moves, the proximal histidine residue moves with it. This 
histidine residue is part of an a helix, which also moves (Figure 7.15). 
The carboxyl terminal end of this a helix lies in the interface between 
the two aB dimers. The change in position of the carboxyl terminal end 
of the helix favors the T-to-R transition. Consequently, the structural 
transition at the iron ion in one subunit is directly transmitted to the other 
subunits. The rearrangement of the dimer interface provides a pathway 
for communication between subunits, enabling the cooperative binding 
of oxygen. 


2,3-Bisphosphoglycerate in red cells is crucial in determining the 
oxygen affinity of hemoglobin 


For hemoglobin to function efficiently, the T state must remain stable until 
the binding of sufficient oxygen has converted it into the R state. In fact, 
however, the T state of hemoglobin is highly unstable, pushing the equilib- 
rium so far toward the R state that little oxygen would be released in physi- 
ological conditions. Thus, an additional mechanism is needed to properly 
stabilize the T state. This mechanism was discovered by comparing the 
oxygen-binding properties of hemoglobin in red blood cells with fully puri- 
fied hemoglobin (Figure 7.16). Pure hemoglobin binds oxygen much more 
tightly than does hemoglobin in red blood cells. This dramatic difference is 
due to the presence within these cells of 2, 3-bisphosphoglycerate (2,3-BPG; 
also known as 2,3-diphosphoglycerate or 2,3-DPG). 


= 30 p= 
O=.¢” H ie} O 
a s Ww 
Orne 0 So 
2 /\ 
00 


2,3-Bisphosphoglycerate 
(2,3-BPG) 


This highly anionic compound is present in red blood cells at approximately 
the same concentration as that of hemoglobin (~2 mM). Without 2,3-BPG, 
hemoglobin would be an extremely inefficient oxygen transporter, releasing 
only 8% of its cargo in the tissues. 

How does 2,3-BPG lower the oxygen affinity of hemoglobin so signifi- 
cantly? Examination of the crystal structure of deoxyhemoglobin in the 
presence of 2,3-BPG reveals that a single molecule of 2,3-BPG binds in 
the center of the tetramer, ina pocket present only inthe T form (Figure 7.17). 
On T-to-R transition, this pocket collapses and 2,3-BPG is released. Thus, 
in order for the structural transition from T to R to take place, the bonds 
between hemoglobin and 2,3-BPG must be broken. In the presence of 2,3- 
BPG, more oxygen-binding sites within the hemoglobin tetramer must 
be occupied in order to induce the T-to-R transition, and so hemoglobin 
remains in the lower-affinity T state until higher oxygen concentrations are 
reached. This mechanism of regulation is remarkable because 2,3-BPG does 
not in any way resemble oxygen, the molecule on which hemoglobin carries 
out its primary function. 2,3-BPG is referred to as an allosteric effector (from 


; N ) 
B, subunit o® ry 


the Greek allos, “other,” and stereos, “structure’’). Regulation by a molecule 
structurally unrelated to oxygen is possible because the allosteric effector 
binds to a site that is completely distinct from that for oxygen. We will 
encounter allosteric effects again when we consider enzyme regulation in 
Chapter 10. 


cs The binding of 2,3-BPG to hemoglobin has other crucial physiolog- 

ical consequences. The globin gene expressed by human fetuses 
differs from that expressed by adults; fetal hemoglobin tetramers include 
two a chains and two y chains. The y chain, a result of gene duplication, is 
72% identical in amino acid sequence with the B chain. One noteworthy 
change is the substitution of a serine residue for His 143 in the B chain, 
part of the 2,3-BPG-binding site. This change removes two positive 
charges from the 2,3-BPG-binding site (one from each chain) and reduces 
the affinity of 2,3-BPG for fetal hemoglobin. Consequently, the oxygen- 
binding affinity of fetal hemoglobin is higher than that of maternal (adult) 
hemoglobin (Figure 7.18). This difference in oxygen affinity allows oxy- 
gen to be effectively transferred from maternal to fetal red blood cells. We 
have here an example in which gene duplication and specialization pro- 
duced a ready solution to a biological challenge—ain this case, the transport 
of oxygen from mother to fetus. 


Carbon monoxide can disrupt oxygen transport by hemoglobin 


Carbon monoxide (CQO) is a colorless, odorless gas that binds to 
hemoglobin at the same site as oxygen, forming a complex termed 
carboxyhemoglobin. Formation of carboxyhemoglobin exerts devastating 
effects on normal oxygen transport in two ways. First, carbon monoxide 
binds to hemoglobin about 200-fold more tightly than does oxygen. 
Even at low partial pressures in the blood, carbon monoxide will displace 
oxygen from hemoglobin, preventing its delivery. Second, carbon mon- 
oxide bound to one site in hemoglobin will shift the oxygen saturation 
curve of the remaining sites to the left, forcing the tetramer into the 
R state. This results in an increased affinity for oxygen, preventing its 
dissociation at tissues. 
Exposure to carbon monoxide—from gas appliances and running auto- 
mobiles, for example—can cause carbon monoxide poisoning, in which 
patients exhibit nausea, vomiting, lethargy, weakness, and disorientation. 
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FIGURE 7.18 Oxygen affinity of fetal red 
blood cells. Fetal red blood cells have a 
higher oxygen affinity than do maternal red 
blood cells because fetal hemoglobin does 
not bind 2,3-BPG as well as maternal 
hemoglobin does. 
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One treatment for carbon monoxide poisoning is administration of 100% 
oxygen, often at pressures greater than atmospheric pressure (this treatment 
is referred to as hyperbaric oxygen therapy). With this therapy, the partial 
pressure of oxygen in the blood becomes sufficiently high to increase sub- 
stantially the displacement of carbon monoxide from hemoglobin. Exposure 
to high concentrations of carbon monoxide, however, can be rapidly fatal: in 
the United States, about 2,500 people die each year from carbon monoxide 
poisoning, about 500 of them from accidental exposures and nearly 2,000 
by suicide. 


7.3 Hydrogen lons and Carbon Dioxide Promote the 
Release of Oxygen: The Bohr Effect 


We have seen how cooperative release of oxygen from hemoglobin helps 
deliver oxygen to where it is most needed: tissues exhibiting low oxygen 
partial pressures. This ability is enhanced by the facility of hemoglobin to 
respond to other cues in its physiological environment that signal the need 
for oxygen. Rapidly metabolizing tissues, such as contracting muscle, 
generate large amounts of hydrogen ions and carbon dioxide (Chapter 16). 
To release oxygen where the need is greatest, hemoglobin has evolved to 
respond to higher levels of these substances. Like 2,3-BPG, hydrogen 
ions and carbon dioxide are allosteric effectors of hemoglobin that bind to 
sites on the molecule that are distinct from the oxygen-binding sites. The 
regulation of oxygen binding by hydrogen ions and carbon dioxide is 
called the Bohr effect after Christian Bohr, who described this phenomenon 
in 1904. 

The oxygen affinity of hemoglobin decreases as pH decreases from a 
value of 7.4 (Figure 7.19). Consequently, as hemoglobin moves into a region 
of lower pH, its tendency to release oxygen increases. For example, transport 
from the lungs, with pH 7.4 and an oxygen partial pressure of 100 torr, to 
active muscle, with a pH of 7.2 and an oxygen partial pressure of 20 torr, 
results in a release of oxygen amounting to 77% of total carrying capacity. 
Only 66% of the oxygen would be released in the absence of any change in 
pH. Structural and chemical studies have revealed much about the chemical 
basis of the Bohr effect. Several chemical groups within the hemoglobin 
tetramer are important for sensing changes in pH; all of these have pK, 
values near pH 7. Consider histidine B146, the residue at the C terminus of 
the B chain. In deoxyhemoglobin, the terminal carboxylate group of B146 
forms a ionic bond, also called a salt bridge, with a lysine residue in the a 
subunit of the other aB dimer. This interaction locks the side chain of histi- 
dine 8146 in a position from which it can participate in a salt bridge with 
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FIGURE 7.19 Effect of pH on the oxygen 

affinity of hemoglobin. Lowering the pH from 
7A (red curve) to 7.2 (blue curve) results in the 
release of O, from oxyhemoglobin. pO, (torr) 
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FIGURE 7.20 Chemical basis of the Bohr 
effect. In deoxyhemoglobin, three amino acid 
residues form two salt bridges that stabilize 
+ the T quaternary structure. The formation of 
C terminus one of the salt bridges depends on the 
presence of an added proton on histidine 
Added 8146. The proximity of the negative charge 
= on aspartate B94 in deoxyhemoglobin favors 
/ protonation of this histidine. Notice that the 
salt bridge between histidine B146 and 
B, His 146 ~ B, Asp 94 aspartate B94 is stabilized by a hydrogen 
bond (green dashed line). 


negatively charged aspartate B94 in the same chain, provided that the imid- 
azole group of the histidine residue is protonated (Figure 7.20). 

In addition to His B146, the a-amino groups at the amino termini of the 
a chain and the side chain of histidine 122 also participate in salt bridges in 
the T state. The formation of these salt bridges stabilizes the T state, leading to 
a greater tendency for oxygen to be released. For example, at high pH, the side 
chain of histidine B146 is not protonated and the salt bridge does not form. 
As the pH drops, however, the side chain of histidine B146 becomes proton- 
ated, the salt bridge with aspartate B94 forms, and the T state is stabilized. 

Carbon dioxide, a neutral species, passes through the red-blood-cell 
membrane into the cell. This transport is also facilitated by membrane 
transporters, including proteins associated with Rh blood types. Carbon 
dioxide stimulates oxygen release by two mechanisms. First, the presence 
of high concentrations of carbon dioxide leads to a drop in pH within the 
red blood cell (Figure 7.21). Carbon dioxide reacts with water to form car- 
bonic acid, H,CO,. This reaction is accelerated by carbonic anhydrase, an 
enzyme abundant in red blood cells that will be considered extensively in 
Chapter 9. H,CO, is a moderately strong acid with a pK, of 3.5. Thus, 
once formed, carbonic acid dissociates to form bicarbonate ion, HCO7, 
and H*, resulting in a drop in pH that stabilizes the T state by the mecha- 
nism discussed previously. 


Body tissue Blood capillary 


FIGURE 7.21 Carbon dioxide and pH. Carbon dioxide in the tissues diffuses into red blood 
cells. Inside a red blood cell, carbon dioxide reacts with water to form carbonic acid, in a 
reaction catalyzed by the enzyme carbonic anhydrase. Carbonic acid dissociates to form 
HCO; and H*, resulting in a drop in pH inside the red cell. 
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— PH 7.4, no CO, 
— PH 7.2, no CO, 
— PH 7.2, 40 torr CO 
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FIGURE 7.22 Carbon dioxide 

effects. The presence of carbon dioxide 
decreases the affinity of hemoglobin for 
oxygen even beyond the effect due to a 
decrease in pH, resulting in even more 
efficient oxygen transport from the tissues 
to the lungs. 


FIGURE 7.23 Transport of CO, from 
tissues to lungs. Most carbon dioxide is 
transported to the lungs in the form of 
HCO3 produced in red blood cells and then 
released into the blood plasma. A lesser 
amount is transported by hemoglobin in the 
form of an attached carbamate. 


In the second mechanism, a direct chemical interaction between carbon 
dioxide and hemoglobin stimulates oxygen release. The effect of carbon 
dioxide on oxygen affinity can be seen by comparing oxygen-binding 
curves in the absence and in the presence of carbon dioxide at a constant pH 
(Figure 7.22). In the presence of carbon dioxide at a partial pressure of 40 
torr at pH 7.2, the amount of oxygen released approaches 90% of the maxi- 
mum carrying capacity. Carbon dioxide stabilizes deoxyhemoglobin by 
reacting with the terminal amino groups to form carbamate groups, which 
are negatively charged, in contrast with the neutral or positive charges on 
the free amino groups. 
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The amino termini lie at the interface between the af dimers, and these 
negatively charged carbamate groups participate in salt-bridge interactions 
that stabilize the T state, favoring the release of oxygen. 

Carbamate formation also provides a mechanism for carbon dioxide 
transport from tissues to the lungs, but it accounts for only about 14% of the 
total carbon dioxide transport. Most carbon dioxide released from red blood 
cells is transported to the lungs in the form of HCO; produced from the 
hydration of carbon dioxide inside the cell (Figure 7.23). Much of the 
HCO, that is formed leaves the cell through a specific membrane-transport 
protein that exchanges HCO; from one side of the membrane for Cl~ from 
the other side. Thus, the serum concentration of HCO; increases. By this 
means, a large concentration of carbon dioxide is transported from tissues to 
the lungs in the form of HCO,. In the lungs, this process is reversed: 
HCO; is converted back into carbon dioxide and exhaled. Thus, carbon 
dioxide generated by active tissues contributes to a decrease in red-blood- 
cell pH and, hence, to oxygen release and is converted into a form that can 
be transported in the serum and released in the lungs. 


7.4 Mutations in Genes Encoding Hemoglobin Subunits 
Can Result in Disease 


In modern times, particularly after the sequencing of the human genome, it 
is routine to think of genetically encoded variations in protein sequence as a 
factor in specific diseases. The notion that diseases might be caused by 
molecular defects was proposed by Linus Pauling in 1949 (four years before 


Watson and Crick’s proposal of the DNA double helix) to explain the blood 
disease sickle-cell anemia. The name of the disorder comes from the abnor- 
mal sickle shape of red blood cells deprived of oxygen in people suffering 
from this disease (Figure 7.24). Pauling proposed that sickle-cell anemia 
might be caused by a specific variation in the amino acid sequence of one 
hemoglobin chain. Today, we know that this bold hypothesis is correct. In 
fact, approximately 7% of the world’s population are carriers of some disor- 
der of hemoglobin caused by a variation in its amino acid sequence. In 
concluding this chapter, we will focus on the two most important of these 
disorders, sickle-cell anemia and thalassemia. 


Sickle-cell anemia results from the aggregation of mutated 
deoxyhemoglobin molecules 


es People with sickled red blood cells experience a number of dangerous 

symptoms. Examination of the contents of these red cells reveals that 
the hemoglobin molecules have formed large fibrous aggregates (Figure 7.25). 
These fibers extend across the red blood cells, distorting them so that they 
clog small capillaries and impair blood flow. In addition, red cells from sickle 
cell patients are more adherent to the walls of blood vessels than those from 
normal individuals, prolonging the opportunity for capillary occlusion. The 
results may be painful swelling of the extremities and a higher risk of stroke 
or bacterial infection (due to poor circulation). The sickled red cells also do 
not remain in circulation as long as normal cells do, leading to anemia. 

What is the molecular defect associated with sickle-cell anemia? Vernon 
Ingram demonstrated in 1956 that a single amino acid substitution in the B 
chain of hemoglobin is responsible—namely, the replacement of a gluta- 
mate residue with valine in position 6. The mutated form is referred to as 
hemoglobin S (HbS). In people with sickle-cell anemia, both alleles of the 
hemoglobin B-chain gene (HbB) are mutated. The HbS substitution sub- 
stantially decreases the solubility of deoxyhemoglobin, although it does not 
markedly alter the properties of oxyhemoglobin. 

Examination of the structure of hemoglobin 5 reveals that the new valine 
residue lies on the surface of the T-state molecule (Figure 7.26). This new 
hydrophobic patch interacts with another hydrophobic patch formed by 
Phe 85 and Leu 88 of the B chain of a neighboring molecule to initiate the 
aggregation process. More-detailed analysis reveals that a single hemoglo- 
bin S fiber is formed from 14 chains of multiple interlinked hemoglobin 
molecules. Why do these aggregates not form when hemoglobin S is oxy- 
genated? Oxyhemoglobin S is in the R state, and residues Phe 85 and Leu 
88 on the B chain are largely buried inside the hemoglobin assembly. In the 


FIGURE 7.24 Sickled red blood cells. A 
micrograph showing a sickled red blood 
cell adjacent to normally shaped red blood 
cells. [Eye of Science/Photo Researchers.] 


ae 


FIGURE 7.25 Sickle-cell hemoglobin 
fibers. An electron micrograph depicting a 
ruptured sickled red blood cell with fibers of 
sickle-cell hemoglobin emerging. [Courtesy 
of Robert Josephs and Thomas E. Wellems, 
University of Chicago.] 


FIGURE 7.26 Deoxygenated 
hemoglobin S. The interaction between 

Val 6 (blue) on a B chain of one hemoglobin 
molecule and a hydrophobic patch formed 
by Phe 85 and Leu 88 (gray) on a B chain of 
another deoxygenated hemoglobin molecule 
leads to hemoglobin aggregation. The 
exposed Val 6 residues of other B chains 
participate in other such interactions in 
hemoglobin S fibers. [Drawn from 2HBS.pdb.] 
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FIGURE 7.28 Sickle-cell trait and 
malaria. A significant correlation is 
observed between regions with a high 
frequency of the HbS allele and regions 
with a high prevalence of malaria. 


FIGURE 7.27 The formation of HbS aggregates. The mutation to Val 6 in hemoglobin S is 
represented by the red triangles, while the hydrophobic patch formed by Phe 85 and Leu 88 
in deoxyhemoglobin is represented by the blue indentations. When HbS is in its deoxy form, it 
exhibits the complementary features necessary for aggregation. 


absence of a partner with which to interact, the surface Val residue in posi- 
tion 6 is benign (Figure 7.27). 


WY Approximately 1 in 100 West Africans suffer from sickle-cell anemia. 

Given the often devastating consequences of the disease, why is the HbS 
mutation so prevalent in Africa and in some other regions? Recall that both 
copies of the HbB gene are mutated in people with sickle-cell anemia. 
Individuals with one copy of the HbB gene and one copy of the HbS gene are 
said to have sickle-cell trait because they can pass the HbS gene to their off- 
spring. While sickle-cell trait is considered a benign condition, rare complica- 
tions have been identified, including an increased risk of exercise-related death 
in high-performance athletes. However, people with sickle-cell trait exhibit 
enhanced resistance to malaria, a disease carried by a parasite, Plasmodium 
falciparum, that lives within red blood cells at one stage in its life cycle. The dire 
effect of malaria on health and reproductive likelihood in historically endemic 
regions has favored people with sickle-cell trait, increasing the prevalence of 
the HbS allele (Figure 7.28). 
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Thalassemia is caused by an imbalanced production of 
hemoglobin chains 


cz Sickle-cell anemia is caused by the substitution of a single specific 

amino acid in one hemoglobin chain. Thalassemia, the other preva- 
lent inherited disorder of hemoglobin, is caused by the loss or substantial 
reduction of a single hemoglobin chain. The result is low levels of functional 
hemoglobin and a decreased production of red blood cells, which may lead 
to anemia, fatigue, pale skin, and spleen and liver malfunction. Thalassemia 
is a set of related diseases. In a-thalassemia, the a chain of hemoglobin is 
not produced in sufficient quantity. Consequently, hemoglobin tetramers 
form that contain only the B chain. These tetramers, referred to as hemoglo- 
bin H (HbH), bind oxygen with high affinity and no cooperativity. Thus, 
oxygen release in the tissues is poor. In B-thalassemia, the B chain of hemo- 
globin is not produced in sufficient quantity. In the absence of 8 chains, the 
a chains form insoluble aggregates that precipitate inside immature red 
blood cells. The loss of red blood cells results in anemia. The most severe 
form of B-thalassemia is called thalassemia major or Cooley anemia. 

Both a- and B-thalassemia are associated with many different genetic 
variations and display a wide range of clinical severity. The most severe 
forms of a-thalassemia are usually fatal shortly before or just after birth. 
However, these forms are relatively rare. An examination of the repertoire of 
hemoglobin genes in the human genome provides one explanation. Normally, 
humans have not two but four alleles for the a chain, arranged such that the 
two genes are located adjacent to each other on one end of each chromosome 
16. Thus, the complete loss of a-chain expression requires the disruption of 
four alleles. B- Thalassemia is more common because humans normally have 
only two alleles for the B chain, one on each copy of chromosome 11. 


The accumulation of free alpha-hemoglobin chains is prevented 


The presence of four genes expressing the a chain, compared with two for 
the B chain, suggests that the a chain would be produced in excess (given 
the overly simple assumption that protein expression from each gene is 
comparable). If this is correct, why doesn’t the excess a chain precipitate? 
One mechanism for maintaining a chains in solution was revealed by the 
discovery of an 11-kDa protein in red blood cells called a-hemoglobin 
stabilizing protein (AHSP). This protein forms a soluble complex specifi- 
cally with newly synthesized a-chain monomers. The crystal structure of a 
complex between AHSP and a-hemoglobin reveals that AHSP binds to the 
same face of a-hemoglobin as does B-hemoglobin (Figure 7.29). AHSP 
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hk. FIGURE 7.29 Stabilizing free 
a-hemoglobin. The structure of a complex 
between AHSP and a-hemoglobin is 
shown. In this complex, the iron atom is 
bound to oxygen and to the distal histidine. 
Notice that AHSP binds to the same 
surface of a-hemoglobin as does 
8-hemoglobin. [Drawn from 1Y01.pdb.] 
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binds the a chain in both the deoxygenated and oxygenated forms. In the 
complex with oxygen bound, the distal histidine, rather than the proximal 
histidine, binds the iron atom. 

AHSP serves to bind and ensure the proper folding of a-hemoglobin 
as it is produced. As B-hemoglobin is expressed, it displaces AHSP 
because the a-hemoglobin—B-hemoglobin dimer is more stable than the 
a-hemoglobin-AHSP complex. Thus, AHSP prevents the misfolding, 
accumulation, and precipitation of free w-hemoglobin. Studies are under 
way to determine if mutations in the gene encoding AHSP play a role in 
modulating the severity of B-thalassemia. 


Additional globins are encoded in the human genome 


a In addition to the gene for myoglobin, the two genes for w-hemoglobin, 

and the one for B-hemoglobin, the human haploid genome contains 
other globin genes. We have already encountered fetal hemoglobin, which 
contains the y chain in place of the B chain. Several other genes encode other 
hemoglobin subunits that are expressed during development, including the 
6 chain, the ¢ chain, and the ¢ chain. 

Examination of the human genome sequence has revealed two addi- 
tional globins. Both of these proteins are monomeric proteins, more similar 
to myoglobin than to hemoglobin. The first, neuroglobin, is expressed pri- 
marily in the brain and at especially high levels in the retina. Neuroglobin 
may play a role in protecting neural tissues from hypoxia (insufficient oxy- 
gen). The second, cytoglobin, is expressed more widely throughout the 
body. Structural and spectroscopic studies reveal that, in both neuroglobin 
and cytoglobin, the proximal and the distal histidines are coordinated to the 
iron atom in the deoxy form. Oxygen binding displaces the distal histidine. 
Future studies should more completely elucidate the functions of these 
members of the globin family. 


SUMMARY 


7.1. Myoglobin and Hemoglobin Bind Oxygen at Iron Atoms in Heme 
Myoglobin is a largely a-helical protein that binds the prosthetic 
group heme. Heme consists of protoporphyrin, an organic compo- 
nent with four linked pyrrole rings, and a central iron ion in the 
Fe? state. The iron ion is coordinated to the side chain of a histi- 
dine residue in myoglobin, referred to as the proximal histidine. 
One of the oxygen atoms in O, binds to an open coordination site on 
the iron. Because of partial electron transfer from the iron to the 
oxygen, the iron ion moves into the plane of the porphyrin on oxy- 
gen binding. Hemoglobin consists of four polypeptide chains, two a 
chains and two B chains. Each of these chains is similar in amino 
acid sequence to myoglobin and folds into a very similar three- 
dimensional structure. The hemoglobin tetramer is best described 
as a pair of af dimers. 


7.2. Hemoglobin Binds Oxygen Cooperatively 
The oxygen-binding curve for myoglobin reveals a simple equilib- 
rium binding process. Myoglobin is half-saturated with oxygen at 
an oxygen concentration of approximately 2 torr. The oxygen- 
binding curve for hemoglobin has an “S”-like (sigmoid) shape, 
indicating that the oxygen binding is cooperative. The binding of 
oxygen at one site within the hemoglobin tetramer affects the affin- 
ities of the other sites for oxygen. Cooperative oxygen binding and 


7.3 


7.4 


release significantly increase the efficiency of oxygen transport. 
The amount of the potential oxygen-carrying capacity utilized in 
transporting oxygen from the lungs (with a partial pressure of oxy- 
gen of 100 torr) to tissues (with a partial pressure of oxygen of 20 
torr) is 66% compared with 7% if myoglobin had been used as the 
oxygen carrier. 

The quaternary structure of hemoglobin changes on oxygen bind- 
ing. The structure of deoxyhemoglobin is referred to as the T state. The 
structure of oxyhemoglobin is referred to as the R state. The two aB 
dimers rotate by approximately 15 degrees with respect to one another 
in the transition from the T to the R state. Cooperative binding can be 
potentially explained by concerted and sequential models. In the con- 
certed model, each hemoglobin adopts either the T state or the R state; 
the equilibrium between these two states is determined by the number 
of occupied oxygen-binding sites. Sequential models allow intermedi- 
ate structures. Structural changes at the iron sites in response to oxygen 
binding are transmitted to the interface between af dimers, influenc- 
ing the T-to-R equilibrium. 

Red blood cells contain 2,3-bisphosphoglycerate in concentra- 
tions approximately equal to that for hemoglobin. 2,3-BPG binds 
tightly to the T state but not to the R state, stabilizing the T state and 
lowering the oxygen affinity of hemoglobin. Fetal hemoglobin binds 
oxygen more tightly than does adult hemoglobin owing to weaker 
2,3-BPG binding. This difference allows oxygen transfer from 
maternal to fetal blood. 


Hydrogen lons and Carbon Dioxide Promote the Release of Oxygen: 
The Bohr Effect 

The oxygen-binding properties of hemoglobin are markedly affected 
by pH and by the presence of carbon dioxide, a phenomenon known as 
the Bohr effect. Increasing the concentration of hydrogen ions—that 
is, decreasing pH—decreases the oxygen affinity of hemoglobin, 
owing to the protonation of the amino termini and certain histidine 
residues. The protonated residues help stabilize the T state. Increasing 
concentrations of carbon dioxide decrease the oxygen affinity of 
hemoglobin by two mechanisms. First, carbon dioxide is converted 
into carbonic acid, which lowers the oxygen affinity of hemoglobin by 
decreasing the pH inside the red blood cell. Second, carbon dioxide 
adds to the amino termini of hemoglobin to form carbamates. These 
negatively charged groups stabilize deoxyhemoglobin through ionic 
interactions. Because hydrogen ions and carbon dioxide are produced 
in rapidly metabolizing tissues, the Bohr effect helps deliver oxygen to 
sites where it is most needed. 


Mutations in Genes Encoding Hemoglobin Subunits 

Can Result in Disease 

Sickle-cell disease is caused by a mutation in the B chain of hemoglobin 
that substitutes a valine residue for a glutamate residue. As a result, a 
hydrophobic patch forms on the surface of deoxy (T-state) hemoglobin 
that leads to the formation of fibrous polymers. These fibers distort 
red blood cells into sickle shapes. Sickle-cell disease was the first dis- 
ease to be associated with a change in the amino acid sequence of a 
protein. Thalassemias are diseases caused by the reduced production 
of either the a or the 8 chain, yielding hemoglobin tetramers that con- 
tain only one type of hemoglobin chain. Such hemoglobin molecules 
are characterized by poor oxygen release and low solubility, leading to 
the destruction of red blood cells in the course of their development. 
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Summary 


210 Red-blood-cell precursors normally produce a slight excess of hemo- 

CHAPTER 7 Hemoglobin: Portrait of a globin a chains compared with B chains. To prevent the aggregation of 

Protein ncAGtoh the excess a chains, they produce a-hemoglobin stabilizing protein, 
which binds specifically to newly synthesized a-chain monomers to 
form a soluble complex. 


APPENDIX: Binding Models Can Be Formulated in Quantitative Terms: 


The Hill Plot and the Concerted Model 


The Hill Plot 


A useful way of quantitatively describing cooperative bind- 
ing processes such as that for hemoglobin was developed by 
Archibald Hill in 1913. Consider the hypothetical equilib- 
rium for a protein X binding a ligand 8: 


X + nS X(S)n (1) 


where n is a variable that can take on both integral and frac- 
tional values. The parameter n is a measure of the degree of 
cooperativity in ligand binding. For X = hemoglobin and 
S = O,, the maximum value of n is 4. The value of n = 4 
would apply if oxygen binding by hemoglobin were com- 
pletely cooperative. If oxygen binding were completely 
noncooperative, then n would be 1. 

Analysis of the equilibrium in equation 1 yields the fol- 
lowing expression for the fractional saturation, Y: 


[S]" 
[S]" + [5so]" 


where [S..)| is the concentration at which X is half-saturated. 
For hemoglobin, this expression becomes 


Oo," 
——— 
pO," + Po 


Y= 


where P., is the partial pressure of oxygen at which 
hemoglobin is half-saturated. This expression can be 
rearranged to: 


Y _ pO," 
1-Y — Pso' 
and so 
Y a 
oa(; = 5) os( Pat)” og(pOz) — nlog(Pso) 


This equation predicts that a plot of log (Y/1 — Y) versus 
log(P.,), called a Hill plot, should be linear with a slope 
of n. 

Hill plots for myoglobin and hemoglobin are shown in 
Figure 7.30. For myoglobin, the Hill plot is linear with a 
slope of 1. For hemoglobin, the Hill plot is not completely 
linear, because the equilibrium on which the Hill plot is 
based is not entirely correct. However, the plot is approxi- 
mately linear in the center with a slope of 2.8. The slope, 
often referred to as the Hill coefficient, is a measure of the 
cooperativity of oxygen binding. The utility of the Hill plot 
is that it provides a simply derived quantitative assessment 
of the degree of cooperativity in binding. With the use of 
the Hill equation and the derived Hill coefficient, a binding 
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myoglobin and hemoglobin. 


curve that closely resembles that for hemoglobin is pro- 
duced (Figure 7.31). 
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APPENDIX FIGURE 7.31 Oxygen-binding curves for several Hill 


coefficients. The curve labeled n = 2.8 closely resembles the curve 
for hemoglobin. 


The Concerted Model 


The concerted model can be formulated in quantitative 
terms. Only four parameters are required: (1) the number of 
binding sites (assumed to be equivalent) in the protein, 
(2) the ratio of the concentrations of the T and R states in 
the absence of bound ligands, (3) the affinity of sites in pro- 
teins in the R state for ligand binding, and (4) a measure of 
how much more tightly subunits in proteins in the R state 
bind ligands compared with subunits in the T state. The 
number of binding sites, n, is usually known from other 
information. For hemoglobin, n = 4. The ratio of the con- 
centrations of the T and R states with no ligands bound, L, 
is the allosteric constant: 


L = [To]/[Ro] 


where the subscript refers to the number of ligands bound 
(in this case, zero). The affinity of subunits in the R state is 
defined by the dissociation constant for a ligand binding to 
a single site in the R state, Kp. Similarly, the dissociation 
constant for a ligand binding to a single site in the T state is 
K.,. We can define the ratio of these two dissociation con- 
stant as 


C= Kr/Kr 


This is the measure of how much more tightly a subunit for 
a protein in the R state binds a ligand compared with a sub- 
unit for a protein in the T state. Note that c < 1 because Kp 
and K.; are dissociation constants and tight binding corre- 
sponds to a small dissociation constant. 

What is the ratio of the concentration of T-state pro- 
teins with one ligand bound to the concentration of R-state 
proteins with one ligand bound? The dissociation constant 
for a single site in the R state is Kp. For a protein with 
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n sites, there are n possible sites for the first ligand to bind. 
This statistical factor favors ligand binding compared with 
a single-site protein. Thus, [R,] = n[Ro|[S]/Kp. Similarly, 
[T,] = n[T I[S]/K>. Thus, 


n{Tol[SVKr_ (To) | 
n[Ro][S]/Kx  [Ro](Ke/Kr) 


[T1]/[Ri] = cL 
Similar analysis reveals that, for states with 7 ligands bound, 
[T,]/[R,] = cL. In other words, the ratio of the concentra- 
tions of the T state to the R state is reduced by a factor of c 
for each ligand that binds. 

Let us define a convenient scale for the concentration of 5: 


a = [S]/Kpr 


This definition is useful because it is the ratio of the concen- 
tration of S to the dissociation constant that determines the 
extent of binding. Using this definition, we see that 


Rol [S 
[Ri] = a = n[Rola 
Similarly, 
[Ty] = oe = ncL[Rola 


What is the concentration of R-state molecules with 
two ligands bound? Again, we must consider the statistical 
factor—that is, the number of ways in which a second 
ligand can bind to a molecule with one site occupied. The 
number of ways isn — 1. However, because which ligand is 
the “first” and which is the “second” does not matter, we 
must divide by a factor of 2. Thus, 
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We can derive similar equations for the case with 7 
ligands bound and for T states. 

We can now calculate the fractional saturation, Y. This 
is the total concentration of sites with ligands bound 
divided by the total concentration of potential binding sites. 
Thus, 


_ (RLF Ey + 20 Ry) + EE) + + mR) + TD 
m[Ro] + [To] + [Ry] + (T] +--+ [RAJ + CED 


Substituting into this equation, we find 
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n[R Ja + ne[T Ja + 2(n(n — 1)/2)[Rgla? 
+ 2(n(n — 1)/2)c?[T gla? ++. 
+ n[Rola” + ne"[T°})a” 
n((R,] + [TQ] + wl Rgl@ + ne[T lea +:-- 
+ [Rj Ja” + c[T,]a”) 


Y= 


Substituting [T)] 
ve a(1 + a)"~! + Lea(1 + ca)" ! 
(1+ a)" + L(1 + ca)” 


= L[R,] and summing these series yields 


We can now use this equation to fit the observed data for 
hemoglobin by varying the parameters L, c, and Kp (with 


Fraction 
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n= 4). Anexcellent fit is obtained with L = 9000, c = 0.014, 
and Kp = 2.5 torr (Figure 7.32). 

In addition to the fractional saturation, the concentra- 
tions of the species Ty, T,, T,, R,, Rz, and R, are shown. The 
concentrations of all other species are very low. The addition 
of concentrations is a major difference between the analysis 
using the Hill equation and this analysis of the concerted 
model. The Hill equation gives only the fractional saturation, 
whereas the analysis of the concerted model yields concentra- 
tions for all species. In the present case, this analysis yields 
the expected ratio of T-state proteins to R-state proteins at 
each stage of binding. This ratio changes from 9000 to 126 to 
1.76 to 0.025 to 0.00035 with zero, one, two, three, and four 
oxygen molecules bound. This ratio provides a quantitative 
measure of the switching of the population of hemoglobin 
molecules from the T state to the R state. 

The sequential model can also be formulated in quanti- 
tative terms. However, the formulation entails many more 
parameters, and many different sets of parameters often 
yield similar fits to the experimental data. 


APPENDIX FIGURE 7.32 Modeling oxygen binding with the concerted model. The 
fractional saturation (y) as a function pO,: L = 9000, c = 0.014, and Kg = 2.5 torr. The 
fraction of molecules in the T state with zero, one, and two oxygen molecules bound 

(Tg, T,, and T,) and the fraction of molecules in the R state with two, three, and four oxygen 
molecules bound (R,, Re, and R,) are shown. The fractions of molecules in other forms are 
too low to be shown. 
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1. Screening the biosphere. The first protein structure to 
have its structure determined was myoglobin from sperm 
whale. Propose an explanation for the observation that 
sperm whale muscle is a rich source of this protein. 


2. Hemoglobin content. The average volume of a red blood 
cell is 87 ym. The mean concentration of hemoglobin in 
red cells is 0.34 g ml~!. 


(a) What is the weight of the hemoglobin contained in an 
average red cell? 


(b) How many hemoglobin molecules are there in an aver- 
age red cell? Assume that the molecular weight of the 
human hemoglobin tetramer is 65 kDa. 


(c) Could the hemoglobin concentration in red cells be 
much higher than the observed value? (Hint: Suppose that 
a red cell contained a crystalline array of hemoglobin mol- 
ecules in a cubic lattice with 65 A sides.) 


3. Iron content. How much iron is there in the hemoglobin 
of a 70-kg adult? Assume that the blood volume is 70 ml 
kg”! of body weight and that the hemoglobin content of 
blood is 0.16 g ml~!. 


4. Oxygenating myoglobin. The myoglobin content of some 
human muscles is about 8 g kg~!. In sperm whale, the myo- 
globin content of muscle is about 80 g kg™!. 


(a) How much O, is bound to myoglobin in human muscle 
and in sperm whale muscle? Assume that the myoglobin is 
saturated with O,,, and that the molecular weights of human 
and sperm whale myoglobin are the same. 


(b) The amount of oxygen dissolved in tissue water (in equi- 
librium with venous blood) at 37°C is about 3.5 X 1075 M. 
What is the ratio of oxygen bound to myoglobin to that 
directly dissolved in the water of sperm whale muscle? 


5. Tuning proton affinity. The pK, ofan acid depends partly on 
its environment. Predict the effect of each of the following envi- 
ronmental changes on the pK, of a glutamic acid side chain. 


(a) A lysine side chain is brought into proximity. 


(b) The terminal carboxyl group of the protein is brought 
into proximity. 


(c) The glutamic acid side chain is shifted from the outside 
of the protein to a nonpolar site inside. 


6. Saving grace. Hemoglobin A inhibits the formation of 
the long fibers of hemoglobin S and the subsequent sickling 
of the red cell on deoxygenation. Why does hemoglobin A 
have this effect? 


7. Carrying a load. Suppose that you are climbing a high 
mountain and the oxygen partial pressure in the air is 
reduced to 75 torr. Estimate the percentage of the oxygen- 
carrying capacity that will be utilized, assuming that the pH 
of both tissues and lungs is 7.4 and that the oxygen concen- 
tration in the tissues is 20 torr. 


8. Bohr for me, not for thee. Does myoglobin exhibit a Bohr 
effect? Why or why not? 


9. High-altitude adaptation. After spending a day or more 
at high altitude (with an oxygen partial pressure of 75 torr), 
the concentration of 2,3-bisphosphoglycerate (2,3-BPG) in 
red blood cells increases. What effect would an increased 
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concentration of 2,3-BPG have on the oxygen-binding 
curve for hemoglobin? Why would this adaptation be ben- 
eficial for functioning well at high altitude? 


10. Blood doping. Endurance athletes sometimes try an 
illegal method of blood doping called autologous transfu- 
sion. Some blood from the athlete is removed well before 
competition, and then transfused back into the athlete just 
before competition. 


(a) Why might blood transfusion benefit the athlete? 


(b) With time, stored red blood cells become depleted in 
2,3-BPG. What might be the consequences of using such 
blood for a blood transfusion? 


11. I’ll take the lobster. Arthropods such as lobsters have 
oxygen carriers quite different from hemoglobin. The 
oxygen-binding sites do not contain heme but, instead, 
are based on two copper(I) ions. The structural changes 
that accompany oxygen binding are shown below. How 
might these changes be used to facilitate cooperative oxy- 
gen binding? 


kh \ Va pi 


12. A disconnect. With the use of site-directed mutagenesis, 
hemoglobin has been prepared in which the proximal 
histidine residues in both the a and the B subunits have 
been replaced by glycine. The imidazole ring from the his- 
tidine residue can be replaced by adding free imidazole in 
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solution. Would you expect this modified hemoglobin to 
show cooperativity in oxygen binding? Why or why not? 


Ow 
N NH 


Imidazole 


13. Successful substitution. Blood cells from some birds do 
not contain 2,3-bisphosphoglycerate but, instead, contain 
one of the compounds in parts a through d, which plays an 
analogous functional role. Which compound do you think 
is most likely to play this role? Explain briefly. 


(a) + /CHs 
HO ~~ \"CHs 
CH; 
Choline 
(b) r 
a eT ae Ae se aa ee 
H2N N 
H 
Spermine 
(c) -0;PO OH 
OPO; 
—O;PO OPO; 
—O;PO 
Inositol pentaphosphate 
d H 
(d) 
Indole 


14. Theoretical curves. (a) Using the Hill equation, plot an 
oxygen-binding curve for a hypothetical two-subunit 
hemoglobin with n = 1.8 and P,, = 10 torr. (b) Repeat, 
using the concerted model with n = 2, L = 1000, c = 0.01, 
and Kp = 1 torr. 


15. Parasitic effect. When P. falciparum lives inside red 
blood cells, the metabolism of the parasite tends to release 
acid. What effect is the presence of acid likely to have on the 


oxygen-carrying capacity of the red blood cells? On the 
likelihood that these cells sickle? 


Data Interpretation Problems 


16. Primitive oxygen binding. Lampreys are primitive 
organisms whose ancestors diverged from the ancestors of 
fish and mammals approximately 400 million years ago. 
Lamprey blood contains a hemoglobin related to mamma- 
lian hemoglobin. However, lamprey hemoglobin is mono- 
meric in the oxygenated state. Oxygen-binding data for 
lamprey hemoglobin are as follows: 


pO, Y pO, Y pO, Y 
0.1 .0060 2.0 .112 50.0  .889 
0.2 .0124 3.0 .170 60.0 .905 
0.3 .0190 4.0 .227 70.0  .917 
0.4 .0245 5.0 283 80.0 .927 
0.5 .0307 7.5 .420 90.0 .935 
0.6 .0380 10.0 .500 100 941 
0.7 ~~ .0430 15.0 .640 150 .960 
0.8 .0481 20.0 .721 200 .970 
0.9 .0530 30.0  .812 

1.0  .0591 40.0  .865 


(a) Plot these data to produce an oxygen-binding curve. 
At what oxygen partial pressure is this hemoglobin half- 
saturated? On the basis of the appearance of this curve, does 
oxygen binding seem to be cooperative? 


(b) Construct a Hill plot using these data. Does the Hill plot 
show any evidence for cooperativity? What is the Hill coefficient? 


(c) Further studies revealed that lamprey hemoglobin 
forms oligomers, primarily dimers, in the deoxygenated 
state. Propose a model to explain any observed cooperativ- 
ity in oxygen binding by lamprey hemoglobin. 


17. Leaning to the left or to the right. The illustration below 
shows several oxygen-dissociation curves. Assume that 
curve 3 corresponds to hemoglobin with physiological con- 
centrations of CO, and 2,3-BPG at pH 7. Which curves 
represent each of the following perturbations? 
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a) Decrease in CO, 
b) Increase in 2,3-BPG 


c) Increase in pH 
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d) Loss of quaternary structure 


Chapter Integration Problems 


18. Location is everything. 2,3-Bisphosphoglycerate lies in a 
central cavity within the hemoglobin tetramer, stabilizing 
the T state. What would be the effect of mutations that 
placed the BPG-binding site on the surface of hemoglobin? 


19. A therapeutic option. Hydroxyurea has been shown to 
increase the expression of fetal hemoglobin in adult red 
blood cells, by a mechanism that remains unclear. Explain 
why hydroxyurea can be a useful therapy for patients with 
sickle-cell anemia. 


Enzymes: Basic Concepts 
and Kinetics 


HO, 


The activity of an enzyme is responsible for the glow of the H 
luminescent jellyfish at left. The enzyme aequorin catalyzes the HO 

oxidation of a compound by oxygen in the presence of calcium 

to release CO, and light. [(Left) chain45154/Getty Images.] 


nzymes, the catalysts of biological systems, are remarkable molecular 

devices that determine the patterns of chemical transformations. They 
also mediate the transformation of one form of energy into another. About 
a quarter of the genes in the human genome encode enzymes, a testament to 
their importance to life. The most striking characteristics of enzymes are 
their catalytic power and specificity. Catalysis takes place at a particular site 
on the enzyme called the active site. 

Nearly all known enzymes are proteins. However, proteins do not have an 
absolute monopoly on catalysis; the discovery of catalytically active RNA 
molecules provides compelling evidence that RNA was a biocatalyst early in 
evolution. Proteins as a class of macromolecules are highly effective catalysts 
for an enormous diversity of chemical reactions because of their capacity to 
specifically bind a very wide range of molecules. By utilizing the full repertoire of 
intermolecular forces, enzymes bring substrates together in an optimal orien- 
tation, the prelude to making and breaking chemical bonds. They catalyze 
reactions by stabilizing transition states, the highest-energy species in reaction 
pathways. By selectively stabilizing a transition state, an enzyme determines 
which one of several potential chemical reactions actually takes place. 
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TABLE 8.1 Rate enhancement by selected enzymes 


Rate 
Nonenzymatic Uncatalyzed Catalyzed enhancement 
Enzyme half-life rate(k,,s') rate(ki,8') (Reg 8 1/Run 8!) 
OMP decarboxylase 78,000,000 years 2.8 X 10716 39 1.4 x 10!7 
Staphylococcal nuclease 130,000 years 7 1o- 95 BG < 10 
AMP nucleosidase 69,000 years 1.0 x 1071! 60 6.0 x 10!2 
Carboxypeptidase A 7.3 years 3.0 x 107° 578 OR anlOue 
Ketosteroid isomerase 7 weeks 1.7 X 1077 66,000 3.9 x 101! 
Triose phosphate 1.9 days 4.3 x 10~° 4,300 1.0 x 10° 
isomerase 
Chorismate mutase 7.4 hours 2.6 x 1075 50 1.9 x 10° 
Carbonic anhydrase 5 seconds 1.3 x 107! 1 x 10° 7.7 X 106 


Abbreviations: OMP, orotidine monophosphate; AMP, adenosine monophosphate. 
Source: After A. Radzicka and R. Wolfenden. Science 267:90—93, 1995. 


8.1 Enzymes Are Powerful and Highly Specific Catalysts 


Enzymes accelerate reactions by factors of as much as a million or more 
(Table 8.1). Indeed, most reactions in biological systems do not take place 
at perceptible rates in the absence of enzymes. Even a reaction as simple 
as the hydration of carbon dioxide is catalyzed by an enzyme—namely, 
carbonic anhydrase. The transfer of CO, from the tissues to the blood and 
then to the air in the alveolae of the lungs would be less complete in the 
absence of this enzyme (p. 203). In fact, carbonic anhydrase is one of the 
fastest enzymes known. Each enzyme molecule can hydrate 10° molecules 
of CO, per second. This catalyzed reaction 1s 10’ times as fast as the 
uncatalyzed one. We will consider the mechanism of carbonic anhydrase 
catalysis in Chapter 9. 

Enzymes are highly specific both in the reactions that they catalyze and 
in their choice of reactants, which are called substrates. An enzyme usually 
catalyzes a single chemical reaction or a set of closely related reactions. Let 
us consider proteolytic enzymes as an example. The biochemical function of 
these enzymes is to catalyze proteolysis, the hydrolysis of a peptide bond. 


“ N i + HO —— Ae O + *HsN i 
2 2 3 
ON Ne, ONC Now oe 
H | 2 ay H |: = = Nu 
O R fe) Ry 
Peptide Carboxyl Amino 


component component 


Most proteolytic enzymes also catalyze a different but related reaction in 
vitro—namely, the hydrolysis of an ester bond. Such reactions are more eas- 
ily monitored than is proteolysis and are useful in experimental investiga- 
tions of these enzymes. 


Ry (e) Ri 0 HO. 
a SR, + HO ae +  R + Ht 
O O 
Ester Acid Alcohol 


Proteolytic enzymes differ markedly in their degree of substrate speci- 
ficity. Papain, which is found in papaya plants, is quite undiscriminating: it 
will cleave any peptide bond with little regard to the identity of the adjacent 
side chains. This lack of specificity accounts for its use in meat-tenderizing 


sauces. The digestive enzyme trypsin, on the other hand, is quite specific 
and catalyzes the splitting of peptide bonds only on the carboxyl side of 
lysine and arginine residues (Figure 8.1A). Thrombin, an enzyme that par- 
ticipates in blood clotting (Section 10.4), is even more specific than trypsin. 
It catalyzes the hydrolysis of Arg—Gly bonds in particular peptide sequences 
only (Figure 8.1B). 

DNA polymerase I, a template-directed enzyme (Section 28.3), is 
another highly specific catalyst. DNA polymerase adds nucleotides to the 
strand being synthesized in a sequence determined by the sequence of 
nucleotides in another DNA strand that serves as a template. DNA poly- 
merase I is remarkably precise in carrying out the instructions given by the 
template. It inserts the wrong nucleotide into a new DNA strand less than 
one in a thousand times. The specificity of an enzyme is due to the precise 
interaction of the substrate with the enzyme. This precision is a result of the 
intricate three-dimensional structure of the enzyme protein. 


Many enzymes require cofactors for activity 


The catalytic activity of many enzymes depends on the presence of small 
molecules termed cofactors, although the precise role varies with the cofac- 
tor and the enzyme. Generally, these cofactors are able to execute chemical 
reactions that cannot be performed by the standard set of twenty amino 
acids. An enzyme without its cofactor is referred to as an apoenzyme; the 
complete, catalytically active enzyme is called a holoenzyme. 


Apoenzyme + cofactor = holoenzyme 


Cofactors can be subdivided into two groups: (1) metals and (2) small 
organic molecules called coenzymes (Table 8.2). Often derived from vita- 
mins, coenzymes can be either tightly or loosely bound to the enzyme. 
Tightly bound coenzymes are called prosthetic groups. Loosely associated 
coenzymes are more like cosubstrates because, like substrates and products, 
they bind to the enzyme and are released from it. The use of the same coen- 
zyme by a variety of enzymes sets coenzymes apart from normal substrates, 
however, as does their source in vitamins (Section 15.4). Enzymes that use 
the same coenzyme usually perform catalysis by similar 
mechanisms. In Chapter 9, we will examine the impor- 
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FIGURE 8.1 Enzyme specificity. (A) Trypsin 
cleaves on the carboxyl side of arginine 
and lysine residues, whereas (B) thrombin 
cleaves Arg-Gly bonds in particular 
sequences only. 


TABLE 8.2 Enzyme cofactors 


Enzyme 


book, we will see how coenzymes and their enzyme part- 


ners operate in their biochemical context. Coenzyme 


Enzymes can transform energy from one form 
into another 


A key activity in all living systems is the conversion of one 
form of energy into another. For example, in photosynthe- 
sis, light energy is converted into chemical-bond energy. 
In cellular respiration, which takes place in mitochondria, 
the free energy contained in small molecules derived from 
food is converted first into the free energy of an ion gradi- 
ent and then into a different currency—the free energy of 
adenosine triphosphate. Given their centrality to life, 
it should come as no surprise that enzymes play vital roles 
in energy transformation. As we will see, enzymes per- 
form fundamental roles in photosynthesis and cellular 
respiration. Other enzymes can then use the chemical- 
bond energy of ATP in diverse ways. For instance, the 
enzyme myosin converts the energy of ATP into the 


Thiamine pyrophosphate 
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Pyridoxal phosphate 
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Pyruvate dehydrogenase 
Monoamine oxidase 
Lactate dehydrogenase 
Glycogen phosphorylase 
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Pyruvate carboxylase 
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mechanical energy of contracting muscles (Section 9.4 and Chapter 35). 
Pumps in the membranes of cells and organelles, which can be thought 
of as enzymes that move substrates rather than chemically alter them, 
use the energy of ATP to transport molecules and ions across the 
membrane (Chapter 13). The chemical and electrical gradients resulting 
from the unequal distribution of these molecules and ions are themselves 
forms of energy that can be used for a variety of purposes, such as sending 
nerve impulses. 

The molecular mechanisms of these energy-transducing enzymes are 
being unraveled. We will see in subsequent chapters how unidirectional 
cycles of discrete steps—binding, chemical transformation, and release— 
lead to the conversion of one form of energy into another. 


8.2 Gibbs Free Energy Is a Useful Thermodynamic 
Function for Understanding Enzymes 


Enzymes speed up the rate of chemical reactions, but the properties of the 
reaction—whether it can take place at all and the degree to which the 
enzyme accelerates the reaction—depend on energy differences between 
reactants and products. Gibbs free energy (G), which was touched on in 
Chapter 1, is a thermodynamic property that is a measure of useful energy, 
or the energy that is capable of doing work. To understand how enzymes 
operate, we need to consider only two thermodynamic properties of the 
reaction: (1) the free-energy difference (AG) between the products and reac- 
tants and (2) the energy required to initiate the conversion of reactants into 
products. The former determines whether the reaction will take place spon- 
taneously, whereas the latter determines the rate of the reaction. Enzymes 
affect only the latter. Let us review some of the principles of thermodynam- 
ics as they apply to enzymes. 


The free-energy change provides information about the spontaneity 
but not the rate of a reaction 


As discussed in Chapter 1, the free-energy change of a reaction (AG) tells us 
if the reaction can take place spontaneously: 


1. A reaction can take place spontaneously only if AG is negative. Such reac- 
tions are said to be exergonic. 


2. A system is at equilibrium and no net change can take place if AG is zero. 


3. A reaction cannot take place spontaneously if AG is positive. An input of 
free energy is required to drive such a reaction. These reactions are termed 
endergonic. 


4. The AG of a reaction depends only on the free energy of the products (the 
final state) minus the free energy of the reactants (the initial state). The AG of 
a reaction is independent of the molecular mechanism of the transforma- 
tion. For example, the AG for the oxidation of glucose to CO, and H,O is 
the same whether it takes place by combustion or by a series of enzyme- 
catalyzed steps in a cell. 


5. The AG provides no information about the rate of a reaction. A negative 
AG indicates that a reaction can take place spontaneously, but it does not 
signify whether it will proceed at a perceptible rate. As will be discussed 
shortly (Section 8.3), the rate of a reaction depends on the free energy of 
activation (AG*), which is largely unrelated to the AG of the reaction. 


The standard free-energy change of a reaction is related to the 219 
equilibrium constant 8.2 Free Energy 


As for any reaction, we need to be able to determine AG for an enzyme- 
catalyzed reaction to know whether the reaction is spontaneous or requires 
an input of energy. To determine this important thermodynamic parameter, 
we need to take into account the nature of both the reactants and the prod- 
ucts as well as their concentrations. 

Consider the reaction 


A+ B=—C+D 
The AG of this reaction is given by 


—e [C][D] 
AG = AG? + RTIn aay (1) 


in which AG*° is the standard free-energy change, R is the gas constant, T is 
the absolute temperature, and [A], [B], [C], and [D] are the molar concen- 
trations (more precisely, the activities) of the reactants. AG® is the free- 
energy change for this reaction under standard conditions—that is, when 
each of the reactants A, B, C, and D is present at a concentration of 1.0 M 
(for a gas, the standard state is usually chosen to be 1 atmosphere). Thus, 
the AG of a reaction depends on the nature of the reactants (expressed in the 
AG*® term of equation 1) and on their concentrations (expressed in the loga- 
rithmic term of equation 1). 
A convention has been adopted to simplify free-energy calculations Units of energy 
for biochemical reactions. The standard state is defined as having a A kilojoule (kJ) is equal to 1000 J. 
pH of 7. Consequently, when H* is a reactant, its activity has the PIOUS) tie Biman Ghehergy Needed 
value 1 (corresponding to a pH of 7) in equations 1 and 3 (below). The ee) Sei an ee ecanhele 


ects E : . of 1 meter. 
activity of water also is taken to be 1 in these equations. The standard A kilocalorie (kcal) is equal to 1000 cal. 


free-energy change at pH 7, denoted by the symbol AG®’, will be used A calorie (cal) is equivalent to the amount 
throughout this book. The kilojoule (abbreviated kJ) and the kilocalorie of heat required to raise the temperature 
(kcal) will be used as the units of energy. One kilojoule is equivalent to of 1 gram of water from 14.5°C to 15.5°C. 
0.239 kilocalorie. ee 


A simple way to determine AG®’ is to measure the concentrations 
of reactants and products when the reaction has reached equilibrium. 
At equilibrium, there is no net change in reactants and products; in 
essence, the reaction has stopped and AG = 0. At equilibrium, equation 1 
then becomes 


[C][D] 
0 = AG’ + RTIn ——— (2) 
[A][B] 
and so 
[C][D] 
AG’ = —RT ln —— (3) 
[A][B] 
The equilibrium constant under standard conditions, K’., is defined as 
[C][D] 
= 4 
2 TAB] oY 
Substituting equation 4 into equation 3 gives 
AG*' = —-RTIn K'eq (5) 


which can be rearranged to give 


K'eq = @ AO'/RT (6) 


TABLE 8.3 Relation between AG°’ 
and K’ ,, (at 25°C) 


Substituting R = 8.315 X 10-3 kJ mol~! deg! and T = 298 K (corre- 
sponding to 25° C) gives 

Keg =e AO/2.47 (7) 
where AG°’ is here expressed in kilojoules per mole because of the choice of 
the units for R in equation 7. Thus, the standard free energy and the equi- 
librium constant of a reaction are related by a simple expression. For exam- 
ple, an equilibrium constant of 10 gives a standard free-energy change of 
—5.69 kJ mol~! (—1.36 kcal mol~!) at 25° C (Table 8.3). Note that, for each 
10-fold change in the equilibrium constant, the AG®°’ changes by 5.69 kJ 
mol! (1.36 kcal mol). 

As an example, let us calculate AG®°’ and AG for the isomerization of 
dihydroxyacetone phosphate (DHAP) to glyceraldehyde 3-phosphate 
(GAP). This reaction takes place in glycolysis (Chapter 16). At equilibrium, 
the ratio of GAP to DHAP is 0.0475 at 25°C (298 K) and pH 7. Hence, 
K'.q = 0.0475. The standard free-energy change for this reaction is then 
calculated from equation 5: 


AG? = —RTInK'cq 


—8.315 X 10° X 298 X In (0.0475) 


+7.53 kJ mol~! (41.80 kcal mol7') 


Under these conditions, the reaction is endergonic. DHAP will not sponta- 
neously convert into GAP. 

Now let us calculate AG for this reaction when the initial concentration 
of DHAP is 2 X 10~*M and the initial concentration of GAP is 3 X 107~°M. 
Substituting these values into equation 1 gives 
3 x 10°°M 
2x 10-4*M 


= 7.53kJ mol! — 10.42 kJ mol”! 


AG = 7.53kJ mol”! + RT In 


= —2.89 kJ mol! (—0.69 kcal mol!) 


This negative value for the AG indicates that the isomerization of 
DHAP to GAP is exergonic and can take place spontaneously when these 
species are present at the preceding concentrations. Note that AG for this 
reaction is negative, although AG*®’ is positive. It is important to stress that 
whether the AG for a reaction is larger, smaller, or the same as AG°’ depends 

on the concentrations of the reactants and products. The criterion of spon- 
taneity for a reaction is AG, not AG®’. This point is important because 
reactions that are not spontaneous based on AG®’ can be made sponta- 
neous by adjusting the concentrations of reactants and products. This 
principle is the basis of the coupling of reactions to form metabolic 
pathways (Chapter 15). 


Enzymes alter only the reaction rate and not the 
reaction equilibrium 


Because enzymes are such superb catalysts, it is tempting to ascribe to 


AG” 
Reg kJ mol7! kcal mol~! 
1075 28.53 6.82 
Oo 22.84 5.46 
1073 17.11 4.09 
iO iil 42 DTS) 
107! 5.69 1.36 
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FIGURE 8.2 Enzymes accelerate the reaction 
rate. The same equilibrium point is reached but 


much more quickly in the presence of an enzyme. 
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them powers that they do not have. An enzyme cannot alter the laws of 
thermodynamics and consequently cannot alter the equilibrium of a 
chemical reaction. Consider an enzyme-catalyzed reaction, the conver- 
sion of substrate, 5, into product, P. Figure 8.2 shows the rate of product 
formation with time in the presence and absence of enzyme. Note that 


the amount of product formed is the same whether or not the enzyme 
is present but, in the present example, the amount of product formed in 
seconds when the enzyme is present might take hours (or centuries, see 
Table 8.1) to form if the enzyme were absent. 

Why does the rate of product formation level off with time? The reaction 
has reached equilibrium. Substrate S$ is still being converted into product P, 
but P is being converted into S at a rate such that the amount of P present 
stays the same. 

Let us examine the equilibrium in a more quantitative way. Suppose 
that, in the absence of enzyme, the forward rate constant (kp) for the conver- 
sion of S into P is 10~*s~! and the reverse rate constant (Rp) for the conver- 
sion of P into S is 107° s~!. The equilibrium constant K is given by the ratio 
of these rate constants: 


10° ¢3 2 
10° °s! 
[P] kp 1074 
K 100 
[S] kg 10° 


The equilibrium concentration of P is 100 times that of 5, whether or not 
enzyme is present. However, it might take a very long time to approach this 
equilibrium without enzyme, whereas equilibrium would be attained rap- 
idly in the presence of a suitable enzyme (Table 8.1). Enzymes accelerate the 
attainment of equilibria but do not shift their positions. The equilibrium position 
is a function only of the free-energy difference between reactants and products. 


8.3 Enzymes Accelerate Reactions by Facilitating the 
Formation of the Transition State 


The free-energy difference between reactants and products accounts for the 
equilibrium of the reaction, but enzymes accelerate how quickly this equi- 
librium is attained. How can we explain the rate enhancement in terms of 
thermodynamics? To do so, we have to consider not the end points of the 
reaction but the chemical pathway between the end points. 

A chemical reaction of substrate 5 to form product P goes through a 
transition state X* that has a higher free energy than does either S or P. 


S—>x?—>P 


The double dagger denotes the transition state. The transition state is a 
transitory molecular structure that is no longer the substrate but is not yet 
the product. The transition state is the least-stable and most-seldom- 
occupied species along the reaction pathway because it is the one with the 
highest free energy. The difference in free energy between the transition 
state and the substrate is called the Gibbs free energy of activation or simply 
the activation energy, symbolized by AG* (Figure 8.3). 


AGt = Gy! — Gs 


Note that the energy of activation, or AG?*, does not enter into the final AG 
calculation for the reaction, because the energy required to generate the 
transition state is released when the transition state forms the product. The 
activation-energy barrier immediately suggests how an enzyme enhances 
the reaction rate without altering AG of the reaction: enzymes function to 
lower the activation energy, or, in other words, enzymes facilitate the forma- 
tion of the transition state. 


221 
8.3 The Transition State 


Transition state, Xt 


AG* (uncatalyzed) 
Pa ~ 
gt AG! (catalyzed) 
i M 


Substrate 


AG 
for the 
reaction 


Free energy —> 


Product 


Reaction progress —> 


FIGURE 8.3 Enzymes decrease the 
activation energy. Enzymes accelerate 
reactions by decreasing AG#, the free 
energy of activation. 
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CHAPTER 8 Enzymes: Basic Concepts 
and Kinetics 


“| think that enzymes are molecules that are 
complementary in structure to the activated 
complexes of the reactions that they 
catalyze, that is, to the molecular 
configuration that is intermediate between 
the reacting substances and the products of 
reaction for these catalyzed processes. The 
attraction of the enzyme molecule for the 
activated complex would thus lead to a 
decrease in its energy and hence to a 
decrease in the energy of activation of the 
reaction and to an increase in the rate of 
reaction.” 
—Linus Pauling 
Nature 161:707, 1948 


Maximal velocity Se 


Reaction velocity — 


Substrate concentration — 


FIGURE 8.4 Reaction velocity versus 
substrate concentration in an enzyme- 
catalyzed reaction. An enzyme-catalyzed 
reaction approaches a maximal velocity. 


One approach to understanding the increase in reaction rates achieved 
by enzymes is to assume that the transition state (X*) and the substrate (S) 
are in equilibrium. 


S == xt—_>p 


in which K* is the equilibrium constant for the formation of X* and v is the 
rate of formation of product from X*. The rate of the reaction v is propor- 
tional to the concentration of X*, 


v x [X*], 


because only X* can be converted into product. The concentration of X* at 
equilibrium is in turn related to the free-energy difference AG* between X* 
and §; the greater the difference in free energy between these two states, the 
smaller the amount of X*. Thus, the overall rate of reaction V depends on 
AG*. Specifically, 


V = v[X*] = “sje 8ovRt 

In this equation, k is Boltzmann’s constant, and h is Planck’s constant. The 
value of RT/h at 25°C is 6.6 X 10!2s~!. Suppose that the free energy of acti- 
vation is 28.53 kJ mol~! (6.82 kcal mol~!). If we were to substitute this value 
of AG in equation 7 (as shown in Table 8.3), this free-energy difference 
would result when the ratio [X*]/[S] is 10~>. If we assume for simplicity’s 
sake that [S] = 1 M, then the reaction rate Vis 6.2 X 10’ s~!. If AG* were 
lowered by 5.69 kJ mol~! (1.36 kcal mol~*), the ratio [X*]/[S] would then be 
10~‘*, and the reaction rate would be 6.2 X 108 s~!. A decrease of 5.69 kJ 
mol~! in AG# yields a 10-fold larger V. A relatively small decrease in AG* 
(20% in this particular reaction) results in a much greater increase in V. 

Thus, we see the key to how enzymes operate: enzymes accelerate reactions 
by decreasing AG*, the activation energy. The combination of substrate and 
enzyme creates a reaction pathway whose transition-state energy is lower 
than that of the reaction in the absence of enzyme (Figure 8.3). Because the 
activation energy is lower, more molecules have the energy required to reach 
the transition state. Decreasing the activation barrier is analogous to lower- 
ing the height of a high-jump bar; more athletes will be able to clear the bar. 
The essence of catalysis is facilitating the formation of the transition state. 


The formation of an enzyme-substrate complex is the first step in 
enzymatic catalysis 


Much of the catalytic power of enzymes comes from their binding to and 
then altering the structure of the substrate to promote the formation of the 
transition state. Thus, the first step in catalysis is the formation of an 
enzyme-substrate (ES) complex. Substrates bind to a specific region of the 
enzyme called the active site. Most enzymes are highly selective in the sub- 
strates that they bind. Indeed, the catalytic specificity of enzymes depends 
in part on the specificity of binding. 

What is the evidence for the existence of an enzyme—substrate complex? 


1. The first clue was the observation that, at a constant concentration of 
enzyme, the reaction rate increases with increasing substrate concentration 
until a maximal velocity is reached (Figure 8.4). In contrast, uncatalyzed 
reactions do not show this saturation effect. The fact that an enzyme-catalyzed 
reaction has a maximal velocity suggests the formation of a discrete ES complex. 
Ata sufficiently high substrate concentration, all the catalytic sites are filled, 
or saturated, and so the reaction rate cannot increase. Although indirect, the 
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hk FIGURE 8.5 Structure of an 
enzyme-substrate complex. (Left) The 
enzyme cytochrome P450 is illustrated 
bound to its substrate camphor. (Right) 
Notice that, in the active site, the substrate 
is surrounded by residues from the 
enzyme. Note also the presence of a heme 
cofactor. [Drawn from 2CPP.pdb.] 


ability to saturate an enzyme with substrate is the most general evidence for 
the existence of ES complexes. 


2. The spectroscopic characteristics of many enzymes and substrates change 
on the formation of an ES complex. These changes are particularly striking 
if the enzyme contains a colored prosthetic group (Problem 39). 


3. X-ray crystallography has provided high-resolution images of substrates 
and substrate analogs bound to the active sites of many enzymes (Figure 8.5). 
In Chapter 9, we will take a close look at several of these complexes. 


The active sites of enzymes have some common features 


The active site of an enzyme is the region that binds the substrates (and the 
cofactor, if any). It also contains the amino acid residues that directly partici- 
pate in the making and breaking of bonds. These residues are called the cata- 
lytic groups. In essence, the interaction of the enzyme and substrate at the active 
site promotes the formation of the transition state. The active site is the region 
of the enzyme that most directly lowers the AG* of the reaction, thus provid- 
ing the rate-enhancement characteristic of enzyme action. Recall from 
Chapter 2 that proteins are not rigid structures, but are flexible 
and exist in an array of conformations. Thus, the interaction 
of the enzyme and substrate at the active site and the formation of 
the transition state is a dynamic process. Although enzymes differ 
widely in structure, specificity, and mode of catalysis, a number of 
generalizations concerning their active sites can be stated: 


(A) 


1. The active site is a three-dimensional cleft, or crevice, formed 
by groups that come from different parts of the amino acid 
sequence: indeed, residues far apart in the amino acid sequence 
may interact more strongly than adjacent residues in the sequence, 
which may be sterically constrained from interacting with one 
another. In lysozyme, an enzyme that degrades the cell walls of 
some bacteria, the important groups in the active site are contrib- 
uted by residues numbered 35, 52, 62, 63, 101, and 108 in the 

' (B) N 
sequence of 129 amino acids (Figure 8.6). : 35 


52 62,63 


101 108 129 


FIGURE 8.6 Active sites may include distant residues. 
(A) Ribbon diagram of the enzyme lysozyme with several 
components of the active site shown in color. (B) A 
schematic representation of the primary structure of 


2. The active site takes up a small part of the total volume of an enzyme. 
Although most of the amino acid residues in an enzyme are not in 
contact with the substrate, the cooperative motions of the entire 


enzyme help to correctly position the catalytic residues at the active 
site. Experimental attempts to reduce the size of a catalytically active 
enzyme show that the minimum size requires about 100 amino acid 


lysozyme shows that the active site is composed of 
residues that come from different parts of the polypeptide 
chain. [Drawn from 6LYZ.pdb.] 
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FIGURE 8.7 Hydrogen bonds between 
an enzyme and substrate. The enzyme 
ribonuclease forms hydrogen bonds with 
the uridine component of the substrate. 
[Information from F. M. Richards, 

H. W. Wyckoff, and N. Allewell. In The 
Neurosciences: Second Study Program, 

F. O. Schmidt, Ed. (Rockefeller University 
Press, 1970), p. 970.] 
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residues. In fact, nearly all enzymes are made up of more than 100 amino acid 
residues, which gives them a mass greater than 10 kDa and a diameter of more 
than 25 A, suggesting that all amino acids in the protein, not just those at the 
active site, are ultimately required to form a functional enzyme. 


3. Active sites are unique microenvironments. In all enzymes of known struc- 
ture, active sites are shaped like a cleft, or crevice, to which the substrates 
bind. Water is usually excluded unless it is a reactant. The nonpolar micro- 
environment of the cleft enhances the binding of substrates as well as 
catalysis. Nevertheless, the cleft may also contain polar residues, some of 
which may acquire special properties essential for substrate binding or 
catalysis. The internal positions of these polar residues are biologically cru- 
cial exceptions to the general rule that polar residues are located on the 
surface of proteins, exposed to water. 


4. Substrates are bound to enzymes by multiple weak attractions. The nonco- 
valent interactions in ES complexes are much weaker than covalent bonds, 
which have energies between —210 and —460 kJ mol! (between —50 and 
—110 kcal mol~'). In contrast, ES complexes usually have equilibrium con- 
stants that range from 10~* to 107° M, corresponding to free energies of 
interaction ranging from about —13 to —50 kJ mol~! (from —3 to —12 kcal 
mol~!). As discussed in Section 1.3, these weak reversible contacts are 
mediated by electrostatic interactions, hydrogen bonds, and van der Waals 
forces. Van der Waals forces become significant in binding only when 
numerous substrate atoms simultaneously come close to many enzyme 
atoms through the hydrophobic effect. Hence, the enzyme and substrate 
should have complementary shapes. The directional character of hydrogen 
bonds between enzyme and substrate often enforces a high degree of speci- 
ficity, as seen in the RNA-degrading enzyme ribonuclease (Figure 8.7). 


5. The specificity of binding depends on the precisely defined arrangement of atoms 
in an active site. Because the enzyme and the substrate interact by means of 
short-range forces that require close contact, a substrate must have a matching 
shape to fit into the site. Emil Fischer proposed the lock-and-key analogy in 
1890 (Figure 8.8), which was the model for enzyme-—substrate interaction for 
several decades. We now know that enzymes are flexible and that the shapes of 
the active sites can be markedly modified by the binding of substrate, a process 
of dynamic recognition called induced fit (Figure 8.9). Moreover, the substrate 
may bind to only certain conformations of the enzyme, in what is called confor- 
mation selection. Thus, the mechanism of catalysis is dynamic, involving struc- 
tural changes with multiple intermediates of both reactants and the enzyme. 


a 


Substrate Substrate 
+_— 
Active site 
ES complex ES complex 
Enzyme Enzyme 
FIGURE 8.8 Lock-and-key model of enzyme-substrate FIGURE 8.9 Induced-fit model of enzyme-substrate binding. 
binding. In this model, the active site of the unbound enzyme In this model, the enzyme changes shape on substrate binding. The 


is complementary in shape to the substrate. 
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active site forms a shape complementary to the substrate only after 
the substrate has been bound. 


The binding energy between enzyme and substrate is 
important for catalysis 


Enzymes lower the activation energy, but where does the energy to lower the 
activation energy come from? Free energy is released by the formation of a 
large number of weak interactions between a complementary enzyme and its 
substrate. The free energy released on binding is called the binding energy. 
Only the correct substrate can participate in most or all of the interactions with 
the enzyme and thus maximize binding energy, accounting for the exquisite 
substrate specificity exhibited by many enzymes. Furthermore, the full com- 
plement of such interactions is formed only when the substrate is converted into the 
transition state. Thus, the maximal binding energy is released when the 
enzyme facilitates the formation of the transition state. The energy released by 
the interaction between the enzyme and the substrate can be thought of as 
lowering the activation energy. The interaction of the enzyme with the sub- 
strate and reaction intermediates is fleeting, with molecular movements 
resulting in the optimal alignment of functional groups at the active site so that 
maximum binding energy occurs only between the enzyme and the transition 
state, the least-stable reaction intermediate. However, the transition state is 
too unstable to exist for long. It collapses to either substrate or product, but 
which of the two accumulates is determined only by the energy difference 
between the substrate and the product—that is, by the AG of the reaction. 


8.4 The Michaelis-Menten Model Accounts for the Kinetic 
Properties of Many Enzymes 


The study of the rates of chemical reactions 1s called kinetics, and the study of the 
rates of enzyme-catalyzed reactions is called enzyme kinetics. A kinetic descrip- 
tion of enzyme activity will help us understand how enzymes function. We 
begin by briefly examining some of the basic principles of reaction kinetics. 


Kinetics is the study of reaction rates 
What do we mean when we say the “rate” of a chemical reaction? Consider 
a simple reaction: 
AP 
The rate V is the quantity of A that disappears in a specified unit of time. It 


is equal to the rate of the appearance of P, or the quantity of P that appears 
in a specified unit of time. 


V = —AA/AT = AP/AT (8) 


If A is yellow and P is colorless, we can follow the decrease in the concentra- 
tion of A by measuring the decrease in the intensity of yellow color with 
time. Consider only the change in the concentration of A for now. The rate 
of the reaction is directly related to the concentration of A by a proportion- 
ality constant, k, called the rate constant. 


V =R[A] (9) 


Reactions that are directly proportional to the reactant concentration are 
called first-order reactions. First-order rate constants have the units of s~!. 
Many important biochemical reactions include two reactants. For 
example, 
2A—>P 
or 
A+ BSP 
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8.4 Michaelis-Menten Model 
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FIGURE 8.10 Determining the relation 
between initial velocity and substrate 
concentration. (A) The amount of product 
formed at different substrate concentrations 
is plotted as a function of time. The initial 
velocity (Vq) for each substrate 
concentration is determined from the slope 
of the curve at the beginning of a reaction, 
when the reverse reaction is insignificant. 
(B) The values for initial velocity determined 
in part A are then plotted, with error bars, 
against substrate concentration. (C) The 
data points are connected to clearly reveal 
the relationship of initial velocity to 
substrate concentration. 


They are called bimolecular reactions and the corresponding rate equations 
often take the form 
V = R[A]’ (10) 
and 
V = R[A][B] (11) 
The rate constants, called second-order rate constants, have the units 
M!s1, 

Sometimes, second-order reactions can appear to be first-order reac- 
tions. For instance, in reaction 11, if B is present in excess and A is present 
at low concentrations, the reaction rate will be first order with respect to A 
and will not appear to depend on the concentration of B. These reactions are 
called pseudo-first-order reactions, and we will see them a number of times in 
our study of biochemistry. 

Interestingly enough, under some conditions, a reaction can be zero 
order. In these cases, the rate is independent of reactant concentrations. 
Enzyme-catalyzed reactions can approximate zero-order reactions under 
some circumstances (p. 228). 


The steady-state assumption facilitates a description of enzyme kinetics 


The simplest way to investigate the reaction rate is to follow the increase in 
reaction product as a function of time. First, the extent of product forma- 
tion is determined as a function of time for a series of substrate concentra- 
tions (Figure 8.10A). As expected, in each case, the amount of product 
formed increases with time, although eventually a time is reached when 
there is no net change in the concentration of S or P. The enzyme is still 
actively converting substrate into product and vice versa, but the reaction 
equilibrium has been attained. However, enzyme kinetics is more readily 
comprehended if we consider only the forward reaction. We can define the 
rate of catalysis Vo, or the initial rate of catalysis, as the number of moles of 
product formed per second when the reaction is just beginning—that is, 
when t ~ 0 (Figure 8.10A). These experiments are repeated three to five 
times with each substrate concentration to insure the accuracy of and assess 
the variability of the values attained. Next, we plot V, versus the substrate 
concentration [S], assuming a constant amount of enzyme, showing the 
data points with error bars (Figure 8.10B). Finally, the data points are con- 
nected, yielding the results shown in Figure 8.10C. The rate of catalysis 
rises linearly as substrate concentration increases and then begins to level 
off and approach a maximum at higher substrate concentrations. For con- 
venience, we will show idealized data without error bars, throughout the 
text, but it is important to keep in mind that in reality, all experiments are 
repeated multiple times. 

In 1913, Leonor Michaelis and Maud Menten proposed a simple model 
to account for these kinetic characteristics. The critical feature in their treat- 
ment is that a specific ES complex is a necessary intermediate in catalysis. 
The model proposed is 


E+S — ES ==E+P 
“1 =§ 
An enzyme E combines with substrate 5 to form an ES complex, with a rate 
constant k,. The ES complex has two possible fates. It can dissociate to E 
and 5, with a rate constant k_,, or it can proceed to form product P, with a 
rate constant k,. The ES complex can also be reformed from E and P by the 
reverse reaction with a rate constant k_ >. However, as before, we can sim- 
plify these reactions by considering the rate of reaction at times close to zero 


(hence, V,)) when there is negligible product formation and thus no back 
reaction (k_, [E][P] ~ 0). 


E+S=—=ES—>E+P (12) 
-1 

Thus, for the graph in Figure 8.11, V, 1s determined for each substrate 
concentration by measuring the rate of product formation at early times 
before P accumulates (Figure 8.10A). 

We want an expression that relates the rate of catalysis to the concentra- 
tions of substrate and enzyme and the rates of the individual steps. Our 
starting point is that the catalytic rate is equal to the product of the concen- 
tration of the ES complex and k,. 


Vo = k2[ES] (13) 

Now we need to express [ES] in terms of known quantities. The rates of 
formation and breakdown of ES are given by 

Rate of formation of ES = k,[E][S] (14) 

Rate of breakdown of ES = (k_; + ko)[ES] (15) 


We will use the steady-state assumption to simplify matters. In a steady state, 
the concentrations of intermediates—in this case, [ES]—-stay the same even if 
the concentrations of starting materials and products are changing. This steady 
state is reached when the rates of formation and breakdown of the ES complex 
are equal. Setting the right-hand sides of equations 14 and 15 equal gives 


Ri[E][S] = (k-1 + Rp) ES] (16) 
By rearranging equation 16, we obtain 
[EJ[S]/[ES] = (k-1 + ko) /Ri (17) 


Equation 17 can be simplified by defining a new constant, K,,, called the 
Michaelis constant: 
_ Ry + ko 
ky 
Note that K,, has the units of concentration and is independent of enzyme 
and substrate concentrations. As will be explained, K,, is an important 
characteristic of enzyme-—substrate interactions. 
Inserting equation 18 into equation 17 and solving for [ES] yields 
[E][S] 
ES] = —— 19 

[ES] = = (19) 
Now let us examine the numerator of equation 19. Because the substrate is usu- 
ally present at a much higher concentration than that of the enzyme, the con- 
centration of uncombined substrate [S] is very nearly equal to the total substrate 
concentration. The concentration of uncombined enzyme [EF] is equal to the 
total enzyme concentration [E]- minus the concentration of the ES complex: 


[E] = [E}7 — [ES] (20) 
Substituting this expression for [E] in equation 19 gives 


([E]r — LES))IS] 


Kyu (18) 


[ES] = (21) 
Ku 
Solving equation 21 for [ES] gives 
_ [E]r[S]/Km 
Pel 14 [S] /Ko (22) 


V ae Vinax 


Vinax/2 


Reaction velocity (Vo) — 


Substrate concentration [S] —> 


FIGURE 8.11 Michaelis-Menten 

kinetics. A plot of the reaction velocity (Vo) 
as a function of the substrate concentration 
[S] for an enzyme that obeys Michaelis— 
Menten kinetics shows that the maximal 
velocity (V.na,) is approached asymptotically. 
The Michaelis constant (Ky) is the substrate 
concentration yielding a velocity of V,.../2. 
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or 
[S] 

ES] = [E 23 

ES] = TElne e (23) 

By substituting this expression for [ES] into equation 13, we obtain 

[S] 

= 4 

Vo = Rol[E}r iS) Ry (24) 


The maximal rate, | ae is attained when the catalytic sites on the 


enzyme are saturated with substrate—that is, when [ES] = [E]-.. Thus, 
Via = ko[E]7 (25) 


Substituting equation 25 into equation 24 yields the Michaelis-Menten 
equation: 
[S] 
Vo Vmax] ie RG (26) 

This equation accounts for the kinetic data given in Figure 8.11. At very 
low substrate concentration, when [5] is much less than Kyy, Vo = (Vinax/Kyy) 
[5]; that is, the reaction is first order with the rate directly proportional to the 
substrate concentration. At high substrate concentration, when [5S] is much 
greater than K\,, V) = V_,a,; that is, the rate is maximal. The reaction is zero 
order, independent of substrate concentration. 

The significance of K,, is clear when we set [S] = K,, in equation 26. 
When [S] = K,, then V, = V,,,,/2- Thus, Ky, is equal to the substrate con- 
centration at which the reaction rate is half its maximal value. As we will see, 
K,, 1s an important characteristic of an enzyme-catalyzed reaction and is 


significant for its biological function. 


Variations in K,, can have physiological consequences 


e The physiological consequence of K, is illustrated by the sensitivity 

of some persons to ethanol. Such persons exhibit facial flushing and 
rapid heart rate (tachycardia) after ingesting even small amounts of alcohol. 
In the liver, alcohol dehydrogenase converts ethanol into acetaldehyde. 


Alcohol 
dehydrogenase 
ee 


CH,CH,OH + NAD* CH,CHO + NADH + H* 
Ethanol Acetaldehyde 


Normally, the acetaldehyde, which is the cause of the symptoms when present 
at high concentrations, is processed to acetate by aldehyde dehydrogenase. 


Aldehyde 
dehydrogenase 
——— 


CH,CHO + NAD* + H,O CH,COO7 + NADH + 2H* 


Acetate 


Most people have two forms of the aldehyde dehydrogenase, a low K,, 
mitochondrial form and a high K,, cytoplasmic form. In susceptible per- 
sons, the mitochondrial enzyme is less active owing to the substitution of a 
single amino acid, and acetaldehyde is processed only by the cytoplasmic 
enzyme. Because this enzyme has a high K,,, it achieves a high rate of 
catalysis only at very high concentrations of acetaldehyde. Consequently, 
less acetaldehyde is converted into acetate; excess acetaldehyde escapes into 
the blood and accounts for the physiological effects. 


Ky and V,,,, values can be determined by several means 


K\, 1s equal to the substrate concentration that yields V,_,,,/2; however V,...5 
like perfection, is only approached but never attained. How, then, can we 


experimentally determine K,, and V,,,,, and how do these parameters enhance 


our understanding of enzyme-catalyzed reactions? The Michaelis constant, 
K,,, and the maximal rate, V,_,,,, can be readily derived from rates of catalysis 
measured at a variety of substrate concentrations if an enzyme operates accord- 
ing to the simple scheme given in equation 26. The derivation of K,, and V,.,., 
is most commonly achieved with the use of curve-fitting programs on a com- 
puter. However, an older method, although rarely used because the data points 
at high and low concentrations are weighted differently and thus sensitive to 
errors, is a source of further insight into the meaning of K,, and V,..,.. 

Before the availability of computers, the determination of K,, and V,,.. 
values required algebraic manipulation of the Michaelis-Menten equation. 
The Michaelis-Menten equation is transformed into one that gives a 
straight-line plot that yields values for V,,,, and K,. Taking the reciprocal 
of both sides of equation 26 gives 


1_ Km 1, 1 
Vo Vmax 5 V rasa 
A plot of 1/V, versus 1/{S], called a Lineweaver—Burk or double- 


reciprocal plot, yields a straight line with a y-intercept of 1/V,,,, anda 
slope of K,,/V,,,, (Figure 8.12). The intercept on the x-axis is —1/K),. 


(27) 


Ky and V,,,, values are important enzyme characteristics 


The K,, values of enzymes range widely (Table 8.4). For most enzymes, K), 
lies between 107! and 1077 M. The K,,, value for an enzyme depends on the 
particular substrate and on environmental conditions such as pH, temperature, 
and ionic strength. The Michaelis constant, K,,, as already noted, is equal to 
the concentration of substrate at which half the active sites are filled. Thus, K,, 
provides a measure of the substrate concentration required for significant 
catalysis to take place. For many enzymes, experimental evidence suggests that 
the K),, value provides an approximation of the substrate concentration in vivo, 
which in turn suggests that most enzymes evolved to have a K,, approximately 
equal to the substrate concentration commonly available. Why might it be 
beneficial to have a K,, value approximately equal to the commonly available 
substrate concentration? If the normal concentration of substrate is near K, ,, 
the enzyme will display significant activity and yet the activity will be sensitive 
to changes in environmental conditions—that is, changes in substrate concen- 
tration. At values below K, ,, enzymes are very sensitive to changes in substrate 
concentration but display little activity. At substrate values well above K,,, 
enzymes have great catalytic activity but are insensitive to changes in substrate 
concentration. Thus, with the normal substrate concentration being approxi- 
mately K,,, the enzymes have significant activity (1/2 V,,,,,) but are still sensi- 
tive to changes in substrate concentration. 

Under certain circumstances, K), reflects the 
strength of the enzyme-substrate interaction. In 


229 
8.4 Michaelis-Menten Model 


1/Vo Slope =Ku/Vinax 


Intercept = —1/Ky 


Pe Intercept = 1/V/nax 


0 1/[S] 


FIGURE 8.12 A double-reciprocal or 
Lineweaver-Burk plot. A double- 
reciprocal plot of enzyme kinetics is 
generated by plotting 1/V, as a function of 
1/[S]. The slope is Ky/Vina,, the intercept 
on the vertical axis is 1/V,,,,, and the 
intercept on the horizontal axis is —1/Kyy. 


TABLE 8.4 K,, values of some enzymes 


: : ‘ Enz Substrat K M 
equation 18, K), is defined as (k_, + k,)/kj. —— acalata m (HM) 
Consider a case in which k_, is much greater than — Chymotrypsin Acetyl-L-tryptophanamide a 
k,. Under such circumstances, the ES complex dis- Lysozyme : Hexa-N-acetylglucosamine : 

: : B-Galactosidase Lactose 4000 

sociates to E and 5 much more rapidly than prod- eee ee 208 

. ee reonine deaminase reonine 
uct is formed. Under these conditions (k_; >> k2) ee tere CO Sano 
2 

eee k_1 28 Penicillinase Benzylpenicillin 50 

M~ k 1 ( ) Pyruvate carboxylase Pyruvate 400 

x ‘ : ae HCO; 1000 

Equation 28 describes the dissociation constant ae i 

of the ES complex. Arginine-tRNA synthetase Arginine 3 
[E][S] ke tRNA 0.4 

Kina (29) ATP 300 
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TABLE 8.5 Turnover numbers of 
some enzymes 


Turnover number 


Enzyme (per second) 
Carbonic anhydrase 600,000 
3-Ketosteroid isomerase 280,000 
Acetylcholinesterase 25,000 
Penicillinase 2,000 
Lactate dehydrogenase 1,000 
Chymotrypsin 100 
DNA polymerase I 15 
Tryptophan synthetase 2) 
Lysozyme 0.5 


In other words, K,; is equal to the dissociation constant of the ES complex 
if k, is much smaller than k_,. When this condition is met, K), is a measure 
of the strength of the ES complex: a high K,, indicates weak binding; a low 
Ky, indicates strong binding. It must be stressed that K,, indicates the 
affinity of the ES complex only when k_, is much greater than k,. 

The maximal rate, V vas reveals the turnover number of an enzyme, 
which is the number of substrate molecules converted into product by an 
enzyme molecule in a unit time when the enzyme is fully saturated with 
substrate. It is equal to the rate constant k, which is also called k.,,. The 
maximal rate, V___, reveals the turnover number of an enzyme if the con- 
centration of active sites [E], is known, because 


ane = Reat LE} 7 (30) 
and thus 
Reat = Vienax/ LE (31) 
For example, a 10~° M solution of carbonic anhydrase catalyzes the forma- 
tion of 0.6 M H,CO, per second when the enzyme is fully saturated with 
substrate. Hence, ko 18 6 X 10° s—!. This turnover number is one of the 
largest known. Each Gatalyzed reaction takes place in a time equal to, on 
average, 1/k,,,, which is 1.7 ys for carbonic anhydrase. The turnover num- 
bers of most enzymes with their physiological substrates range from 1 to 104 
per second (Table 8.5). 
K,,and V,,,,, also permit the determination of rs) the fraction of active sites 
filled. This relation of fre to Ky, and V,,, 1s given by the following equation: 


4 [S] 


tS Vee 81+ Ku es 


k.at/Ky is a measure of catalytic efficiency 


When the substrate concentration is much greater than K,,, the rate of 
catalysis is equal to V...., which is a function of k,,,, the turnover number, 
as already described. However, most enzymes are not normally saturated 
with substrate. Under physiological conditions, the [S]/K), ratio is typically 
between 0.01 and 1.0. When [S] < Kyy, the enzymatic rate is much less 
than k,,, because most of the active sites are unoccupied. Is there a number 
that characterizes the kinetics of an enzyme under these more typical cel- 
lular conditions? Indeed there is, as can be shown by combining equations 
13 and 19 to give 


Vo= [E][S] (33) 


Kut 
When [S] < Ky, the concentration of free enzyme [EF], is nearly equal 
to the total concentration of enzyme [E].,; so 


Reat 
Vo= > IlElr (34) 


Thus, when [5S] < Ky, the enzymatic velocity depends on the values of 

Rat! Kyp [5], and [E].. Under these conditions, k.,./K,, 1s the rate constant 
for the interaction of S and E. The rate constant k engl called the specific- 
ity constant, is a measure of catalytic efficiency because it takes into account 
both the rate of catalysis with a particular substrate (k,,,) and the nature of 
the enzyme-substrate interaction (K,,). For instance, by using k,,,/Ky, 
values, we can compare an enzyme’s preference for different substrates. 
Table 8.6 shows the k,,,/Ky, values for several different substrates of chy- 
motrypsin. Chymotrypsin clearly has a preference for cleaving next to 
bulky, hydrophobic side chains. 


TABLE 8.6 Substrate preferences of chymotrypsin 


Amino acid in ester Amino acid side chain Reat/ Ky (87! M1) 
Glycine —H 13x 107! 
CH, 
l 
Valine ae 2.0 
CH, 
Norvaline —CH,CH,CH, 3.6 X 102 
Norleucine —CH,CH,CH,CH, 3.0 X 103 


Phenylalanine -c,—{ 1.0 x 105 


Source: Information from A. Fersht, Structure and Mechanism in Protein Science: A Guide to Enzyme 
Catalysis and Protein Folding (W. H. Freeman and Company, 1999), Table 7.3. 


How efficient can an enzyme be? We can approach this question by 
determining whether there are any physical limits on the value of k.,,/Ky,. 
Note that the k cat Ky ratio depends on k,, k_,, andk cat 28 Can be shown by 
substituting for K,,. 


Reatky Reat 

Peat! Kk ky 7 Reat (., 2 ke . Pi ey 
Suppose that the rate of formation of product (k,,,) is much faster than the 
rate of dissociation of the ES complex (k_,). The value of k.,,/K, then 
approaches k,. Thus, the ultimate limit on the value of k.,,/Ky, 1s set by k,, 
the rate of formation of the ES complex. This rate cannot be faster than the 
diffusion-controlled encounter of an enzyme and its substrate. Diffusion limits 
the value of k, and so it cannot be higher than between 10% and 10’s~! M71. 
Hence, the upper limit on k,,,/Ky, is between 108 and 10? s"! M4, 

The k,,,/ Ky, ratios of the enzymes superoxide dismutase, acetylcholin 
esterase, and triose phosphate isomerase are between 10° and 10? s~! M71. 
Enzymes that have k,,,/Ky, ratios at the upper limits have attained kinetic 
perfection. Their catalytic velocity is restricted only by the rate at which they 
encounter substrate in the solution (‘Table 8.7). Any further gain in catalytic 
rate can come only by decreasing the time for diffusion of the substrate into 
the enzyme’s immediate environment. Remember that the active site is only 
a small part of the total enzyme structure. Yet, for catalytically perfect 
enzymes, every encounter between enzyme and substrate is productive. In 
these cases, there may be attractive electrostatic forces on the enzyme that 
entice the substrate to the active site. These forces are sometimes referred to 
poetically as Circe effects. 

The diffusion of a substrate throughout a solution can also be partly 
overcome by confining substrates and products in the limited volume of a 
multienzyme complex. Indeed, some series of enzymes are organized into 
complexes so that the product of one enzyme is very rapidly found by the 
next enzyme. In effect, products are channeled from one enzyme to the 
next, much as in an assembly line. 


Most biochemical reactions include multiple substrates 


Most reactions in biological systems start with two substrates and yield two 
products. They can be represented by the bisubstrate reaction: 


A+B==P+Q 


Many such reactions transfer a functional group, such as a phosphory] or 
an ammonium group, from one substrate to the other. Those that are 
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TABLE 8.7 Enzymes for which k,,,/Ky 
is close to the diffusion- 
controlled rate of encounter 


Enzyme Beat! Ky (s~! M~1) 
Acetylcholinesterase 1.6 X 108 
Carbonic anhydrase 8.3 X 107 
Catalase 4x 107 
Crotonase DBS 110? 
Fumarase 1.6 x 108 
Triose phosphate De SAMO 
isomerase 
B-Lactamase 1x 108 
Superoxide dismutase 7X 10° 


Source: Information from A. Fersht, Structure and 
Mechanism in Protein Science: A Guide to Enzyme 
Catalysis and Protein Folding (W. H. Freeman and 
Company, 1999), Table 4.5. 


Circe effect 
The utilization of attractive forces to lure a 
substrate into a site in which it undergoes a 
transformation of structure, as defined by 
William P. Jencks, an enzymologist, who 
coined the term. 

A goddess of Greek mythology, Circe 
lured Odysseus’s men to her house and then 
transformed them into pigs. 
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oxidation—reduction reactions transfer electrons between substrates. 
Multiple substrate reactions can be divided into two classes: sequential reac- 
tions and double-displacement reactions. 


Sequential reactions. In sequential reactions, all substrates must bind to the 
enzyme before any product 1s released. Consequently, in a bisubstrate reac- 
tion, a ternary complex of the enzyme and both substrates forms. Sequential 
mechanisms are of two types: ordered, in which the substrates bind the 
enzyme in a defined sequence, and random. 

Many enzymes that have NAD* or NADH as a substrate exhibit the 
ordered sequential mechanism. Consider lactate dehydrogenase, an impor- 
tant enzyme in glucose metabolism (Section 16.1). This enzyme reduces 
pyruvate to lactate while oxidizing NADH to NAD*. 


= Oo.” (e) 
Oo. .O Scw 
Nc* | 
| + NADH + H* HO ; H + NAD+ 
Cc 
A 
o~ CH; CH; 
Pyruvate Lactate 


In the ordered sequential mechanism, the coenzyme always binds first and 
the lactate is always released first. This sequence can be represented by 
using a notation developed by W. Wallace Cleland: 


NADH Pyruvate Lactate © NAD* 


Enzyme i a a Enzyme 


E (NADH) (pyruvate) ——— E (lactate) (NAD*) 


The enzyme exists as a ternary complex consisting of, first, the enzyme and 
substrates and, after catalysis, the enzyme and products. 

In the random sequential mechanism, the order of the addition of sub- 
strates and the release of products is random. An example of a random 
sequential reaction is the formation of phosphocreatine and ADP from cre- 
atine and ATP which is catalyzed by creatine kinase (Section 15.2). 


+ 2 
NH> NH, O 
aS ae: 
O. Si O. Co 
ao “ie “SNH, + ATP — ae aie SS + ADP 
fe) CH; fe) CH; 
Creatine Phosphocreatine 


Either creatine or ATP may bind first, and either phosphocreatine or ADP 
may be released first. Phosphocreatine is an important energy source in mus- 
cle. Sequential random reactions also can be depicted in the Cleland notation. 


ATP Creatine Phosphocreatine ADP 


iY 


E (creatine) E (phosphocreatine) 
| | (ATP) : (ADP) 


Creatine ATP ADP _ Phosphocreatine 


Enzyme >— Enzyme 


Although the order of certain events is random, the reaction still passes through 
the ternary complexes including, first, substrates and, then, products. 


Double-displacement (ping-pong) reactions. In double-displacement, 
or ping-pong, reactions, one or more products are released before all sub- 
strates bind the enzyme. The defining feature of double-displacement 
reactions is the existence of a substituted enzyme intermediate, in which 
the enzyme is temporarily modified. Reactions that shuttle amino 
groups between amino acids and a-ketoacids are classic examples of 
double-displacement mechanisms. The enzyme aspartate aminotrans- 
ferase catalyzes the transfer of an amino group from aspartate to 
a-ketoglutarate. 


; ee -00C 
ye CH, coo- 
/ / ha 
H2C H2C H2C 
H2C 
H + IN ——_ \ + \ H 
C C—COO- C—cCOO- Cx. 
+HN~  ~COO- 7 Jf +H;N~  ~COO- 
Aspartate a-Ketoglutarate Oxaloacetate Glutamate 


The sequence of events can be portrayed as the following Cleland notation: 
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Aspartate Oxaloacetate a-Ketoglutarate Glutamate 
Enzyme | l | l Enzyme 
— (E-NH3) ————= _(E-NH3) (E-NH3) ——— 


(aspartate) (oxaloacetate) (a-ketoglutarate) 


After aspartate binds to the enzyme, the enzyme accepts aspartate’s amino 
group to form the substituted enzyme intermediate. The first product, oxa- 
loacetate, subsequently departs. The second substrate, a-ketoglutarate, 
binds to the enzyme, accepts the amino group from the modified enzyme, 
and is then released as the final product, glutamate. In the Cleland notation, 
the substrates appear to bounce on and off the enzyme much as a Ping-Pong 
ball bounces on a table. 


Allosteric enzymes do not obey Michaelis-Menten kinetics 


The Michaelis-Menten model has greatly assisted the development of 
enzymology. Its virtues are simplicity and broad applicability. However, the 
Michaelis-Menten model cannot account for the kinetic properties of many 
enzymes. An important group of enzymes that do not obey Michaelis— 
Menten kinetics are the allosteric enzymes. These enzymes consist of multi- 
ple subunits and multiple active sites. 

Allosteric enzymes often display sigmoidal plots of the reaction velocity 
V, versus substrate concentration [5] (Figure 8.13), rather than the hyper- 
bolic plots predicted by the Michaelis-Menten equation (Figure 8.11). In 
allosteric enzymes, the binding of substrate to one active site can alter the 
properties of other active sites in the same enzyme molecule. A possible 
outcome of this interaction between subunits is that the binding of substrate 
becomes cooperative; that is, the binding of substrate to one active site facili- 
tates the binding of substrate to the other active sites. Such cooperativity 
results in a sigmoidal plot of V, versus [S]. In addition, the activity of an 
allosteric enzyme may be altered by regulatory molecules that reversibly 
bind to specific sites other than the catalytic sites. The catalytic properties 
of allosteric enzymes can thus be adjusted to meet the immediate needs of a 
cell. For this reason, allosteric enzymes are key regulators of metabolic path- 
ways (Chapter 10). Recall that we have already met an allosteric protein, 
hemoglobin, in Chapter 7. 


E 
(glutamate) 


Reaction velocity Vp — 


Substrate concentration [S] — 


FIGURE 8.13 Kinetics for an allosteric 
enzyme. Allosteric enzymes display a 
sigmoidal dependence of reaction velocity 
on substrate concentration. 


(A) Substrate 


(B) Competitive 


ies 


Uncompetitive 
inhibitor 
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((@) Substrate \ 


(D) Substrate 
Noncompetitive 


inhibitor 
‘ 


FIGURE 8.14 Distinction between 
reversible inhibitors. (A) Enzyme-substrate 
complex; (B) a competitive inhibitor binds 

at the active site and thus prevents the 
substrate from binding; (C) an uncompetitive 
inhibitor binds only to the enzyme-substrate 
complex; (D) a noncompetitive inhibitor does 
not prevent the substrate from binding. 


H 
H5N N N 
ye 
HN ZA 
N 
Oo N 
“ui” 


Dihydrofolate 


HN N N 
N = 
ZO N 
NH, N 
HC” 


Methotrexate 


FIGURE 8.15 Enzyme inhibitors. The substrate dihydrofolate and its 
structural analog methotrexate. Regions with structural differences 


are shown in red. 
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The activity of many enzymes can be inhibited by the binding of specific 
small molecules and ions. This means of inhibiting enzyme activity serves as 
a major control mechanism in biological systems, typified by the regulation of 
allosteric enzymes. In addition, many drugs and toxic agents act by inhibiting 
enzymes (Chapter 36). This type of enzyme inhibition is not usually the result 
of evolutionary forces, as it is for allosteric enzymes, but rather due to design 
of inhibitors by scientists or simple chance discovery of inhibitory molecules. 
Examining inhibition can bea source of insight into the mechanism of enzyme 
action: specific inhibitors can often be used to identify residues critical for 
catalysis. Transition-state analogs are especially potent inhibitors. 

Enzyme inhibition can be either irreversible or reversible. An irreversible 
inhibitor dissociates very slowly from its target enzyme because it has 
become tightly bound to the enzyme, either covalently or noncovalently. 
Some irreversible inhibitors are important drugs. Penicillin acts by cova- 
lently modifying the enzyme transpeptidase, thereby preventing the syn- 
thesis of bacterial cell walls and thus killing the bacteria (p. 239). Aspirin 
acts by covalently modifying the enzyme cyclooxygenase, reducing the 
synthesis of signaling molecules in inflammation. 

Reversible inhibition, in contrast with irreversible inhibition, is characterized 
by a rapid dissociation of the enzyme—inhibitor complex. In the type of revers- 
ible inhibition called competitive inhibition, an enzyme can bind substrate 
(forming an ES complex) or inhibitor (EI) but not both (ESI, enzyme— 
substrate-inhibitor complex). The competitive inhibitor often resembles the 
substrate and binds to the active site of the enzyme (Figure 8.14). The substrate 
is thereby prevented from binding to the same active site. A competitive inhibitor 
diminishes the vate of catalysis by reducing the proportion of enzyme molecules 
bound to a substrate. At any given inhibitor concentration, competitive inhibi- 
tion can be relieved by increasing the substrate concentration. Under these 
conditions, the substrate successfully competes with the inhibitor for the active 
site. Methotrexate is an especially potent competitive inhibitor of the enzyme 
dihydrofolate reductase, which plays a role in the biosynthesis of purines 
and pyrimidines. Methotrexate is a structural analog of 
dihydrofolate, a substrate for dihydrofolate reductase 
(Figure 8.15). What makes it such a potent competitive 
inhibitor is that it binds to the enzyme 1000 times as tightly 
as the natural substrate binds, and it inhibits nucleotide 
base synthesis. It is used to treat cancer (Section 25.3). 

Uncompetitive inhibition is essentially substrate- 
dependent inhibition in that the inhibitor binds only to the 
enzyme-substrate complex. The binding site of an uncom- 
petitive inhibitor is created only on interaction of the enzyme 
and substrate (Figure 8.14C). Uncompetitive inhibition 
cannot be overcome by the addition of more substrate. 

In noncompetitive inhibition, the inhibitor and substrate 
can bind simultaneously to an enzyme molecule at different 
binding sites (Figure 8.14D). Unlike uncompetitive inhibi- 
tion, anoncompetitive inhibitor can bind free enzyme or the 
enzyme-substrate complex. A noncompetitive inhibitor 
acts by decreasing the concentration of functional enzyme 
rather than by diminishing the proportion of enzyme mol- 
ecules that are bound to substrate. The net effect is to 
decrease the turnover number. Noncompetitive inhibition, 
like uncompetitive inhibition, cannot be overcome by 


increasing the substrate concentration. A more complex pattern, called mixed 
inhibition, is produced when a single inhibitor both hinders the binding of sub- 
strate and decreases the turnover number of the enzyme. 


The different types of reversible inhibitors are kinetically distinguishable 


How can we determine whether a reversible inhibitor acts by competitive, 
uncompetitive, or noncompetitive inhibition? Let us consider only enzymes that 
exhibit Michaelis-Menten kinetics. Measurements of the rates of catalysis at 
different concentrations of substrate and inhibitor serve to distinguish the three 
types of inhibition. In competitive inhibition, the inhibitor competes with the 
substrate for the active site. The dissociation constant for the inhibitor is given by 


Kj = (E][1/(ET) 


The smaller the K,, the more potent the inhibition. The hallmark of competitive 
inhibition is that it can be overcome by a sufficiently high concentration sub- 
strate (Figure 8.16). The effect of a competitive inhibitor is to increase the 
apparent value of K,,, meaning that more substrate is needed to obtain the 
same reaction rate. This new value of K, ,, called K 4 is numerically equal to 


Ky = Ku(1 + [1]/Ki) 


where [I] 1s the concentration of inhibitor and K, is the dissociation constant 
for the enzyme—inhibitor complex. In the presence of a competitive inhibi- 
tor, an enzyme will have the same V,,.. as in the absence of an inhibitor. At 
a sufficiently high concentration, virtually all the active sites are filled with 
substrate, and the enzyme is fully operative. 

Competitive inhibitors are commonly used as drugs. Drugs such as ibu- 
profen are competitive inhibitors of enzymes that participate in signaling 
pathways in the inflammatory response. Statins are drugs that reduce high 
cholesterol levels by competitively inhibiting a key enzyme in cholesterol 
biosynthesis (Section 26.3). 

In uncompetitive inhibition, the , 
inhibitor binds only to the ES com- Fe{—s<oEs41—>eaP 
plex. This enzyme-substratesnhibi- 
tor complex, ESI, does not go on to Kl 
form any product. Because some 
unproductive ESI complex will 
always be present, V,_... will be lower 
in the presence of inhibitor than in its 
absence (Figure 8.17). The uncom- 
petitive inhibitor lowers the apparent 
value of K,, because the inhibitor 
binds to ES to form ESI, depleting 
ES. To maintain the equilibrium 
between E and ES, more 5 binds to 
E, increasing the apparent value of k, 0 
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and thereby reducing the apparent 
value of K,, (see equation 18). Thus, 
a lower concentration of S is required 
to form half of the maximal concen- 
tration of ES. The herbicide glypho- 
sate, also known as Roundup, is an 
uncompetitive inhibitor of an 
enzyme in the biosynthetic pathway 
for aromatic amino acids. 

In noncompetitive inhibition (Fig- 
ure 8.18), substrate can bind either 
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FIGURE 8.17 Kinetics of an 
uncompetitive inhibitor. The reaction 
pathway shows that the inhibitor binds only 
to the enzyme-substrate complex. 
Consequently, V.,,, cannot be attained, 
even at high substrate concentrations. The 
apparent value for K,, is lowered, becoming 
smaller as more inhibitor is added. 
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FIGURE 8.16 Kinetics of a competitive 
inhibitor. As the concentration of a 
competitive inhibitor increases, higher 
concentrations of substrate are required to 
attain a particular reaction velocity. The 
reaction pathway suggests how sufficiently 
high concentrations of substrate can 
completely relieve competitive inhibition. 
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FIGURE 8.18 Kinetics of a 
noncompetitive inhibitor. The reaction 
pathway shows that the inhibitor binds both 
to free enzyme and to an enzyme-substrate 
complex. Consequently, as with uncompetitive 
competition, V,,,, cannot be attained. In 
pure noncompetitive inhibition, K,, remains 
unchanged, and so the reaction rate 
increases more slowly at low substrate 
concentrations than is the case for 
uncompetitive competition. 
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FIGURE 8.19 Competitive inhibition 
illustrated on a double-reciprocal plot. A 
double-reciprocal plot of enzyme kinetics in 
the presence and absence of a competitive 
inhibitor illustrates that the inhibitor has no 
effect on V_.,, but increases Kj). 
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FIGURE 8.20 Uncompetitive inhibition 
illustrated by a double-reciprocal plot. 
An uncompetitive inhibitor does not affect 
the slope of the double-reciprocal plot. V,,,, 
and K,, are reduced by equivalent amounts. 
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FIGURE 8.21 Noncompetitive inhibition 
illustrated on a double-reciprocal plot. 
A double-reciprocal plot of enzyme kinetics 
in the presence and absence of a pure 
noncompetitive inhibitor shows that Ky is 
unaltered and V,_., is decreased. 


to the enzyme or the enzyme-—inhibitor complex. However, the enzyme— 
inhibitor-substrate complex does not proceed to form product. In pure 
noncompetitive inhibition, the K; for the inhibitor binding to E is the same 
as for binding to ES complex. The value of V,,,, is decreased to a new value 
called Vitis, whereas the value of K,, is unchanged. The maximal velocity 


in the presence of a pure noncompetitive inhibitor, V22E, is given by 
V, 
yare. = ——= 36 


Why is V_.,,, lowered though K), remains unchanged? In essence, the 
inhibitor simply lowers the concentration of functional enzyme. The 
resulting solution behaves as a more dilute solution of enzyme does. 
Noncompetitive inhibition cannot be overcome by increasing the substrate 
concentration. Doxycycline, an antibiotic, functions at low concentra- 
tions as a noncompetitive inhibitor of a proteolytic enzyme (collagenase). 
It is used to treat periodontal disease. Some of the toxic effects of lead 
poisoning may be due to lead’s ability to act as a noncompetitive inhibi- 
tor of a host of enzymes. Lead reacts with crucial sulfhydryl groups in 
these enzymes. 

Double-reciprocal plots are especially useful for distinguishing between 
competitive, uncompetitive, and noncompetitive inhibitors. In competitive 
inhibition, the intercept on the y-axis of the plot of 1/V, versus 1/[S] is the 
same in the presence and in the absence of inhibitor, although the slope is 
increased (Figure 8.19). The intercept is unchanged because a competitive 
inhibitor does not alter V,,,,,. The increase in the slope of the 1/V, versus 
1/[S] plot indicates the strength of binding of a competitive inhibitor. In the 
presence of a competitive inhibitor, equation 27 is replaced by 


a +a + O)(z) (37) 
Vo Vewax Venax K; [5] 

In other words, the slope of the plot is increased by the factor (1 + [I]/K.) 
in the presence of a competitive inhibitor. Consider an enzyme with a Ky of 
10~* M. In the absence of inhibitor, when Vy = Vmax/2 when [S] = 10-4 M. 
In the presence of a 2 X 10~% M competitive inhibitor that is bound to 
the enzyme with a K, of 10~% M, the apparent K,, (K Ne) will be equal to 
K,,(1 + [IJ/K,), or 3 X 107* M. Substitution of these values into equation 
37 gives when V, = V,,,,/4, when [S] = 10-4 M. The presence of the 
competitive inhibitor thus cuts the reaction rate in half at this substrate 
concentration. 

In uncompetitive inhibition (Figure 8.20), the inhibitor combines only 
with the enzyme-substrate complex. The equation that describes the dou- 
ble-reciprocal plot for an uncompetitive inhibitor is 


1 Kw 1, 1 1] 
Vo 7 Vina [S] | (1 7 RD) (38) 


The slope of the line, Kyy / on is the same as that for the uninhibited 
enzyme, but the intercept on the y-axis will be increased by 1 + [I]/K.. 
Consequently, the lines in double-reciprocal plots will be parallel. 

In pure noncompetitive inhibition (Figure 8.21), the inhibitor can com- 
bine with either the enzyme or the enzyme-substrate complex with the 
same dissociation constant. The value of V,,, 1s decreased to the new value 
V2FF., and so the intercept on the vertical axis is increased (equation 36). The 
new slope, which is equal to Ky / VR, is larger by the same factor. In con- 
trast with V__,., Ky, 1s not affected by pure noncompetitive inhibition. 


max’ 


Irreversible inhibitors can be used to map the active site 


In Chapter 9, we will examine the chemical details of how enzymes function. 
The first step in obtaining the chemical mechanism of an enzyme is to deter- 
mine what functional groups are required for enzyme activity. How can we 
ascertain what these functional groups are? X-ray crystallography of the 
enzyme bound to its substrate or substrate analog provides one approach. 
Irreversible inhibitors that covalently bond to the enzyme provide an alter- 
native and often complementary approach: the inhibitors modify the func- 
tional groups, which can then be identified. Irreversible inhibitors can be 
divided into three categories: group-specific reagents, reactive substrate 
analogs (also called affinity labels), and suicide inhibitors. 

Group-specific reagents react with specific side chains of amino acids. An 
example of a group-specific reagent is diisopropylphosphofluoridate (DIPF). 
DIPF modifies only 1 of the 28 serine residues in the proteolytic enzyme chymo- 
trypsin and yet inhibits the enzyme, implying that this serine residue is espe- 
cially reactive. We will see in Chapter 9 that this serine residue is indeed located 
at the active site. DIPF also revealed a reactive 
serine residue in acetylcholinesterase, an enzyme 
important in the transmission of nerve impulses 
(Figure 8.22). Thus, DIPF and similar com- 
pounds that bind and inactivate acetylcholines- 
terase are potent nerve gases. Most group-specific OH 
reagents do not display the exquisite specificity 
shown by DIPF. Consequently, more specific 
means of modifying the active site are required. 

Affinity labels, or reactive substrate analogs, are 
molecules that are structurally similar to the sub- 
strate for an enzyme and that covalently bind to 
active-site residues. They are thus more specific Acetylcholin- BIbE 
for the enzyme’s active site than are group-specific esterase 
reagents. Tosyl-L-phenylalanine chloromethyl 
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FIGURE 8.22 Enzyme inhibition by diisopropylphosphofluoridate (DIPF), a 


ketone (TPCK) is a substrate analog for chymo- group-specific reagent. DIPF can inhibit an enzyme by covalently modifying 


trypsin (Figure 8.23). TPCK binds at the active —_a crucial serine residue. 
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Tosyl-.-phenylalanine chloromethyl ketone (TPCK) R 


FIGURE 8.23 Affinity labeling. (A) Tosyl-L- 
phenylalanine chloromethyl ketone (TPCk) 
is a reactive analog of the normal substrate 
for the enzyme chymotrypsin. (B) TPCK 
binds at the active site of chymotrypsin and 
modifies an essential histidine residue. 
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FIGURE 8.24 Bromoacetol phosphate, 
an affinity label for triose phosphate 


isomerase (TPI). Bromoacetol phosphate, 
an analog of dihydroxyacetone phosphate, 
binds at the active site of the enzyme and 
covalently modifies a glutamic acid residue 


required for enzyme activity. 
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site and then reacts irreversibly with a histidine 


% residue at that site, inhibiting the enzyme. The 


/ \ compound 3-bromoacetol phosphate is an 
<> (~ wa c—O + Br affinity label for the enzyme triose phosphate 
O 


isomerase (TPI). It mimics the normal sub- 
strate, dihydroxyacetone phosphate, by bind- 
; ing at the active site; then it covalently modifies 
Inactivated ar. : 
enzyme the enzyme such that the enzyme is irreversibly 
inhibited (Figure 8.24). 
Suicide inhibitors, or mechanism-based inhibi- 
tors, are modified substrates that provide the 
most specific means for modifying an enzyme’s active site. The inhibitor binds 
to the enzyme as a substrate and is initially processed by the normal catalytic 
mechanism. The mechanism of catalysis then generates a chemically reactive 
intermediate that inactivates the enzyme through covalent modification. The 
fact that the enzyme participates in its own irreversible inhibition strongly sug- 
gests that the covalently modified group on the enzyme is vital for catalysis. An 
example is N,N-dimethylpropargylamine, an inhibitor of the enzyme mono- 
amine oxidase (MAO). A flavin prosthetic group of monoamine oxidase oxi- 
dizes the N, N-dimethylpropargylamine, which in turn inactivates the enzyme 
by binding to N-5 of the flavin prosthetic group (Figure 8.25). Monoamine 
oxidase deaminates neurotransmitters such as dopamine and serotonin, lower- 
ing their levels in the brain. Parkinson disease is associated with low levels of 
dopamine, and depression is associated with low levels of serotonin. N,N- 
Dimethylpropargylamine and (— )deprenyl, another suicide inhibitor of mono- 
amine oxidase, are used to treat Parkinson disease and depression. 
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FIGURE 8.25 Mechanism-based (suicide) inhibition. Monoamine oxidase, an enzyme 
important for neurotransmitter synthesis, requires the cofactor FAD (flavin adenine dinucleotide). 
N,N-Dimethylpropargylamine inhibits monoamine oxidase by covalently modifying the flavin 
prosthetic group only after the inhibitor has been oxidized. The N-5 flavin adduct is stabilized 
by the addition of a proton. R represents the remainder of the flavin prosthetic group. 


Penicillin irreversibly inactivates a key enzyme in bacterial 
cell-wall synthesis 


es Penicillin, the first antibiotic discovered, provides us with another 
example of a clinically useful suicide inhibitor. Penicillin consists of a 
thiazolidine ring fused to a B-lactam ring to which a variable R group is attached 
by a peptide bond (Figure 8.26A). In benzylpenicillin, for example, R is a 
benzyl group (Figure 8.26B). This structure can undergo a variety of rearrange- 
ments, and, in particular, the B-lactam ring is very labile. Indeed, this instabil- 
ity is closely tied to the antibiotic action of penicillin, as will be evident shortly. 
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FIGURE 8.26 The reactive site of penicillin is the peptide bond of its B-lactam ring. 
(A) Structural formula of penicillin. (B) Representation of benzylpenicillin. 


How does penicillin inhibit bacterial growth? Let us consider 
Staphylococcus aureus, the most common cause of staph infections. Penicillin 
interferes with the synthesis of the S. aureus cell wall. The S. aureus cell wall 
is made up of a macromolecule, called a peptidoglycan (Figure 8.27), which 
consists of linear polysaccharide chains that are cross-linked by short pep- 
tides (pentaglycines and tetrapeptides). The enormous peptidoglycan mol- 
ecule confers mechanical support and prevents bacteria from bursting in 
response to their high internal osmotic pressure. Glycopeptide transpepti- 
dase catalyzes the formation of the cross-links that make the peptidoglycan 
so stable (Figure 8.28). Bacterial cell walls are distinctive in containing D 
amino acids, which form cross-links by a mechanism different from that 
used to synthesize proteins. 

Penicillin inhibits the cross-linking transpeptidase by the Trojan horse 
stratagem. The transpeptidase normally forms an acyl intermediate with the 
penultimate D-alanine residue of the D-Ala-D-Ala peptide (Figure 8.29). 
This covalent acyl-enzyme intermediate then reacts with the amino group 
of the terminal glycine in another peptide to form the cross-link. Penicillin 
is welcomed into the active site of the transpeptidase because it mimics the 
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FIGURE 8.28 Formation of cross-links in S. aureus peptidoglycan. The terminal amino 
group of the pentaglycine bridge in the cell wall attacks the peptide bond between two 
D-alanine residues to form a cross-link. 
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FIGURE 8.27 Schematic representation 
of the peptidoglycan in Staphylococcus 
aureus. The sugars are shown in yellow, the 
tetrapeptides in red, and the pentaglycine 
bridges in blue. The cell wall is a single, 
enormous, bag-shaped macromolecule 
because of extensive cross-linking. 
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FIGURE 8.29 Transpeptidation reaction. 
An acyl-enzyme intermediate is formed in 


the transpeptidation reaction leading to 
cross-link formation. 
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D-Ala-D-Ala moiety of the normal substrate (Figure 8.30). Bound penicillin 
then forms a covalent bond with a serine residue at the active site of the 
enzyme. This penicilloyl-enzyme does not react further. Hence, the transpep- 
tidase is irreversibly inhibited and cell-wall synthesis cannot take place. 
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FIGURE 8.30 Conformations of penicillin and a normal substrate. The conformation of 
penicillin in the vicinity of its reactive peptide bond (A) resembles the postulated conformation 
of the transition state of R-b-Ala-b-Ala (B) in the transpeptidation reaction. [Information from 
B. Lee, J. Mol. Biol. 61:463-469, 1971.] 


Why is penicillin such an effective inhibitor of the transpeptidase? The 
highly strained, four-membered B-lactam ring of penicillin makes it espe- 
cially reactive (Figure 8.26). On binding to the transpeptidase, the serine 
residue at the active site attacks the carbonyl carbon atom of the lactam ring 
to form the penicilloyl-serine derivative (Figure 8.31). Because the peptidase 
participates in its own inactivation, penicillin acts as a suicide inhibitor. 


Penicillin | os 
wil i 


Glycopeptide 
transpeptidase 


Penicilloyl-enzyme complex 
(enzymatically inactive) 


FIGURE 8.31 Formation of a penicilloyl-enzyme derivative. Penicillin reacts irreversibly 
with the transpeptidase to inactivate the enzyme. 


Transition-state analogs are potent inhibitors of enzymes 


We turn now to compounds that provide the most intimate views of the cata- 
lytic process itself. Linus Pauling proposed in 1948 that compounds resem- 
bling the transition state of a catalyzed reaction should be very effective 
inhibitors of enzymes. These mimics are called transition-state analogs. The 
inhibition of proline racemase is an instructive example. The racemization 
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FIGURE 8.32 Inhibition by transition-state analogs. (A) The isomerization of L-proline to 
b-proline by proline racemase proceeds through a planar transition state in which the 
a-carbon atom is trigonal rather than tetrahedral. (B) Pyrrole 2-carboxylic acid, a transition- 
state analog because of its trigonal geometry, is a potent inhibitor of proline racemase. 


of proline proceeds through a transition state in which the tetrahedral 
a-carbon atom has become trigonal (Figure 8.32). In the trigonal form, 
all three bonds are in the same plane; C, also carries a net negative 
charge. This symmetric carbanion can be reprotonated on one side to 
give the L isomer or on the other side to give the D isomer. This picture 
is supported by the finding that the inhibitor pyrrole 2-carboxylate binds 
to the racemase 160 times as tightly as does proline. The a-carbon atom 
of this inhibitor, like that of the transition state, is trigonal. An analog that 
also carries a negative charge on C,, would be expected to bind even more 
tightly. In general, highly potent and specific inhibitors of enzymes can 
be produced by synthesizing compounds that more closely resemble the 
transition state than the substrate itself. The inhibitory power of transi- 
tion-state analogs underscores the essence of catalysis: selective binding of 
the transition state. 


Catalytic antibodies demonstrate the importance of selective binding 
of the transition state to enzymatic activity 


Recall that antibodies bind precisely to their targets (antigens), and that 
specific antibodies can be generated against any antigen (Section 3.3). 
Antibodies that recognize transition states should function as catalysts, 
if our understanding of the importance of the transition state to catalysis 
is correct. The preparation of an antibody that catalyzes the insertion of 
a metal ion into a porphyrin nicely illustrates the validity of this 
approach. Ferrochelatase, the final enzyme in the biosynthetic pathway 
for the production of heme, catalyzes the insertion of Fe** into proto- 
porphyrin IX (Section 24.4). The nearly planar porphyrin must be bent 
for iron to enter. 

The challenge was to find a transition-state analog for this metallation 
reaction that could be used as an antigen (immunogen) to generate an anti- 
body. The solution came from studies showing that an alkylated porphy- 
rin, N-methylmesoporphyrin, is a potent inhibitor of ferrochelatase 
(Figure 8.33). This compound resembles the transition state because 
N-alkylation forces the porphyrin to be bent. Moreover, N-alkylporphyrins 
were known to chelate metal ions 10* times as fast as their unalkylated 
counterparts do. Bending increases the exposure of the pyrrole nitrogen 
lone pairs of electrons to solvent, which enables the binding of the iron ion. FIGURE 8.33 N-Methylmesoporphyrin is 
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rin to facilitate the entry of a metal ion. On average, an antibody molecule ferrochelatase proceeds through a 
metallated 80 porphyrin molecules per hour, a rate only 10-fold less than transition state in which the porphyrin is 
that of ferrochelatase, and 2500-fold faster than the uncatalyzed reaction. bent. N-Methylmesoporphyrin, a bent 
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FIGURE 8.34 Single molecule studies 
can reveal molecular heterogeneity. 

(A) Complex biomolecules, such as 
enzymes, display molecular heterogeneity. 
(B) When measuring an enzyme property 
using ensemble methods, an average value 
of all of the enzymes present is the result. 
(C) Single enzyme studies reveal molecular 
heterogeneity, with the various forms 
showing different properties. 


evidence that enzymes function by assuming a conformation in the active 
site that is complementary in structure to the transition state. The power of 
transition-state analogs is now evident: (1) they are sources of insight into cata- 
lytic mechanisms, (2) they can serve as potent and specific inhibitors of 
enzymes, and (3) they can be used as immunogens to generate a wide range of 
novel catalysts. 


8.6 Enzymes Can Be Studied One Molecule at a Time 


Most experiments that are performed to determine an enzyme characteristic 
require an enzyme preparation in a buffered solution. Even a few microliters 
of such a solution will contain millions of enzyme molecules. Much that we 
have learned about enzymes thus far has come from such experiments, 
called ensemble studies. A basic assumption of ensemble studies is that all of 
the enzyme molecules are the same or very similar. When we determine an 
enzymatic property such as the value of K,, in ensemble studies, that value 
is of necessity an average value of all of the enzyme molecules present. 
However, as discussed in Chapter 2, we now know that molecular heteroge- 
neity, the ability of a molecule, over time, to assume several different struc- 
tures that differ slightly in stability, is an inherent property of all large 
biomolecules (p. 53). How can we tell if this molecular heterogeneity affects 
enzyme activity? 

By way of example, consider a hypothetical situation. A Martian vis- 
its Earth to learn about higher education. The spacecraft hovers high 
above a university, and our Martian meticulously records how the stu- 
dent population moves about campus. Much information can be gath- 
ered from such studies: where students are likely to be at certain times on 
certain days, which buildings are used when and by how many. Now, 
suppose our visitor developed a high-magnification camera that could 
follow one student throughout the day. Such data would provide a much 
different perspective on college life: What does this student eat? To 
whom does she talk? How much time does she spend studying? This new 
in singulo method, examining one individual at a time, yields a great deal 
of new information but also illustrates a potential pitfall of studying indi- 
viduals, be they students or enzymes: How can we be certain that the 
student or molecule is representative and not an outlier? This pitfall can 
be overcome by studying enough individuals to satisfy statistical analysis 
for validity. 

Let us leave our Martian to his observations, and consider a more bio- 
chemical situation. Figure 8.34A shows an enzyme that displays molecular 
heterogeneity, with three active forms that catalyze the same reaction but 
at different rates. These forms have slightly different stabilities, but ther- 
mal noise is sufficient to interconvert the forms. Each form is present as a 
fraction of the total enzyme population as indicated. If we were to perform 
an experiment to determine enzyme activity under a particular set of con- 
ditions with the use of ensemble methods, we would get a single value, 
which would represent the average of the heterogeneous assembly 
(Figure 8.34B). However, were we to perform a sufficient number of 
single-molecule experiments, we would discover that the enzyme has three 
different molecular forms with very different activities (Figure 8.34C). 
Moreover, these different forms would most likely correspond to impor- 
tant biochemical differences. 

The development of powerful techniques—such as patch-clamp 
recording, single-molecule fluorescence, and optical tweezers—has 
enabled biochemists to look into the workings of individual molecules. 


We will examine single-molecule studies of membrane channels with the 
use of patch-clamp recording (Section 13.4), ATP-synthesizing complexes 
with the use of single-molecule fluorescence (Section 18.4) and molecular 
motors with the use of an optical trap (Section 34.2). We are now able to 
observe events at a molecular level that reveal rare or transient structures 
and fleeting events in a reaction sequence, as well as to measure mechani- 
cal forces affecting or generated by an enzyme. Single-molecule studies 
open a new vista on the function of enzymes in particular and on all large 
biomolecules in general. 


SUMMARY 


8.1 Enzymes Are Powerful and Highly Specific Catalysts 
Most catalysts in biological systems are enzymes, and nearly all enzymes 
are proteins. Enzymes are highly specific and have great catalytic power. 
They can enhance reaction rates by factors of 10° or more. Many 
enzymes require cofactors for activity. Such cofactors can be metal ions 
or small, vitamin-derived organic molecules called coenzymes. 


8.2 Gibbs Free Energy Is a Useful Thermodynamic Function for 

Understanding Enzymes 

Free energy (G) is the most valuable thermodynamic function for 
understanding the energetics of catalysis. A reaction can take place 
spontaneously only if the change in free energy (AG) is negative. The 
free-energy change of a reaction that takes place when reactants and 
products are at unit activity is called the standard free-energy change 
(AG°). Biochemists use AG®’, the standard free-energy change at pH 7. 
Enzymes do not alter reaction equilibria; rather, they increase the rate 
at which equilibrium is attained. 


8.3 Enzymes Accelerate Reactions by Facilitating the Formation of the 
Transition State 
Enzymes serve as catalysts by decreasing the free energy of activation 
of chemical reactions. Enzymes accelerate reactions by providing a 
reaction pathway in which the transition state (the highest-energy spe- 
cies) has a lower free energy and hence is more rapidly formed than in 
the uncatalyzed reaction. 

The first step in catalysis is the formation of an enzyme-substrate 
complex. Substrates are bound to enzymes at active-site clefts from 
which water is largely excluded when the substrate is bound. The 
specificity of enzyme-substrate interactions arises mainly from hydro- 
gen bonding, which is directional, and from the shape of the active site, 
which rejects molecules that do not have a sufficiently complementary 
shape. Enzymes facilitate formation of the transition state by a dynamic 
process in which the substrate binds to specific conformations of the 
enzyme, accompanied by conformational changes at active sites that 
result in catalysis. 


8.4 The Michaelis-Menten Model Accounts for the Kinetic Properties of 
Many Enzymes 
The kinetic properties of many enzymes are described by the Michaelis— 
Menten model. In this model, an enzyme (E) combines with a substrate 
(S) to form an enzyme-substrate (ES) complex, which can proceed to 
form a product (P) or to dissociate into E and 8. 
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The rate of formation of product V) is given by the Michaelis-Menten 
equation: 


[S] 
Vo Vmax §) ra Re 
in which V,.,,,, 1s the reaction rate when the enzyme is fully saturated 
with substrate and K, ,, the Michaelis constant, is the substrate con- 
centration at which the reaction rate is half maximal. The maximal 
rate, Vay? is equal to the product of ko, or Hoy and the total concen- 
tration of enzyme. The kinetic constant k,,, called the turnover 
number, is the number of substrate molecules converted into prod- 
uct per unit time at a single catalytic site when the enzyme is fully 
saturated with substrate. Turnover numbers for most enzymes are 
between 1 and 104 per second. The ratio of k.,,/K,, provides a mea- 
sure of enzyme efficiency and specificity. 

Allosteric enzymes constitute an important class of enzymes 
whose catalytic activity can be regulated. These enzymes, which 
do not conform to Michaelis-Menten kinetics, have multiple 
active sites. These active sites display cooperativity, as evidenced 
by a sigmoidal dependence of reaction velocity on substrate 
concentration. 


cat 


Enzymes Can Be Inhibited by Specific Molecules 

Specific small molecules or ions can inhibit even nonallosteric enzymes. 
In irreversible inhibition, the inhibitor is covalently linked to the 
enzyme or bound so tightly that its dissociation from the enzyme is 
very slow. Covalent inhibitors provide a means of mapping the 
enzyme’s active site. In contrast, reversible inhibition is characterized 
by a more rapid and less stable interaction between enzyme and 
inhibitor. A competitive inhibitor prevents the substrate from bind- 
ing to the active site. It reduces the reaction velocity by diminishing 
the proportion of enzyme molecules that are bound to substrate. 
Competitive inhibition can be overcome by raising the substrate con- 
centration. In uncompetitive inhibition, the inhibitor combines only 
with the enzyme-substrate complex. In noncompetitive inhibition, 
the inhibitor decreases the turnover number. Uncompetitive and non- 
competitive inhibition cannot be overcome by raising the substrate 
concentration. 

The essence of catalysis is selective stabilization of the transi- 
tion state. Hence, an enzyme binds the transition state more tightly 
than it binds the substrate. Transition-state analogs are stable com- 
pounds that mimic key features of this highest-energy species. 
They are potent and specific inhibitors of enzymes. Proof that 
transition-state stabilization is a key aspect of enzyme activity 
comes from the generation of catalytic antibodies. Transition-state 
analogs are used as antigens, or immunogens, in generating cata- 
lytic antibodies. 


Enzymes Can Be Studied One Molecule at a Time 

Many enzymes are now being studied in singulo, at the level of a 
single molecule. Such studies are important because they yield 
information that is difficult to obtain in studies of populations of 
molecules. Single-molecule methods reveal a distribution of enzyme 
characteristics rather than an average value as is acquired with the 
use of ensemble methods. 
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APPENDIX: Enzymes are Classified on the Basis of the Types of Reactions 


That They Catalyze 


Many enzymes have common names that provide little 
information about the reactions that they catalyze. For 
example, a proteolytic enzyme secreted by the pancreas is 
called trypsin. Most other enzymes are named for their 
substrates and for the reactions that they catalyze, with the 
suffix “ase” added. Thus, a peptide hydrolase is an enzyme 
that hydrolyzes peptide bonds, whereas ATP synthase is an 
enzyme that synthesizes ATP. 


To bring some consistency to the classification of 
enzymes, in 1964 the International Union of Biochemistry 
established an Enzyme Commission to develop a nomen- 
clature for enzymes. Reactions were divided into six major 
groups numbered 1 through 6 (Table 8.8). These groups 
were subdivided and further subdivided so that a 
four-number code preceded by the letters EC for Enzyme 
Commission could precisely identify all enzymes. 


Consider as an example nucleoside monophosphate 
(NMP) kinase, an enzyme that we will examine in detail in 
Section 9.4. It catalyzes the following reaction: 


ATP + NMP == ADP + NDP 


TABLE 8.8 Six major classes of enzymes 


NMP kinase transfers a phosphoryl group from ATP to 
any nucleotide monophosphate (NMP) to form a nucleo- 
side diphosphate (NDP) and ADP. Consequently, it is a 
transferase, or member of group 2. Many groups other than 
phosphoryl groups, such as sugars and single-carbon units, 
can be transferred. Transferases that shift a phosphoryl 
group are designated 2.7. Various functional groups can 
accept the phosphoryl group. If a phosphate is the acceptor, 
the transferase is designated 2.7.4. The final number desig- 
nates the acceptor more precisely. In regard to NMP kinase, 
a nucleoside monophosphate is the acceptor, and the 
enzyme’s designation is EC 2.7.4.4. Although the common 
names are used routinely, the classification number is used 
when the precise identity of the enzyme is not clear from 
the common name alone. 


Class Type of reaction Example Chapter 

1. Oxidoreductases Oxidation—reduction Lactate dehydrogenase 16 

2. Transferases Group transfer Nucleoside monophosphate 9 

kinase (NMP kinase) 

3. Hydrolases Hydrolysis reactions (transfer of functional Chymotrypsin 9 
groups to water) 

4. Lyases Addition or removal of groups to form Fumarase 7 
double bonds 

5. Isomerases Isomerization (intramolecular group transfer) Triose phosphate isomerase 16 

6. Ligases Ligation of two substrates at the expense Aminoacyl-tRNA synthetase 30 
of ATP hydrolysis 

KEY TERMS 


enzyme (p. 216) 
substrate (p. 216) 
cofactor (p. 217) 
apoenzyme (p. 217) 


(p. 227) 


holoenzyme (p. 217) 
coenzyme (p. 217) 
prosthetic group (p. 217) 
free enerey (p. 218) (5. 230) 
transition state (p. 221) 

free energy of activation (p. 221) 
active site (p. 222) 


induced fit (p. 224) 


K, (the Michaelis constant) 


V nax (Maximal rate) (p. 228) 

Michaelis—Menten equation (p. 228) 

Lineweaver—Burk equation (double- 
reciprocal plot) (p. 229) 

turnover number (p. 230) 


Riat/ Ky tatio (the specificity constant) 


sequential reaction (p. 232) 


double-displacement (ping-pong) 
reaction (p. 233) 


allosteric enzyme (p. 233) 

competitive inhibition (p. 234) 

uncompetitive inhibition (p. 234) 

noncompetitive inhibition (p. 234) 

group-specific reagent (p. 237) 

affinity label (reactive substrate analog) 
(p. 237) 

mechanism-based (suicide) inhibition 
(p. 238) 

transition-state analog (p. 240) 


catalytic antibody (abzyme) 
(p. 241) 
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PROBLEMS 


1. Raisons d’étre. What are the two properties of enzymes 
that make them especially useful catalysts? 


2. Partners. What does an apoenzyme require to become a 
holoenzyme? 


3. Different partners. What are the two main types of cofac- 
tors? 


4. One a day. Why are vitamins necessary for good health? 


5. A function of state. What is the fundamental mechanism 
by which enzymes enhance the rate of chemical reactions? 


6. Nooks and crannies. What is the structural basis for 
enzyme specificity? 


7. Give with one hand, take with the other. Why does the 
activation energy of a reaction not appear in the final AG of 
the reaction? 


8. The more things change, the move they stay the same. 
Suppose that, in the absence of enzyme, the forward rate 
constant (k,) for the conversion of S into P is 107*s7! and 
the reverse rate constant (kp) for the conversion of P into S 
is 107° s71, 

10-457! 

Sa 

10 "s 
(a) What is the equilibrium for the reaction? What is 
the AG°’? 


(b) Suppose an enzyme enhances the rate of the reaction 
100 fold. What are the rate constants for the enzyme- 
catalyzed reaction? The equilibrium constant? The AG°’? 


9. Mountain climbing. Proteins are thermodynamically 
unstable. The AG of the hydrolysis of proteins is quite 
negative, yet proteins can be quite stable. Explain this 
apparent paradox. What does it tell you about protein 
synthesis? 


10. Protection. Suggest why the enzyme lysozyme, which 
degrades cell walls of some bacteria, is present in tears. 


11. Mutual attraction. What is meant by the term binding 
energy? 


12. Catalytically binding. What is the role of binding 
energy in enzyme catalysis? 


13. Sticky situation. What would be the result of an enzyme 
having a greater binding energy for the substrate than for 
the transition state? 


14. Stability matters. Transition-state analogs, which 
can be used as enzyme inhibitors and to generate cata- 
lytic antibodies, are often difficult to synthesize. Suggest 
a reason. 


15. Match’em. Match the K ” values with the appropriate 
AG*’ values. 


Ki, AG?'(kJ mol~!) 
1 28.53 

fl 10-5 —11.42 

(c) 104 5.69 

(d) 10? 0 

(e) 1071 —22.84 


16. Free energy! Assume that you have a solution of 0.1 M 
glucose 6-phosphate. To this solution, you add the enzyme 
phosphoglucomutase, which catalyzes the following reaction: 


Phosphoglucomutase 


Glucose 6-phosphate glucose 1-phosphate 


The AG” for the reaction is +7.5 kJ mol~!(+1.8 kcal mol~!). 


(a) Does the reaction proceed as written? If so, what are the 
final concentrations of glucose 6-phosphate and glucose 
1-phosphate? 


(b) Under what cellular conditions could you produce glu- 
cose 1-phosphate at a high rate? 
17. Free energy, too! Consider the following reaction: 


Phosphoglucomutase 


Glucose 6-phosphate glucose 1-phosphate 


After reactant and product were mixed and allowed to 
reach equilibrium at 25°C, the concentration of each com- 
pound was measured: 


[Glucose 1-phosphate].g = 0.01 M 

[Glucose 6-phosphate].4 = 0.19 M 
Calculate ae and AG*’. 
18. Keeping busy. Many isolated enzymes, if incubated at 
37°C, will be denatured. However, if the enzymes are incu- 


bated at 37°C in the presence of substrate, the enzymes are 
catalytically active. Explain this apparent paradox. 


19. Active yet responsive. What is the biochemical advan- 
tage of having a K,, approximately equal to the substrate 
concentration normally available to an enzyme? 


20. Affinity or not affinity? That is the question. The 
affinity between a protein and a molecule that binds to 
the protein is frequently expressed in terms of a dissocia- 
tion constant K 4. 


Protein—small molecule complex = = protein + small molecule 


[ protein ][small molecule] 
Kg = 


[Protein — small molecule complex] 


Does K,, measure the affinity of the enzyme complex? 
Under what circumstances might K,, approximately 
equal K,? 


21. Angry biochemists. Many biochemists go bananas, and 
justifiably, when they see a Michaelis-Menten plot like the 
one shown below. To see why, determine the V, as a fraction 
of V,.4x When the substrate concentration is equal to 10 Ky, 
and 20 K,,. Please control your outrage. 


IS] 


22. Hydrolytic driving force. The hydrolysis of pyrophos- 
phate to orthophosphate is important in driving forward 
biosynthetic reactions such as the synthesis of DNA. This 
hydrolytic reaction is catalyzed in E. coli by a pyrophospha- 
tase that has a mass of 120 kDa and consists of six identical 
subunits. For this enzyme, a unit of activity is defined as the 
amount of enzyme that hydrolyzes 10 mol of pyrophos- 
phate in 15 minutes at 37°C under standard assay condi- 
tions. The purified enzyme has a V,,,,. of 2800 units per 
milligram of enzyme. 


(a) How many moles of substrate is hydrolyzed per second 
per milligram of enzyme when the substrate concentration 
is much greater than K),? 


(b) How many moles of active sites are there in 1 mg of 
enzyme? Assume that each subunit has one active site. 


(c) What is the turnover number of the enzyme? Compare 
this value with others mentioned in this chapter. 


23. Destroying the Trojan horse. Penicillin is hydrolyzed 
and thereby rendered inactive by penicillinase (also known 
as B-lactamase), an enzyme present in some penicillin- 
resistant bacteria. The mass of this enzyme in Staphylococcus 
aureus is 29.6 kDa. The amount of penicillin hydrolyzed in 
1 minute in a 10-ml solution containing 10~° g of purified 
penicillinase was measured as a function of the concentra- 
tion of penicillin. Assume that the concentration of penicil- 
lin does not change appreciably during the assay. 


[Penicillin] Amount hydrolyzed 
pM (nmol) 
1 0.11 
3 0.25 
5 0.34 
10 0.45 
30 0.58 
50 0.61 
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(a) Plot Vj versus [S] and 1/V, versus 1/[S] for these data. 
Does penicillinase appear to obey Michaelis-Menten kinet- 
ics? If so, what is the value of Ky,’ 


(b) What is the value of V__..? 


(c) What is the turnover number of penicillinase under 
these experimental conditions? Assume one active site per 
enzyme molecule. 


24. Counterpoint. Penicillinase (B-lactamase) hydrolyzes 
penicillin. Compare penicillinase with glycopeptide trans- 
peptidase. 


25. A different mode. The kinetics of an enzyme is measured 
as a function of substrate concentration in the presence and 
absence of 100 2M inhibitor. 


(a) What are the values of V__,, and K,, in the presence of 
this inhibitor? 


(b) What type of inhibition is it? 
(c) What is the dissociation constant of this inhibitor? 


Velocity (mol minute™!) 


(S] 


(uM) No inhibitor Inhibitor 
3 10.4 2.4 
5 14.5 2.9 
10 22.5 4.5 
30 33.8 6.8 
90 40.5 8.1 


(d) If[S] = 30 pM, what fraction of the enzyme molecules 
have a bound substrate in the presence and in the absence of 
100 pM inhibitor? 


26. A fresh view. The plot of 1/V, versus 1/[S] is some- 
times called a Lineweaver—Burk plot. Another way of 
expressing the kinetic data is to plot Vy versus V,/[S], 
which is known as an Eadie—Hofstee plot. 


(a) Rearrange the Michaelis-Menten equation to give V, 
as a function of V,/[S]. 


(b) What is the significance of the slope, the y-intercept, 
and the x-intercept in a plot of V, versus V)/[S]? 


(c) Sketch a plot of Vy versus V,/[S] in the absence of an 
inhibitor, in the presence of a competitive inhibitor, and in 
the presence of a noncompetitive inhibitor. 


27. Defining attributes. What is the defining characteristic 
for an enzyme catalyzing a sequential reaction? A double- 
displacement reaction? 


28. Competing substrates. Suppose that two substrates, A 
and B, compete for an enzyme. Derive an expression relat- 
ing the ratio of the rates of utilization of A and B, V,/V,' to 
the concentrations of these substrates and their values of 
k.,, and K,,. (Hint: Express V, as a function of k,,,/K), for 
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substrate A, and do the same for V;.) Is specificity deter- 
mined by K,, alone? 


29. A tenacious mutant. Suppose that a mutant enzyme 
binds a substrate 100 times as tightly as does the native 
enzyme. What is the effect of this mutation on catalytic rate 
if the binding of the transition state is unaffected? 


30. More Michaelis-Menten. For an enzyme that follows 
simple Michaelis-Menten kinetics, what is the value of 
Vinax if Vo is equal to 1 mol minute! at 10 Ky)? 


31. Controlled paralysis. Succinylcholine is a fast-acting, short- 
duration muscle relaxant that is used when a tube is inserted 
into a patient’s trachea or when a bronchoscope is used to 
examine the trachea and bronchi for signs of cancer. Within 
seconds of the administration of succinylcholine, the patient 
experiences muscle paralysis and is placed ona respirator while 
the examination proceeds. Succinylcholine is a competitive 
inhibitor of acetylcholinesterase, a nervous system enzyme, 
and this inhibition causes paralysis. However, succinylcholine 
is hydrolyzed by blood-serum cholinesterase, which shows a 
broader substrate specificity than does the nervous system 
enzyme. Paralysis lasts until the succinylcholine is hydrolyzed 
by the serum cholinesterase, usually several minutes later. 


(a) Asa safety measure, serum cholinesterase is measured 
before the examination takes place. Explain why this mea- 
surement is good idea. 


(b) What would happen to the patient if the serum cholin- 
esterase activity were only 10 units of activity per liter 
rather than the normal activity of about 80 units? 


(c) Some patients have a mutant form of the serum cholin- 
esterase that displays a K,, of 10 mM, rather than the 
normal 1.4 mM. What will be the effect of this mutation on 
the patient? 


Data Interpretation Problems 


32. A natural attraction, but more complicated. You have 
isolated two versions of the same enzyme, a wild type anda 
mutant differing from the wild type at a single amino acid. 
Working carefully but expeditiously, you then establish the 
following kinetic characteristics of the enzymes. 


Maximum velocity Ky, 
Wild type 100 pmol/min 10 mM 
Mutant 1 pmol/min 0.1mM 


(a) With the assumption that the reaction occurs in two 
steps in which k_, is much larger than k,, which enzyme 
has the higher affinity for substrate? 


(b) What is the initial velocity of the reaction catalyzed by 
the wild-type enzyme when the substrate concentration is 
10 mM? 


(c) Which enzyme alters the equilibrium more in the direc- 
tion of product? 


33. Ky, matters. The amino acid asparagine is required by 
cancer cells to proliferate. Treating patients with the enzyme 
asparaginase is sometimes used as a chemotherapy treat- 
ment. Asparaginase hydrolyzes asparagine to aspartate and 
ammonia. The adjoining illustration shows the Michaelis— 
Menten curves for two asparaginases from different sources, 
as well as the concentration of asparagine in the environ- 
ment (indicated by the arrow). Which enzyme would 
make a better chemotherapeutic agent? 


Asparaginase 1 


Asparaginase 2 


Vo 


t IS] 


34. Enzyme specificity. Catalysis of the cleavage of peptide 
bonds in small peptides by a proteolytic enzyme is described 
in the following table. 


Substrate Ky, (mM) Reat (s~') 
EMTAVG 4.0 24 
EMTAVA 1,5 30 
EMTAVF 0.5 18 


The arrow indicates the peptide bond cleaved in each case. 


(a) If a mixture of these peptides were presented to the 
enzyme with the concentration of each peptide being the 
same, which peptide would be digested most rapidly? Most 
slowly? Briefly explain your reasoning, if any. 


(b) The experiment is performed again on another peptide 
with the following results. 


EMTNF 9 18 


On the basis of these data, suggest the features of the amino 
acid sequence that dictate the specificity of the enzyme. 


35. Varying the enzyme. For a one-substrate, enzyme- 
catalyzed reaction, double-reciprocal plots were deter- 
mined for three different enzyme concentrations. Which of 
the following three families of curve would you expect to be 
obtained? Explain. 


1/Vo 1/Vo 1/Vo 


| 1/[S] | 1/[S] | 1/[S] 


36. Mental experiment. Picture in your mind the velocity 
vs. substrate concentration curve for a typical Michaelis- 
Menten enzyme. Now, imagine that the experimental con- 
ditions are altered as described below. For each of the condi- 
tions described, fill in the table indicating precisely (when 
possible) the effect on V... and K,, of the imagined 
Michaelis-Menten enzyme. 


Experimental condition Vv Ky 


a. Twice as much enzyme is used. 

b. Half as much enzyme is used 

c. A competitive inhibitor is present. 

d. An uncompetitive inhibitor is present. 


e. A pure non-competitive inhibitor is present. 


37. Too much of a good thing. A simple Michaelis-Menten 
enzyme, in the absence of any inhibitor, displayed the 
following kinetic behavior. 


V, 


max 


Reaction velocity 
Vo— 


[Ss] — 


(a) Draw a double-reciprocal plot that corresponds to the 
velocity-versus-substrate curve. 


(b) Suggest a plausible explanation for these kinetic results. 


38. Rate-limiting step. In the conversion of A into D in the 
following biochemical pathway, enzymes E,, Ep, and Eo 
have the K,, values indicated under each enzyme. If all of the 
substrates and products are present at a concentration of 
10-*M and the enzymes have approximately the same V__. 
which step will be rate limiting and why? 


ax’? 


Ea ER Ec 
Ky= 102M 10-*M 10-*M 


39. Colored luminosity Tryptophan synthetase, a bacterial 
enzyme that contains a pyridoxal phosphate (PLP) pros- 
thetic group, catalyzes the synthesis of L-tryptophan from 
L-serine and an indole derivative. The addition of L-serine 
to the enzyme produces a marked increase in the fluores- 
cence of the PLP group, as the adjoining graph shows. The 
subsequent addition of indole, the second substrate, reduces 
this fluorescence to a level even lower than that produced 
by the enzyme alone. How do these changes in fluorescence 
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support the notion that the enzyme interacts directly with 
its substrates? 


+ Serine 
a 


Enzyme alone 


Fluorescence intensity — 


+ Serine 
and indole 


| | 
450 500 550 


Wavelength (nm) 


Chapter Integration Problems 


40. Titration experiment. The effect of pH on the activity of 
an enzyme was examined. At its active site, the enzyme has 
an ionizable group that must be negatively charged for sub- 
strate binding and catalysis to take place. The ionizable 
group has a pK, of 6.0. The substrate is positively charged 
throughout the pH range of the experiment. 


E7~+St == ES*—E-+Pt 
+ 
Ht 


| 


EH 


(a) Draw the V,-versus-pH curve when the substrate con- 
centration is much greater than the enzyme K,,. 


(b) Draw the V,-versus-pH curve when the substrate con- 
centration is much less than the enzyme Ky). 


(c) At which pH will the velocity equal one-half of the 
maximal velocity attainable under these conditions? 


41. A question of stability. Pyridoxal phosphate (PLP) is a 
coenzyme for the enzyme ornithine aminotransferase. The 
enzyme was purified from cells grown in PLP-deficient 
media as well as from cells grown in media that contained 
pyridoxal phosphate. The stability of the two different 
enzyme preparations was then measured by incubating the 
enzyme at 37°C for different lengths of time and then assay- 
ing for the amount of enzyme activity remaining. The fol- 
lowing results were obtained. 


250 
CHAPTER 8 Enzymes: Basic Concepts and Kinetics 


(a) Why does the amount of active enzyme decrease with 
the time of incubation? 


(b) Why does the amount of enzyme from the PLP- 
deficient cells decline more rapidly? 


100% 


Enzyme activity 
remaining —> 


+PLP 
PLP 


0% 


Time —> 


42. Not just for enzymes. Kinetics is useful for studying 
reactions of all types, not just those catalyzed by enzymes. 
In Chapters 4 and 5, we learned that DNA could be 
reversibly melted. When melted double-stranded DNA is 
allowed to renature, the process can be described as con- 
sisting of two steps, a slow second order reaction followed 
by a rapid first order reaction. Explain what is occurring 
in each step. 


Catalytic Strategies 


CHAPTER 


Chess and enzymes have in common the use of strategy, consciously thought out 
in the game of chess and selected by evolution for the action of an enzyme. The 
three amino acid residues at the right, denoted by the white bonds, constitute a 
catalytic triad found in the active site of a class of enzymes that cleave peptide 
bonds. The substrate, represented by the molecule with the black bonds, is as 
hopelessly trapped as the king in the photograph of a chess match at the left and 
is sure to be cleaved. [Photograph courtesy of Wendie Berg.] 


hat are the sources of the catalytic power and specificity of enzymes? 

This chapter presents the catalytic strategies used by four classes of 
enzymes: serine proteases, carbonic anhydrases, restriction endonucleases, 
and myosins. Each class catalyzes reactions that require the addition of 
water to a substrate. The mechanisms of these enzymes have been revealed 
through the use of incisive experimental probes, including the techniques of 
protein structure determination (Chapter 3) and site-directed mutagenesis 
(Chapter 5). The mechanisms illustrate many important principles of 
catalysis. We shall see how these enzymes facilitate the formation of the 
transition state through the use of binding energy and induced fit as well as 
additional specific catalytic strategies. 

Each of the four classes of enzymes in this chapter illustrates the use of 
such strategies to solve a different problem. For serine proteases, exempli- 
fied by chymotrypsin, the challenge is to promote a reaction that is almost 
immeasurably slow at neutral pH in the absence of a catalyst. For carbonic 
anhydrases, the challenge is to achieve a high absolute rate of reaction, suit- 
able for integration with other rapid physiological processes. For restriction 
endonucleases such as EcoRV, the challenge is to attain a high degree of 
specificity. Finally, for myosins, the challenge is to utilize the free energy 


OUTLINE 


9.1 Proteases Facilitate a 
Fundamentally Difficult Reaction 


9.2 Carbonic Anhydrases Make a Fast 
Reaction Faster 


9.3 Restriction Enzymes Catalyze 
Highly Specific DNA-Cleavage 
Reactions 


9.4 Myosins Harness Changes in 
Enzyme Conformation to Couple 
ATP Hydrolysis to Mechanical 
Work 
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associated with the hydrolysis of adenosine triphosphate (ATP) to drive 
other processes. Each of the examples selected is a member of a large protein 
class. For each of these classes, comparison between class members reveals 
how enzyme active sites have evolved and been refined. Structural and 
mechanistic comparisons of enzyme action are thus the sources of insight 
into the evolutionary history of enzymes. In addition, our knowledge of 
catalytic strategies has been used to develop practical applications, includ- 
ing potent drugs and specific enzyme inhibitors. Finally, although we shall 
not consider catalytic RNA molecules explicitly in this chapter, the princi- 
ples also apply to these catalysts. 


A few basic catalytic principles are used by many enzymes 


In Chapter 8, we learned that enzymatic catalysis begins with substrate bind- 
ing. The binding energy is the free energy released in the formation of a large 
number of weak interactions between the enzyme and the substrate. The use 
of this binding energy is the first common strategy used by enzymes. We can 
envision this binding energy as serving two purposes: it establishes substrate 
specificity and increases catalytic efficiency. Only the correct substrate can 
participate in most or all of the interactions with the enzyme and thus maxi- 
mize binding energy, accounting for the exquisite substrate specificity exhib- 
ited by many enzymes. Furthermore, the full complement of such interactions 
is formed only when the combination of enzyme and substrate is in the transi- 
tion state. Thus, interactions between the enzyme and the substrate stabilize 
the transition state, thereby lowering the free energy of activation. The bind- 
ing energy can also promote structural changes in both the enzyme and the 
substrate that facilitate catalysis, a process referred to as induced fit. 

In addition to the first strategy involving binding energy, enzymes com- 
monly employ one or more of the following four additional strategies to 
catalyze specific reactions: 


1. Covalent Catalysis. In covalent catalysis, the active site contains a reac- 
tive group, usually a powerful nucleophile, that becomes temporarily cova- 
lently attached to a part of the substrate in the course of catalysis. The 
proteolytic enzyme chymotrypsin provides an excellent example of this 
strategy (Section 9.1). 


2. General Acid—Base Catalysis. In general acid—base catalysis, a molecule 
other than water plays the role of a proton donor or acceptor. Chymotrypsin 
uses a histidine residue as a base catalyst to enhance the nucleophilic power of 
serine (Section 9.1), whereas a histidine residue in carbonic anhydrase facili- 
tates the removal of a hydrogen ion from a zinc-bound water molecule to 
generate hydroxide ion (Section 9.2). For myosins, a phosphate group of the 
ATP substrate serves as a base to promote its own hydrolysis (Section 9.3). 


3. Catalysis by Approximation. Many reactions have two distinct substrates, 
including all four classes of hydrolases considered in detail in this chapter. 
In such cases, the reaction rate may be considerably enhanced by bringing 
the two substrates together along a single binding surface on an enzyme. 
For example, carbonic anhydrase binds carbon dioxide and water in adja- 
cent sites to facilitate their reaction (Section 9.2). 


4. Metal Ion Catalysis. Metal ions can function catalytically in several 
ways. For instance, a metal ion may facilitate the formation of nucleophiles 
such as hydroxide ion by direct coordination. A zinc(II) ion serves this pur- 
pose in catalysis by carbonic anhydrase (Section 9.2). Alternatively, a metal 
ion may serve as an electrophile, stabilizing a negative charge on a reaction 
intermediate. A magnesium(II) ion plays this role in EcoRV (Section 9.3). 


Finally, a metal ion may serve as a bridge between enzyme and substrate, 
increasing the binding energy and holding the substrate in a conformation 
appropriate for catalysis. This strategy is used by myosins (Section 9.4) and, 
indeed, by almost all enzymes that utilize ATP as a substrate. 


9.1 Proteases Facilitate a Fundamentally Difficult 
Reaction 


Peptide bond hydrolysis is an important process in living systems 
(Chapter 23). Proteins that have served their purpose must be degraded so 
that their constituent amino acids can be recycled for the synthesis of new 
proteins. Proteins ingested in the diet must be broken down into small pep- 
tides and amino acids for absorption in the gut. Furthermore, as described 
in detail in Chapter 10, proteolytic reactions are important in regulating the 
activity of certain enzymes and other proteins. 

Proteases cleave proteins by a hydrolysis reaction—the addition of a 
molecule of water to a peptide bond: 


1 
fe) 
C R i 
RY wo 2 +H,O = Rr—G - + R)>——NHst 
H fe) 


Although the hydrolysis of peptide bonds is thermodynamically favorable, 
such reactions are extremely slow. In the absence of a catalyst, the half-life 
for the hydrolysis of a typical peptide at neutral pH is estimated to be 
between 10 and 1000 years. Yet, peptide bonds must be hydrolyzed within 
milliseconds in some biochemical processes. 

The chemical nature of peptide bonds is responsible for their kinetic 
stability. Specifically, the resonance structure that accounts for the planarity 
of peptide bonds (Section 2.2) also makes them resistant to hydrolysis. This 
resonance structure endows them with partial double-bond character: 
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The carbon-nitrogen bond is strengthened by its double-bond character. 
Furthermore, the carbonyl carbon atom is less electrophilic and less suscep- 
tible to nucleophilic attack than are the carbonyl carbon atoms in more 
reactive compounds such as carboxylate esters. Consequently, to promote 
peptide-bond cleavage, an enzyme must facilitate nucleophilic attack at a 
normally unreactive carbonyl group. 


Chymotrypsin possesses a highly reactive serine residue 


A number of proteolytic enzymes participate in the breakdown of proteins 
in the digestive systems of mammals and other organisms. One such 
enzyme, chymotrypsin, cleaves peptide bonds selectively on the carboxy]l- 
terminal side of the large hydrophobic amino acids such as tryptophan, 
tyrosine, phenylalanine, and methionine (Figure 9.1). Chymotrypsin is a 
good example of the use of covalent catalysis. The enzyme employs a power- 
ful nucleophile to attack the unreactive carbonyl carbon atom of the sub- 
strate. This nucleophile becomes covalently attached to the substrate briefly 
in the course of catalysis. 
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FIGURE 9.1 Specificity of 
chymotrypsin. Chymotrypsin cleaves 
proteins on the carboxyl side of aromatic 
or large hydrophobic amino acids 
(shaded orange). The likely bonds cleaved 
by chymotrypsin are indicated in red. 


FIGURE 9.2 An unusually reactive serine 
residue in chymotrypsin. Chymotrypsin 

is inactivated by treatment with 
diisopropylphosphofluoridate (DIPF), 

which reacts only with serine 195 among 
28 possible serine residues. 
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What is the nucleophile that chymotrypsin employs to attack the sub- 
strate carbonyl carbon atom? A clue came from the fact that chymotrypsin 
contains an extraordinarily reactive serine residue. Chymotrypsin mole- 
cules treated with organofluorophosphates such as diisopropylphospho- 
fluoridate (DIPF) lost all activity irreversibly (Figure 9.2). Only a single 
residue, serine 195, was modified. This chemical modification reaction sug- 
gested that this unusually reactive serine residue plays a central role in the 
catalytic mechanism of chymotrypsin. 


Chymotrypsin action proceeds in two steps linked by a covalently 
bound intermediate 


A study of the kinetics of chymotrypsin provided a second clue to its 
catalytic mechanism. Enzyme kinetics are often easily monitored by having 
the enzyme act on a substrate analog that forms a colored product. For 
chymotrypsin, such a chromogenic substrate is N-acetyl-L-phenylalanine 
p-nitrophenyl ester. This substrate is an ester rather than an amide, but 
many proteases will also hydrolyze esters. One of the products formed by 
chymotrypsin’s cleavage of this substrate is p-nitrophenolate, which has a 
yellow color (Figure 9.3). Measurements of the absorbance of light revealed 
the amount of p-nitrophenolate being produced. 

Under steady-state conditions, the cleavage of this substrate obeys 
Michaelis-Menten kinetics with a K\, of 20 uM anda k.,, of 77s '. The 
initial phase of the reaction was examined by using the stopped-flow 
method, which makes it possible to mix enzyme and substrate and monitor 
the results within a millisecond. This method revealed an initial rapid burst 
of colored product, followed by its slower formation as the reaction reached 


I H O + HO — 
Kee IN Oe 
n | 
ZO 
N 
O 
N-Acetyl-.-phenylalanine p-nitrophenyl ester p-Nitrophenolate 


FIGURE 9.3 Chromogenic substrate. N-Acetyl-L-phenylalanine p-nitrophenyl ester yields a 
yellow product, p-nitrophenolate, on cleavage by chymotrypsin. p-Nitrophenolate forms by 
deprotonation of p-nitrophenol at pH 7. 


the steady state (Figure 9.4). These results suggest that hydrolysis proceeds 
in two phases. In the first reaction cycle that takes place immediately after 


ou : : i Steady-state phase 
mixing, only the first phase must take place before the colored product is os 
released. In subsequent reaction cycles, both phases must take place. Note 3 
that the burst is observed because the first phase is substantially more rapid g 8 
than the second phase for this substrate. Ss 

The two phases are explained by the formation of a covalently bound $5 

enzyme-substrate intermediate (Figure 9.5). First, the acyl group of the 2 | f= phase 
substrate becomes covalently attached to the enzyme as p-nitrophenolate = 
(or an amine if the substrate is an amide rather than an ester) is released. = 
The enzyme-—acyl group complex is called the acyl-enzyme intermediate. Milliseconds after mixing — 
Second, the acyl-enzyme intermediate is hydrolyzed to release the FIGURE 9.4 Kinetics of chymotrypsin 
carboxylic acid component of the substrate and regenerate the free catalysis. Two phases are evident in the 
enzyme. Thus, one molecule of p-nitrophenolate is produced rapidly cleaving of N-acetyl-L-phenylalanine 
from each enzyme molecule as the acyl-enzyme intermediate is formed. p-nitrophenyl ester by chymotrypsin: a 


rapid burst phase (pre-steady-state) and a 


However, it takes longer for the enzyme to be “reset” by the hydrolysis Steddy-iale phase 


of the acyl-enzyme intermediate, and both phases are required for 
enzyme turnover. 
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FIGURE 9.5 Covalent catalysis. Hydrolysis by chymotrypsin takes place in two phases: 
(A) acylation to form the acyl-enzyme intermediate followed by (B) deacylation to regenerate 
the free enzyme. 


Serine is part of a catalytic triad that also includes 
histidine and aspartate 


The three-dimensional structure of chymotrypsin revealed that this 
enzyme is roughly spherical and comprises three polypeptide chains, 
linked by disulfide bonds. It is synthesized as a single polypeptide, 
termed chymotrypsinogen, which is activated by the proteolytic cleavage of 
the polypeptide to yield the three chains (Section 10.4). The active site of 
chymotrypsin, marked by serine 195, lies in a cleft on the surface of the 
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hk FIGURE 9.6 Location of the active 
site in chymotrypsin. Chymotrypsin 
consists of three chains, shown in ribbon 
form in orange, blue, and green. The side 
chains of the catalytic triad residues are 
shown as ball-and-stick representations. 
Notice these side chains, including serine 
195, lining the active site in the upper half 
of the structure. A/so notice two intrastrand 
and two interstrand disulfide bonds in 
various locations throughout the molecule. 
[Drawn from 1GCT.pdb.] 
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enzyme (Figure 9.6). The structure of the active site explained the special 
reactivity of serine 195 (Figure 9.7). The side chain of serine 195 is hydro- 
gen bonded to the imidazole ring of histidine 57. The —NH group of this 
imidazole ring is, in turn, hydrogen bonded to the carboxylate group of 
aspartate 102. This constellation of residues is referred to as the catalytic 
triad. How does this arrangement of residues lead to the high reactivity of 
serine 195? The histidine residue serves to position the serine side chain 
and to polarize its hydroxyl group so that it is poised for deprotonation. In 
the presence of the substrate, the histidine residue accepts the proton 
from the serine 195 hydroxyl group. In doing so, the histidine acts as a 
general base catalyst. The withdrawal of the proton from the hydroxyl 
group generates an alkoxide ion, which is a much more powerful nucleo- 
phile than is an alcohol. The aspartate residue helps orient the histidine 
residue and make it a better proton acceptor through hydrogen bonding 
and electrostatic effects. 
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FIGURE 9.7 The catalytic triad. The catalytic triad, shown on the left, converts serine 195 
into a potent nucleophile, as illustrated on the right. 


These observations suggest a mechanism for peptide hydrolysis 
(Figure 9.8). After substrate binding (step 1), the reaction begins with the 
oxygen atom of the side chain of serine 195 making a nucleophilic attack on 
the carbonyl carbon atom of the target peptide bond (step 2). There are now 
four atoms bonded to the carbonyl carbon, arranged as a tetrahedron, 
instead of three atoms in a planar arrangement. This inherently unstable 
tetrahedral intermediate bears a formal negative charge on the oxygen atom 
derived from the carbonyl group. This charge is stabilized by interactions 
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FIGURE 9.8 Peptide hydrolysis by chymotrypsin. The mechanism of peptide hydrolysis 
illustrates the principles of covalent and acid-base catalysis. The reaction proceeds in eight 
steps: (1) substrate binding, (2) nucleophilic attack of serine on the peptide carbonyl group, (3) 
collapse of the tetrahedral intermediate, (4) release of the amine component, (5) water binding, 
(6) nucleophilic attack of water on the acyl-enzyme intermediate, (7) collapse of the tetrahedral 
intermediate; and (8) release of the carboxylic acid component. The dashed green lines 
represent hydrogen bonds. 


with NH groups from the protein in a site termed the oxyanion hole 
(Figure 9.9). These interactions also help stabilize the transition state that 
precedes the formation of the tetrahedral intermediate. This tetrahedral 
intermediate collapses to generate the acyl-enzyme (step 3). This step is 
facilitated by the transfer of the proton being held by the positively charged 
histidine residue to the amino group formed by cleavage of the peptide 
bond. The amine component is now free to depart from the enzyme (step 4), 
completing the first stage of the hydrolytic reaction—acylation of the 
enzyme. Such acyl-enzyme intermediates have even been observed using 
X-ray crystallography by trapping them through adjustment of conditions 
such as the nature of the substrate, pH, or temperature. 

The next stage—deacylation—begins when a water molecule takes the 
place occupied earlier by the amine component of the substrate (step 5). The 
ester group of the acyl-enzyme is now hydrolyzed by a process that essentially 
repeats steps 2 through 4. Now acting as a general acid catalyst, histidine 
57 draws a proton away from the water molecule. The resulting OH™ ion 
attacks the carbonyl carbon atom of the acyl group, forming a tetrahedral 
intermediate (step 6). This structure breaks down to form the carboxylic acid 
product (step 7). Finally, the release of the carboxylic acid product (step 8) 
readies the enzyme for another round of catalysis. 
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FIGURE 9.9 The oxyanion hole. The 
structure stabilizes the tetrahedral 
intermediate of the chymotrypsin reaction. 
Notice that hydrogen bonds (shown in green) 
link peptide NH groups and the negatively 
charged oxygen atom of the intermediate. 
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FIGURE 9.10 Specificity pocket of 
chymotrypsin. Notice that this pocket is 
lined with hydrophobic residues and is 
deep, favoring the binding of residues with 
long hydrophobic side chains such as 
phenylalanine (shown in green). The active- 
site serine residue (serine 195) is positioned 
to cleave the peptide backbone between 
the residue bound in the pocket and the 
next residue in the sequence. The key 
amino acids that constitute the binding 
site are identified. 


k FIGURE 9.12 Structural similarity of 
trypsin and chymotrypsin. An overlay of 
the structure of chymotrypsin (red) on that 
of trypsin (blue) is shown. Notice the high 
degree of similarity. Only a-carbon-atom 
positions are shown. The mean deviation in 
position between corresponding a-carbon 
atoms is 1.7 A. [Drawn from 5PTP.pdb and 
1GCT.pdb.] 
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This mechanism accounts for all characteristics of chymotrypsin 
action except the observed preference for cleaving the peptide bonds 
just past residues with large, hydrophobic side chains. Examination of 
the three-dimensional structure of chymotrypsin with substrate analogs 
and enzyme inhibitors revealed the presence of a deep hydrophobic 
pocket, called the S, pocket, into which the long, uncharged side chains 
of residues such as phenylalanine and tryptophan can fit. The binding of 
an appropriate side chain into this pocket positions the adjacent peptide 
bond into the active site for cleavage (Figure 9.10). The specificity of 
chymotrypsin depends almost entirely on which amino acid is directly 
on the amino-terminal side of the peptide bond to be cleaved. Other 
proteases have more-complex specificity patterns. Such enzymes have 
additional pockets on their surfaces for the recognition of other residues 
in the substrate. Residues on the amino-terminal side of the scissile 
bond (the bond to be cleaved) are labeled P,, P,, P3, and so forth, head- 
ing away from the scissile bond (Figure 9.11). Likewise, residues on the 
carboxy] side of the scissile bond are labeled P,’, P,, P;’, and so forth. 
The corresponding sites on the enzyme are referred toas5,,5,or5,',5,’, 
and so forth. 
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FIGURE 9.11 Specificity nomenclature for protease-substrate interactions. The potential 
sites of interaction of the substrate with the enzyme are designated P (shown in red), and 
corresponding binding sites on the enzyme are designated S. The scissile bond (also shown 

in red) is the reference point. 


Catalytic triads are found in other hydrolytic enzymes 


md Many other peptide-cleaving proteins have subsequently been 

found to contain catalytic triads similar to that discovered in chymo- 
trypsin. Some, such as trypsin and elastase, are obvious homologs of 
chymotrypsin. The sequences of these proteins are approximately 40% 
identical with that of chymotrypsin, and their overall structures are quite 
similar (Figure 9.12). These proteins operate by mechanisms identical 
with that of chymotrypsin. However, the three enzymes differ markedly 
in substrate specificity. Chymotrypsin cleaves at the peptide bond after 
residues with an aromatic or long nonpolar side chain. Trypsin cleaves at 
the peptide bond after residues with long, positively charged side 
chains—namely, arginine and lysine. Elastase cleaves at the peptide bond 
after amino acids with small side chains—such as alanine and serine. 
Comparison of the S, pockets of these enzymes reveals that these different 
specificities ave due to small structural differences. In trypsin, an aspartate 
residue (Asp 189) is present at the bottom of the 5, pocket in place of a 
serine residue in chymotrypsin. The aspartate residue attracts and stabi- 
lizes a positively charged arginine or lysine residue in the substrate. In 
elastase, two residues at the top of the pocket in chymotrypsin and tryp- 
sin are replaced by much bulkier valine residues (Val 190 and Val 216). 
These residues close off the mouth of the pocket so that only small side 
chains can enter (Figure 9.13). 
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FIGURE 9.13 The S, pockets of chymotrypsin, trypsin, and elastase. Certain residues 
play key roles in determining the specificity of these enzymes. The side chains of these 
residues, as well as those of the active-site serine residues, are shown in color. 


Other members of the chymotrypsin family include a collection of pro- 
teins that take part in blood clotting, to be discussed in Chapter 10, as well 
as the tumor marker protein prostate-specific antigen (PSA). In addition, 
a wide range of proteases found in bacteria, viruses, and plants belong to 
this clan. 

Other enzymes that are not homologs of chymotrypsin have been found 
to contain very similar active sites. As noted in Chapter 6, the presence of 
very similar active sites in these different protein families is a consequence 
of convergent evolution. Subtilisin, a protease in bacteria such as Bacillus 
amyloliquefaciens, is a particularly well characterized example. The active 
site of this enzyme includes both the catalytic triad and the oxyanion hole. 
However, one of the NH groups that forms the oxyanion hole comes from 
the side chain of an asparagine residue rather than from the peptide back- 
bone (Figure 9.14). Subtilisin is the founding member of another large 
family of proteases that includes representatives from Archaea, Bacteria, 
and Eukarya. 

Finally, other proteases have been discovered that contain an active-site 
serine or threonine residue that is activated not by a histidine—aspartate pair 
but by a primary amino group from the side chain of lysine or by the 
N-terminal amino group of the polypeptide chain. 

Thus, the catalytic triad in proteases has emerged at least three times 
in the course of evolution. We can conclude that this catalytic strategy 
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FIGURE 9.14 The catalytic triad and 
oxyanion hole of subtilisin. Notice the 
two enzyme NH groups (both in the 
backbone and in the side chain of Asn 155) 
located in the oxyanion hole. The NH 
groups will stabilize a negative charge that 
develops on the peptide bond attacked by 
nucleophilic serine 221 of the catalytic triad. 


FIGURE 9.15 Site-directed mutagenesis 
of subtilisin. Residues of the catalytic triad 
were mutated to alanine, and the activity of 
the mutated enzyme was measured. 
Mutations in any component of the catalytic 
triad cause a dramatic loss of enzyme 
activity. Note that the activity is displayed on 
a logarithmic scale. The mutations are 


identified as follows: the first letter is the one- 


letter abbreviation for the amino acid being 
altered; the number identifies the position of 
the residue in the primary structure; and the 
second letter is the one-letter abbreviation 
for the amino acid replacing the original one. 
Uncat. refers to the estimated rate for the 
uncatalyzed reaction. 
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must be an especially effective approach to the hydrolysis of peptides and 
related bonds. 


The catalytic triad has been dissected by site-directed mutagenesis 


How can we test the validity of the mechanism proposed for the catalytic 
triad? One way is to test the contribution of individual amino acid residues 
to the catalytic power of a protease by using site-directed mutagenesis 
(Section 5.2). Subtilisin has been extensively studied by this method. Each 
of the residues within the catalytic triad, consisting of aspartic acid 32, his- 
tidine 64, and serine 221, has been individually converted into alanine, and 
the ability of each mutant enzyme to cleave a model substrate has been 
examined (Figure 9.15). 

As expected, the conversion of active-site serine 221 into alanine dra- 
matically reduced catalytic power; the value of k,,, fell to less than one mil- 
lionth of its value for the wild-type enzyme. The value of K,, was essentially 
unchanged; its increase by no more than a factor of two indicated that sub- 
strate continued to bind normally. The mutation of histidine 64 to alanine 
reduced catalytic power to a similar degree. The conversion of aspartate 32 
into alanine reduced catalytic power by less, although the value of k.,, still 
fell to less than 0.005% of its wild-type value. The simultaneous conversion 
of all three residues into alanine was no more deleterious than the conversion 
of serine or histidine alone. These observations support the notion that the 
catalytic triad and, particularly, the serine—histidine pair act together to gen- 
erate a nucleophile of sufficient power to attack the carbonyl! carbon atom of 
a peptide bond. Despite the reduction in their catalytic power, the mutated 
enzymes still hydrolyze peptides a thousand times as fast as buffer at pH 8.6. 

Site-directed mutagenesis also offered a way to probe the importance of 
the oxyanion hole for catalysis. The mutation of asparagine 155 to glycine 
eliminated the side-chain NH group from the oxyanion hole of subtilisin. 
The elimination of the NH group reduced the value of k,,, to 0.2% of its 
wild-type value but increased the value of K,, by only a factor of two. 
These observations demonstrate that the NH group of the asparagine resi- 
due plays a significant role in stabilizing the tetrahedral intermediate and 
the transition state leading to it. 


Cysteine, aspartyl, and metalloproteases are other major classes of 
peptide-cleaving enzymes 


a Not all proteases utilize strategies based on activated serine residues. 
Classes of proteins have been discovered that employ three alternative 
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approaches to peptide-bond hydrolysis (Figure 9.16). These classes are the 
(1) cysteine proteases, (2) aspartyl proteases, and (3) metalloproteases. In 
each case, the strategy is to generate a nucleophile that attacks the peptide 
carbonyl group (Figure 9.17). 

The strategy used by the cysteine proteases is most similar to that used by 
the chymotrypsin family. In these enzymes, a cysteine residue, activated by 
a histidine residue, plays the role of the nucleophile that attacks the peptide 
bond (Figure 9.17) in a manner quite analogous to that of the serine residue 
in serine proteases. Because the sulfur atom in cysteine is inherently a better 
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FIGURE 9.16 Three classes of 
proteases and their active sites. These 
examples of a cysteine protease, an 
aspartyl protease, and a metalloprotease 
use a histidine-activated cysteine residue, 
an aspartate-activated water molecule, and 
a metal-activated water molecule, 
respectively, as the nucleophile. The two 
halves of renin are in blue and red to 
highlight the approximate twofold symmetry 
of aspartyl proteases. Notice how different 
these active sites are despite the similarity 
in the reactions they catalyze. [Drawn from 
1PPN.pdb.; 1HRN. pdb; 1LND.pdb.] 


FIGURE 9.17 The activation strategies 
for three classes of proteases. The 
peptide carbonyl group is attacked by (A) a 
histidine-activated cysteine in the cysteine 
proteases, (B) an aspartate-activated water 
molecule in the aspartyl proteases, and (C) 
a metal-activated water molecule in the 
metalloproteases. For the metalloproteases, 
the letter B represents a base (often 
glutamate) that helps deprotonate the 
metal-bound water. 
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hk FIGURE 9.18 HIV protease, a 
dimeric aspartyl protease. The protease 
is a dimer of identical subunits, shown in 
blue and yellow, consisting of 99 amino 
acids each. Notice the placement of active- 
site aspartic acid residues, one from each 
chain, which are shown as ball-and-stick 
structures. The flaps will close down on the 
binding pocket after substrate has been 
bound. [Drawn from 3PHV.pdb.] 


nucleophile than is the oxygen atom in serine, cysteine proteases appear to 
require only this histidine residue in addition to cysteine and not the full 
catalytic triad. A well-studied example of these proteins is papain, an 
enzyme purified from the fruit of the papaya. Mammalian proteases homol- 
ogous to papain have been discovered, most notably the cathepsins, proteins 
having a role in the immune system and other systems. The cysteine-based 
active site arose independently at least twice in the course of evolution; the 
caspases, enzymes that play a major role in apoptosis (a genetically pro- 
grammed cell death pathway), have active sites similar to that of papain, but 
their overall structures are unrelated. 

The second class comprises the aspartyl proteases. The central feature of 
the active sites is a pair of aspartic acid residues that act together to allow a 
water molecule to attack the peptide bond. One aspartic acid residue (in its 
deprotonated form) activates the attacking water molecule by poising it for 
deprotonation. The other aspartic acid residue (in its protonated form) 
polarizes the peptide carbonyl group so that it is more susceptible to attack 
(Figure 9.17). Members of this class include renin, an enzyme involved in 
the regulation of blood pressure, and the digestive enzyme pepsin. These 
proteins possess approximate twofold symmetry. A likely scenario is that 
two copies of a gene for the ancestral enzyme fused to form a single gene that 
encoded a single-chain enzyme. Each copy of the gene would have contrib- 
uted an aspartate residue to the active site. The individual chains are now 
joined to make a single chain in most aspartyl proteases, whereas the prote- 
ases present in human immunodeficiency virus (HIV) and other retrovi- 
ruses comprise dimers of identical chains (Figure 9.18). This observation is 
consistent with the idea that larger aspartyl proteases may have evolved by 
fusion of separate subunits. 

The metalloproteases constitute the final major class of peptide-cleaving 
enzymes. The active site of such a protein contains a bound metal ion, 
almost always zinc, that activates a water molecule to act as a nucleophile to 
attack the peptide carbonyl group. The bacterial enzyme thermolysin and 
the digestive enzyme carboxypeptidase A are classic examples of the zinc 
proteases. Thermolysin, but not carboxypeptidase A, is a member of a large 
and diverse family of homologous zinc proteases that includes the matrix 
metalloproteases, enzymes that catalyze the reactions in tissue remodeling 
and degradation. 

In each of these three classes of enzymes, the active site includes features 
that act to (1) activate a water molecule or another nucleophile, (2) polarize 
the peptide carbonyl group, and (3) stabilize a tetrahedral intermediate 
(Figure 9.17). 


Protease inhibitors are important drugs 


Several important drugs are protease inhibitors. For example, captopril, 

used to regulate blood pressure, is one of many inhibitors of the angio- 
tensin-converting enzyme (ACE), a metalloprotease. Indinavir (Crixivan), 
retrovir, and more than 20 other compounds used in the treatment of AIDS 
are inhibitors of HIV protease (Figure 9.18), an aspartyl protease. HIV prote- 
ase cleaves multidomain viral proteins into their active forms; blocking this 
process completely prevents the virus from being infectious. HIV protease 
inhibitors, in combination with inhibitors of other key HIV enzymes, dra- 
matically reduced deaths due to AIDS, assuming that the cost of the treatment 
could be covered (Figure 36.21). In many cases, these drugs have converted 
AIDS from a death sentence to a treatable chronic disease. 
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Indinavir resembles the peptide substrate of the HIV protease. 
Indinavir is constructed around an alcohol that mimics the tetrahedral 
intermediate; other groups are present to bind into the 5,, 5,,5,’, and 5," 
recognition sites on the enzyme (Figure 9.19). X-ray crystallographic 
studies revealed that, in the active site, indinavir adopts a confirmation 
that approximates the twofold symmetry of the enzyme (Figure 9.20). 
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FIGURE 9.19 Indinavir, an HIV protease 
inhibitor. The structure of indinavir (Crixivan) 
is shown in comparison with that of a peptide 
substrate of HIV protease. The scissile bond 
in the substrate is highlighted in red. 


hk FIGURE 9.20 HIV protease-indinavir 
complex. (Left) The HIV protease is shown 
with the inhibitor indinavir bound at the 
active site. Notice the twofold symmetry of 
the enzyme structure. (Right) The drug has 
been rotated to reveal its approximately 
twofold symmetric conformation. [Drawn 
from 1HSH.pdb.] 
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The active site of HIV protease is covered by two flexible flaps that fold 
down on top of the bound inhibitor. The OH group of the central alcohol 
interacts with the two aspartate residues of the active site. In addition, two 
carbonyl groups of the inhibitor are hydrogen bonded to a water molecule 
(not shown in Figure 9.20), which, in turn, is hydrogen bonded to a pep- 
tide NH group in each of the flaps. This interaction of the inhibitor with 
water and the enzyme is not possible within cellular aspartyl proteases 
such as renin. Thus, the interaction may contribute to the specificity of 
indinavir for HIV protease. To prevent side effects, protease inhibitors 
used as drugs must be specific for one enzyme without inhibiting other 
proteins within the body. 


9.2 Carbonic Anhydrases Make a Fast Reaction Faster 


Carbon dioxide is a major end product of aerobic metabolism. In mammals, 
this carbon dioxide is released into the blood and transported to the lungs 
for exhalation. While in the red blood cells, carbon dioxide reacts with water 
(Section 7.3). The product of this reaction is a moderately strong acid, car- 
bonic acid (pK, = 3.5), which is converted into bicarbonate ion (HCO; ) on 
the loss of a proton. 


? ; ‘ : 
1 1 
C + H,0 — Cc — C.. +4 Ht 
I kt HO™ OH Ho~ “0 
Carbonic Bicarbonate 
acid ion 


Even in the absence of a catalyst, this hydration reaction proceeds at 
a moderately fast pace. At 37°C near neutral pH, the second-order rate 
constant k, is 0.0027 M~!s~!. This value corresponds to an effective first- 
order rate constant of 0.15 s~! in water ((H,O] = 55.5 M). The reverse 
reaction, the dehydration of HCO;j, is even more rapid, with a rate constant 
of Rg = 50 s_!. These rate constants correspond to an equilibrium con- 
stant of K, = 5,4 x 107° and a ratio of [CO,] to [H,CO,] of 340 : 1 at 
equilibrium. 


es Carbon dioxide hydration and HCO; dehydration are often cou- 

pled to rapid processes, particularly transport processes. Thus, 
almost all organisms contain enzymes, referred to as carbonic anhydrases, 
that increase the rate of reaction beyond the already reasonable spontane- 
ous rate. For example, carbonic anhydrases dehydrate HCO; in the blood 
to form CO, for exhalation as the blood passes through the lungs. 
Conversely, they convert CO, into HCO; to generate the aqueous humor 
of the eye and other secretions. Furthermore, both CO, and HCO; are 
substrates and products for a variety of enzymes, and the rapid intercon- 
version of these species may be necessary to ensure appropriate substrate 
levels. So important are these enzymes in human beings that mutations in 
some carbonic anhydrases have been found to be associated with osteope- 
trosis (excessive formation of dense bones accompanied by anemia) and 
mental retardation. 

Carbonic anhydrases accelerate CO, hydration dramatically. The most- 
active enzymes hydrate CO, at rates as high as k,,, = 10°s—!, or a million 
times a second per enzyme molecule. Fundamental physical processes such 
as diffusion and proton transfer ordinarily limit the rate of hydration, and so 
the enzymes employ special strategies to attain such prodigious rates. 


Carbonic anhydrase contains a bound zinc ion 
essential for catalytic activity 


Less than 10 years after the discovery of carbonic anhydrase in 1932, this 
enzyme was found to contain a bound zinc ion. Moreover, the zinc ion 
appeared to be necessary for catalytic activity. This discovery, remarkable at 
the time, made carbonic anhydrase the first known zinc-containing enzyme. 
At present, hundreds of enzymes are known to contain zinc. In fact, more 
than one-third of all enzymes either contain bound metal ions or require the 
addition of such ions for activity. Metal ions have several properties that 
increase chemical reactivity: their positive charges, their ability to form 
strong yet kinetically labile bonds, and, in some cases, their capacity to be 
stable in more than one oxidation state. The chemical reactivity of metal 
ions explains why catalytic strategies that employ metal ions have been 
adopted throughout evolution. 

X-ray crystallographic studies have supplied the most-detailed and 
direct information about the zinc site in carbonic anhydrase. At least seven 
carbonic anhydrases, each with its own gene, are present in human beings. 
They are all clearly homologous, as revealed by substantial sequence iden- 
tity. Carbonic anhydrase IJ, a major protein component of red blood cells, 
has been the most extensively studied (Figure 9.21). It is also one of the 
most active carbonic anhydrases. 


Zinc is found only in the +2 state in biological systems. A zinc atom is 
essentially always bound to four or more ligands; in carbonic anhydrase, 
three coordination sites are occupied by the imidazole rings of three histi- 
dine residues and an additional coordination site is occupied by a water 
molecule (or hydroxide ion, depending on pH). Because the molecules 
occupying the coordination sites are neutral, the overall charge on the 
Zn(His), unit remains +2. 


Catalysis entails zinc activation of a water molecule 


How does this zinc complex facilitate carbon dioxide hydration? A major 
clue comes from the pH profile of enzymatically catalyzed carbon dioxide 
hydration (Figure 9.22). 

At pH 8, the reaction proceeds near its maximal rate. As the pH 
decreases, the rate of the reaction drops. The midpoint of this transition is 
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k& FIGURE 9.21 The structure 
of human carbonic anhydrase II 
and its zinc site. (Left) Notice 
that the zinc ion is bound to the 
imidazole rings of three histidine 
residues as well as to a water 
molecule. (Right) Notice the 
location of the zinc site in a 
cleft near the center of the 
enzyme. [Drawn from 1CA2.pdb.] 
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FIGURE 9.22 Effect of pH on carbonic 
anhydrase activity. Changes in pH alter 
the rate of carbon dioxide hydration 
catalyzed by carbonic anhydrase Il. The 
enzyme is maximally active at high pH. 
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FIGURE 9.24 Carbon dioxide binding 
site. Crystals of carbonic anhydrase were 
exposed to carbon dioxide gas at high 
pressure and low temperature, and x-ray 
diffraction data were collected. The electron 
density for carbon dioxide, clearly visible 
adjacent to the zinc and its bound water, 
reveals the carbon dioxide binding 

site. [Information from J. F. Domsic, et al., 

J. Biol. Chem. 283:30766-30771, 2008.] 
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FIGURE 9.23 The pK, of zinc-bound water. Binding to zinc lowers the pK, of water from 
15.7 to 7. 


near pH 7, suggesting that a group that loses a proton at pH 7 (pK, = 7) 
plays an important role in the activity of carbonic anhydrase. Moreover, the 
curve suggests that the deprotonated (high pH) form of this group partici- 
pates more effectively in catalysis. Although some amino acids, notably 
histidine, have pK, values near 7, a variety of evidence suggests that the 
group responsible for this transition is not an amino acid but is the zinc-bound 
water molecule. 

The binding of a water molecule to the positively charged zinc center 
reduces the pK, of the water molecule from 15.7 to 7 (Figure 9.23). With 
the pK, lowered, the water molecule can more easily lose a proton at neutral 
pH, generating a substantial concentration of hydroxide ion (bound to 
the zinc atom). A zinc-bound hydroxide ion (OH ) is a potent nucleophile 
able to attack carbon dioxide much more readily than water does. Adjacent 
to the zinc site, carbonic anhydrase also possesses a hydrophobic patch that 
serves as a binding site for carbon dioxide (Figure 9.24). Based on these 


FIGURE 9.25 Mechanism of carbonic eae @||-~ 
anhydrase. The zinc-bound hydroxide HO 
mechanism for the hydration of carbon . 0 
dioxide reveals one aspect of metal ion O | 
catalysis. The reaction proceeds in four Hp —C Ho ,C 
steps: (1) water deprotonation, (2) carbon | - @ Ih 
dioxide binding, (8) nucleophilic attack by mt 000ClU SS In2+ 
hydroxide on carbon dioxide, and (4) Hie” x His His XoHis 
‘ His 5 His 
displacement of bicarbonate ion by water. 


observations, a simple mechanism for carbon dioxide hydration can be 
proposed (Figure 9.25): 


1. The zinc ion facilitates the release of a proton from a water molecule, 
which generates a hydroxide ion. 


2. The carbon dioxide substrate binds to the enzyme’s active site and is 
positioned to react with the hydroxide ion. 


3. The hydroxide ion attacks the carbon dioxide, converting it into bicar- 
bonate ion, HCO... 


4. The catalytic site is regenerated with the release of HCO; and the bind- 
ing of another molecule of water. 


Thus, the binding of a water molecule to the zinc ion favors the formation 
of the transition state by facilitating proton release and by positioning the 
water molecule to be in close proximity to the other reactant. 

Studies of a synthetic analog model system provide evidence for the mech- 
anism’s plausibility. A simple synthetic ligand binds zinc through four 


(A) 


nitrogen atoms (compared with three histidine nitrogen atoms in the 
enzyme), as shown in Figure 9.26. One water molecule remains bound to 
the zinc ion in the complex. Direct measurements reveal that this water 
molecule has a pK, value of 8.7, not as low as the value for the water mole- 
cule in carbonic anhydrase but substantially lower than the value for free 
water. At pH 9.2, this complex accelerates the hydration of carbon dioxide 
more than 100-fold. Although its rate of catalysis is much less efficient than 
catalysis by carbonic anhydrase, the model system strongly suggests that 
the zinc-bound hydroxide mechanism is likely to be correct. Carbonic 
anhydrases have evolved to employ the reactivity intrinsic to a zinc-bound 
hydroxide ion as a potent catalyst. 


A proton shuttle facilitates rapid regeneration of the 
active form of the enzyme 


As noted earlier, some carbonic anhydrases can hydrate carbon dioxide at 
rates as high as a million times a second (10° s~!). The magnitude of this 
rate can be understood from the following observations. In the first step of 
a carbon dioxide hydration reaction, the zinc-bound water molecule must 
lose a proton to regenerate the active form of the enzyme (Figure 9.27). The 
rate of the reverse reaction, the protonation of the zinc-bound hydroxide 
ion, is limited by the rate of proton diffusion. Protons diffuse very rapidly 


with second-order rate constants near 10!1 M~! s~!. Thus, the backward 
rate constant k_, must be less than 10!' M~! s~!. Because the equilibrium 
constant K is equal to k,/k_,, the forward rate constant is given by k, = 
K-k_,. Thus, ifk_, =10''M~!s~! and K = 107’ M (because pK, = 7), 
then k, must be less than or equal to 10* s~!. In other words, the rate of 
proton diffusion limits the rate of proton release to less than 10* s~! for a 
group with pK, = 7. However, if carbon dioxide is hydrated at a rate of LO" 
s_!, then every step in the mechanism (Figure 9.25) must take place at least 
this fast. How is this apparent paradox resolved? 

The answer became clear with the realization that the highest rates 
of carbon dioxide hydration require the presence of buffer, suggesting that 
the buffer components participate in the reaction. The buffer can bind 
or release protons. The advantage is that, whereas the concentrations of 


FIGURE 9.26 A synthetic analog model 
system for carbonic anhydrase. (A) An 
organic compound, capable of binding zinc, 
was synthesized as a model for carbonic 
anhydrase. The zinc complex of this ligand 
accelerates the hydration of carbon dioxide 
more than 100-fold under appropriate 
conditions. (B) The structure of the presumed 
active complex showing zinc bound to the 
ligand and to one water molecule. 


FIGURE 9.27 Kinetics of water 
deprotonation. The kinetics of deprotonation 
and protonation of the zinc-bound water 
molecule in carbonic anhydrase. 
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FIGURE 9.28 The effect of buffer on 
deprotonation. The deprotonation of the 
zinc-bound water molecule in carbonic 
anhydrase is aided by buffer component B. 


FIGURE 9.29 The effect of buffer 
concentration on the rate of carbon 
dioxide hydration. The rate of carbon 
dioxide hydration increases with the 
concentration of the buffer 
1,2-dimethylbenzimidazole. The buffer 
enables the enzyme to achieve its high 
catalytic rates. 


FIGURE 9.30 Histidine proton shuttle. 

(1) Histidine 64 abstracts a proton from the 
zinc-bound water molecule, generating a 
nucleophilic hydroxide ion and a protonated 
histidine. (2) The buffer (B) removes a 
proton from the histidine, regenerating the 
unprotonated form. 
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protons and hydroxide ions are limited to 10~7 M at neutral pH, the con- 
centration of buffer components can be much higher, of the order of 
several millimolar. If the buffer component BH* has a pK, of 7 (match- 
ing that for the zinc-bound water molecule), then the equilibrium con- 
stant for the reaction in Figure 9.28 is 1. The rate of proton abstraction is 
given by k,’ - [B]. The second-order rate constants k,’ and k_,’ will be 
limited by buffer diffusion to values less than approximately 10° M~!s~!. 
Thus, buffer concentrations greater than [B] = 10~°M (or 1 mM) may be 
high enough to support carbon dioxide hydration rates of 10° M7! s~! 
because k,' -[B] = (10? M~!s~!)-(1073M) = 10°s~!. The prediction that 
the rate increases with increasing buffer concentration has been confirmed 
experimentally (Figure 9.29). 
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The molecular components of many buffers are too large to reach the 
active site of carbonic anhydrase. Carbonic anhydrase II has evolved a pro- 
ton shuttle to allow buffer components to participate in the reaction from 
solution. The primary component of this shuttle is histidine 64. This resi- 
due transfers protons from the zinc-bound water molecule to the protein 
surface and then to the buffer (Figure 9.30). Thus, catalytic function has 
been enhanced through the evolution of an apparatus for controlling proton 
transfer from and to the active site. Because protons participate in many 
biochemical reactions, the manipulation of the proton inventory within 
active sites is crucial to the function of many enzymes and explains the 
prominence of acid—base catalysis. 


9.3 Restriction Enzymes Catalyze Highly Specific 
DNA-Cleavage Reactions 


We next consider a hydrolytic reaction that results in the cleavage of DNA. 
Bacteria and archaea have evolved mechanisms to protect themselves from 
viral infections. Many viruses inject their DNA genomes into cells; once 
inside, the viral DNA hijacks the cell’s machinery to drive the production of 
viral proteins and, eventually, of progeny virus. Often, a viral infection 
results in the death of the host cell. A major protective strategy for the host 
is to use restriction endonucleases (restriction enzymes) to degrade the viral 
DNA on its introduction into a cell. These enzymes recognize particular 
base sequences, called recognition sequences or recognition sites, in their target 
DNA and cleave that DNA at defined positions. We have already consid- 
ered the utility of these important enzymes for dissecting genes and 
genomes (Section 5.1). The most well-studied class of restriction enzymes 
comprises the type II restriction enzymes, which cleave DNA within their 
recognition sequences. Other types of restriction enzymes cleave DNA at 
positions somewhat distant from their recognition sites. 

Restriction endonucleases must show tremendous specificity at two lev- 
els. First, they must not degrade host DNA containing the recognition 
sequences. Second, they must cleave only DNA molecules that contain 
recognition sites (hereafter referred to as cognate DNA) without cleaving 
DNA molecules that lack these sites. How do these enzymes manage to 
degrade viral DNA while sparing their own? In E. coli, the restriction endo- 
nuclease EcoRV cleaves double-stranded viral DNA molecules that contain 
the sequence 5'-GATATC-3’ but leaves intact host DNA containing hun- 
dreds of such sequences. We shall return to the strategy by which host cells 
protect their own DNA at the end of this section. 

Restriction enzymes must cleave DNA only at recognition sites, without 
cleaving at other sites. Suppose that a recognition sequence is six base pairs 
long. Because there are 4°, or 4096, sequences having six base pairs, the con- 
centration of sites that must not be cleaved will be approximately 4000-fold 
higher than the concentration of sites that should be cleaved. Thus, to keep 
from damaging host-cell DNA, restriction enzymes must cleave cognate 
DNA molecules much more than 4000 times as efficiently as they cleave non- 
specific sites. We shall return to the mechanism used to achieve the necessary 
high specificity after considering the chemistry of the cleavage process. 


Cleavage is by in-line displacement of 3’-oxygen from 
phosphorus by magnesium-activated water 


A restriction endonuclease catalyzes the hydrolysis of the phosphodiester 
backbone of DNA. Specifically, the bond between the 3’-oxygen atom and 
the phosphorus atom is broken. The products of this reaction are DNA 
strands with a free 3’-hydroxyl group and a 5’'-phosphoryl group at the 
cleavage site (Figure 9.31). This reaction proceeds by nucleophilic attack at 
the phosphorus atom. We will consider two alternative mechanisms, 
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FIGURE 9.31 Hydrolysis of a 
phosphodiester bond. All restriction 
enzymes catalyze the hydrolysis of DNA 
phosphodiester bonds, leaving a 
phosphoryl group attached to the 5’ end. 
The bond that is cleaved is shown in red. 


base 
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suggested by analogy with the proteases. The restriction endonuclease 
might cleave DNA by mechanism 1 through a covalent intermediate, 
employing a potent nucleophile (Nu), or by mechanism 2 through direct 
hydrolysis: 


Mechanism 1 (covalent intermediate) 


0-oO 
Oo-O oad 
Om Paw 
wr + enzyme——NuH =—— enzyme——Nu OR2 + ROH 
RO OR, 
O-O 
a / 0 ~O 
Bs —, I 
enzyme—Nu OR, + H2O0 =——— enzyme—NuH + Paw 
R,O OH 
Mechanism 2 (direct hydrolysis) 
o-P 0” P 
~” + HO —=— ROH+  P_ 
R,0~ “OR, HO OR> 


Each mechanism postulates a different nucleophile to attack the phosphorus 
atom. In either case, each reaction takes place by in-line displacement: 


R10 ORi OR, 

Nu + P—L Nu----- i = L N—P gif 
RO : YOR 
R30 R,O0 OR; tas 


The incoming nucleophile attacks the phosphorus atom, and a pentacoordi- 
nate transition state is formed. This species has a trigonal bipyramidal 
geometry centered at the phosphorus atom, with the incoming nucleophile 
at one apex of the two pyramids and the group that is displaced (the leaving 
group, L) at the other apex. Note that the displacement inverts the stereo- 
chemical conformation at the tetrahedral phosphorous atom, analogous to 
the interconversion of the R and 5 configurations around a tetrahedral car- 
bon center (Section 2.1). 

The two mechanisms differ in the number of times that the displace- 
ment takes place in the course of the reaction. In the first type of mecha- 
nism, a nucleophile in the enzyme (analogous to serine 195 in chymotryp- 
sin) attacks the phosphate group to form a covalent intermediate. In a 
second step, this intermediate is hydrolyzed to produce the final products. 
In this case, two displacement reactions take place at the phosphorus atom. 
Consequently, the stereochemical configuration at the phosphorus atom 
would be inverted and then inverted again, and the overall configuration 
would be retained. In the second type of mechanism, analogous to that 
used by the aspartyl- and metalloproteases, an activated water molecule 
attacks the phosphorus atom directly. In this mechanism, a single displace- 
ment reaction takes place at the phosphorus atom. Hence, the stereochem- 
ical configuration at the phosphorus atom is inverted after cleavage. To 
determine which mechanism is correct, we examine the stereochemistry at 
the phosphorus atom after cleavage. 

A difficulty is that the stereochemistry is not easily observed, because two 
of the groups bound to the phosphorus atom are simple oxygen atoms, iden- 
tical with each other. This difficulty can be circumvented by replacing one 
oxygen atom with sulfur (producing a species called a phosphorothioate). 


Cleavage site 


- = H2 0 


adenine 


Let us consider EcoRV endonuclease. This enzyme cleaves the phosphodi- 
ester bond between the T and the A at the center of the recognition sequence 
5’-GATATC-3’. The first step is to synthesize an appropriate substrate for 
EcoRV containing phosphorothioates at the sites of cleavage (Figure 9.32). 
The reaction is then performed in water that has been greatly enriched in 
180 to allow the incoming oxygen atom to be marked. The location of the 
'8Q label with respect to the sulfur atom indicates whether the reaction 
proceeds with inversion or retention of stereochemistry. This experiment 
revealed that the stereochemical configuration at the phosphorus atom was 
inverted only once with cleavage. This result is consistent with a direct attack 
by water at the phosphorus atom and rules out the formation of any cova- 
lently bound intermediate (Figure 9.33). 


Restriction enzymes require magnesium for catalytic activity 


Many enzymes that act on phosphate-containing substrates require 
Mg?? or some other similar divalent cation for activity. One or more Mg? 
(or similar) cations are essential to the function of restriction endonucleases. 
What are the functions of these metal ions? 

Direct visualization of the complex between EcoRV endonuclease and 
cognate DNA molecules in the presence of Mg** by crystallization has not 
been possible, because the enzyme cleaves the substrate under these circum- 
stances. Nonetheless, metal ion complexes can be visualized through several 
approaches. In one approach, crystals of EcoRV endonuclease are prepared 
bound to oligonucleotides that contain the enzyme’s recognition sequence. 
These crystals are grown in the absence of magnesium to prevent cleavage; 
after their preparation, the crystals are soaked in solutions containing the 
metal. Alternatively, crystals have been grown with the use of a mutated 
form of the enzyme that is less active. Finally, Mg?* can be replaced by 
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FIGURE 9.32 Labeling with 
phosphorothioates. Phosphorothioate 
groups, in which one of the nonbridging 
oxygen atoms is replaced by a sulfur atom, 
can be used to label specific sites in the 
DNA backbone to determine the overall 
stereochemical course of a displacement 
reaction. Here, a phosphorothioate is 
placed at sites that can be cleaved by 
EcoRV endonuclease. 


FIGURE 9.33 Stereochemistry of 
cleaved DNA. Cleavage of DNA by EcoRV 
endonuclease results in overall inversion of 
the stereochemical configuration at the 
phosphorus atom, as indicated by the 
stereochemistry of the phosphorus atom 
bound to one bridging oxygen atom, one 
16Q, one 180, and one sulfur atom. Two 
possible products are shown, only one of 
which is observed, indicating direct attack 
of water at the phosphorous atom. 
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FIGURE 9.34 A magnesium ion-binding 
site in EcoRV endonuclease. The 
magnesium ion helps to activate a water 
molecule and positions it so that it can 
attack the phosphorus atom. 


FIGURE 9.35 Structure of the 
recognition site of EcoRV 

endonuclease. (A) The sequence of the 
recognition site, which is symmetric around 
the axis of rotation designated in green. 

(B) The inverted repeat within the recognition 
sequence of EcoRV (and most other 
restriction endonucleases) endows the DNA 
site with twofold rotational symmetry. 


metal ions such as Ca’* that bind but do not result in much catalytic 
activity. In all cases, no cleavage takes place, and so the locations of the metal 
ion-binding sites are readily determined. 

As many as three metal ions have been found to be present per active 
site. The roles of these multiple metal ions are still under investigation. One 
ion-binding site is occupied in essentially all structures. This metal ion is 
coordinated to the protein through two aspartate residues and to one of the 
phosphate-group oxygen atoms near the site of cleavage. This metal ion 
binds the water molecule that attacks the phosphorus atom, helping to posi- 
tion and activate it in a manner similar to that for the Zn?* ion of carbonic 
anhydrase (Figure 9.34). 


The complete catalytic apparatus is assembled only within 
complexes of cognate DNA molecules, ensuring specificity 


We now return to the question of specificity, the defining feature of restric- 
tion enzymes. The recognition sequences for most restriction endonucleases 
are inverted repeats. This arrangement gives the three-dimensional structure 
of the recognition site a twofold rotational symmetry (Figure 9.35). 

The restriction enzymes display a corresponding symmetry: they are 
dimers whose two subunits are related by twofold rotational symmetry. 
The matching symmetry of the recognition sequence and the enzyme 
facilitates the recognition of cognate DNA by the enzyme. This similarity 
in structure has been confirmed by the determination of the structure of the 
complex between EcoRV endonuclease and DNA fragments containing its 
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k FIGURE 9.36 EcoRV embracing a cognate DNA molecule. (A) This view of the 
structure of EcoRV endonuclease bound to a cognate DNA fragment is down the helical axis of 
the DNA. The two protein subunits are in yellow and blue, and the DNA backbone is in red. 
Notice that the twofold axes of the enzyme dimer and the DNA are aligned. One of the DNA- 
binding loops (in green) of EcoRV endonuclease is shown interacting with the base pairs of its 
cognate DNA-binding site. Key amino acid residues are shown hydrogen bonding with (B) a CG 
base pair and (C) an AT base pair. [Drawn from 1RVB.pdb.] 


recognition sequence (Figure 9.36). The enzyme surrounds 
the DNA ina tight embrace. 

An enzyme’s binding affinity for substrates often deter- 
mines specificity. Surprisingly, however, binding studies 
performed in the absence of magnesium have demonstrated 
that the EcoRV endonuclease binds to all sequences, both 
cognate and noncognate, with approximately equal affinity. 
Why, then, does the enzyme cleave only cognate sequences? 
The answer lies in a unique set of interactions between the 
enzyme and a cognate DNA sequence. 

Within the 5’-GATATC-3' sequence, the G and A bases 
at the 5’ end of each strand and their Watson—Crick partners 
directly contact the enzyme by hydrogen bonding with resi- 
dues that are located in two loops, one projecting from the —_ igure 9.37 Distortion of the recognition site. The DNA is 
surface of each enzyme subunit (Figure 9.36). The most represented as a ball-and-stick model. The path of the DNA helical 
striking feature of this complex is the distortion of the DNA, _ axis, shown in red, is substantially distorted on binding to the 
which is substantially kinked in the center (Figure 9.37). The — ©"zyme. For the B form of DNA, the axis is straight (not shown). 
central two TA base pairs in the recognition sequence play a 
key role in producing the kink. They do not make contact with the enzyme 
but appear to be required because of their ease of distortion. The 5'-TA-3’ 
sequence is known to be among the most easily deformed base pairs. 

The structures of complexes formed with noncognate DNA 
fragments are strikingly different from those formed with cognate DNA: 
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the noncognate DNA conformation is not substantially 
distorted (Figure 9.38). This lack of distortion has important 
consequences with regard to catalysis. No phosphate is posi- 
tioned sufficiently close to the active-site aspartate residues to 
complete a magnesium ion-binding site (Figure 9.34). Hence, 
the nonspecific complexes do not bind the magnesium ions 
and the complete catalytic apparatus is never assembled. 
The distortion of the substrate and the subsequent binding 
of the magnesium ion account for the catalytic specificity of 
more than a millionfold that is observed for EcoRV endon- 
culease. Thus, enzyme specificity may be determined by the 
specificity of enzyme action rather than the specificity of sub- 
strate binding. 

We can now see the role of binding energy in this strategy 
for attaining catalytic specificity. The distorted DNA makes 
additional contacts with the enzyme, increasing the binding 
energy. However, the increase in binding energy is canceled 
by the energetic cost of distorting the DNA from its relaxed 
conformation (Figure 9.39). Thus, for EcoRV endonuclease, there is little 
difference in binding affinity for cognate and nonspecific DNA frag- 
ments. However, the distortion in the cognate complex dramatically 
affects catalysis by completing the magnesium ion-binding site. This 
example illustrates how enzymes can utilize available binding energy to 
deform substrates and poise them for chemical transformation. Interactions 
that take place within the distorted substrate complex stabilize the transi- 
tion state leading to DNA hydrolysis. 


hk FIGURE 9.38 Nonspecific and 
cognate DNA within EcoRV 
endonuclease. A comparison of the 
positions of the nonspecific (orange) and 
the cognate DNA (red) within EcoRV. Notice 
that, in the nonspecific complex, the DNA 
backbone is too far from the enzyme to 
complete the magnesium ion-binding 

sites. [Drawn from 1RVB.pdb.] 


Enzyme + 
nonspecific DNA 


Enzyme + 


cognate DNA Host-cell DNA is protected by the addition of methyl 


groups to specific bases 


How does a host cell harboring a restriction enzyme pro- 


>| Se Su tect its own DNA? The host DNA is methylated on 
o ne: 73 specific adenine bases within host recognition sequences 
2 ES ES by other enzymes called methylases (Figure 9.40). An 
Z| a a endonuclease will not cleave DNA if its recognition 
sequence is methylated. For each restriction endonucle- 

Nonepeditie Cognate Catalytically 25% the host cell produces a corresponding methylase 


that marks the host DNA at the appropriate methyla- 
tion site. These pairs of enzymes are referred to as 
restriction-modification systems. 

The distortion in the DNA explains how methylation 
blocks catalysis and protects host-cell DNA. The host E. 


complex complex competent 


DNA distortion 


FIGURE 9.39 Greater binding energy of EcoRV endonuclease 


bound to cognate versus noncognate DNA. The additional 
interactions between EcoRV endonuclease and cognate DNA 
increase the binding energy, which can be used to drive DNA 
distortions necessary for forming a catalytically competent 
complex. 


Cleaved 


FIGURE 9.40 Protection by 

methylation. The recognition sequence for 
EcoRV endonuclease (left) and the sites of 
methylation (right) in DNA protected from 
the catalytic action of the enzyme. 
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coli adds a methyl group to the amino group of the adenine 
nucleotide at the 5’ end of the recognition sequence. The 
presence of the methyl group blocks the formation of a 
hydrogen bond between the amino group and the side-chain 
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carbonyl group of asparagine 185 (Figure 9.41). This asparagine residue is EcoRV 
closely linked to the other amino acids that form specific contacts with the 

DNA. The absence of the hydrogen bond disrupts other interactions between 

the enzyme and the DNA substrate, and the distortion necessary for cleavage 

will not take place. 


Type Il restriction enzymes have a catalytic core in common and are 
probably related by horizontal gene transfer 


iz Type II restriction enzymes are prevalent in Archaea and Bacteria. 

What can we tell of the evolutionary history of these enzymes? 
Comparison of the amino acid sequences of a variety of type II restriction 
endonucleases did not reveal significant sequence similarity between most 
pairs of enzymes. However, a careful examination of three-dimensional 
structures, taking into account the location of the active sites, revealed the Thymine Adenine 
presence of a core structure conserved in the different enzymes. 

These observations indicate that many type II restriction enzymes are 
indeed evolutionarily related. Analyses of the sequences in greater detail 
suggest that bacteria may have obtained genes encoding these enzymes from ‘irveition ot hvdkdent bonds batwean 
other species by horizontal gene transfer, the passing of pieces of DNA (such EcoRV endonuclease and cognate DNA 
as plasmids) between species that provide a selective advantage in a particu- molecules and prevents their hydrolysis. 
lar environment. For example, EcoRI (from E. coli) and RsrI (from 
Rhodobacter sphaeroides) are 50% identical in sequence over 266 amino 
acids, clearly indicative of a close evolutionary relationship. However, these 
species of bacteria are not closely related. Thus, these species appear to have 
obtained the gene for these restriction endonucleases from a common source more 
recently than the time of their evolutionary divergence. Moreover, the codons 
used by the gene encoding EcoRI endonuclease to specify given amino acids 
are strikingly different from the codons used by most E. coli genes, which 
suggests that the gene did not originate in E. coli. 

Horizontal gene transfer may be a common event. For example, genes 
that inactivate antibiotics are often transferred, leading to the transmission of 
antibiotic resistance from one species to another. For restriction-modification 
systems, protection against viral infections may have favored horizontal 
gene transfer. 


Methylated DNA 


FIGURE 9.41 Methylation of adenine. 
The methylation of adenine blocks the 


9.4 Myosins Harness Changes in Enzyme Conformation 
to Couple ATP Hydrolysis to Mechanical Work 


The final enzymes that we will consider are the myosins. These enzymes 
catalyze the hydrolysis of adenosine triphosphate (ATP) to form adenosine 
diphosphate (ADP) and inorganic phosphate (P:) and use the energy associ- 
ated with this thermodynamically favorable reaction to drive the motion of 
molecules within cells. 
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Adenosine triphosphate (ATP) Inorganic phosphate (P;) Adenosine diphosphate (ADP) 


For example, when we lift a book, the energy required comes from ATP 
hydrolysis catalyzed by myosin in our muscles. Myosins are found in all 
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Globular ATPase domains 


FIGURE 9.42 Elongated structure of 
muscle myosin. An electron micrograph 
showing myosin from mammalian muscle. 
This dimeric protein has an elongated 
structure with two globular ATPase domains 
per dimer. [Courtesy of Dr. Paula Flicker, 

Dr. Theo Walliman, and Dr. Peter Vibert.] 


h& FIGURE 9.43 Myosin-ATP complex structure. An overlay 
of the structures of the ATPase domain from Dictyostelium 
discoideum myosin with no ligands bound (blue) and the complex 


eukaryotes and the human genome encodes more than 40 different myosins. 
Myosins generally have elongated structures with globular domains that actu- 
ally carry out ATP hydrolysis (Figure 9.42). In this chapter, we will focus on 
the globular ATPase domains, particularly the strategies that allow myosins to 
hydrolyze ATP in a controlled manner and to use the free energy associated 
with this reaction to promote substantial conformational changes within the 
myosin molecule. These conformational changes are amplified by other struc- 
tures in the elongated myosin molecules to transport proteins or other cargo 
substantial distances within cells. In Chapter 35, we will examine the action of 
myosins and other molecular-motor proteins in much more detail. 

As will be discussed in Chapter 15, ATP is used as the major currency 
of energy inside cells. Many enzymes use ATP hydrolysis to drive other 
reactions and processes. In almost all cases, an enzyme that hydrolyzed 
ATP without any such coupled processes would simply drain the energy 
reserves of a cell without benefit. 


ATP hydrolysis proceeds by the attack of water on 
the gamma phosphoryl group 


In our examination of the mechanism of restriction enzymes, we learned 
that an activated water molecule performs a nucleophilic attack on phos- 
phorus to cleave the phosphodiester backbone of DNA. The cleavage of 
ATP by myosins follows an analogous mechanism. To understand the myo- 
sin mechanism in more detail, we must first examine the structure of the 
myosin ATPase domain. 

The structures of the ATPase domains of several different myosins have 
been examined. One such domain, that from the soil-living amoeba 
Dictyostelium discoideum, an organism that has been extremely useful for 
studying cell movement and molecular-motor proteins, has been studied in 
great detail. The crystal structure of this protein fragment in the absence 
of nucleotides revealed a single globular domain comprising approximately 
750 amino acids. A water-filled pocket is present toward the center of the 
structure, suggesting a possible nucleotide-binding site. Crystals of this pro- 
tein were soaked in a solution containing ATP and the structure was exam- 
ined again. Remarkably, this structure revealed intact ATP bound in the 
active site with very little change in the overall structure and without evidence 
of significant hydrolysis (Figure 9.43). The ATP is also bound to a Mg?" ion. 

Kinetic studies of myosins, as well as many other 
enzymes having ATP or other nucleoside triphosphates as 
a substrate, reveal that these enzymes are essentially inac- 
tive in the absence of divalent metal ions such as magne- 


of this protein with ATP and magnesium bound (red). Notice that sium (Mg’*) or manganese (Mn?*) but acquire activity on 
the two structures are extremely similar to one another. [Drawn the addition of these ions. In contrast with the enzymes 


from 1FMV.pdb and 1FMW.pdbj. 
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discussed so far, the metal is not a component of the active 
site. Rather, nucleotides such as ATP bind these ions, and 
it is the metal ion—nucleotide complex that is the true sub- 
strate for the enzymes. The dissociation constant for the 
ATP-Mg?* complex is approximately 0.1 mM, and thus, 
given that intracellular Mg?* concentrations are typically 
in the millimolar range, essentially all nucleoside triphos- 
phates are present as NT P—Mg’* complexes. Magnesium 
or manganese complexes of nucleoside triphosphates are the 
true substrates for essentially all NTP-dependent enzymes. 
The nucleophilic attack by a water molecule on the 
y-phosphory] group requires some mechanism to activate the 
water, suchas abasicresidueora bound metal ion. Examination 


of the myosin—ATP complex structure shows no basic resi- 
due in an appropriate position and reveals that the bound 
Mg?" ion is too far away from the phosphoryl group to play 
this role. These observations suggest why this ATP complex 
is relatively stable; the enzyme is not in a conformation that is 
competent to catalyze the reaction. This observation suggests 
that the domain must undergo a conformational change to 
catalyze the ATP-hydrolysis reaction. 


Ser 236 


Formation of the transition state for ATP hydrolysis is 
associated with a substantial conformational change 


Vanadium ion 


Thr 186 


; : . FIGURE 9.44 Myosin ATPase Transition-State Analog. The 
The catalytically competent conformation of the myosin __ structure of the transition-state analog formed by treating the 


ATPase domain must bind and stabilize the transition myosin ATPase domain with ADP and vanadate (VO,,°-) in the 
state of the reaction. In analogy with restriction enzymes, —_ Presence of magnesium. Notice that the vanadium ion is 


we expect that ATP hydrolysis includes a pentacoordinate 
transition state. 
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Such pentacoordinate structures based on phosphorus are too unstable to 
be readily observed. However, transition-state analogs in which other 
atoms replace phosphorus are more stable. The transition metal vana- 
dium, in particular, forms similar structures. The myosin ATPase 
domain can be crystallized in the presence of ADP and vanadate, VO}~. 
The result is the formation of a complex that closely matches the expected 
transition-state structure (Figure 9.44). As expected, the vanadium atom 
is coordinated to five oxygen atoms, including one oxygen atom from 
ADP diametrically opposite an oxygen atom that is analogous to the 
attacking water molecule in the transition state. The Mg?" ion is coordi- 
nated to one oxygen atom from the vanadate, one oxygen atom from the 
ADP, two hydroxyl groups from the enzyme, and two water molecules. 
In this position, this ion does not appear to play any direct role in activat- 
ing the attacking water. However, an additional residue from the enzyme, 
Ser 236, is well positioned to play a role in catalysis (Figure 9.44). In the 
proposed mechanism of ATP hydrolysis based on this structure, the 
water molecule attacks the y-phosphoryl group, with the hydroxyl group 
of Ser 236 facilitating the transfer of a proton from the attacking water to 
the hydroxyl group of Ser 236, which, in turn, is deprotonated by one of 
the oxygen atoms of the y-phosphoryl group (Figure 9.45). Thus, in 
effect, the ATP serves as a base to promote its own hydrolysis. 


coordinated to five oxygen atoms including one from ADP. The 
positions of two residues that bind magnesium as well as Ser 236, 
a residue that appears to play a direct role in catalysis, are 

shown. [Drawn from 1VOM.pdb] 


FIGURE 9.45 Facilitating water 

attack. The water molecule attacking the 
y-phosphoryl group of ATP is deprotonated 
by the hydroxyl group of Ser 236, which, in 
turn, is deprotonated by one of the oxygen 
atoms of the y-phosphoryl group forming 
the H,PO; product. 
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h& FIGURE 9.46 Myosin conformational 
changes. A comparison of the overall 
structures of the myosin ATPase domain 
with ATP bound (shown in red) and that 
with the transition-state analog ADP- 
vanadate (shown in blue). Notice the large 
conformational change of a region at the 
carboxyl-terminus of the domain, some 
parts of which move as much as 25 A. 
[Drawn from 1FMW.pdb and 1VOM.pdb]. 


Comparison of the overall structures of the myosin ATPase domain 
complexed with ATP and with the ADP-—vanadate reveals some remarkable 
differences. Relatively modest structural changes occur in and around the 
active site. In particular, a stretch of amino acids moves closer to the nucleo- 
tide by approximately 2 A and interacts with the oxygen atom that corre- 
sponds to the attacking water molecule. These changes help facilitate the 
hydrolysis reaction by stabilizing the transition state. However, examina- 
tion of the overall structure shows even more striking changes. 

A region comprising approximately 60 amino acids at the carboxyl- 
terminus of the domain adopts a different configuration in the ADP— 
vanadate complex, displaced by as much as 25 A from its position in the ATP 
complex (Figure 9.46). This displacement tremendously amplifies the rela- 
tively subtle changes that take place in the active site. The effect of this 
motion is amplified even more as this carboxyl-terminal domain is connected 
to other structures within the elongated structures typical of myosin mole- 
cules (Figure 9.42). Thus, the conformation that is capable of promoting the 
ATP hydrolysis reaction is itself substantially different from other conforma- 
tional changes that take place in the course of the catalytic cycle. 


The altered conformation of myosin persists for a 
substantial period of time 


Myosins are slow enzymes, typically turning over approximately once per 
second. What steps limit the rate of turnover? In an experiment that was 
particularly revealing, the hydrolysis of ATP was catalyzed by the myosin 
ATPase domain from mammalian muscle. The reaction took place in water 
labeled with !8O to track the incorporation of solvent oxygen into the reac- 
tion products. The fraction of oxygen in the phosphate product was ana- 
lyzed. In the simplest case, the phosphate would be expected to contain one 
oxygen atom derived from water and three initially present in the terminal 
phosphoryl group of ATP. 
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Instead, between two and three of the oxygen atoms in the phosphate were 
found, on average, to be derived from water. These observations indicate that 
the ATP hydrolysis reaction within the enzyme active site is reversible. Each 
molecule of ATP is cleaved to ADP and P. and then re-formed from these 
products several times before the products are released from the enzyme 
(Figure 9.47). At first glance, this observation is startling because ATP hydro- 
lysis is a very favorable reaction with an equilibrium constant of approximately 
140,000. However, this equilibrium constant applies to the molecules free in 
solution, not within the active site of an enzyme. Indeed, more-extensive 
analysis suggests that this equilibrium constant on the enzyme is approxi- 
mately 10, indicative of a general strategy used by enzymes. Enzymes catalyze 
reactions by stabilizing the transition state. The structure of this transition state 
is intermediate between the enzyme-bound reactants and the enzyme-bound 
products. Many of the interactions that stabilize the transition state will help 
equalize the stabilities of the reactants and the products. Thus, the equilibrium 
constant between enzyme-bound reactants and products is often close to 1, regard- 
less of the equilibrium constant for the reactants and products free in solution. 

These observations reveal that the hydrolysis of ATP to ADP and P. is 
not the rate-limiting step for the reaction catalyzed by myosin. Instead, the 
release of the products, particularly P., from the enzyme is rate limiting. 
The fact that a conformation of myosin with ATP hydrolyzed but still 
bound to the enzyme persists for a significant period of time is critical for 
coupling conformational changes that take place in the course of the reac- 
tion to other processes. 


Scientists can watch single molecules of myosin move 


Myosin molecules function to use the free energy of hydrolysis of ATP to 
drive macroscopic motion. Myosin molecules move along a filamentous 
protein termed actin, as we will discuss in more detail in Chapter 35. Using 
a variety of physical methods, scientists have been able to watch single myo- 
sin in action. For example, a myosin family member termed myosin V can be 
labelled with fluorescent tags so that it can be localized when fixed on a 
surface with a precision of less than 15 A. When this myosin is placed on a 
surface coated with actin filaments, each molecule remains in a fixed posi- 
tion. However, when ATP is added, each molecule moves along the surface. 
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FIGURE 9.47 Reversible hydrolysis of 
ATP within the myosin active site. For 
myosin, more than one atom of oxygen from 
water is incorporated in inorganic phosphate. 
The oxygen atoms are incorporated in cycles 
of hydrolysis of ATP to ADP and inorganic 
phosphate, phosphate rotation within the 


active site, and reformation of ATP now 
containing oxygen from water. 
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hk FIGURE 9.49 The core domain of 
NMP kinases. Notice the P-loop shown in 
green. The dashed lines represent the 
remainder of the protein structure. [Drawn 
from 1GKY.pdb.] 


FIGURE 9.48 Single molecule motion. (A) A trace of the position of a single dimeric myosin 
V molecule as it moves across a surface coated with actin filaments. (B) A model of how the 
dimeric molecule moves in discrete steps with an average size of 74 + 5 nm. [Data from 

A. Yildiz, et al., Science 300(6628):2061-2065, 2003.] 


Tracking individual molecules reveals that each moves in steps of approxi- 
mately 74 nm as shown in Figure 9.48. The observation of steps of a fixed 
size as well as the determination of this step size helps reveal details of the 
mechanism of action of these tiny molecular motors. 


Myosins are a family of enzymes containing P-loop structures 


Ae X-ray crystallography has yielded the three-dimensional structures of 

a number of different enzymes that share key structural characteristics 
and, almost certainly, an evolutionary history with myosin. In particular, a 
conserved NTP-binding core domain is present. This domain consists of a 
central 8 sheet, surrounded on both sides by « helices (Figure 9.49). A char- 
acteristic feature of this domain is a loop between the first B strand and the 
first helix. This loop typically has several glycine residues that are often 
conserved between more closely related members of this large and diverse 
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a subunit of transducin B subunit of ATP synthase 


hk FIGURE 9.50 Three proteins containing P-loop NTPase domains. Notice the 
conserved domains shown with the inner surfaces of the ribbons in purple and the 
P-loops in green. [Drawn from 4AKE.pdb; 1TND.pdb; 1BMF.pdb.] 


family. The loop is often referred to as the P-loop because it interacts with 
phosphoryl groups on the bound nucleotide. P-loop NTPase domains are 
present in a remarkably wide array of proteins, many of which participate in 
essential biochemical processes. Examples include ATP synthase, the key 
enzyme responsible for ATP generation; signal-transduction proteins such 
as G proteins; proteins essential for translating mRNA into proteins, such 
as elongation factor Tu; and DNA- and RNA-unwinding helicases. The 
wide utility of P-loop NTPase domains is perhaps best explained by their 
ability to undergo substantial conformational changes on nucleoside tri- 
phosphate binding and hydrolysis. We shall encounter these domains 
throughout the book and shall observe how they function as springs, 
motors, and clocks. To allow easy recognition of these domains in the book, 
they will be depicted with the inner surfaces of the ribbons in a ribbon dia- 
gram shown in purple and the P-loop shown in green (Figure 9.50). 


SUMMARY 


Enzymes adopt conformations that are structurally and chemically comple- 
mentary to the transition states of the reactions that they catalyze. Sets of 
interacting amino acid residues make up sites with the special structural and 
chemical properties necessary to stabilize the transition state. Enzymes use 
five basic strategies to form and stabilize the transition state. The first 
(1) involves the use of binding energy to promote both specificity and 
catalysis. The others are (2) covalent catalysis, (3) general acid—base cataly- 
sis, (4) catalysis by approximation, and (5) metal ion catalysis. The four 
classes of enzymes examined in this chapter catalyze the addition of water to 
their substrates but have different requirements for catalytic speed, speci- 
ficity, and coupling to other processes. 


9.1 Proteases Facilitate a Fundamentally Difficult Reaction 

The cleavage of peptide bonds by chymotrypsin is initiated by the attack 
by a serine residue on the peptide carbonyl group. The attacking 
hydroxyl group is activated by interaction with the imidazole group of a 
histidine residue, which is, in turn, linked to an aspartate residue. This 
Ser-His-Asp catalytic triad generates a powerful nucleophile. The prod- 
uct of this initial reaction is a covalent intermediate formed by the 
enzyme and an acyl group derived from the bound substrate. The 
hydrolysis of this acyl-enzyme intermediate completes the cleavage pro- 
cess. The tetrahedral intermediates for these reactions have a negative 
charge on the peptide carbonyl oxygen atom. This negative charge is 
stabilized by interactions with peptide NH groups in a region on the 
enzyme termed the oxyanion hole. 

Other proteases employ the same catalytic strategy. Some of these 
proteases, such as trypsin and elastase, are homologs of chymotrypsin. 
Other proteases, such as subtilisin, contain a very similar catalytic triad 
that has arisen by convergent evolution. Active-site structures that dif- 
fer from the catalytic triad are present in a number of other classes of 
proteases. These classes employ a range of catalytic strategies but, in 
each case, a nucleophile is generated that is sufficiently powerful to 
attack the peptide carbonyl group. In some enzymes, the nucleophile is 
derived from a side chain whereas, in others, an activated water mole- 
cule attacks the peptide carbony] directly. 


9.2 Carbonic Anhydrases Make a Fast Reaction Faster 
Carbonic anhydrases catalyze the reaction of water with carbon 
dioxide to generate carbonic acid. The catalysis can be extremely 


281 


Summary 


282 
CHAPTER 9 Catalytic Strategies 


9.3 


9.4 


fast: some carbonic anhydrases hydrate carbon dioxide at rates as 
high as 1 million times per second. A tightly bound zinc ion 1s a cru- 
cial component of the active sites of these enzymes. Each zinc ion 
binds a water molecule and promotes its deprotonation to generate a 
hydroxide ion at neutral pH. This hydroxide ion attacks carbon 
dioxide to form bicarbonate ion, HCO j. Because of the physiologi- 
cal roles of carbon dioxide and bicarbonate ions, speed is of the 
essence for this enzyme. To overcome limitations imposed by the rate 
of proton transfer from the zinc-bound water molecule, the most- 
active carbonic anhydrases have evolved a proton shuttle to transfer 
protons to a buffer. 


Restriction Enzymes Catalyze Highly Specific 

DNA-Cleavage Reactions 

A high level of substrate specificity is often the key to biological func- 
tion. Restriction endonucleases that cleave DNA at specific recogni- 
tion sequences discriminate between molecules that contain these 
recognition sequences and those that do not. Within the enzyme-— 
substrate complex, the DNA substrate is distorted in a manner that 
generates a magnesium ion-binding site between the enzyme and 
DNA. The magnesium ion binds and activates a water molecule, 
which attacks the phosphodiester backbone. 

Some enzymes discriminate between potential substrates 
by binding them with different affinities. Others may bind many 
potential substrates but promote chemical reactions efficiently only 
on specific molecules. Restriction endonucleases such as EcoRV 
endonuclease employ the latter mechanism. Only molecules con- 
taining the proper recognition sequence are distorted in a manner 
that allows magnesium ion binding and, hence, catalysis. Restriction 
enzymes are prevented from acting on the DNA of a host cell by the 
methylation of key sites within its recognition sequences. The added 
methyl groups block specific interactions between the enzymes and 
the DNA such that the distortion necessary for cleavage does not 
take place. 


Myosins Harness Changes in Enzyme Conformation to 

Couple ATP Hydrolysis to Mechanical Work 

Finally, myosins catalyze the hydrolysis of adenosine triphosphate 
(ATP) to form adenosine diphosphate (ADP) and inorganic 
phosphate (P.). The conformations of myosin ATPase domains free 
of bound nucleotides and with bound ATP are quite similar. 
Through the use of ADP and vanadate (VO?-), an excellent mimic 
of the transition state for ATP hydrolysis bound to the myosin 
ATPase domain can be produced. The structure of this complex 
reveals that dramatic conformational changes take place on forma- 
tion of this species from the ATP complex. These conformational 
changes are used to drive substantial motions in molecular motors. 
The rate of ATP hydrolysis by myosin is relatively low and is limited 
by the rate of product release from the enzyme. The hydrolysis of 
ATP to ADP and P, within the enzyme is reversible with an equilib- 
rium constant of approximately 10, compared with an equilibrium 
constant of 140,000 for these species free in solution. Myosins are 
examples of P-loop NTPase enzymes, a large collection of protein 
families that play key roles ina range of biological processes by virtue 
of the conformational changes that they undergo with various nucle- 
otides bound. 
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binding energy (p. 252) 

induced fit (p. 252) 

covalent catalysis (p. 252) 

general acid-base catalysis (p. 252) 
catalysis by approximation (p. 252) 
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PROBLEMS 


1. No burst. Examination of the cleavage of the amide sub- 
strate, A, by chymotrypsin with the use of stopped-flow 
kinetic methods reveals no burst. The reaction is monitored 
by noting the color produced by the release of the amino 
part of the substrate (highlighted in orange). Why is no 
burst observed? 
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2. Contributing to your own demise. Consider the subtilisin 
substrates A and B. 


Phe-Ala-Gln-Phe-X Phe-Ala-His-Phe-X 
A B 


These substrates are cleaved (between Phe and X) by native 
subtilisin at essentially the same rate. However, the His 
64-to-Ala mutant of subtilisin cleaves substrate B more 
than 1000-fold as rapidly as it cleaves substrate A. Propose 
an explanation. 


3. 1 + 1 #2. Consider the following argument. In sub- 
tilisin, mutation of Ser 221 to Ala results in a 10°-fold 
decrease in activity. Mutation of His 64 to Ala results in a 
similar 10°-fold decrease. Therefore, simultaneous muta- 
tion of Ser 221 to Ala and His 64 to Ala should result in a 
10° x 10° = 10!-fold reduction in activity. Is this reduc- 
tion correct? Why or why not? 


4. Adding a charge. In chymotrypsin, a mutant was con- 
structed with Ser 189, which is in the bottom of the sub- 
strate-specificity pocket, changed to Asp. What effect 
would you predict for this Ser 189—>Asp 189 mutation? 


5. Conditional results. In carbonic anhydrase II, mutation 
of the proton-shuttle residue His 64 to Ala was expected to 
result in a decrease in the maximal catalytic rate. However, 
in buffers such as imidazole with relatively small molecular 
components, no rate reduction was observed. In buffers 
with larger molecular components, significant rate reduc- 
tions were observed. Propose an explanation. 


6. How many sites? A researcher has isolated a restriction 
endonuclease that cleaves at only one particular 10-base- 
pair site. Would this enzyme be useful in protecting cells 
from viral infections, given that a typical viral genome is 
50,000 base pairs long? Explain. 


7. Is faster better? Restriction endonucleases are, in gen- 
eral, quite slow enzymes with typical turnover numbers of 
1 s~!. Suppose that endonucleases were faster, with turn- 
over numbers similar to those for carbonic anhydrase (10° 
s_'), such that they act faster than do methylases. Would 
this increased rate be beneficial to host cells, assuming that 
the fast enzymes have similar levels of specificity? 


8. Adopting a new gene. Suppose that one species of bacte- 
ria obtained one gene encoding a restriction endonuclease 
by horizontal gene transfer. Would you expect this acquisi- 
tion to be beneficial? 


9. Chelation therapy. Treatment of carbonic anhydrase 
with high concentrations of the metal chelator EDTA (eth- 
ylenediaminetetraacetic acid) results in the loss of enzyme 
activity. Propose an explanation. 


10. An aldehyde inhibitor. Elastase is specifically inhibited 
by an aldehyde derivative of one of its substrates: 


Cc H 

N-Acetyl-Pro-Ala-Pro—N a 
H 
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(a) Which residue in the active site of elastase is most likely 
to form a covalent bond with this aldehyde? 


(b) What type of covalent link would be formed? 
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11. Identify the enzyme. Consider the structure of molecule 
A. Which enzyme discussed in the chapter do you think 
molecule A will most effectively inhibit? 
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12. Acid test. At pH 7.0, carbonic anhydrase exhibits a 
Rat Of 600,000 s_'. Estimate the value expected for Roop at 


pH 6.0. 


13. Restriction. To terminate a reaction in which a restric- 
tion enzyme cleaves DNA, researchers often add high 
concentrations of the metal chelator EDTA (ethylenedi- 


aminetetraacetic acid). Why does the addition of EDTA 
terminate the reaction? 


14. Labeling strategy. ATP is added to the myosin ATPase 
domain in water labeled with '°O. After 50% of the ATP 
has been hydrolyzed, the remaining ATP is isolated and 
found to contain !8O. Explain. 


15. Viva la resistance. Many patients become resistant to 
HIV protease inhibitors with the passage of time owing to 


mutations in the HIV gene that encodes the protease. 
Mutations are not found in the aspartate residue that inter- 
acts with the drugs. Why not? 


16. More than one way to skin k,,,. Serine 236 in Dictyostelium 
discoideum myosin has been mutated to alanine. The 
mutated protein showed modestly reduced ATPase activ- 
ity. Analysis of the crystal structure of the mutated protein 
revealed that a water molecule occupied the position of the 
hydroxyl group of the serine residue in the wild-type pro- 
tein. Propose a mechanism for the ATPase activity of the 
mutated enzyme. 


17. A power struggle. The catalytic power of an enzyme can 
be defined as the ratio of the rate of the enzyme catalyzed 
reaction to that for the uncatalyzed reaction. Using the 
information in Figure 9.15 for subtilisin and in Figure 9.22 
for carbonic anhydrase, calculate the catalytic powers for 
these two enzymes. 


18. Wounded but not dead. How much activity (in terms of 
relative k,,, values) does the version of subtilisin with all 
three residues in the catalytic triad mutated have compared 
to uncatalyzed reaction? Propose an explanation. 


Mechanism Problem 


19. Complete the mechanism. On the basis of the informa- 
tion provided in Figure 9.17, complete the mechanisms for 
peptide-bond cleavage by (a) a cysteine protease, (b) an 
aspartyl protease, and (c) a metalloprotease. 
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Like motor traffic, metabolic pathways flow more efficiently when 
regulated by signals. Cytidine triphosphate (CTP), the final 
product of a multistep pathway, controls flux through the 
pathway by inhibiting the committed step catalyzed by aspartate 
transcarbamoylase (ATCase). [(Left) Michael Winokur 
Photography/Getty Images. ] 


OUTLINE 

he activity of enzymes must often be regulated so that they function at 

the proper time and place. This regulation is essential for coordination 10.1 Aspartate Transcarbamoylase Is 
of the vast array of biochemical processes taking place at any instant in an Allosterically Inhibited by the End 
organism. Enzymatic activity is regulated in five principal ways: Product of Its Pathway 

. , . . a . 10.2 lsozymes Provide a Means of 

1. Allosteric Control. Allosteric proteins contain distinct regulatory sites Regulation Specific to Distinct 
and multiple functional sites. The binding of small signal molecules at Tissues and Developmental 
regulatory sites controls the activity of these proteins. Moreover, allosteric Stages 


proteins show the property of cooperativity: activity at one functional site 
affects the activity at others. Proteins displaying allosteric control are thus 
information transducers: their activity can be modified in response to signal 
molecules or to information shared among active sites. This chapter exam- 10.4 Many Enzymes Are Activated by 
ines one of the best-understood allosteric proteins: the enzyme aspartate Specific Proteolytic Cleavage 
transcarbamoylase (ATCase). Catalysis by aspartate transcarbamoylase of 

the first step in pyrimidine biosynthesis is inhibited by cytidine triphos- 

phate, the final product of that biosynthesis, in an example of feedback 

inhibition. We have already examined an allosteric protein—hemoglobin, 

the oxygen transport protein in the blood (Chapter 7). 


10.3 Covalent Modification Is a Means 
of Regulating Enzyme Activity 


2. Multiple Forms of Enzymes. Isozymes, or isoenzymes, provide an avenue 
for varying regulation of the same reaction to meet the specific physiological 
needs in the particular tissue at a particular time. Isozymes are homologous 
enzymes within a single organism that catalyze the same reaction but differ 
285 


286 
CHAPTER 10 Regulatory Strategies 


FIGURE 10.1 ATCase reaction. Aspartate 
transcarbamoylase catalyzes the 
committed step, the condensation of 
aspartate and carbamoyl phosphate to 
form N-carbamoylaspartate, in pyrimidine 
synthesis. 


slightly in structure and more obviously in K,, and V aoe values as well as in 
regulatory properties. Often, isozymes are expressed in a distinct tissue or 


organelle or at a distinct stage of development. 


3. Reversible Covalent Modification. The catalytic properties of many enzymes 
are markedly altered by the covalent attachment of a modifying group, com- 
monly a phosphoryl group. ATP serves as the phosphoryl donor in these reac- 
tions, which are catalyzed by protein kinases. The removal of phosphoryl 
groups by hydrolysis is catalyzed by protein phosphatases. This chapter 
considers the structure, specificity, and control of protein kinase A (PKA), a 
ubiquitous eukaryotic enzyme that regulates diverse target proteins. 


4. Proteolytic Activation. The enzymes controlled by some of these regula- 
tory mechanisms cycle between active and inactive states. A different regu- 
latory strategy is used to irreversibly convert an inactive enzyme into an 
active one. Many enzymes are activated by the hydrolysis of a few peptide 
bonds or even one such bond in inactive precursors called zymogens or 
proenzymes. This regulatory mechanism generates digestive enzymes such 
as chymotrypsin, trypsin, and pepsin. Blood clotting is due to a remarkable 
cascade of zymogen activations. Active digestive and clotting enzymes are 
switched off by the irreversible binding of specific inhibitory proteins that 
are irresistible lures to their molecular prey. 


5. Controlling the Amount of Enzyme Present. Enzyme activity can also be 
regulated by adjusting the amount of enzyme present. This important form 
of regulation usually takes place at the level of transcription. We will con- 
sider the control of gene transcription in Chapters 29, 30, and 31. 


To begin, we will consider the principles of allostery by examining the 
enzyme aspartate transcarbamoylase. 


10.1 Aspartate Transcarbamoylase Is Allosterically 
Inhibited by the End Product of Its Pathway 


Aspartate transcarbamoylase catalyzes the first step in the biosynthesis of 
pyrimidines: the condensation of aspartate and carbamoyl phosphate to 
form N-carbamoylaspartate and orthophosphate (Figure 10.1). This reac- 
tion is the committed step in the pathway, which consists of 10 reactions, that 
will ultimately yield the pyrimidine nucleotides uridine triphosphate (UTP) 
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Cytidine triphosphate (CTP) 


and cytidine triphosphate (CTP). How is this enzyme regulated to generate 
precisely the amount of pyrimidines needed by the cell? 

ATCase is inhibited by CTP, the final product of the AT Case-initiated 
pathway. The rate of the reaction catalyzed by AT Case is fast at low concen- 
trations of CTP but slows as CTP concentration increases (Figure 10.2). 
Thus, the pathway continues to make new pyrimidines until sufficient 
quantities of CTP have accumulated. The inhibition of ATCase by CTP is 
an example of feedback inhibition, the inhibition of an enzyme by the 
end product of the pathway. Feedback inhibition by CTP ensures that 
N-carbamoylaspartate and subsequent intermediates in the pathway are not 
needlessly formed when pyrimidines are abundant. 

The inhibitory ability of CTP is remarkable because CTP is structurally 
quite different from the substrates of the reaction (Figure 10.1). Thus CTP 
must bind to a site distinct from the active site at which substrate binds. 
Such sites are called allosteric or regulatory sites. CTP is an example of an 
allosteric inhibitor. In ATCase (but not all allosterically regulated enzymes), the 
catalytic sites and the regulatory sites are on separate polypeptide chains. 


Allosterically regulated enzymes do not follow 
Michaelis-Menten kinetics 


Allosteric enzymes are distinguished by their response to changes in sub- 
strate concentration in addition to their susceptibility to regulation by other 
molecules. Let us examine the rate of product formation as a function of 
substrate concentration for ATCase (Figure 10.3). The curve differs from 
that expected for an enzyme that follows Michaelis-Menten kinetics. The 
observed curve is referred to as sigmoidal because it resembles the letter 
“S.” The vast majority of allosteric enzymes display sigmoidal kinetics. 
Recall from the discussion of hemoglobin that sigmoidal curves result from 
cooperation between subunits: the binding of substrate to one active site in 
a molecule increases the likelihood that substrate will bind to other active 
sites. To understand the basis of sigmoidal enzyme kinetics and inhibition 
by CTP, we need to examine the structure of ATCase. 


ATCase consists of separable catalytic and regulatory subunits 


What is the evidence that ATCase has distinct regulatory and catalytic sites? 
ATCase can be literally separated into regulatory (r) and catalytic (c) subunits 
by treatment witha mercurial compound suchas p-hydroxymercuribenzoate, 
which reacts with sulfhydryl groups (Figure 10.4). Ultracentrifugation 
(p. 74) following treatment with mercurials revealed that AT Case is com- 
posed of two kinds of subunits (Figure 10.5). The subunits can be readily 
separated by ion-exchange chromatography because they differ markedly 
in charge (p. 67) or by centrifugation in a sucrose density gradient 
because they differ in size (p. 67). These size differences are manifested 
in the sedimentation coefficients: that of the native enzyme is 11.65, 
whereas those of the dissociated subunits are 2.85 and 5.88. The attached 
p-mercuribenzoate groups can be removed from the separated subunits 
by adding an excess of mercaptoethanol, providing isolated subunits 
for study. 

The larger subunit is the catalytic subunit. This subunit has catalytic activity 
but displays the hyperbolic kinetics of Michaelis-Menten enzymes rather than 
sigmoidal kinetics. Furthermore, the isolated catalytic subunit is unresponsive 
to CTP. The isolated smaller subunit can bind CTP, but has no catalytic activ- 
ity. Hence, that subunit is the regulatory subunit. The catalytic subunit (c3) 
consists of three chains (34 kDa each), and the regulatory subunit (r,) consists 
of two chains (17 kDa each). The catalytic and regulatory subunits combine 
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FIGURE 10.2 CTP inhibits 

ATCase. Cytidine triphosphate, an end 
product of the pyrimidine-synthesis 
pathway, inhibits aspartate 
transcarbamoylase despite having little 
structural similarity to reactants or 
products. 
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FIGURE 10.3 ATCase displays sigmoidal 
kinetics. A plot of product formation as a 
function of substrate concentration 
produces a sigmoidal curve because the 
binding of substrate to one active site 
increases the activity at the other active 
sites. Thus, the enzyme shows 
cooperativity. 
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FIGURE 10.4 Modification of cysteine 
residues. p-Hydroxymercuribenzoate 
reacts with crucial cysteine residues in 
aspartate transcarbamoylase. 
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FIGURE 10.5 Ultracentrifugation studies 
of ATCase. Sedimentation velocity 
patterns of (A) native ATCase and (B) the 
enzyme after treatment with 
p-hydroxymercuribenzoate show that the 
enzyme can be dissociated into regulatory 
(r) and catalytic (c) subunits. [Data from 

J. C. Gerhart and H. K. Schachman, 
Biochemistry 4:1054-1062, 1965.] 


(A) Zinc _ 


r chain 


K FIGURE 10.6 Structure of 
ATCase. (A) The quaternary structure of 
aspartate transcarbamoylase as viewed 
from the top. The drawing in the center is a 
simplified representation of the relations 
between subunits. A single catalytic trimer 
[catalytic (c) chains, shown in yellow] is 
visible; in this view, the second trimer is 
hidden below the one visible. Notice that 
each r chain interacts with a c chain 
through the zinc domain. (B) A side view of 
the complex. [Drawn from 1RAI.pdb.] 
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rapidly when they are mixed. The resulting complex has the same structure, 
C6Ig, as the native enzyme: two catalytic trimers and three regulatory dimers. 


2 ¢3 el 3 12 — core 


Most strikingly, the reconstituted enzyme has the same allosteric and 
kinetic properties as those of the native enzyme. Thus, ATCase is composed 
of discrete catalytic and regulatory subunits, and the interaction of the sub- 
units in the native enzyme produces its regulatory and catalytic properties. The 
fact that the enzyme can be separated into isolated catalytic and regulatory 
subunits, which can be reconstituted back to the functional enzyme, allows 
for a variety of experiments to characterize the allosteric properties of the 
enzyme (Problems 33 and 34). 


Allosteric interactions in ATCase are mediated by large changes 
in quaternary structure 


What are the subunit interactions that account for the properties of ATCase? 
Significant clues have been provided by the three-dimensional structure of 
ATCase in various forms. Two catalytic trimers are stacked one on top of 
the other, linked by three dimers of the regulatory chains (Figure 10.6). 
There are significant contacts between the catalytic and the regulatory 
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subunits: each r chain within a regulatory dimer interacts with a c chain 
within a catalytic trimer. The c chain makes contact with a structural 
domain in the r chain that is stabilized by a zinc ion bound to four cysteine 
residues. The zinc ion is critical for the interaction of the r chain with the c 
chain. The mercurial compound p-hydroxymercuribenzoate is able to dis- 
sociate the catalytic and regulatory subunits because mercury binds strongly 
to the cysteine residues, displacing the zinc and preventing interaction with 
the c chain. 

To locate the active sites, the enzyme is crystallized in the presence of 
N-(phosphonacetyl)-L-aspartate (PALA), a bisubstrate analog (an analog 
of the two substrates) that resembles an intermediate along the pathway of 
catalysis (Figure 10.7). PALA is a potent competitive inhibitor of ATCase 
that binds to and blocks the active sites. The structure of the ATCase— 
PALA complex reveals that PALA binds at sites lying at the boundaries 
between pairs of c chains within a catalytic trimer (Figure 10.8). Each cata- 
lytic trimer contributes three active sites to the complete enzyme. Further 
examination of the ATCase-PALA complex reveals a remarkable change in 
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FIGURE 10.7 PALA, a bisubstrate 
analog. (Top) Nucleophilic attack by the 
amino group of aspartate on the carbony! 
carbon atom of carbamoyl phosphate 
generates an intermediate on the pathway 
to the formation of N-carbamoylaspartate. 
(Bottom) N-(Phosphonacetyl)-L-aspartate 
(PALA) is an analog of the reaction 
intermediate and a potent competitive 
inhibitor of aspartate transcarbamoylase. 


kK FIGURE 10.8 The active 
site of ATCase. Some of the 
crucial active-site residues are 
shown binding to the inhibitor 
PALA (shaded gray). Notice that 
the active site is composed 
mainly of residues from one c 
chain, but an adjacent c chain 
also contributes important 
residues (boxed in green). [Drawn 
from 8ATC.pdb.] 
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FIGURE 10.9 The T-to-R state transition 
in ATCase. Aspartate transcarbamoylase 
exists in two conformations: a compact, 
relatively inactive form called the tense (T) 
state and an expanded form called the 
relaxed (R) state. Notice that the structure 
of ATCase changes dramatically in the 
transition from the T state to the R State. 
PALA binding stabilizes the R state. 


J 


6A 
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quaternary structure on binding of PALA. The two catalytic trimers move 
12 A farther apart and rotate approximately 10 degrees about their common 
threefold axis of symmetry. Moreover, the regulatory dimers rotate approx- 
imately 15 degrees to accommodate this motion (Figure 10.9). The enzyme 
literally expands on PALA binding. In essence, ATCase has two distinct 
quaternary forms: one that predominates in the absence of substrate or sub- 
strate analogs and another that predominates when substrates or analogs are 
bound. We call these forms the T (for tense) state and the R (for relaxed) 
state, respectively, as we did for the two quaternary states of hemoglobin. 

How can we explain the enzyme’s sigmoidal kinetics in light of the struc- 
tural observations? Like hemoglobin, the enzyme exists in an equilibrium 
between the T state and the R state. 


R= T 


In the absence of substrate, almost all the enzyme molecules are in the T 
state because the T state is more stable than the R state. The ratio of the 
concentration of enzyme in the T state to that in the R state is called the 
allosterts contstant (L). For most allosteric enzymes, L is on the order of 10? 
to 10”. 


The T state has a low affinity for substrate and hence shows a low catalytic 
activity. The occasional binding of a substrate molecule to one active site in 
an enzyme increases the likelihood that the entire enzyme shifts to the R 
state with its higher binding affinity. The addition of more substrate has two 
effects. First, it increases the probability that each enzyme molecule will 
bind at least one substrate molecule. Second, it increases the average num- 
ber of substrate molecules bound to each enzyme. The presence of addi- 
tional substrate will increase the fraction of enzyme molecules in the more 
active R state because the position of the equilibrium depends on the number 
of active sites that are occupied by substrate. We considered this property, 
called cooperativity because the subunits cooperate with one another, when 
we discussed the sigmoidal oxygen-binding curve of hemoglobin. The 
effects of substrates on allosteric enzymes are referred to as homotropic 
effects (from the Greek homés, “same’”’). 

This mechanism for allosteric regulation is referred to as the concerted 
model because the change in the enzyme is “all or none”; the entire enzyme 
is converted from T into R, affecting all of the catalytic sites equally. In 
contrast, the sequential model assumes that the binding of ligand to one site 
on the complex can affect neighboring sites without causing all subunits 


FIGURE 10.10 Basis for the sigmoidal 

curve. The generation of the sigmoidal curve 
by the property of cooperativity can be 
understood by imagining an allosteric enzyme 
as a mixture of two Michaelis-Menten enzymes, 
one with a high value of Ky, that corresponds to 
the T state and another with a low value of Ky, 
that corresponds to the R state. As the 
concentration of substrate is increased, the 
equilibrium shifts from the T state to the R 
state, which results in a steep rise in activity 
with respect to substrate concentration. 


R-state curve 
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to undergo the T-to-R transition (p. 199). Although the concerted model 
explains the behavior of ATCase well, most other allosteric enzymes have 
features of both models. 

The sigmoidal curve for ATCase can be pictured as a composite of two 
Michaelis-Menten curves, one corresponding to the less-active T state and 
the other to the more-active R state. At low concentrations of substrate, the 
curve closely resembles that of the T state enzyme. As the substrate concen- 
tration is increased, the curve progressively shifts to resemble that of the R 
state enzyme (Figure 10.10). 

What is the biochemical advantage of sigmoidal kinetcs? Allosteric 
enzymes transition from a less active state to a more active state within a 
narrow range of substrate concentration. The benefit of this behavior is 
illustrated in Figure 10.11, which compares the kinetics of a Michaelis- 
Menten enzyme (blue curve) to that of an allosteric enzyme (red curve). In 
this example, the Michaelis-Menten enzyme requires an approximately 
27-fold increase in substrate concentration to increase V, from 0.1 V,,,,, to 
0.8 V..4- In contrast, the allosteric enzyme requires only about a 4-fold 
increase in substrate concentration to attain the same increase in velocity. 
The activity of allosteric enzymes is move sensitive to changes in substrate con- 
centration near Ky, than are Michaelis—Menten enzymes with the same V,,_,,. 
This sensitivity is called a threshold effect: below a certain substrate concen- 
tration, there is little enzyme activity. However, after the threshold has been 
reached, enzyme activity increases rapidly. In other words, much like an “on 
or off” switch, cooperativity ensures that most of the enzyme is either on 
(R state) or off (T state). The vast majority of allosteric enzymes display 
sigmoidal kinetics. 


Allosteric regulators modulate the T-to-R equilibrium 


We now turn our attention to the effects of pyrimidine nucleotides. 
As noted earlier, CTP inhibits the action of ATCase. X-ray studies of 
ATCase in the presence of CTP reveal (1) that the enzyme is in the T state 
when bound to CTP and (2) that a binding site for this nucleotide exists 
in each regulatory chain in a domain that does not interact with the cata- 
lytic subunit (Figure 10.12). Each active site is more than 50 A from the 
nearest CTP-binding site. The question naturally arises, How can CTP 
inhibit the catalytic activity of the enzyme when it does not interact with 
the catalytic chain? 

The quaternary structural changes observed on substrate-analog 
binding suggest a mechanism for inhibition by CTP (Figure 10.13). The 
binding of the inhibitor CTP to the T state shifts the T-to-R equilibrium in 
favor of the T state, decreasing net enzyme activity. CTP increases the 
allosteric coefficient from 200 in its absence to 1250 when all of the regu- 
latory sites are occupied by CTP. The binding of CTP makes it more 
difficult for substrate binding to convert the enzyme into the R state. 
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FIGURE 10.11 Allosteric enzymes 
display threshold effects. As the T-to-R 
transition occurs, the velocity increases 
over a narrower range of substrate 
concentration for an allosteric enzyme (red 
curve) than for a Michaelis-Menten enzyme 


(blue curve). 
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FIGURE 10.12 CTP stabilizes the T 
state. The binding of CTP to the regulatory 
subunit of aspartate transcarbamoylase 
stabilizes the T state. 
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FIGURE 10.13 The R state and the T 
state are in equilibrium. Even in the 
absence of any substrate or regulators, 
aspartate transcarbamoylase exists in 
equilibrium between the R and the T states. 
Under these conditions, the T state is 
favored by a factor of approximately 200. 
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FIGURE 10.14 Effect of CTP and ATP on 
ATCase kinetics. CTP stabilizes the 

T state of aspartate transcarbamoylase, 
making it more difficult for substrate binding 
to convert the enzyme into the R state. As 
a result, the curve is shifted to the right, as 
shown in red. ATP is an allosteric activator 
of aspartate transcarbamoylase because it 
stabilizes the R state, making it easier for 
substrate to bind. As a result, the curve is 
shifted to the left, as shown in blue. 
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Consequently, CTP increases the initial phase of the sigmoidal curve 
(Figure 10.14). More substrate is required to attain a given reaction rate. 
UTP, the immediate precursor to CTP, also regulates ATCase. While 
unable to inhibit the enzyme alone, UTP synergistically inhibits ATCase 
in the presence of CTP. 

Interestingly, ATP, too, is an allosteric effector of ATCase, binding to 
the same site as CTP. However, ATP binding stabilizes the R state, low- 
ering the allosteric coefficient from 200 to 70 and thus increasing the 
reaction rate at a given aspartate concentration (Figure 10.14). At high 
concentrations of ATP, the kinetic profile shows a less-pronounced 
sigmoidal behavior. Because ATP and CTP bind at the same site, high 
levels of ATP prevent CTP from inhibiting the enzyme. The effects of 
nonsubstrate molecules on allosteric enzymes (such as those of CTP and 
ATP on ATCase) are referred to as heterotropic effects (from the Greek 
héteros, “different”). Substrates generate the sigmoidal curve (homo- 
tropic effects), whereas regulators shift the K,, (heterotropic effects). 
Note, however, that both types of effect are generated by altering the 
T/R ratio. 

The increase in ATCase activity in response to increased ATP concen- 
tration has two potential physiological ramifications. First, high ATP con- 
centration signals a high concentration of purine nucleotides in the cell; the 
increase in ATCase activity will tend to balance the purine and pyrimidine 
pools. Second, a high concentration of ATP indicates that energy is avail- 
able for mRNA synthesis and DNA replication and leads to the synthesis of 
pyrimidines needed for these processes. 


10.2 lsozymes Provide a Means of Regulation 
Specific to Distinct Tissues and 
Developmental Stages 


Isozymes, or isoenzymes, are enzymes that differ in amino acid sequence 
yet catalyze the same reaction. Typically, these enzymes display different 
kinetic parameters, such as K,,, or respond to different regulatory mol- 
ecules. They are encoded by different genes, which usually arise through 
gene duplication and divergence. Isozymes can often be distinguished 
from one another by physical properties such as electrophoretic mobility. 
Isoform is a more generic term used when the protein in question is not 
an enzyme. 

The existence of isozymes permits the fine-tuning of metabolism to meet 
the needs of a given tissue or developmental stage. Consider the example of 
lactate dehydrogenase (LDH), an enzyme that catalyzes a step in anaerobic 
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glucose metabolism and glucose synthesis. Human beings have two isozy- 
mic polypeptide chains for this enzyme: the H isozyme is highly expressed 
in heart muscle and the M isozyme is expressed in skeletal muscle. The 
amino acid sequences are 75% identical. Each functional enzyme is tetra- 
meric, and many different combinations of the two isozymic polypeptide 
chains are possible. The H, isozyme, found in the heart, has a higher affin- 
ity for substrates than does the M, isozyme. The two isozymes also differ 
in that high levels of pyruvate allosterically inhibit the H, but not the M, 
isozyme. The other combinations, such as H,M, have intermediate 
properties. We will consider these isozymes in their biological context in 
Chapter 16. 

The M, isozyme functions optimally in the anaerobic environment 
of hard-working skeletal muscle, whereas the H, isozyme does so in the 
aerobic environment of heart muscle. Indeed, the proportions of these 
isozymes change throughout the development of the rat heart as the 
tissue switches from an anaerobic environment to an aerobic one 
(Figure 10.15A). Figure 10.15B shows the tissue-specific forms of 
lactate dehydrogenase in adult rat tissues. Essentially all of the enzymes 
that we will encounter in later chapters, including allosteric enzymes, exist 
in isozymic forms. 


e The appearance of some isozymes in the blood is a sign of tissue 
damage, useful for clinical diagnosis. For instance, an increase in 
serum levels of H, relative to H;M 1s an indication that a myocardial infarc- 
tion, or heart attack, has damaged heart-muscle cells, leading to the release 
of cellular material. 


10.3 Covalent Modification Is a Means of Regulating 
Enzyme Activity 


The covalent attachment of a molecule to an enzyme or protein can modify 
its activity. In these instances, a donor molecule provides the functional 
moiety being attached. Most modifications are reversible. Phosphorylation 
and dephosphorylation are common means of covalent modification. The 
attachment of acetyl groups to lysine residues by acetyltransferases and 
their removal by deacetylases are another example. Histones—proteins that 
are packaged with DNA into chromosomes—are extensively acetylated and 
deacetylated in vivo on lysine residues (Section 31.3). More heavily acety- 
lated histones are associated with genes that are being actively transcribed. 
Although protein acetylation was originally discovered as a modification of 
histones, we now know that it is a major means of regulation, with more 
than 2000 different proteins in mammalian cells regulated by acetylation. 


FIGURE 10.15 Isozymes of lactate 
dehydrogenase. (A) The rat heart lactate 
dehydrogenase (LDH) isozyme profile 
changes in the course of development. The 
H isozyme is represented by squares and 
the M isozyme by circles. The negative and 
positive numbers denote the days before 
and after birth, respectively. (B) LDH isozyme 
content varies by tissue. [(A) Data from 
W.-H. Li, Molecular Evolution (Sinauer, 1997), 
p. 283; (B) after K. Urich, Comparative Animal 
Biochemistry (Springer Verlag, 1990), p. 542.] 
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TABLE 10.1 Common covalent modifications of protein activity 


Example of 
Modification Donor molecule modified protein Protein function 
Phosphorylation ATP Glycogen Glucose homeostasis; 
phosphorylase energy transduction 
Acetylation Acetyl CoA Histones DNA packing; 
transcription 
Myristoylation Myristoyl CoA Src Signal transduction 
ADP ribosylation NADt RNA polymerase Transcription 
Farnesylation Farnesyl Ras Signal transduction 
pyrophosphate 
y-Carboxylation HCO,~ Thrombin Blood clotting 
Sulfation 3’-Phosphoadenosine- Fibrinogen Blood-clot formation 
5’-phosphosulfate 
Ubiquitination Ubiquitin Cyclin Control of cell cycle 


Protein acetylation appears to be especially important in the regulation 
of metabolism. The acetyltransferase and deacetylase enzymes are them- 
selves regulated by phosphorylation, showing that the covalent modifica- 
tion of a protein can be controlled by the covalent modification of the modi- 
fying enzymes. 

Modification is not readily reversible in some cases. The irreversible 
attachment of a lipid group causes some proteins in signal-transduction 
pathways, such as Ras (a GTPase) and 5rc (a protein tyrosine kinase), to 
become affixed to the cytoplasmic face of the plasma membrane. Fixed in 
this location, the proteins are better able to receive and transmit information 
that is being passed along their signaling pathways (Chapter 14). Mutations 
in both Ras and Src are seen in a wide array of cancers. The attachment of 
the small protein ubiquitin can signal that a protein is to be destroyed, the 
ultimate means of regulation (Chapter 23). The protein cyclin must be 
ubiquitinated and destroyed before a cell can enter anaphase and proceed 
through the cell cycle. 

Virtually all the metabolic processes that we will examine are regulated 
in part by covalent modification. Indeed, the allosteric properties of many 
enzymes are altered by covalent modification. Table 10.1 lists some of the 
common covalent modifications. 


Kinases and phosphatases control the extent of protein 
phosphorylation 


We will see phosphorylation used as a regulatory mechanism in virtually 
every metabolic process in eukaryotic cells. Indeed, as much as 30% of eukary- 
otic proteins are phosphorylated. The enzymes catalyzing phosphorylation 
reactions are called protein kinases. These enzymes constitute one of the 
largest protein families known: there are more than 500 homologous kinases 
in human beings. This multiplicity of enzymes allows regulation to be fine- 
tuned according to a specific tissue, time, or substrate. 

ATP is the most common donor of phosphoryl groups. The terminal 
(y) phosphoryl group of ATP is transferred to a specific amino acid of the 
acceptor protein or enzyme. In eukaryotes, the acceptor residue is com- 
monly one of the three containing a hydroxyl group in its side chain. 
Transfers to serine and threonine residues are handled by one class of 
protein kinases and to tyrosine residues by another. Tyrosine kinases, 
which are unique to multicellular organisms, play pivotal roles in growth 
regulation, and mutations in these enzymes are commonly observed in 
cancer cells. 
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Table 10.2 lists a few of the known serine and threonine protein kinases. 
The acceptors in protein-phosphorylation reactions are located inside cells, 
where the phosphoryl-group donor ATP is abundant. Proteins that are 
entirely extracellular are not regulated by reversible phosphorylation. 

Protein kinases vary in their degree of specificity. Dedicated protein 
kinases phosphorylate a single protein or several closely related ones. 
Multifunctional protein kinases modify many different targets; they have a 
wide reach and can coordinate diverse processes. Comparisons of amino 
acid sequences of many phosphorylation sites show that a multifunctional 
kinase recognizes related sequences. For example, the consensus sequence 
recognized by protein kinase A is Arg-Arg-X-Ser-Z or Arg-Arg-X-Thr-Z, 
in which X is a small residue, Z is a large hydrophobic one, and Ser or Thr 
is the site of phosphorylation. However, this sequence is not absolutely 
required. Lysine, for example, can substitute for one of the arginine residues 
but with some loss of affinity. Thus, the primary determinant of specificity is 
the amino acid sequence surrounding the serine or threonine phosphorylation 
site. However, distant residues can contribute to specificity. For instance, a 
change in protein conformation can open or close access to a possible phos- 
phorylation site. 

Protein phosphatases reverse the effects of kinases by catalyzing the 
removal of phosphoryl groups attached to proteins. The enzyme hydrolyzes 
the bond attaching the phosphoryl group. 


TABLE 10.2 Examples of serine and threonine kinases and their activating signals 


Signal Enzyme 
Cyclic nucleotides Cyclic AMP-dependent protein kinase 
Cyclic GMP-dependent protein kinase 
Cat and calmodulin Ca** —calmodulin protein kinase 
Phosphorylase kinase or glycogen synthase kinase 2 
AMP AMP-activated kinase 
Diacylglycerol Protein kinase C 


Metabolic intermediates and § Many target-specific enzymes, such as pyruvate dehydrogenase 
other “local” effectors kinase and branched-chain ketoacid dehydrogenase kinase 


Source: Information from D. Fell, Understanding the Control of Metabolism (Portland Press, 1997), Table 7.2. 
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The unmodified hydroxyl-containing side chain is regenerated and ortho- 
phosphate (P;) is produced. This family of enzymes, of which there are 
about 200 members in human beings, plays a vital role in cells because the 
enzymes turn off the signaling pathways that are activated by kinases. One 
class of highly conserved phosphatase called PP2A suppresses the cancer- 
promoting activity of certain kinases. 

Importantly, the phosphorylation and dephosphorylation reactions are 
not the reverse of one another; each is essentially irreversible under physi- 
ological conditions. Furthermore, both reactions take place at negligible 
rates in the absence of enzymes. Thus, phosphorylation of a protein sub- 
strate will take place only through the action of a specific protein kinase and 
at the expense of ATP cleavage, and dephosphorylation will take place only 
through the action of a phosphatase. The result is that target proteins cycle 
unidirectionally between unphosphorylated and phosphorylated forms. The 
rate of cycling between the phosphorylated and the dephosphorylated states 
depends on the relative activities of kinases and phosphatases. 


Phosphorylation is a highly effective means of regulating 
the activities of target proteins 


Phosphorylation is a common covalent modification of proteins in all forms 
of life, which leads to the question, What makes protein phosphorylation 
so valuable in regulating protein function that its use is ubiquitous? 
Phosphorylation is a highly effective means of controlling the activity of 
proteins for several reasons: 


1. The free energy of phosphorylation is large. Of the —50 kJ mol7! 
(—12 kcal mol~!) provided by ATP, about half is consumed in making 
phosphorylation irreversible; the other half is conserved in the phosphory- 
lated protein. A free-energy change of 5.69 kJ mol! (1.36 kcal mol~!) 
corresponds to a factor of 10 in an equilibrium constant. Hence, phosphory- 
lation can change the conformational equilibrium between different func- 
tional states by a large factor, of the order of 10+. In essence, the energy 
expenditure allows for a stark shift from one conformation to another. 


2. A phosphoryl group adds two negative charges to a modified protein. These 
new charges may disrupt electrostatic interactions in the unmodified pro- 
tein and allow new electrostatic interactions to be formed. Such structural 
changes can markedly alter substrate binding and catalytic activity. 


3. A phosphoryl group can form three or more hydrogen bonds. The tetrahe- 
dral geometry of a phosphoryl group makes these bonds highly directional, 
allowing for specific interactions with hydrogen-bond donors. 


4, Phosphorylation and dephosphorylation can take place in less than a second 
or over a span of hours. The kinetics can be adjusted to meet the timing 
needs of a physiological process. 


5. Phosphorylation often evokes highly amplified effects. A single activated 
kinase can phosphorylate hundreds of target proteins in a short interval. If 
the target protein is an enzyme, it can in turn transform a large number of 
substrate molecules. 


6. ATP is the cellular energy currency (Chapter 15). The use of this com- 
pound as a phosphoryl-group donor links the energy status of the cell to the 
regulation of metabolism. 


Cyclic AMP activates protein kinase A by altering the 
quaternary structure 


Let us examine a specific protein kinase that helps animals cope with stress- 
ful situations. The “flight or fight” response is common to many animals 
presented with a dangerous or exciting situation. Muscle becomes primed 
for action. This priming is the result of the activity of a particular protein 
kinase. In this case, the hormone epinephrine (adrenaline) triggers the for- 
mation of cyclic AMP (cAMP), an intracellular messenger formed by the 
cyclization of ATP. Cyclic AMP subsequently activates a key enzyme: 
protein kinase A (PKA). The kinase alters the activities of target proteins by 
phosphorylating specific serine or threonine residues. The striking finding 
is that most effects of cAMP in eukaryotic cells are achieved through the 
activation of PKA by cAMP. 

PKA provides a clear example of the integration of allosteric regulation 
and phosphorylation. PKA is activated by cAMP concentrations near 10 nM. 
The quaternary structure is reminiscent of that of ATCase. Like that 
enzyme, PKA in muscle consists of two kinds of subunits: a 49-kDa regula- 
tory (R) subunit and a 38-kDa catalytic (C) subunit. In the absence of 
cAMP, the regulatory and catalytic subunits form an R,C,, complex that is 
enzymatically inactive (Figure 10.16). The binding of two molecules of 
cAMP to each of the regulatory subunits leads to the dissociation of RC, 
into an R, subunit and two C subunits. These free catalytic subunits are 
enzymatically active. Thus, the binding of cAMP to the regulatory subunit 
relieves its inhibition of the catalytic subunit. PKA, like most other kinases, 
exists in isozymic forms for fine-tuning regulation to meet the needs of a 
specific cell or developmental stage. In mammals, four isoforms of the R 
subunit and three of the C subunit are encoded in the genome. 

How does the binding of cAMP activate the kinase? Each R chain con- 
tains the sequence Arg-Arg-Gly-Ala-Ile, which matches the consensus 
sequence for phosphorylation except for the presence of alanine in place of 
serine. In the R,C, complex, this pseudosubstrate sequence of R occupies the 
catalytic site of C, thereby preventing the entry of protein substrates 
(Figure 10.16). The binding of cAMP to the R chains allosterically moves 
the pseudosubstrate sequences out of the catalytic sites. The released C 
chains are then free to bind and phosphorylate substrate proteins. 
Interestingly, the cAMP-binding domain of the R subunit is highly conserved 
and found in all organisms. 
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FIGURE 10.16 
Regulation of protein 
kinase A. The binding of 
four molecules of cAMP 
activates protein kinase 
A by dissociating the 
inhibited holoenzyme 
(R,C,) into a regulatory 
subunit (R5) and two 
catalytically active 
subunits (C). Each R 
chain includes 
cAMP-binding domains 
and a pseudosubstrate 
sequence. 


FIGURE 10.17 Protein kinase A bound to 
an inhibitor. This ribbon model shows a 
complex of the catalytic subunit of protein 
kinase A with an inhibitor (yellow) bearing a 
pseudosubstrate sequence. Notice that the 
inhibitor binds to the active site, a cleft 
between the domains of the enzyme. The 
bound ATP (purple)-Mn2* (green) is in the 
active site adjacent to the site to which the 
inhibitor is bound. [Drawn from 1ATP.pdb.] 
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ATP and the target protein bind to a deep cleft in the catalytic subunit 
of protein kinase A 


WY X-ray crystallography reveals the three-dimensional structure of the 
catalytic subunit of PKA bound to ATP and a 20-residue peptide 
inhibitor. The 350-residue catalytic subunit of PKA has two lobes, an 
amino terminal N-lobe and a carboxyl terminal C-lobe (Figure 10.17). 
ATP and part of the inhibitor fill a deep cleft between the lobes. The 
N-lobe makes many contacts with ATP—Mn?", whereas the C-lobe binds 
the peptide and contributes the key catalytic residues. As with other 
kinases, the two lobes move closer to one another on substrate binding; 
mechanisms that restrict this domain closure provide a means of regulat- 
ing protein kinase activity. The PKA structure has broad significance 
because residues 40 to 280 constitute a conserved catalytic core, called the 
kinase fold, that is common to essentially all known protein kinases. We see 
here an example of a successful biochemical solution to a problem (in this 
case, protein phosphorylation) being employed many times in the course 
of evolution. 

The bound peptide in this crystal occupies the active site because it 
contains the pseudosubstrate sequence Arg-Arg-Asn-Ala-lle 
(Figure 10.18). The structure of the complex reveals the interactions by 
which the enzyme recognizes the consensus sequence. The guanidinium 
group of the first arginine residue forms an ion pair with the carboxylate 
side chain of a glutamate residue (Glu 127) of the enzyme. The second 
arginine likewise interacts with two other carboxylate groups. The 
nonpolar side chain of isoleucine, which matches Z in the consensus 
sequence (p. 295), fits snugly in a hydrophobic groove formed by two 
leucine residues of the enzyme. 
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FIGURE 10.18 Binding of pseudosubstrate to protein kinase A. Notice that the 
inhibitor makes multiple contacts with the enzyme. The two arginine side chains of the 
pseudosubstrate ionically interact with three glutamate carboxylate groups (green dashed 
lines). Hydrophobic interactions also are important in the recognition of substrate. The 
isoleucine residue of the pseudosubstrate is in contact with a pair of leucine residues of 
the enzyme. 


10.4 Many Enzymes are Activated by Specific 
Proteolytic Cleavage 


We turn now to a different mechanism of enzyme regulation. Many enzymes 
acquire full enzymatic activity as they spontaneously fold into their character- 
istic three-dimensional forms. In contrast, the folded forms of other enzymes 
are inactive until the cleavage of one or a few specific peptide bonds. The inac- 
tive precursor is called a zymogen or a proenzyme. An energy source such as 
ATP is not needed for cleavage. Therefore, in contrast with reversible regula- 
tion by phosphorylation, even proteins located outside cells can be activated 
by this means. Another noteworthy difference is that proteolytic activation, 
in contrast with allosteric control and reversible covalent modification, is 
irreversible, taking place just once in the life of an enzyme molecule. 

Specific proteolysis is a common means of activating enzymes and other 
proteins in biological systems. For example: 


1. The digestive enzymes that hydrolyze foodstuffs are synthesized as zymo- 
gens in the stomach and pancreas (Table 10.3). 


TABLE 10.3 Gastric and pancreatic zymogens 


Site of synthesis Zymogen Active enzyme 
Stomach Pepsinogen Pepsin 

Pancreas Chymotrypsinogen Chymotrypsin 
Pancreas Trypsinogen Trypsin 

Pancreas Procarboxypeptidase Carboxypeptidase 


2. Blood clotting is mediated by a cascade of proteolytic activations that 
ensures a rapid and amplified response to trauma. 


3. Some protein hormones are synthesized as inactive precursors. For exam- 
ple, insulin is derived from proinsulin by proteolytic removal of a peptide. 


4. The fibrous protein collagen, the major constituent of skin and bone, is 
derived from procollagen, a soluble precursor. 


5. Many developmental processes are controlled by the activation of zymogens. 
For example, in the metamorphosis of a tadpole into a frog, large amounts of 
collagen are resorbed from the tail in the course of a few days. Likewise, much 
collagen is broken down in a mammalian uterus after delivery. The conversion 
of procollagenase into collagenase, the active protease responsible for collagen 
breakdown, is precisely timed in these remodeling processes. 


6. Programmed cell death, or apoptosis, is mediated by proteolytic enzymes 
called caspases, which are synthesized in precursor form as procaspases. 
When activated by various signals, caspases function to cause cell death in 
most organisms, ranging from C. elegans to human beings. Apoptosis 
provides a means of sculpting the shapes of body parts in the course of 
development and a means of eliminating damaged or infected cells. 


We next examine the activation and control of zymogens, using enzymes 
responsible for digestion and blood-clot formation as examples. 


Chymotrypsinogen is activated by specific cleavage of a single 
peptide bond 


Chymotrypsin is a digestive enzyme that hydrolyzes proteins. Chymotrypsin, 
whose mechanism of action was described in detail in Chapter 9, specifically 
cleaves peptide bonds on the carboxy] side of amino acid residues with large, 
hydrophobic R groups (Table 8.6). Its inactive precursor, chymotrypsinogen, 
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FIGURE 10.20 Proteolytic activation of 
chymotrypsinogen. The three chains of 
a-chymotrypsin are linked by two interchain 
disulfide bonds (A to B, and B to C). The 
approximate positions of disulfide bonds 
are shown. 


Ribosomes attached to 
endoplasmic reticulum 


Zymogen 
granule 


FIGURE 10.19 Secretion of zymogens by an 
acinar cell of the pancreas. Zymogens are 
synthesized on ribosomes attached to the 
endoplasmic reticulum. They are subsequently 
processed in the Golgi apparatus and 
packaged into zymogen or secretory granules. 
With the proper signal, the granules fuse with 
the plasma membrane, discharging their 
contents into the lumen of the pancreatic 
ducts. Cell cytoplasm is depicted as pale 
green. Membranes and lumen are shown as 
dark green. 


Lumen 
is synthesized in the pancreas, as are several other zymogens and digestive 
enzymes. Indeed, the pancreas is one of the most active organs in synthesizing 
and secreting proteins. The enzymes and zymogens are synthesized in the 
acinar cells of the pancreas and stored inside membrane-bounded granules 
(Figure 10.19). The zymogen granules accumulate at the apex of the acinar 
cell; when the cell is stimulated by a hormonal signal or a nerve impulse, the 
contents of the granules are released into a duct leading into the duodenum. 

Chymotrypsinogen, a single polypeptide chain consisting of 245 amino 
acid residues, is virtually devoid of enzymatic activity. It is converted into a 
fully active enzyme when the peptide bond joining arginine 15 and isoleu- 
cine 16 is cleaved by trypsin (Figure 10.20). The resulting active enzyme, 
called -chymotrypsin, then acts on other 7-chymotrypsin molecules by 
removing two dipeptides to yield a-chymotrypsin, the stable form of the 
enzyme. The three resulting chains in a-chymotrypsin remain linked to one 
another by two interchain disulfide bonds. The striking feature of this acti- 
vation process is that cleavage of a single specific peptide bond transforms the 
protein from a catalytically inactive form into one that is fully active. 


Proteolytic activation of chymotrypsinogen leads to the 
formation of a substrate-binding site 


How does cleavage of a single peptide bond activate the zymogen? The 
cleavage of the peptide bond between amino acids 15 and 16 triggers key 
conformational changes, which were revealed by the elucidation of the 
three-dimensional structure of chymotrypsinogen. 


1. The newly formed amino-terminal group of isoleucine 16 turns inward and 
forms an ionic bond with aspartate 194 in the interior of the chymotrypsin 
molecule (Figure 10.21). 


2. This electrostatic interaction triggers a number of conformational changes. 
Methionine 192 moves from a deeply buried position in the zymogen to the 
surface of the active enzyme, and residues 187 and 193 move farther apart from 


each other. These changes result in the formation of the substrate- 
specificity site for aromatic and bulky nonpolar groups. One side of 
this site is made up of residues 189 through 192. This cavity for 
binding part of the substrate is not fully formed in the zymogen. 


3. The tetrahedral transition state generated by chymotrypsin has 
an oxyanion (a negatively charged carbonyl oxygen atom) that is 
stabilized by hydrogen bonds with two NH groups of the main chain 
of the enzyme (Figure 9.9). One of these NH groups is not appropri- 
ately located in chymotrypsinogen, and so the site stabilizing the 
oxyanion (the oxyanion hole, p. 257) is incomplete in the zymogen. 


4. The conformational changes elsewhere in the molecule are 
very small. Thus, the switching on of enzymatic activity in a protein 
can be accomplished by discrete, highly localized conformational 
changes that are triggered by the hydrolysis of a single peptide bond. 


The generation of trypsin from trypsinogen leads to the 
activation of other zymogens 


The structural changes accompanying the activation of trypsinogen, 
the precursor of the proteolytic enzyme trypsin, are different 
from those in the activation of chymotrypsinogen. Four regions 
of the polypeptide are very flexible in the zymogen, whereas they 
have a well-defined conformation in trypsin. The resulting struc- 
tural changes also complete the formation of the oxyanion hole. 
The digestion of proteins and other molecules in the duodenum 
requires the concurrent action of several enzymes, because each is 
specific for a limited number of side chains. Thus, the zymogens 
must be switched on at the same time. Coordinated control is 
achieved by the action of trypsin as the common activator of all the 
pancreatic zymogens—trypsinogen, chymotrypsinogen, proelastase, 
procarboxypeptidase, and prolipase, the inactive precursor of a lipid- 
degrading enzyme. To produce active trypsin, the cells that line the 
duodenum display a membrane-embedded enzyme, enteropeptidase, 
which hydrolyzes a unique lysine-isoleucine peptide bond in 
trypsinogen as the zymogen enters the duodenum from the pancreas. 
The small amount of trypsin produced in this way activates more 
trypsinogen and the other zymogens (Figure 10.22). Thus, the 
formation of trypsin by enteropeptidase is the master activation step. 
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FIGURE 10.22 Zymogen activation by proteolytic cleavage. Enteropeptidase initiates the 
activation of the pancreatic zymogens by activating trypsin, which then activates other 


zymogens. Active enzymes are shown in yellow; zymogens are shown in orange. 
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FIGURE 10.21 Conformations of 
chymotrypsinogen and 

chymotrypsin. The electrostatic interaction 
between the a-amino group of isoleucine 
16 and the carboxylate of aspartate 194, 
essential for the structure of active 
chymotrypsin, is possible only only after 
cleavage of the peptide bond between 
isoleucine and arginine in 
chymotrypsinogen. [Information from Gregory 
A. Petsko and Dagmar Ringe, Protein 
Structure and Function (Sinauer, 2009), p. 
3-16, Figure 3-31.] 
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FIGURE 10.23 Interaction of trypsin with 
its inhibitor. Structure of a complex of 
trypsin (yellow) and pancreatic trypsin 
inhibitor (red). Notice that lysine 15 of the 
inhibitor penetrates into the active site of 
the enzyme. There it forms an ionic bond 
with aspartate 189 in the active site. Also 
notice that bound inhibitor and the free 
inhibitor are almost identical in 

structure. [Drawn from 1BPI.pdb.] 


Some proteolytic enzymes have specific inhibitors 


The conversion of a zymogen into a protease by cleavage of a single peptide 
bond is a precise means of switching on enzymatic activity. However, this acti- 
vation step is irreversible, and so a different mechanism is needed to terminate 
proteolysis. Specific protease inhibitors accomplish this task. Serpins, serine 
protease inhibitors, are an example of one such family of inhibitors. For exam- 
ple, pancreatic trypsin inhibitor, a 6-kDa protein, inhibits trypsin by binding 
very tightly to its active site. The dissociation constant of the complex is 0.1 
pM, which corresponds to a standard free energy of binding of about —75 kJ 
mol”! (—18 kcal mol~'). In contrast with nearly all known protein assemblies, 
this complex is not dissociated into its constituent chains by treatment with 
denaturing agents such as 8 M urea or 6 M guanidine hydrochloride (p. 47). 
The reason for the exceptional stability of the complex is that pancre- 
atic trypsin inhibitor is a very effective substrate analog. X-ray analyses 
show that the inhibitor lies in the active site of the enzyme, positioned such 
that the side chain of lysine 15 of this inhibitor interacts with the aspartate 
side chain in the specificity pocket of trypsin. In addition, there are many 
hydrogen bonds between the main chain of trypsin and that of its inhibitor. 
Furthermore, the carbonyl group of lysine 15 and the surrounding atoms of 
the inhibitor fit snugly in the active site of the enzyme. Comparison of the 
structure of the inhibitor bound to the enzyme with that of the free inhibitor 
reveals that the structure is essentially unchanged on binding to the enzyme 
(Figure 10.23). Thus, the inhibitor is preorganized into a structure that is 
highly complementary to the enzyme’s active site. Indeed, the peptide bond 
between lysine 15 and alanine 16 in pancreatic trypsin inhibitor is cleaved 
but at a very slow rate: the half-life of the trypsin—inhibitor complex is sev- 
eral months. In essence, the inhibitor is a substrate, but its intrinsic struc- 
ture is so nicely complementary to the enzyme’s active site that it binds very 
tightly, rarely progressing to the transition state and is turned over slowly. 


Trypsin-pancreatic trypsin Free pancreatic 
inhibitor complex trypsin inhibitor 


e The amount of trypsin is much greater than the amount of inhibitor. 
Why does trypsin inhibitor exist? Recall that trypsin activates other 
zymogens. Consequently, the prevention of even small amounts of trypsin 
from initiating the cascade while the zymogens are still in the pancreas or 


pancreatic ducts is vital. Trypsin inhibitor binds to any prematurely activated 303 
trypsin molecules in the pancreas or pancreatic ducts. This inhibition prevents 10.4 Activation by Proteolytic Cleavage 
severe damage to those tissues, which could lead to acute pancreatitis. 


e Pancreatic trypsin inhibitor is not the only important protease 

inhibitor. A 53-kDa plasma protein, @,-antitrypsin (also called 

a@-antiproteinase), protects tissues from digestion by elastase, a secretory 

product of neutrophils (white blood cells that engulf bacteria). Antielastase 

would be a more accurate name for this inhibitor, because it blocks 

elastase much more effectively than it blocks trypsin. Like pancreatic 

trypsin inhibitor, @,-antitrypsin blocks the action of target enzymes by 

binding nearly irreversibly to their active sites. Genetic disorders leading 

to a deficiency of @,-antitrypsin illustrate the physiological importance 

of this inhibitor. For example, the substitution of lysine for glutamate at FIGURE 10.24 Oxidation of methionine 

residue 53 in the type Z mutant slows the secretion of this inhibitor from to methionine sulfoxide. 

liver cells. Serum levels of the inhibitor are about 15% of normal in 

people homozygous for this defect. The consequence is that excess elas- 

tase destroys alveolar walls in the lungs by digesting elastic fibers and 

other connective-tissue proteins. 

The resulting clinical condition is called emphysema 

(also known as chronic obstructive pulmonary disease nqpinsic PATHWAY 

[COPD]). People with emphysema must breathe much 

harder than normal people to exchange the same voll- Damaged surface 

ume of air because their alveoli are much less resilient | 

than normal. Cigarette smoking markedly increases the ininoaeh 

likelihood that even a type Z heterozygote will develop Kallikrein 

emphysema. The reason is that smoke oxidizes methio- . 

nine 358 of the inhibitor (Figure 10.24), a residue essen- al Xlle 

; oe ; a eee A EXTRINSIC PATHWAY 

tial for binding elastase. Indeed, this methionine side xl x, 

chain is the bait that selectively traps elastase. The a 

methionine sulfoxide oxidation product, in contrast, does IX WX 

not lure elastase, a striking consequence of the insertion 

of just one oxygen atom into a protein and a remarkable “VIIa 
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example of the effect of human behavior on bioche- 
mistry. We will consider another protease inhibitor, Xx i) Xx 
antithrombin III, when we examine the control of blood 

clotting. ‘Na 


Blood clotting is accomplished by a cascade ay 
of zymogen activations 


. : Bi FINAL Fibrinogen 
e Enzymatic cascades are often employed in bio SGMNMON () 


chemical systems to achieve a rapid response. In PATHWAY 
a cascade, an initial signal institutes a series of steps, 
each of which is catalyzed by an enzyme. At each step, _ ree 
the signal is amplified. For instance, ifa signal molecule by thrombin 
activates an enzyme that in turn activates 10 enzymes 
and each of the 10 enzymes in turn activates 10 addi- aay is 

‘ oon . ‘ interplay of the intrinsic, extrinsic, and final common pathways. The 

tional enzymes, after four steps the original signal will intrinsic pathway begins with the activation of factor XII (Hageman 
have been amplified 10,000-fold. Hemostasis, the pro- factor) by contact with abnormal surfaces produced by injury. The 
cess of blood clot formation and dissolution, requires a _ extrinsic pathway is triggered by trauma, which releases tissue factor 
cascade of zymogen activations: the activated form of (TF). TF forms a complex with VII, which initiates a cascade-activating 
one clotting factor catalyzes the activation of the next thrombin. Inactive forms of clotting factors are shown in red; their 


. eee activated counterparts (indicated by the subscript “a”) are in yellow. 
(Figure 10.25). Thus, Very small amounts of the initial Stimulatory proteins that are not themselves enzymes are shown in 


factors suffice to trigger the cascade, ensuring a rapid blue boxes. A striking feature of this process is that the activated form 
response to trauma. of one clotting factor catalyzes the activation of the next factor. 


FIGURE 10.25 Blood-clotting cascade. A fibrin clot is formed by the 


Kringle pastry 


FIGURE 10.27 The calcium-binding 
region of prothrombin. Prothrombin binds 
calcium ions with the modified amino acid 
y-carboxyglutamate (red). [Drawn from 
2PF2.pdb_] 
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Two means of initiating blood clotting have been described, the intrinsic 
pathway and the extrinsic pathway. The intrinsic clotting pathway is acti- 
vated by exposure of anionic surfaces upon rupture of the endothelial lining 
of the blood vessels. The extrinsic pathway, which appears to be most cru- 
cial in blood clotting, is initiated when trauma exposes tissue factor (TF), an 
integral membrane glycoprotein. Upon exposure to the blood, tissue factor 
binds to factor VII to activate factor X. Both the intrinsic and extrinsic path- 
ways lead to the activation of factor X (a serine protease), which in turn 
converts prothrombin into thrombin, the key protease in clotting. Thrombin 
then amplifies the clotting process by activating enzymes and factors that 
lead to the generation of yet more thrombin, an example of positive feed- 
back. Note that the active forms of the clotting factors are designated with a 
subscript “a,” whereas factors that are activated by thrombin are designated 
with an asterisk. 


Prothrombin requires a vitamin K-dependent modification 
for activation 


Thrombin is synthesized as a zymogen called prothrombin. The inactive 
molecule comprises four major domains, with the serine protease domain at 
its carboxyl terminus (Figure 10.26). The first domain, called the gla 
domain, is rich in y carboxyglutamate residues (abbreviation gla), and the 
second and third domains are called kringle domains (named after a Danish 
pastry that they resemble). Vitamin K is required for the synthesis of y car- 
boxyglutamate, a strong chelator of Ca**. These three domains work in 
concert to keep prothrombin in an inactive form. Moreover, because it is 
rich in yy carboxyglutamate, the gla domain is able to bind Ca** 
(Figure 10.27). What is the effect of this binding? The binding of Ca?+ by 
prothrombin anchors the zymogen to phospholipid membranes derived 
from blood platelets after injury. This binding is crucial because it brings 
prothrombin into close proximity to two clotting proteins, factor X, and 
factor V,, (a stimulatory protein), that catalyze its conversion into thrombin. 
Activation is begun by proteolytic cleavage of the bond between arginine 
274 and threonine 275 to release a fragment containing the first three 
domains. Cleavage of the bond between arginine 323 and isoleucine 324 
yields active thrombin. 


Cleavage sites 


FIGURE 10.26 Modular structure of prothrombin. Cleavage of two peptide bonds yields 
thrombin. All the y-carboxyglutamate residues are in the gla domain. 


Fibrinogen is converted by thrombin into a fibrin clot 


The best-characterized part of the clotting process is the final step in the 
cascade: the conversion of fibrinogen into fibrin by thrombin. Fibrinogen 
is made up of three globular units connected by two rods (Figure 10.28). 
This 340-kDa protein consists of six chains: two each of Aa, BB, and y. 
The rod regions are triple-stranded a-helical coiled coils, a recurring 
motif in proteins (Section 2.3). Thrombin cleaves four arginine—glycine 
peptide bonds in the central globular region of fibrinogen (p. 217). On 
cleavage, an A peptide of 18 residues is released from each of the two Aa 
chains, as is a B peptide of 20 residues from each of the two BB chains. 
These A and B peptides are called fibrinopeptides. A fibrinogen molecule 
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FIGURE 10.28 Structure of a fibrinogen 
molecule. (A) A ribbon diagram. The two 
rod regions are a-helical coiled coils, 
connected to a globular region at each 
end. The structure of the central globular 
region has not been determined. (B) A 
schematic representation showing the 
Globular unit positions of the fibrinopeptides A and 

B. [Part A drawn from 1DEQ.pdb.] 


(B) 


devoid of these fibrinopeptides is called a fibrin monomer and has the sub- 
unit structure (@By),. 

Fibrin monomers spontaneously assemble into ordered 
fibrous arrays called fibrin. Electron micrographs and 
low-angle x-ray patterns show that fibrin has a periodic 
structure that repeats every 23 nm (Figure 10.29). Higher- 
resolution images reveal how the removal of the fibrino- 
peptides permits the fibrin monomers to come together to 
form fibrin. The homologous £ and y chains have globular 
domains at the carboxyl-terminal ends (Figure 10.30). 
These domains have binding “holes” that interact with eC up 

: ss “c along the fiber axis is half the length of a fibrinogen molecule. [From 
peptides. The B domain ” specific for ne BEnRES of the John L. Woodhead et al., “The Ultrastructure of Fibrinogen Caracas || 
form H,N*-Gly-His-Arg-, whereas the y domain binds Molecules, Fibers, and Clots,” J. Biol. Chem. 271(9):4946-4953, 1996, 
H,N*-Gly-Pro-Arg-. Exactly these sequences (some- Mar 1. © American Society for Biochemistry and Molecular Biology] 
times called “knobs”) are exposed at the amino-terminal 
ends of the B and a chains, respectively, on thrombin cleavage. The knobs 
of the a subunits fit into the holes on the y subunits of another monomer to 
form a protofibril. This protofibril is extended when the knobs of the B 
subunits fit into the holes of B subunits of other protofibrils. Thus, analo- 
gous to the activation of chymotrypsinogen, peptide-bond cleavage exposes 
new amino termini that can participate in specific interactions. The newly 
formed “soft clot” is stabilized by the formation of amide bonds between 
the side chains of lysine and glutamine residues in different monomers. 


FIGURE 10.29 Electron micrograph of fibrin. The 23-nm period 


= J Fibrinopeptides 
u 


r 
Gly-His-Arg sequences o. 
/ i Polymerization 
Thrombin @ 
@ 


V 


Gly-Pro-Arg sequences 


FIGURE 10.30 Formation of a fibrin clot. (1) Thrombin cleaves fibrinopeptides A and B 
from the central globule of fibrinogen. (2) Globular domains at the carboxyl-terminal ends 
of the B and y chains interact with “knobs” exposed at the amino-terminal ends of the B 
and y chains to form clots. 
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FIGURE 10.31 Structures of vitamin K 
and two antagonists, dicoumarol and 
warfarin. 
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This cross-linking reaction is catalyzed by transglutaminase (factor 
XIII), which itself is activated from the protransglutaminase form by 
thrombin. 


Vitamin K is required for the formation of -y-carboxyglutamate 


Vitamin K (Figure 10.31) has been known for many years to be essen- 

tial for the synthesis of prothrombin and several other clotting 
factors. Indeed, it is called vitamin K because a deficiency in this vitamin 
results in defective blood koagulation (Scandinavian spelling). After inges- 
tion, vitamin K is reduced to a dihydro derivative that is required by 
y-glutamyl carboxylase to convert the first 10 glutamate residues in the amino- 
terminal region of prothrombin into y-carboxyglutamate (Figure 10.32). 


Glutamate residue CO2 y-Carboxyglutamate residue 
O2 
CH; y-Glutamyl carboxylase CH; 
oO 
. Sy R 
OH <— x O 
ae Quinone Epoxide 
Vitamin K reductase reductase Vitamin K 
(hydroquinone) (epoxide) 


X = Proposed site of warfarin 
inhibition 


FIGURE 10.32 Synthesis of y-carboxyglutamate by y-glutamyl carboxylase. The 
formation of -y-carboxyglutamate requires the hydroquinone derivative of vitamin K, which is 
regenerated from the epoxide derivative by the sequential action of epoxide reductase and 
quinone reductase, both of which are inhibited by warfarin. 


Recall that y-carboxyglutamate, a strong chelator of Ca?*, is required for 
the activation of prothrombin (p. 304). Dicoumarol, which is found in 
spoiled sweet clover, causes a fatal hemorrhagic disease in cattle fed on this 
hay. Cows fed dicoumarol synthesize an abnormal prothrombin that does 
not bind Ca?*, in contrast with normal prothrombin. Dicoumarol was the 
first anticoagulant used to prevent thromboses in patients prone to clot for- 
mation. However, it is seldom used now because of poor absorption and 
gastrointestinal side effects. Warfarin, another vitamin K antagonist, is 
commonly administered as an anticoagulant. Warfarin inhibits the keto 
reductase and quinone reductase that are required to regenerate the dihydro 
derivative of vitamin K (Figure 10.32). Dicoumarol, warfarin, and their 
chemical derivatives serve as effective rat poisons. 


The clotting process must be precisely regulated 


There is a fine line between hemorrhage and thrombosis, the formation 
of blood clots in blood vessels. Clots must form rapidly yet remain 
confined to the area of injury. What are the mechanisms that normally 
limit clot formation to the site of injury? The lability of clotting factors 
contributes significantly to the control of clotting. Activated factors are 
short-lived because they are diluted by blood flow, removed by the liver, 
and degraded by proteases. For example, the stimulatory protein factors 
Vand VIII, are digested by protein C, a protease that is switched on by 
the action of thrombin. Thus, thrombin has a dual function: it catalyzes 
the formation of fibrin and it initiates the deactivation of the clotting 
cascade. 

Specific inhibitors of clotting factors are also critical in the termination 
of clotting. For instance, tissue factor pathway inhibitor (TFPI) inhibits the 
complex of TF—VII,—X, that activates thrombin. Another key inhibitor is 
antithrombin III, a member of the serpin family of protease inhibitors 
(p. 302) that forms an irreversible inhibitory complex with thrombin. 
Antithrombin III resembles a, -antitrypsin except that it inhibits thrombin 
much more strongly than it inhibits elastase (Figure 10.23). Antithrombin 
III also blocks other serine proteases in the clotting cascade—namely, fac- 
tors XII ie XL, IX. and Ks The inhibitory action of antithrombin III is 
enhanced by heparin, a negatively charged polysaccharide (Section 11.3) 
found in mast cells near the walls of blood vessels and on the surfaces of 
endothelial cells (Figure 10.33). Heparin acts as an anticoagulant by increas- 
ing the rate of formation of irreversible complexes between antithrombin III 
and the serine protease clotting factors. 


e The importance of the ratio of thrombin to antithrombin is 
illustrated in the case of a 14-year-old boy who died of a bleeding 
disorder because of a mutation in his @,-antitrypsin, which normally 
inhibits elastase. Methionine 358 in @,-antitrypsin’s binding pocket for 
elastase was replaced by arginine, resulting in a change in specificity 
from an elastase inhibitor to a thrombin inhibitor. Activity of @,-anti- 
trypsin normally increases markedly after injury to counteract excess 
elastase arising from stimulated neutrophils. The mutant a,-antitrypsin 
caused the patient’s thrombin activity to drop to such a low level that 
hemorrhage ensued. We see here a striking example of how a change of a 
single residue in a protein can dramatically alter specificity and an example 
of the critical importance of having the right amount of a protease 
inhibitor. 

Antithrombin limits the extent of clot formation, but what happens to 
the clots themselves? Clots are not permanent structures but are designed to 


An account of a hemorrhagic disposition 
existing in certain families 

“About seventy or eighty years ago, a woman 
by the name of Smith settled in the vicinity of 
Plymouth, New Hampshire, and transmitted 
the following idiosyncrasy to her 
descendants. It is one, she observed, to 
which her family is unfortunately subject and 
has been the source not only of great 
solicitude, but frequently the cause of death. 
If the least scratch is made on the skin of 
some of them, as mortal a hemorrhage will 
eventually ensue as if the largest wound is 
inflicted. . . . It is a surprising circumstance 
that the males only are subject to this strange 
affection, and that all of them are not liable to 
it... . Although the females are exempt, they 
are still capable of transmitting it to their male 
children.” 


John Otto (1803) 


FIGURE 10.33 Electron micrograph of a 
mast cell. Heparin and other molecules in 
the dense granules are released into the 
extracellular space when the cell is 
triggered to secrete. [Courtesy of Lynne 
Mercer.] 
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FIGURE 10.35 The effect of tissue-type 
plasminogen activator. Angiographic 
images demonstrate the effect of TPA 
administration. The top left image shows an 
occluded cerebral artery (arrow) prior to 
TPA injection. The middle image indicates 
the site of injection. The lower right image, 
made several hours after injection, reveals 
the restoration of blood flow to the cerebral 
artery. [Medical Body Scans/Science Source.] 
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FIGURE 10.36 Action of antihemophilic 
factor. Antihemophilic factor (Factor VIII) 
stimulates the activation of factor X by factor 
X,. Interestingly, the activity of factor VIII is 
markedly increased by limited proteolysis by 
thrombin. This positive feedback amplifies the 
clotting signal and accelerates clot formation 
after a threshold has been reached. 


Fibrin binding Kringle Kringle Serine protease 


FIGURE 10.34 Modular structure of tissue-type plasminogen activator (TPA). 


dissolve when the structural integrity of damaged areas 
is restored. Fibrin is degraded by plasmin, a serine pro- 
tease that hydrolyzes peptide bonds in the coiled-coil 
regions. Plasmin molecules can diffuse through aque- 
ous channels in the porous fibrin clot to cut the acces- 
sible connector rods. Plasmin is formed by the proteo- 
lytic activation of plasminogen, an inactive precursor 
that has a high affinity for the fibrin clots. This conver- 
sion is carried out by tissue-type plasminogen activator 
(TPA), a 72-kDa protein that has a domain structure 
closely related to that of prothrombin (Figure 10.34). 
However, a domain that targets TPA to fibrin clots 
replaces the membrane-targeting gla domain of pro- 
thrombin. The TPA bound to fibrin clots swiftly acti- 
vates adhering plasminogen. In contrast, TPA activates 
free plasminogen very slowly. The gene for TPA has 
been cloned and expressed in cultured mammalian 
cells. TPA administered at the onset of a heart attack or 
a stroke caused by a blood clot increases the likelihood 
of survival without physical or cognitive disabilities 
(Figure 10.35). 


Hemophilia revealed an early step in clotting 


c2 Some important breakthroughs in the elucidation of clotting 
pathways have come from studies of patients with bleeding disorders. 
Classic hemophilia, or hemophilia A, is the best-known clotting defect. This 
disorder is genetically transmitted as a sex-linked recessive characteristic. 
In classic hemophilia, factor VIII (antihemophilic factor) of the intrinsic 
pathway is missing or has markedly reduced activity. Although factor VIII is 
not itself a protease, it markedly stimulates the activation of factor X, the 
final protease of the intrinsic pathway, by factor IX,, a serine protease 
(Figure 10.36). Thus, activation of the intrinsic pathway is severely 
impaired in hemophilia. 

In the past, hemophiliacs were treated with transfusions of a concen- 
trated plasma fraction containing factor VIII. This therapy carried the risk 
of infection. Indeed, many hemophiliacs contracted hepatitis and, more 
recently, AIDS. A safer source of factor VIII was urgently needed. With the 
use of biochemical purification and recombinant DNA techniques, the gene 
for factor VIII was isolated and expressed in cells grown in culture. 
Recombinant factor VIII purified from these cells has largely replaced 
plasma concentrates in treating hemophilia. 


SUMMARY 


10.1 Aspartate Transcarbamoylase Is Allosterically Inhibited by the End 
Product of Its Pathway 
Allosteric proteins constitute an important class of proteins whose 
biological activity can be regulated. Specific regulatory molecules can 
modulate the activity of allosteric proteins by binding to distinct 
regulatory sites, separate from the functional sites. These proteins have 
multiple functional sites, which display cooperativity as evidenced by a 


sigmoidal dependence of function on substrate concentration. Aspartate 309 
transcarbamoylase (ATCase), one of the best-understood allosteric Summary 
enzymes, catalyzes the synthesis of N-carbamoylaspartate, the first 

intermediate in the synthesis of pyrimidines. ATCase is feedback 

inhibited by CTP, the final product of the pathway. ATP reverses this 

inhibition. ATCase consists of separable catalytic (c,) subunits (which 

bind the substrates) and regulatory (r,) subunits (which bind CTP and 

ATP). The inhibitory effect of CTP, the stimulatory action of ATP, and 

the cooperative binding of substrates are mediated by large changes in 

quaternary structure. On binding substrates, the c, subunits of the c,r, 

enzyme move apart and reorient themselves. This allosteric transition is 

highly concerted. All subunits of an ATCase molecule simultaneously 

interconvert from the T (low-affinity) to the R (high-affinity) state. 


10.2 Isozymes Provide a Means of Regulation Specific to Distinct Tissues 
and Developmental Stages 
Isozymes differ in structural characteristics but catalyze the same reac- 
tion. They provide a means of fine-tuning metabolism to meet the needs 
of a given tissue or developmental stage. The results of gene-duplication 
events provide the means for subtle regulation of enzyme function. 


10.3 Covalent Modification Is a Means of Regulating Enzyme Activity 
The covalent modification of proteins is a potent means of controlling the 
activity of enzymes and other proteins. Phosphorylation is a common type 
of reversible covalent modification. Signals can be highly amplified by 
phosphorylation because a single kinase can act on many target molecules. 
The regulatory actions of protein kinases are reversed by protein 
phosphatases, which catalyze the hydrolysis of attached phosphoryl groups. 
Cyclic AMP serves as an intracellular messenger in the transduction 
of many hormonal and sensory stimuli. Cyclic AMP switches on 
protein kinase A, a major multifunctional kinase, by binding to the 
regulatory subunit of the enzyme, thereby releasing the active catalytic 
subunits of PKA. In the absence of cAMP, the catalytic sites of PKA 
are occupied by pseudosubstrate sequences of the regulatory subunit. 


10.4 Many Enzymes Are Activated by Specific Proteolytic Cleavage 

The activation of an enzyme by the proteolytic cleavage of one or a few 
peptide bonds is a recurring control mechanism seen in processes as 
diverse as the activation of digestive enzymes and blood clotting. The 
inactive precursor is a zymogen (proenzyme). Trypsinogen is activated 
by enteropeptidase or trypsin, and trypsin then activates a host of other 
zymogens, leading to the digestion of foodstuffs. For instance, trypsin 
converts chymotrypsinogen, a zymogen, into active chymotrypsin by 
hydrolyzing a single peptide bond. 

A striking feature of the clotting process is that it is accomplished by a 
cascade of zymogen conversions, in which the activated form of one clot- 
ting factor catalyzes the activation of the next precursor. Many of the 
activated clotting factors are serine proteases. In the final step of clot for- 
mation, fibrinogen, a highly soluble molecule in the plasma, is converted 
by thrombin into fibrin by the hydrolysis of four arginine-glycine bonds. 
The resulting fibrin monomer spontaneously forms long, insoluble fibers 
called fibrin. Zymogen activation is also essential in the lysis of clots. 
Plasminogen is converted into plasmin, a serine protease that cleaves 
fibrin, by tissue-type plasminogen activator. Although zymogen activa- 
tion is irreversible, specific inhibitors of some proteases exert control. The 
irreversible protein inhibitor antithrombin II holds blood clotting in 
check in the clotting cascade. 
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KEY TERMS 


cooperativity (p. 285) 

feedback (end-product) inhibition (p. 287) 
allosteric (regulatory) site (p. 287) 
homotropic effect (p. 290) 

concerted model (p. 290) 

sequential model (p. 290) 

heterotropic effect (p. 292) 


PROBLEMS 


isozyme (isoenzyme) (isoform) (p. 292) 
covalent modification (p. 293) 

protein kinase (p. 294) 

consensus sequence (p. 295) 

protein phosphatase (p. 295) 

protein kinase A (PKA) (p. 297) 
pseudosubstrate sequence (p. 297) 


zymogen (proenzyme) (p. 299) 
enzymatic cascade (p. 303) 
hemostasis (p. 303) 

intrinsic pathway (p. 304) 
extrinsic pathway (p. 304) 


1. Context please. The allosteric properties of aspartate 
transcarbamoylase have been discussed in detail in this 
chapter. Whatisthefunction ofaspartate transcarbamoylase? 


2. Activity profile. A histidine residue in the active site of 
aspartate transcarbamoylase is thought to be important in 
stabilizing the transition state of the bound substrates. 
Predict the pH dependence of the catalytic rate, assuming 
that this interaction is essential and dominates the pH- 
activity profile of the enzyme. (See equations on p. 16.) 


3. Knowing when to say when. What is feedback inhibition? 
Why is it a useful property? 


4. Knowing when to get going. What is the biochemical 
rationale for ATP serving as a positive regulator of AT Case? 


5. No T. What would be the effect of a mutation in an 
allosteric enzyme that resulted in a T/R ratio of 0? 


6. Turned upside down. An allosteric enzyme that follows 
the concerted model has a T/R ratio of 300 in the absence 
of substrate. Suppose that a mutation reversed the ratio. 
How would this mutation affect the relation between the 
rate of the reaction and the substrate concentration? 


7. Partners. As shown in Figure 10.2, CTP inhibits 
ATCase; however, the inhibition is not complete. Can you 
suggest another molecule that might enhance the inhibition 
of ATCase? Hint: See Figure 25.2. 


8. RT equilibrium. Differentiate between homotropic and 
heterotropic effectors. 


9. Restoration project. If isolated regulatory subunits and 
catalytic subunits of ATCase are mixed, the native enzyme 
is reconstituted. What is the biological significance of the 
observation? 


10. Because it’s an enzyme. X-ray crystallographic studies 
of ATCase in the R form required the use of the bisubstrate 
analog PALA. Why was this analog, a competitive inhibitor, 
used instead of the actual substrates? 


11. Allosteric switching. A substrate binds 100 times as 
tightly to the R state of an allosteric enzyme as to its T state. 
Assume that the concerted (MWC) model applies to this 
enzyme. (See equations for the Concerted Model in the 
Appendix to Chapter 7.) 


(a) By what factor does the binding of one substrate 
molecule per enzyme molecule alter the ratio of the concen- 
trations of enzyme molecules in the R and T states? 


(b) Suppose that L, the ratio of [T] to [R] in the absence of 
substrate, is 10’ and that the enzyme contains four binding 
sites for substrate. What is the ratio of enzyme molecules 
in the R state to those in the T state in the presence of satu- 
rating amounts of substrate, assuming that the concerted 
model is obeyed? 


12. Allosteric transition. Consider an allosteric protein 
that obeys the concerted model. Suppose that the ratio of 
T to R formed in the absence of ligand is 10°, Ky = 2 
mM, and Kp = 5 pM. The protein contains four binding 
sites for ligand. What is the fraction of molecules in the 
R form when 0, 1, 2, 3, and 4 ligands are bound? (See 
equations for the Concerted Model in the Appendix to 
Chapter 7.) 


13. Negative cooperativity. You have isolated a dimeric 
enzyme that contains two identical active sites. The binding 
of substrate to one active site decreases the substrate affin- 
ity of the other active site. Can the concerted model account 
for this negative cooperativity? Hint: See Section 7.2. 


14. A new view of cooperativity. Draw a double-reciprocal 
plot for a typical Michaelis-Menten enzyme and an 
allosteric enzyme that have the same V,,, and K,,. Draw 
a double reciprocal plot for the same allosteric enzyme 
in the presence of an allosteric inhibitor and an allosteric 
stimulator. 


15. Paradoxical at first glance. Recall that phosphonacetyl- 
L-aspartate (PALA) isa potent inhibitor of ATCase because 


it mimics the two physiological substrates. However, in the 
presence of substrates, low concentrations of this unreactive 
bisubstrate analog increase the reaction velocity. On the 
addition of PALA, the reaction rate increases until an aver- 
age of three molecules of PALA are bound per molecule of 
enzyme. This maximal velocity is 17-fold greater than it is 
in the absence of PALA. The reaction rate then decreases to 
nearly zero on the addition of three more molecules of 
PALA per molecule of enzyme. Why do low concentrations 
of PALA activate ATCase? 


16. Regulation energetics. The phosphorylation and dephos- 
phorylation of proteins is a vital means of regulation. 
Protein kinases attach phosphoryl groups, whereas only a 
phosphatase will remove the phosphoryl group from the 
target protein. What is the energy cost of this means of 
covalent regulation? 


17. Vive la différence. What is an isozyme? 


18. Fine-tuning biochemistry. What is the advantage for an 
organism to have isozymic forms of an enzyme? 


19. Making matches. 


(a) ATCase 1. Protein phosphoryla- 
tion catalyst 

(b) T state 2. Required to modify 
glutamate 

(c) R state 3. Activates a particular 
kinase 

(d) Phosphorylation 4. Proenzyme 

(e) Kinase 5. Activates trypsin 

(f) Phosphatase 6. Common covalent 
modification 

(g) cAMP 7. Inhibited by CTP 

(h) Zymogen 8. Less-active state of an 
allosteric protein 

(i) Enteropeptidase 9. Initiates extrinsic 
pathway 

(j) Vitamin K 10. Forms fibrin 

(k) Thrombin 11. More-active state of 


an allosteric protein 


(1) Tissue factor 12. Removes phosphates 


20. Powering change. Phosphorylation is a common 
covalent modification of proteins in all forms of life. What 
energetic advantages accrue from the use of ATP as the 
phosphoryl donor? 


21. No going back. What is the key difference between 
regulation by covalent modification and specific proteolytic 
cleavage? 
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22. Zymogen activation. When very low concentrations of 
pepsinogen are added to acidic media, how does the half- 
time for activation depend on zymogen concentration? 


23. No protein shakes advised. Predict the physiological 
effects of a mutation that resulted in a deficiency of 
enteropeptidase. 


24. A revealing assay. Suppose that you have just examined 
a young boy with a bleeding disorder highly suggestive 
of classic hemophilia (factor VIII deficiency). Because of 
the late hour, the laboratory that carries out specialized 
coagulation assays is closed. However, you happen to 
have a sample of blood from a classic hemophiliac whom 
you admitted to the hospital an hour earlier. What is the 
simplest and most rapid test that you can perform to deter- 
mine whether your present patient also is deficient in factor 
VIII activity? 


25. Counterpoint. The synthesis of factor X, like that of 
prothrombin, requires vitamin K. Factor X also contains 
y-carboxyglutamate residues in its amino-terminal region. 
However, activated factor X, in contrast with thrombin, 
retains this region of the molecule. What is a likely func- 
tional consequence of this difference between the two 
activated species? 


26. A discerning inhibitor. Antithrombin III forms an irre- 
versible complex with thrombin but not with prothrombin. 
What is the most likely reason for this difference in 
reactivity? 


27. Drug design. A drug company has decided to use 
recombinant DNA methods to prepare a modified a ,-anti- 
trypsin that will be more resistant to oxidation than is the 
naturally occurring inhibitor. Which single amino acid 
substitution would you recommend? 


28. Blood must flow. Why is inappropriate blood-clot 
formation dangerous? 


29. Hemostasis. Thrombin functions in both coagulation 
and fibrinolysis. Explain. 


30. Dissolution row. What is tissue-type plasminogen 
activator and what is its role in preventing heart attacks? 


31. Joining together. What differentiates a soft clot from a 
mature clot? 


Data Interpretation Problems 


32. Distinguishing between models. The following graph 
shows the fraction of an allosteric enzyme in the R state (fp) 
and the fraction of active sites bound to substrate (Y) as a 
function of substrate concentration. Which model, the con- 
certed or sequential, best explains these results? 
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[Data from M. W. Kirschner and H. K. Schachman. Biochemistry 
12:2997-3004, 1966.] 


33. Reporting live from ATCase 1. ATCase underwent reac- 
tion with tetranitromethane to form a colored nitrotyrosine 
group (A,,,, = 430 nm) in each of its catalytic chains. The 
absorption by this reporter group depends on its immediate 
environment. An essential lysine residue at each catalytic site 
also was modified to block the binding of substrate. Catalytic 
trimers from this doubly modified enzyme were then com- 
bined with native trimers to form a hybrid enzyme. The 
absorption by the nitrotyrosine group was measured on addi- 
tion of the substrate analog succinate. What is the signifi- 
cance of the alteration in the absorbance at 430 nm? 


Succinate 


Absorbance change (%) 


| | | 
350 450 550 


Wavelength (nm) 


[Data from H. K. Schachman. J. Biol. Chem. 263:18583-18586, 1988.] 


34. Reporting live from ATCase 2. A different ATCase 
hybrid was constructed to test the effects of allosteric 
activators and inhibitors. Normal regulatory subunits 
were combined with nitrotyrosine-containing catalytic 
subunits. The addition of ATP in the absence of sub- 
strate increased the absorbance at 430 nm, the same 
change elicited by the addition of succinate (see the 
graph in Problem 33). Conversely, CTP in the absence of 
substrate decreased the absorbance at 430 nm. What is 
the significance of the changes in absorption of the 
reporter groups? 
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[Data from H. K. Schachman. J. Biol. Chem. 263:18583-18586, 1988.] 


35. Conviviality and PKA. Recent studies have suggested 
that protein kinase A may be important in establishing 
behaviors in many organisms, including humans. One study 
investigated the role of PKA in locust behavior. Certain spe- 
cies of locust live solitary lives until crowded, at which point 
they become gregarious—they prefer the crowded life. 
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[Data from S. R. Ott, et al., Proc. Natl. Acad. Sci. U. S. A. 
109(7):E381-7, 2012.] 


Locusts were grouped together for one hour, and then 
allowed to stay with the group or move away. Prior to 
crowding, some insects were injected with a PKA inhibitor, 
a cyclic GMP-dependent kinase inhibitor or no inhibitor, as 
indicated. The results are shown above. 


(a) What is the response of the control group to crowding? 


(b) What is the result if the insects are first treated with 
PKA inhibitor? PKG inhibitor? 


(c) What was the purpose of the experiment with the PKG 
inhibitor? 


(d) What do these results suggest about the role of PKA in 
the transition from a solitary to a gregarious life style? 


‘The above experiments were repeated on a different species of 
insect that is always gregarious. The results are shown below. 
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[Data from S. R. Ott, et al., Proc. Natl. Acad. Sci. U. S. A. 
109(7):E381-7, 2012.] 


(e) What do these results suggest about the role of PKA in 
insects that are always gregarious? 


Chapter Integration Problems 


36. Repeating heptads. Each of the three types of fibrin 
chains contains repeating heptapeptide units (abcdefg) in 
which residues a and d are hydrophobic. Propose a reason 
for this regularity. 
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37. Density matters. The sedimentation value of aspartate 
transcarbamoylase decreases when the enzyme switches 
to the R state. On the basis of the allosteric properties 
of the enzyme, explain why the sedimentation value 
decreases. 


38. Too tight a grip. Trypsin cleaves proteins on the car- 
boxyl side of lysine. Trypsin inhibitor has a lysine residue, 
and binds to trypsin, yet it is not a substrate. Explain. 


39. Apparently not the four-leaf variety. Cows that graze on 
spoiled sweet clover, which contains dicoumarol, die from 
hemorrhagic disease. The cause of death is defective pro- 
thrombin. However, the amino acid composition of the 
defective prothrombin is identical to that of normal pro- 
thrombin. What is the mechanism of action of dicoumarol? 
Why are the amino acid compositions of the defective and 
normal prothrombin the same? 


Mechanism Problems 


40. Aspartate transcarbamoylase. Write the mechanism (in 
detail) for the conversion of aspartate and carbamoy] phos- 
phate into N-carbamoylaspartate. Include a role for the 
histidine residue present in the active site. 


41. Protein kinases. Write a mechanism (in detail) for phos- 
phorylation of a serine residue by ATP catalyzed by protein 
kinase. What groups might you expect to find in the 
enzyme’s active site? 


Carbohydrates 


Carbohydrates are important fuel molecules, but they play many other biochemical roles, 
including protection against high-impact forces. The cartilage of a runner’s foot cushions the 
impact of each step she takes. A key component of cartilage are molecules called 
glycosaminoglycans, large polymers made up of many repeats of dimers such as the pair shown 
at the right. [Untitled x-ray/Nick Veasey/Getty Images.] 


or years, the study of carbohydrates was considered less exciting than 

many if not most topics of biochemistry. Carbohydrates were recog- 
nized as important fuels and structural components but were thought to be 
peripheral to most key activities of the cell. In essence, they were considered 
the underlying girders and fuel for a magnificent piece of biochemical 
architecture. This view has changed dramatically in the past few years. We 
have learned that cells of all organisms are coated in a dense and complex 
coat of carbohydrates. Secreted proteins are often extensively decorated 
with carbohydrates essential to a protein’s function. The extracellular 
matrix in higher eukaryotes—the environment in which the cells live—is 
rich in secreted carbohydrates central to cell survival and cell-to-cell com- 
munication. Carbohydrates, carbohydrate-containing proteins, and spe- 
cific carbohydrate-binding proteins are required for interactions that allow 
cells to form tissues, are the basis of human blood groups, and are used by 
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CHAPTER 11 


Carbohydrates 


a variety of pathogens to gain access to their hosts. Indeed, rather than 
being mere infrastructure components, carbohydrates supply details and 
enhancements to the biochemical architecture of the cell, helping to define 
the functionality and uniqueness of the cell. 

A key property of carbohydrates that allows their many functions is the 
tremendous structural diversity possible within this class of molecules. 
Carbohydrates are built from monosaccharides, which are small mole- 
cules—typically containing from three to nine carbon atoms that are bound 
to hydroxyl groups—that vary in size and in the stereochemical configura- 
tion at one or more carbon centers. These monosaccharides can be linked 
together to form a large variety of oligosaccharide structures. The sheer 
number of possible oligosaccharides makes this class of molecules informa- 
tion rich. This information, when attached to proteins, can augment the 
already immense diversity of proteins. 

The realization of the importance of carbohydrates to so many aspects of 
biochemistry has spawned a field of study called glycobiology. Glycobiology 
is the study of the synthesis and structure of carbohydrates and how carbo- 
hydrates are attached to and recognized by other molecules such as proteins. 
Along with a new field comes a new “omics” to join genomics and pro- 
teomics—glycomics. Glycomics is the study of the glycome, all of the carbo- 
hydrates and carbohydrate-associated molecules that cells produce. Like 
the proteome, the glycome is dynamic, depending on cellular and environ- 
mental conditions. Unraveling oligosaccharide structures and elucidating 
the effects of their attachment to other molecules constitutes a tremendous 
challenge in the field of biochemistry. 


11.1 Monosaccharides Are the Simplest Carbohydrates 


Carbohydrates are carbon-based molecules that are rich in hydroxyl 
groups. Indeed, the empirical formula for many carbohydrates is 
(CH,O),—literally, a carbon hydrate. Simple carbohydrates are called 
monosaccharides. These simple sugars serve not only as fuel molecules but 
also as fundamental constituents of living systems. For instance, DNA has 
a backbone consisting of alternating phosphoryl groups and deoxyribose, 
a cyclic five-carbon sugar. 

Monosaccharides are aldehydes or ketones that have two or more 
hydroxyl groups. The smallest monosaccharides, composed of three carbon 
atoms, are dihydroxyacetone and D- and L-glyceraldehyde. 
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Dihydroxyacetone is called a ketose because it contains a keto group (in red 
above), whereas glyceraldehyde is called an aldose because it contains an 
aldehyde group (also in red). They are referred to as trioses (tri- for three, 
referring to the three carbon atoms that they contain). Similarly, simple 
monosaccharides with four, five, six, and seven carbon atoms are called 
tetroses, pentoses, hexoses, and heptoses, respectively. Perhaps the monosac- 
charides of which we are most aware are the hexoses, such as glucose and 
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fructose. Glucose is an essential energy source for virtually all forms of life. 
Fructose is commonly used as a sweetener that is converted into glucose 
derivatives inside the cell. 

Carbohydrates can exist ina dazzling variety of isomeric forms (Figure 11.1). 
Dihydroxyacetone and glyceraldehyde are constitutional isomers because 
they have identical molecular formulas but differ in how the atoms are 
ordered. Stereoisomers are isomers that differ in spatial arrangement. Recall 
from the discussion of amino acids (p. 29) that stereoisomers are designated 
as having either D or L configuration. Glyceraldehyde has a single asym- 
metric carbon atom and, thus, there are two stereoisomers of this sugar: 
D-glyceraldehyde and L-glyceraldehyde. These molecules are a type of ste- 
reoisomer called enantiomers, which are mirror images of each other. Most 
vertebrate monosaccharides have the D configuration. According to con- 
vention, the D and L isomers are determined by the configuration of the 
asymmetric carbon atom farthest from the aldehyde or keto group. 
Dihydroxyacetone is the only monosaccharide without at least one asym- 
metric carbon atom. 

Monosaccharides made up of more than three carbon atoms have mul- 
tiple asymmetric carbons, and so they can exist not only as enantiomers but 
also as diastereoisomers, isomers that are not mirror images of each other. 
The number of possible stereoisomers equals 2”, where n is the number of 
asymmetric carbon atoms. Thus, a six-carbon aldose with 4 asymmetric 
carbon atoms can exist as 16 possible diastereoisomers, of which glucose is 
one such isomer. 

Figure 11.2 shows the common sugars that we will see most frequently 
in our study of biochemistry. D-Ribose, the carbohydrate component of 
RNA, is a five-carbon aldose, as is deoxyribose, the monosaccharide com- 
ponent of deoxynucleotides. D-Glucose, D-mannose, and D-galactose are 


FIGURE 11.1 lsomeric forms of 
carbohydrates. 
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FIGURE 11.2 Common monosaccharides. 


Aldoses contain an aldehyde (shown in 
blue), whereas ketoses, such as fructose, 
contain a ketose (shown in red). The 
asymmetric carbon atom farthest from the 
aldehyde or ketone (shown in green) 
designates the structures as being in the 
D configuration. 
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abundant six-carbon aldoses. Note that D-glucose and D-mannose differ in 
configuration only at C-2, the carbon atom in the second position. Sugars 
that are diastereoisomers differing in configuration at only a single asym- 
metric center are epimers. Thus, D-glucose and D-mannose are epimeric at 
C-2; D-glucose and D-galactose are epimeric at C-4. 

Note that ketoses have one less asymmetric center than aldoses with 
the same number of carbon atoms. D-Fructose is the most abundant 
ketohexose. 


Many common sugars exist in cyclic forms 


The predominant forms of ribose, glucose, fructose, and many other sugars 
in solution, as is the case inside the cell, are not open chains. Rather, the 
open-chain forms of these sugars cyclize into rings. The chemical basis for 
ring formation is that an aldehyde can react with an alcohol to form a 
hemiacetal. 
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For an aldohexose such as glucose, a single molecule provides both the 
aldehyde and the alcohol: the C-1 aldehyde in the open-chain form of glu- 
cose reacts with the C-5 hydroxyl group to form an intramolecular hemiace- 
tal (Figure 11.3). The resulting cyclic hemiacetal, a six-membered ring, is 
called pyranose because of its similarity to pyran. 

Similarly, a ketone can react with an alcohol to form a hemiketal. 
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The C-2 keto group in the open-chain form of a ketohexose, such as 
fructose, can form an intramolecular hemiketal by reacting with either the 
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C-6 hydroxyl group to form a six-membered cyclic hemiketal or the C-5 
hydroxyl group to form a five-membered cyclic hemiketal (Figure 11.4). 
The five-membered ring is called a furanose because of its similarity 
to furan. 
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The depictions of glucopyranose (glucose) and fructofuranose (fructose) 
shown in Figures 11.3 and 11.4 are Haworth projections. In such projections, 
the carbon atoms in the ring are not written out. The approximate plane of 
the ring is perpendicular to the plane of the paper, with the heavy line on the 
ring projecting toward the reader. 

We have seen that carbohydrates can contain many asymmetric carbon 
atoms. An additional asymmetric center 1s created when a cyclic hemiacetal 
is formed, generating yet another diastereoisomeric form of sugars called 
anomers. In glucose, C-1 (the carbonyl carbon atom in the open-chain 
form) becomes an asymmetric center. Thus, two ring structures can be 
formed: a-D-glucopyranose and B-D-glucopyranose (Figure 11.3). For D 
sugars drawn as Haworth projections in the standard orientation as shown 
in Figure 11.3, the designation a means that the hydroxyl group attached to 
C-1 is on the opposite side of the ring as C-6; B means that the hydroxyl group 
is on the same side of the ring as C-6. The C-1 carbon atom is called the 
anomeric carbon atom. An equilibrium mixture of glucose contains approxi- 
mately one-third a anomer, two-thirds B anomer, and <1% of the open- 
chain form. 

The furanose-ring form of fructose also has anomeric forms, in which a 
and £ refer to the hydroxyl groups attached to C-2, the anomeric carbon 
atom (Figure 11.4). Fructose forms both pyranose and furanose rings. The 
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FIGURE 11.3 Pyranose formation. The 
open-chain form of glucose cyclizes when 
the C-5 hydroxyl group attacks the oxygen 
atom of the C-1 aldehyde group to form an 
intramolecular hemiacetal. Two anomeric 
forms, designated a and B, can result. 


FIGURE 11.4 Furanose formation. The 
open-chain form of fructose cyclizes to a 
five-membered ring when the C-5 hydroxyl 
group attacks the C-2 ketone to form an 
intramolecular hemiketal. Two anomers are 
possible, but only the a anomer is shown. 
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FIGURE 11.6 Common monosaccharides 


in their ring forms. 
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pyranose form predominates in fructose that is free in solution, and the 
furanose form predominates in many fructose derivatives (Figure 11.5). 

8-D-Fructopyranose, found in honey, is one of the sweetest chemicals 
known. The B-D-fructofuranose form is not nearly as sweet. Heating con- 
verts B-fructopyranose into the B-fructofuranose form, reducing the sweet- 
ness of the solution. For this reason, corn syrup with a high concentration of 
fructose in the B-D-pyranose form is used as a sweetener in cold, but not 
hot, drinks. Figure 11.6 shows the common sugars discussed previously in 
their ring forms. 
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Pyranose and furanose rings can assume different conformations 


The six-membered pyranose ring is not planar because of the tetrahedral 
geometry of its saturated carbon atoms. Instead, pyranose rings adopt two 
classes of conformations, termed chair and boat because of the resem- 
blance to these objects (Figure 11.7). In the chair form, the substituents 
on the ring carbon atoms have two orientations: axial and equatorial. 
Axial bonds are nearly perpendicular to the average plane of the ring, 
whereas equatorial bonds are nearly parallel to this plane. Axial substitu- 
ents sterically hinder each other if they emerge on the same side of the 
ring (e.g., 1,3-diaxial groups). In contrast, equatorial substituents are less 


crowded. The chair form of B-D-glucopyranose predominates 
because all axial positions are occupied by hydrogen atoms. The 
bulkier —OH and —CH,OH groups emerge at the less-hindered 
periphery. The boat form of glucose is disfavored because it is 
quite sterically hindered. 

Furanose rings, like pyranose rings, are not planar. They can be 
puckered so that four atoms are nearly coplanar and the fifth is 
about 0.5 A away from this plane (Figure 11.8). This conformation 
is called an envelope form because the structure resembles an opened 
envelope with the back flap raised. In the ribose moiety of most 
biomolecules, either C-2 or C-3 is out of the plane on the same side 
as C-5. These conformations are called C-2-endo and C-3-endo, 
respectively. 


H CH2OH 
FIGURE 11.8 Envelope 
conformations of B-pD-ribose. 
The C-3-endo and C-2-endo forms 
of B-D-ribose are shown. The color 
indicates the four atoms that lie 
C-3-endo C-2-endo approximately in a plane. 
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Glucose is a reducing sugar 


Because the a and B isomers of glucose are in an equilibrium that passes 
through the open-chain form, glucose has some of the chemical properties 
of free aldehydes, such as the ability to react with oxidizing agents. For 
example, glucose can react with cupric ion (Cu?*), reducing it to cuprous 
ion (Cut), while being oxidized to gluconic acid. 
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cs Solutions of cupric ion (known as Fehling’s solution) provide a simple 

test for the presence of sugars such as glucose. Sugars that react are 
called reducing sugars; those that do not are called nonreducing sugars. 
Reducing sugars can often nonspecifically react with a free amino group to 
form a stable covalent bond. For instance, as a reducing sugar, glucose 
reacts with hemoglobin to form glycosylated hemoglobin (hemoglobin 
Ac). Monitoring changes in the amount of glycosylated hemoglobin is an 
especially useful means of assessing the effectiveness of treatments for dia- 
betes mellitus, a condition characterized by high levels of blood glucose 
(Section 27.3). Because the glycosylated hemoglobin remains in circulation, 
the amount of the modified hemoglobin corresponds to the long-term 
regulation—over several months—of glucose levels. In nondiabetic indi- 
viduals, less than 6% of the hemoglobin is glycosylated, whereas, in uncon- 
trolled diabetics, almost 10% of the hemoglobin is glycosylated. Although 
the glycosylation of hemoglobin has no effect on oxygen binding and is thus 


Chair form 


Boat form 


FIGURE 11.7 Chair and boat forms of 
{-p-glucose. The chair form is more stable 
because hydrogen atoms occupy the axial 
positions, resulting in less steric hindrance. 


Abbreviations: a, axial; 
e, equatorial. 


321 


322 benign, similar reducing reactions are often detrimental because the glyco- 

CHAPTER 11 Carbohydrates sylations alter the normal biochemical function of the modified proteins. 
These modifcations, known as advanced glycation end products (AGE), have 
been implicated in aging, arteriosclerosis, and diabetes, as well as other 
pathological conditions. 


Monosaccharides are joined to alcohols and amines through 
glycosidic bonds 


The biochemical properties of monosaccharides can by modified by reac- 
tion with other molecules. These modifications increase the biochemical 
versatility of carbohydrates, enabling them to serve as signal molecules or 
facilitating their metabolism. Three common reactants are alcohols, amines, 
and phosphates. A bond formed between the anomeric carbon atom of a 
carbohydrate and the oxygen atom of an alcohol is called a glycosidic bond— 
specifically, an O-glycosidic bond. O-Glycosidic bonds are prominent when 
carbohydrates are linked together to form long polymers and when they are 
attached to proteins (Figure 11.9). In addition, the anomeric carbon atom of 
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a sugar can be linked to the nitrogen atom of an amine to form an N-glycosidic 
bond, such as when nitrogenous bases are attached to ribose units to form 
nucleosides. Carbohydrates can also be modifed by the attachment of func- 
tional groups to carbons other than the anomeric carbon (Figure 11.10). 
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expressed on cell surfaces. metabolism. The addition of phosphoryl groups is a common modification 


of sugars. For instance, the first step in the breakdown of glucose to obtain 
energy is its conversion into glucose 6-phosphate. Several subsequent 


intermediates in this metabolic pathway, such as dihydroxyacetone phos- 
phate and glyceraldehyde 3-phosphate, are phosphorylated sugars. 
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Phosphorylation makes sugars anionic; the negative charge not only pre- 
vents these sugars from spontaneously leaving the cell by crossing lipid- 
bilayer membranes, but also prevents them from interacting with transport- 
ers of the unmodified sugar. Moreover, phosphorylation creates reactive 
intermediates that will more readily undergo metabolism. For example, a 
multiply phosphorylated derivative of ribose plays key roles in the biosyn- 
theses of purine and pyrimidine nucleotides (Chapter 25). 


11.2 Monosaccharides Are Linked to Form 
Complex Carbohydrates 


Because sugars contain hydroxyl groups, glycosidic bonds can join one 
monosaccharide to another. Oligosaccharides are built by the linkage of two 
or more monosaccharides by O-glycosidic bonds (Figure 11.11). In the 
disaccharide maltose, for example, two D-glucose residues are joined by a 
glycosidic linkage between the a-anomeric form of C-1 on one sugar and 
the hydroxyl oxygen atom on C-4 of the adjacent sugar. Such a linkage is 
called an a-1,4-glycosidic bond. Just as proteins have a directionality 
defined by the amino and carboxyl termini, oligosaccharides have a direc- 
tionality defined by their reducing and nonreducing ends. The carbohy- 
drate unit at the reducing end has a free anomeric carbon atom that has 
reducing activity because it can form the open-chain form, as discussed 
earlier (p. 321). By convention, this end of the oligosaccharide is still called 
the reducing end even when it is bound to another molecule such as a pro- 
tein and thus no longer has reducing properties. 

The fact that monosaccharides have multiple hydroxyl groups means 
that many different glycosidic linkages are possible. For example, consider 
three monosaccharides: glucose, mannose, and galactose. These molecules 
can be linked together in the laboratory to form more than 12,000 structures 
differing in the order of the monosaccharides and the hydroxyl groups par- 
ticipating in the glycosidic linkages. In this section, we will look at some of 
the most common oligosaccharides found in nature. 


Sucrose, lactose, and maltose are the common disaccharides 


A disaccharide consists of two sugars joined by an O-glycosidic bond. Three 
abundant disaccharides that we encounter frequently are sucrose, lactose, 
and maltose (Figure 11.12). Sucrose (common table sugar) is obtained com- 
mercially from sugar cane or sugar beets. The anomeric carbon atoms of a 
glucose unit and a fructose unit are joined in this disaccharide; the con- 
figuration of this glycosidic linkage is a for glucose and B for fructose. 
Sucrose can be cleaved into its component monosaccharides by the enzyme 
sucrase. Lactose, the disaccharide of milk, consists of galactose joined to 
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a-1,4-Glycosidic bond 
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FIGURE 11.11 Maltose, a disaccharide. 
Two molecules of glucose are linked by 
an a-1,4-glycosidic bond to form the 
disaccharide maltose. The angles in the 
bonds to the central oxygen atom do not 
denote carbon atoms. The angles are 
added only for ease of illustration. The 
glucose molecule on the right is capable of 
assuming the open-chain form, which can 
act as a reducing agent. The glucose 
molecule on the left cannot assume the 
open-chain form, because the C-1 carbon 
atom is bound to another molecule. 
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(a-p-Glucopyranosyl-(1 — 2)-B-p-fructofuranose 


FIGURE 11.12 Common disaccharides. 
Sucrose, lactose, and maltose are common 
dietary components. As in Figure 11.11, the 
angles in the bonds to the central oxygen 
atoms do not denote carbon atoms. 
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FIGURE 11.13 Branch point in glycogen. Two 
chains of glucose molecules joined by a-1,4- 
glycosidic bonds are linked by an a-1,6-glycosidic 
bond to create a branch point. Such an a-1,6- 
glycosidic bond forms at approximately every 10 
glucose units, making glycogen a highly branched 
molecule. 
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Maltose 
(a-p-Glucopyranosyl-(1— 4)-a-p-glucopyranose 


Lactose 
(B-p-Galactopyranosyl-(1— 4)-a-p-glucopyranose 


glucose by a B-1,4-glycosidic linkage. Lactose is hydrolyzed to these 
monosaccharides by lactase in human beings and by B-galactosidase in 
bacteria. In maltose, two glucose units are joined by an a-1,4-glycosidic 
linkage. Maltose comes from the hydrolysis of large polymeric oligosac- 
charides such as starch and glycogen and is in turn hydrolyzed to glucose 
by maltase. Sucrase, lactase, and maltase are located on the outer surfaces 
of epithelial cells lining the small intestine. The cleavage products of 
sucrose, lactose, and maltose can be further processed to provide energy in 
the form of ATP. 


Glycogen and starch are storage forms of glucose 


Glucose is an important energy source in virtually all life forms. However, 
free glucose molecules cannot be stored because in high concentrations, 
glucose will disturb the osmotic balance of the cell, potentially resulting in 
cell death. The solution is to store glucose as units in a large polymer, which 
is not osmotically active. 

Large polymeric oligosaccharides, formed by the linkage of mul- 
tiple monosaccharides, are called polysaccharides and play vital roles 
in energy storage and in maintaining the structural integrity of an 
organism. If all of the monosaccharide units in a polysaccharide are 
the same, the polymer is called a homopolymer. The most common 
homopolymer in animal cells is glycogen, the storage form of glu- 
cose. Glycogen is present in most of our tissues but is most abun- 
dant in muscle and liver. As will be considered in detail in Chapter 21, 
glycogen is a large, branched polymer of glucose residues. Most of 
the glucose units in glycogen are linked by a-1,4-glycosidic bonds. 
The branches are formed by a-1,6-glycosidic bonds, present about 
once in 10 units (Figure 11.13). 

The nutritional reservoir in plants is the homopolymer starch, of 
which there are two forms. Amylose, the unbranched type of starch, 
consists of glucose residues in a-1,4 linkage. Amylopectin, the 
branched form, has about 1 a-1,6 linkage per 30 a-1,4 linkages, in 
similar fashion to glycogen except for its lower degree of branching. 
More than half the carbohydrate ingested by human beings is starch 
found in wheat, potatoes, and rice, to name just a few sources. 
Amylopectin, amylose, and glycogen are rapidly hydrolyzed by a-amylase, 
an enzyme secreted by the salivary glands and the pancreas. 


Cellulose, a structural component of plants, is made of 
chains of glucose 


Cellulose, the other major polysaccharide of glucose found in plants, 
serves a structural rather than a nutritional role as an important compo- 
nent of the plant cell wall. Cellulose is among the most abundant organic 
compounds in the biosphere. Some 10!° kg of cellulose is synthesized and 
degraded on Earth each year, an amount 1000 times as great as the 
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Cellulose 
(B-1,4 linkages) 


Starch and glycogen 
(a-1,4 linkages) 


FIGURE 11.14 Glycosidic bonds determine polysaccharide 
structure. The B-1,4 linkages favor straight chains, which are 
optimal for structural purposes. The a-1,4 linkages favor bent 
structures, which are more suitable for storage. 


combined weight of the human race. Cellulose is an unbranched polymer 
of glucose residues joined by B-1,4 linkages, in contrast with the a-1,4 
linkage seen in starch and glycogen. This simple difference in stereo- 
chemistry yields two molecules with very different properties and bio- 
logical functions. The B configuration allows cellulose to form very long, 
straight chains. Fibrils are formed by parallel chains that interact with 
one another through hydrogen bonds, generating a rigid, supportive 
structure. The straight chains formed by B linkages are optimal for the 
construction of fibers having a high tensile strength. The a-1,4 linkages 
in glycogen and starch produce a very different molecular architecture: a 
hollow helix is formed instead of a straight chain (Figure 11.14). The hol- 
low helix formed by a linkages is well suited to the formation of a more- 
compact, accessible store of sugar. Although mammals lack cellulases 
and therefore cannot digest wood and vegetable fibers, cellulose and 
other plant fibers are still an important constituent of the mammalian 
diet as a component of dietary fiber. Soluble fiber such as pectin (polyga- 
lacturonic acid) slows the movement of food through the gastrointestinal 
tract, allowing improved digestion and the absorption of nutrients. 
Insoluble fibers, such as cellulose, increase the rate at which digestion 
products pass through the large intestine. This increase in rate can mini- 
mize exposure to toxins in the diet. 

We have considered only homopolymers of glucose. However, given the 
variety of different monosaccharides that can be put together in any number 
of arrangements, the number of possible polysaccharides is huge. We will 
consider some of these polysaccharides shortly. 


11.3 Carbohydrates Can Be Linked to Proteins 
to Form Glycoproteins 


A carbohydrate group can be covalently attached to a protein to form a 
glycoprotein. Such modifications are not rare, as 50% of the proteome con- 
sists of glycoproteins. We will examine three classes of glycoproteins. The 
first class is simply referred to as glycoproteins. In glycoproteins of this 
class, the protein constituent is the largest component by weight. This 
versatile class plays a variety of biochemical roles. Many glycoproteins 
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FIGURE 11.15 Glycosidic bonds 
between proteins and carbohydrates. A 
glycosidic bond links a carbohydrate to the 
side chain of asparagine (N-linked) or to the 
side chain of serine or threonine (O-linked). 
The glycosidic bonds are shown in red. 


are components of cell membranes, where they take part in processes such 
as cell adhesion and the binding of sperm to eggs. Other glycoproteins are 
formed by linking carbohydrates to soluble proteins. Many of the pro- 
teins secreted from cells are glycosylated, or modified by the attachment 
of carbohydrates, including most proteins present in the serum compo- 
nent of blood. 

The second class of glycoproteins comprises the proteoglycans. The 
protein component of proteoglycans is conjugated to a particular type of 
polysaccharide called a glycosaminoglycan. Carbohydrates make up a much 
larger percentage by weight of the proteoglycan compared with simple 
glycoproteins. Proteoglycans function as structural components and 
lubricants. 

Mucins, or mucoproteins, are, like proteoglycans, predominantly carbo- 
hydrate. N-Acetylgalactosamine is usually the carbohydrate moiety bound 
to the protein in mucins. N-Acetylgalactosamine is an example of an amino 
sugar, so named because an amino group replaces a hydroxyl group. 
Mucins, a key component of mucus, serve as lubricants. 

Glycosylation greatly increases the complexity of the proteome. A given 
protein with several potential glycosylation sites can have many different 
glycosylated forms (called glycoforms), each of which can be generated only 
in a specific cell type or developmental stage. 


Carbohydrates can be linked to proteins through asparagine (N-linked) 
or through serine or threonine (O-linked) residues 


Sugars in glycoproteins are attached either to the amide nitrogen atom in the 
side chain of asparagine (termed an N-linkage) or to the oxygen atom in the side 
chain of serine or threonine (termed an O-linkage), as shown in Figure 11.15. 
An asparagine residue can accept an oligosaccharide only if the residue is 
part of an Asn-X-Ser or Asn-X-Thr sequence, in which X can be any resi- 
due, except proline. However, not all potential sites are glycosylated. Which 
sites are glycosylated depends on other aspects of the protein structure and 
on the cell type in which the protein is expressed. All N-linked oligosac- 
charides have in common a pentasaccharide core consisting of three man- 
nose and two N-acetylglucosamine residues. Additional sugars are attached 
to this core to form the great variety of oligosaccharide patterns found in 
glycoproteins (Figure 11.16). 
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FIGURE 11.16 N-linked oligosaccharides. A pentasaccharide core (shaded gray) is common 
to all N-linked oligosaccharides and serves as the foundation for a wide variety of N-linked 
oligosaccharides, two of which are illustrated: (A) high- mannose type; (B) complex type. 


The glycoprotein erythropoietin is a vital hormone 


e Let us look at a glycoprotein present in the blood 

serum that has dramatically improved treatment 
for anemia, particularly that induced by cancer chemo- 
therapy. The glycoprotein hormone erythropoietin (EPO) 
is secreted by the kidneys and stimulates the production 
of red blood cells. EPO is composed of 165 amino acids 
and is N-glycosylated at three asparagine residues and 
O-glycosylated on a serine residue (Figure 11.17). The 
mature EPO is 40% carbohydrate by weight, and glyco- 
sylation enhances the stability of the protein in the blood. 
Unglycosylated protein has only about 10% of the bioac- 
tivity of the glycosylated form because the protein is rap- 
idly removed from the blood by the kidneys. The 
availability of recombinant human EPO has greatly aided 
the treatment of anemias. However, some endurance ath- 
letes have used recombinant human EPO to increase the 
red-blood-cell count and hence their oxygen-carrying 
capacity. Drug-testing laboratories are able to distinguish 
some forms of prohibited human recombinant EPO from 
natural EPO in athletes by detecting differences in their 
glycosylation patterns through the use of isoelectric focus- 


ing (p. 73). 


FIGURE 11.17 Oligosaccharides attached to erythropoietin. 
Erythropoietin has oligosaccharides linked to three asparagine 
residues and one serine residue. The structures shown are 
approximately to scale. See Figure 11.16 for the carbohydrate 
key. [Drawn from 1BUY.pdf.] 


Glycosylation functions in nutrient sensing 


An especially important glycosylation reaction is the 
covalent attachment of N-acetylglucosamine (GlcNAc) 
to serine or threonine residues of cellular proteins, a 
reaction catalyzed by GlcNAc transferase. The concentration of GlcNAc 
reflects the active metabolism of carbohydrates, amino acids and fats, 
indicating that nutrients are abundant (Figure 11.18). More than one 
thousand proteins are modified by GlcNAcylation, including transcrip- 
tion factors and components of signaling pathways. Interestingly, 
because the GlcNAcylation sites are also potential phosphorylation 
sites, O-GlcNAc transferase and protein kinases may be involved in 
cross talk to modulate one another’s signaling activity. Like phosphory- 
lation, GlcNAcylation is reversible, with GlcNAcase catalyzing the 
removal of the carbohydrate. Dysregulation of GlcNAc transferase has 
been linked to insulin resistance, diabetes, cancer and neurological 
pathologies. 


Proteoglycans, composed of polysaccharides and protein, 
have important structural roles 


As stated earlier, proteoglycans are proteins attached to glycosamino- 
glycans. The glycosaminoglycan makes up as much as 95% of the bio- 
molecule by weight, and so the proteoglycan resembles a polysaccharide 
more than a protein. Proteoglycans not only function as lubricants and 
structural components in connective tissue, but also mediate the adhe- 
sion of cells to the extracellular matrix, and bind factors that regulate cell 
proliferation. 


e The properties of proteoglycans are determined primarily by the 
glycosaminoglycan component. Many glycosaminoglycans are 
made of repeating units of disaccharides containing a derivative of an 


Glucose signals 
carbohydrate availability 
(Chapter 16) 


CH,OH 
H OOH 


H CH; 


yoo c a 
Nitrogen signals | | hy 
protein availability fe) Acetate signals 
(Chapter 23) fatty acid availability 
(Chapter 22) 


B-D-Acetylglucosamine 
(GIcNAc) 


FIGURE 11.18 Glycosylation as a 
nutrient sensor. N-acetylglucosamine is 
attached to proteins when nutrients are 
abundant. 
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FIGURE 11.20 Hurler disease. Formerly 
called gargoylism, Hurler disease is a 
mucopolysaccharidosis having symptoms 
hat include wide nostrils, a depressed 
nasal bridge, thick lips and earlobes, and 
irregular teeth. In Hurler disease, 
glycosaminoglycans cannot be degraded. 
The excess of these molecules are stored 
in the soft tissue of the facial regions, 
resulting in the characteristic facial 
eatures. [Courtesy National MPS Society, 
www.mpssociety.org.] 
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Hyaluronate 


FIGURE 11.19 Repeating units in glycosaminoglycans. Structural formulas for five 
repeating units of important glycosaminoglycans illustrate the variety of modifications and 
linkages that is possible. Amino groups are shown in blue and negatively charged groups in 
red. Hydrogen atoms have been omitted for clarity. The right-hand structure is a glucosamine 
derivative in each case. 


amino sugar, either glucosamine or galactosamine (Figure 11.19). At least 
one of the two sugars in the repeating unit has a negatively charged carbox- 
ylate or sulfate group. The major glycosaminoglycans in animals are chon- 
droitin sulfate, keratan sulfate, heparin, dermatan sulfate, and hyaluronate. 
Recall that heparin acts as an anticoagulant to assist the termination of 
blood clotting (p. 307). Mucopolysaccharidoses are a collection of diseases, 
such as Hurler disease, that result from the inability to degrade glycosami- 
noglycans (Figure 11.20). Although precise clinical features vary with the 
disease, all mucopolysaccharidoses result in skeletal deformities and 
reduced life expectancies. 


Proteoglycans are important components of cartilage 


Among the best-characterized members of this diverse class is the proteo- 
glycan in the extracellular matrix of cartilage. The proteoglycan aggrecan 
and the protein collagen are key components of cartilage. The triple helix 
of collagen (p. 45) provides structure and tensile strength, whereas aggre- 
can serves as a shock absorber. The protein component of aggrecan is a 
large molecule composed of 2397 amino acids. The protein has three 
globular domains, and the site of glycosaminoglycan attachment is the 
extended region between globular domains 2 and 3. This linear region 
contains highly repetitive amino acid sequences, which are sites for the 
attachment of keratan sulfate and chondroitin sulfate. Many molecules of 
aggrecan are in turn noncovalently bound through the first globular 
domain to a very long filament formed by linking together molecules of 
the glycosaminoglycan hyaluronate (Figure 11.21). Water is bound to the 
glycosaminoglycans, attracted by the many negative charges. Aggrecan 
can cushion compressive forces because the absorbed water enables it to 
spring back after having been deformed. When pressure is exerted, as 
when the foot hits the ground while walking, water is squeezed from the 
glycosaminoglycan, cushioning the impact. When the pressure 1s released, 
the water rebinds. Osteoarthritis, the most common form of arthritis, 
results when water is lost from proteoglycan with aging. Other forms of 
arthritis can result from the proteolytic degradation of aggrecan and col- 
lagen in the cartilage. 
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FIGURE 11.21 Structure of proteoglycan from cartilage. (A) Electron micrograph of a 
proteoglycan from cartilage (with false color added). Proteoglycan monomers emerge laterally 
at regular intervals from opposite sides of a central filament of hyaluronate. (B) Schematic 
representation. G = globular domain. [(A) Courtesy of Dr. Lawrence Rosenberg. From 

J. A. Buckwalter and L. Rosenberg. Collagen Relat. Res. 3:489-504, 1983.] 


In addition to being a key component of structural tissues, glycosami- 
noglycans are common throughout the biosphere. Chitin is a glycosami- 
noglycan found in the exoskeleton of insects, crustaceans, and arachnids 
and is, next to cellulose, the second most abundant polysaccharide in 
nature (Figure 11.22). Cephalopods such as squid use their razor sharp 
beaks, which are made of extensively crosslinked chitin, to disable and 
consume prey. 


Mucins are glycoprotein components of mucus 


A third class of glycoproteins is the mucins (mucoproteins). In mucins, the 
protein component is extensively glycosylated at serine or threonine resi- 
dues by N-acetylgalactosamine (Figure 11.10). Mucins are capable of form- 
ing large polymeric structures and are common in mucous secretions. These 
glycoproteins are synthesized by specialized cells in the tracheobronchial, 
gastrointestinal, and genitourinary tracts. Mucins are abundant in saliva 
where they function as lubricants. 

A model of a mucin is shown in Figure 11.23A. The defining feature of 
the mucins is a region of the protein backbone termed the variable number of 
tandem repeats (VNTR) region, which is rich in serine and threonine residues 
that are O-glycosylated. Indeed, the carbohydrate moiety can account for as 
much as 80% of the molecule by weight. A number of core carbohydrate 
structures are conjugated to the protein component of mucin. Figure 11.23B 
shows one such structure. 


a Mucins adhere to epithelial cells and act as a protective barrier; they 
also hydrate the underlying cells. In addition to protecting cells from 
environmental insults, such as stomach acid, inhaled chemicals in the lungs, 
and bacterial infections, mucins have roles in fertilization, the immune 
response, and cell adhesion. Mucins are overexpressed in bronchitis and 
cystic fibrosis, and the overexpression of mucins is characteristic of adeno- 
carcinomas—cancers of the glandular cells of epithelial origin. 
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FIGURE 11.22 Chitin, a glycosamino- 
glycan, is present in insect wings and 
the exoskeleton. Glycosaminoglycans are 
components of the exoskeletons of insects, 
crustaceans, and arachnids. [FLPA/Alamy.] 
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FIGURE 11.23 Mucin structure. (A) A 
schematic representation of a mucoprotein. 
The VNTR region is highly glycosylated, 
forcing the molecule into an extended 
conformation. The Cys-rich domains and 
the D domain facilitate the polymerization of 
many such molecules. (B) An example of 
an oligosaccharide that is bound to the 
VNTR region of the protein. See Figure 
11.16 for the carbohydrate key. [Information 
from A. Varki et al. (Eds.), Essentials of 
Glycobiology, 2d ed. (Cold Spring Harbor 
Press, 2009), pp. 117, 118.] 


FIGURE 11.24 Golgi complex and 
endoplasmic reticulum. The electron 
micrograph shows the Golgi complex and 
adjacent endoplasmic reticulum. The black 
dots on the cytoplasmic surface of the ER 
membrane are ribosomes. [Micrograph 
courtesy of Lynne Mercer.] 


Isoprene 


FIGURE 11.25 Golgi complex as sorting 
center. The Golgi complex is the sorting 
center in the targeting of proteins to 
lysosomes, secretory vesicles, and the 
plasma membrane. The cis face of the 
Golgi complex receives vesicles from the 
endoplasmic reticulum, and the trans face 
sends a different set of vesicles to target 
sites. Vesicles also transfer proteins from 
one compartment of the Golgi complex to 
another. [Courtesy of Dr. Marilyn Farquhar.] 
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Protein glycosylation takes place in the lumen of the endoplasmic 
reticulum and in the Golgi complex 


The major pathway for protein glycosylation takes place inside the lumen of 
the endoplasmic reticulum (ER) and in the Golgi complex, organelles that play 
central roles in protein trafficking (Figure 11.24). The protein is synthesized 
by ribosomes attached to the cytoplasmic face of the ER membrane, and the 
peptide chain is inserted into the lumen of the ER (Section 30.6). The N-linked 
glycosylation begins in the ER and continues in the Golgi complex, whereas 
the O-linked glycosylation takes place exclusively in the Golgi complex. 

A large oligosaccharide destined for attachment to the asparagine residue 
of a protein is assembled on dolichol phosphate, a specialized lipid molecule 
located in the ER membrane and containing about 20 isoprene (C,) units. 


H3C H3C H CHS é r¢) 
n=15-19 
Dolichol phosphate 


The terminal phosphate group of the dolichol phosphate is the site of attach- 
ment of the oligosaccharide. This activated (energy-rich) form of the oligosac- 
charide is subsequently transferred to a specific asparagine residue of the 
growing polypeptide chain by an enzyme located on the lumenal side of the ER. 
Proteins in the lumen of the ER and in the ER membrane are trans- 
ported to the Golgi complex, which is a stack of flattened membranous sacs. 
Carbohydrate units of glycoproteins are altered and elaborated in the Golgi 
complex. The O-linked sugar units are fashioned there, and the N-linked 
sugars, arriving from the ER as acomponent of a glycoprotein, are modified 
in many different ways. The Golgi complex is the major sorting center of the 
cell. Proteins proceed from the Golgi complex to lysosomes, secretory gran- 
ules, or the plasma membrane, according to signals encoded within their 
amino acid sequences and three-dimensional structures (Figure 11.25). 
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Specific enzymes are responsible for 
oligosaccharide assembly 


How are the complex carbohydrates formed, be they unconju- 
gated molecules such as glycogen or components of glycopro- 
teins? Complex carbohydrates are synthesized through the 
action of specific enzymes, glycosyltransferases, which catalyze 
the formation of glycosidic bonds. Given the diversity of known 
glycosidic linkages, many different enzymes are required. 
Indeed, glycosyltransferases account for 1% to 2% of gene prod- 
ucts in all organisms examined. 

While dolichol phosphate-linked oligosaccharides are sub- 
strates for some glycosyltransferases, the most common carbohy- 
drate donors for glycosyltransferases are activated sugar nucleo- 
tides, such as UDP-glucose (UDP is the abbreviation for uridine 
diphosphate) (Figure 11.26). The attachment of a nucleotide to 
enhance the energy content of a molecule is a common strategy in 
biosynthesis that we will see many times in our study of bio- 
chemistry. The acceptor substrates for glycosyltransferases are 
quite varied and include carbohydrates, serine, threonine, and 
asparagine residues of proteins, lipids, and even nucleic acids. 


Blood groups are based on protein glycosylation patterns 


2 The human ABO blood groups illustrate the effects of 

glycosyltransferases on the formation of glycoproteins. 
Each blood group is designated by the presence of one of the 
three different carbohydrates, termed A, B, or O, attached to 
glycoproteins and glycolipids on the surfaces of red blood cells 
(Figure 11.27). These structures have in common an oligosac- 
charide foundation called the O (or sometimes H) antigen. The 
A and B antigens differ from the O antigen by the addition of 
one extra monosaccharide, either N-acetylgalactosamine (for 
A) or galactose (for B) through an a-1,3 linkage to a galactose 
moiety of the O antigen. 

Specific glycosyltransferases add the extra monosaccharide 
to the O antigen. Each person inherits the gene for one glyco- 
syltransferase of this type from each parent. The type A trans- 
ferase specifically adds N-acetylgalactosamine, whereas the 
type B transferase adds galactose. These enzymes are identical 
in all but 4 of 354 positions. The O phenotype is the result of 
a mutation in the O transferase that results in the synthesis of 
an inactive enzyme. 

These structures have important implications for blood trans- 
fusions and other transplantation procedures. If an antigen not 
normally present in a person is introduced, the person’s immune 
system recognizes it as foreign. Red-blood-cell lysis occurs rap- 
idly, leading to a severe drop in blood pressure (hypotension), 
shock, kidney failure, and death from circulatory collapse. 
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FIGURE 11.26 General form of a glycosyltransferase 
reaction. The sugar to be added comes from a sugar 
nucleotide—in this case, UDP-glucose. The acceptor, 


designated X in this illustration, can be one of a variety of 


biomolecules, including other carbohydrates or proteins. 
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FIGURE 11.27 Structures of A, B, and O oligosaccharide 
antigens. The carbohydrate structures shown are depicted 


symbolically by employing a scheme (see the key in 
Figure 11.16) that is becoming widely used. 


Ne Why are different blood types present in the human population? 
Suppose that a pathogenic organism such as a parasite expresses on 
its cell surface a carbohydrate antigen similar to one of the blood-group 
antigens. This antigen may not be readily detected as foreign in a person 
whose blood type matches the parasite antigen, and the parasite will flour- 
ish. However, other people with different blood types will be protected. 
Hence, there will be selective pressure on human beings to vary blood type 
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FIGURE 11.28 Formation of a mannose 
6-phosphate marker. A glycoprotein 
destined for delivery to lysosomes 
acquires a phosphate marker in the Golgi 
compartment in a two-step process. First, 
GIcNAc phosphotransferase adds a 
phospho-N-acetylglucosamine unit to the 
6-OH group of a mannose, and then an 
N-acetylglucosaminidase removes the 
added sugar to generate a mannose 
6-phosphate residue in the core 
oligosaccharide. 


to prevent parasitic mimicry and a corresponding selective pressure on 
parasites to enhance mimicry. The constant “arms race” between patho- 
genic microorganisms and human beings drives the evolution of diversity of 
surface antigens within the human population. 


Errors in glycosylation can result in pathological conditions 


cz Although the role of carbohydrate attachment to proteins is not 

known in detail in most cases, data indicate that this glycosylation is 
important for the processing and stability of these proteins, as it is for EPO. 
Certain types of muscular dystrophy can be traced to improper glycosyl- 
ation of dystroglycan, a membrane protein that links the extracellular 
matrix with the cytoskeleton. Indeed, an entire family of severe inherited 
human diseases called congenital disorders of glycosylation has been identi- 
fied. These pathological conditions reveal the importance of proper modifi- 
cation of proteins by carbohydrates and their derivatives. 

An especially clear example of the role of glycosylation is provided by 
I-cell disease (also called mucolipidosis II), a lysosomal storage disease. 
Normally, a carbohydrate marker directs certain digestive enzymes from 
the Golgi complex to lysosomes. Lysosomes are organelles that degrade and 
recycle damaged cellular components or material brought into the cell by 
endocytosis. In patients with I-cell disease, lysosomes contain large inclu- 
sions of undigested glycosaminoglycans and glycolipids—hence the “T” in 
the name of the disease. These inclusions are present because the enzymes 
normally responsible for the degradation of glycosaminoglycans are miss- 
ing from affected lysosomes. Remarkably, the enzymes are present at very 
high levels in the blood and urine. Thus, active enzymes are synthesized, 
but, in the absence of appropriate glycosylation, they are exported instead 
of being sequestered in lysosomes. In other words, in I-cell disease, a whole 
series of enzymes are incorrectly addressed and delivered to the wrong location. 
Normally, these enzymes contain a mannose 6-phosphate residue as a com- 
ponent of an N-oligosaccharide that serves as the marker directing the 
enzymes from the Golgi complex to lysosomes. In I-cell disease, however, 
the attached mannose lacks a phosphate. I-cell patients are deficient in the 
N-acetylglucosamine phosphotransferase catalyzing the first step in the addi- 
tion of the phosphoryl! group; the consequence is the mistargeting of eight 
essential enzymes (Figure 11.28). I-cell disease causes the patient to suffer 
severe psychomotor retardation and skeletal deformities, similar to those in 
Hurler disease. Remarkably, mutations in the phosphotransferase have also 
been linked to stuttering. Why some mutations cause stuttering while other 
cause I-cell disease is a mystery. 


Oligosaccharides can be “sequenced” 


How is it possible to determine the structure of a glycoprotein—the oligo- 
saccharide structures and their points of attachment? Most approaches make 
use of enzymes that cleave oligosaccharides at specific types of linkages. 
The first step is to detach the oligosaccharide from the protein. For 
example, N-linked oligosaccharides can be released from proteins by an 
enzyme such as peptide N-glycosidase F, which cleaves the N-glycosidic 
bonds linking the oligosaccharide to the protein. The oligosaccharides can 
then be isolated and analyzed. Matrix-assisted laser desorption/ionization/ 
time-of-flight (MALDI-TOF) or other mass spectrometric techniques 
(Section 3.3) provide the mass of an oligosaccharide fragment. However, 
many possible oligosaccharide structures are consistent with a given mass. 
More-complete information can be obtained by cleaving the oligosaccha- 
ride with enzymes of varying specificities. For example, B-1,4-galactosidase 
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FIGURE 11.29 Mass spectrometric “sequencing” of oligosaccharides. Carbohydrate- 
cleaving enzymes were used to release and specifically cleave the oligosaccharide component 
of the glycoprotein fetuin from bovine serum. Parts A and B show the masses obtained with 
MALDI-TOF spectrometry as well as the corresponding structures of the oligosaccharide- 
digestion products: (A) digestion with peptide N-glycosidase F (to release the oligosaccharide 
from the protein) and neuraminidase; (B) digestion with peptide N-glycosidase F, neuraminidase, 
and B-1,4-galactosidase. Knowledge of the enzyme specificities and the masses of the 
products permits the characterization of the oligosaccharide. See Figure 11.16 for the 
carbohydrate key. [Data from A. Varki, R. D. Cummings, J. D. Esko, H. H. Freeze, G. W. Hart, and 

J. Marth (Eds.), Essentials of Glycobiology (Cold Spring Harbor Laboratory Press, 1999), p. 596.] 


cleaves B-glycosidic bonds exclusively at galactose residues. The products 
can again be analyzed by mass spectrometry (Figure 11.29). The repetition 
of this process with the use of an array of enzymes of different specificity 
will eventually reveal the structure of the oligosaccharide. 

Proteases applied to glycoproteins can reveal the points of oligosaccha- 
ride attachment. Cleavage by a specific protease yields a characteristic pat- 
tern of peptide fragments that can be analyzed chromatographically. 
Fragments attached to oligosaccharides can be picked out because their 
chromatographic properties will change on glycosidase treatment. Mass 
spectrometric analysis or direct peptide sequencing can reveal the identity 
of the peptide in question and, with additional effort, the exact site of oligo- 
saccharide attachment. 

While the sequencing of the human genome is complete, the character- 
ization of the much more complex proteome, including the biological roles 
of glycosylated proteins, presents a challenge to biochemistry. 


11.4 Lectins Are Specific Carbohydrate-Binding Proteins 


The diversity and complexity of the carbohydrate units and the variety 
of ways in which they can be joined in oligosaccharides and polysaccha- 
rides suggest that they are functionally important. Nature does not construct 
complex patterns when simple ones suffice. Why all this intricacy and diver- 
sity? It is now clear that these carbohydrate structures are the recognition 
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& FIGURE 11.30 Structure 
of a carbohydrate-binding 
domain of an animal 
C-type lectin. Notice that a 
calcium ion links a mannose 
residue to the lectin. Selected 
interactions are shown, 
with some hydrogen atoms 
omitted for clarity. [Drawn from 
2MSC. pdb.] 


sites for a special class of proteins. Such proteins, termed glycan-binding 
proteins, bind specific carbohydrate structures on neighboring cell surfaces. 
Originally discovered in plants, glycan-binding proteins are ubiquitous, and 
no living organisms have been found that lack these key proteins. We will 
focus on a particular class of glycan-binding proteins termed lectins (from 
Latin legere, “to select”). The interaction of lectins with their carbohydrate 
partners is another example of carbohydrates being information-rich mole- 
cules that guide many biological processes. The diverse carbohydrate struc- 
tures displayed on cell surfaces are well-suited to serving as sites of 
interaction between cells and their environments. Interestingly, the partners 
for lectin binding are often the carbohydrate moiety of glycoproteins. 


Lectins promote interactions between cells 


Cell—cell contact is a vital interaction in a host of biochemical functions, 
ranging from building a tissue from isolated cells to facilitating the trans- 
mission of information. The chief function of lectins is to facilitate cell-cell 
contact. A lectin usually contains two or more carbohydrate-binding sites. 
The lectins on the surface of one cell interact with arrays of carbohydrates 
displayed on the surface of another cell. Lectins and carbohydrates are 
linked by a number of weak noncovalent interactions that ensure specificity 
yet permit unlinking as needed. The weak interactions between one cell 
surface and another resemble the action of Velcro; each interaction is weak, 
but the composite is strong. 

We have already met a lectin obliquely. Recall that, in I-cell disease, 
lysosomal enzymes lack the appropriate mannose 6-phosphate, a molecule 
that directs the enzymes to the lysosome. Under normal circumstances, the 
mannose 6-phosphate receptor, a lectin, binds the enzymes in the Golgi appa- 
ratus and directs them to the lysosome. 


Lectins are organized into different classes 


Lectins can be divided into classes on the basis of their amino acid sequences 
and biochemical properties. One large class is the C type (for calcium- 
requiring) found in animals. These proteins each have a homologous 
domain of 120 amino acids that is responsible for carbohydrate binding. 
The structure of one such domain bound to a carbohydrate target is shown 
in Figure 11.30. 

A calcium ion on the protein acts as a bridge between the protein and 
the sugar through direct interactions with sugar OH groups. In addition, 
two glutamate residues in the protein bind to both the calcium ion and 
the sugar, and other protein side chains form hydrogen bonds with other 
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OH groups on the carbohydrate. The carbohydrate-binding specificity of a 
particular lectin is determined by the amino acid residues that bind the 
carbohydrate. C-type lectins function in a variety of cellular activities, 
including receptor-mediated endocytosis, a process by which soluble mol- 
ecules are bound to the cell surface and subsequently internalized (Section 
26.3), and cell-cell recognition. 

Proteins termed selectins are members of the C-type family. Selectins 
bind immune-system cells to sites of injury in the inflammatory response. 
The L, E, and P forms of selectins bind specifically to carbohydrates on 
lymph-node vessels, endothelium, or activated blood platelets, respectively. 
New therapeutic agents that control inflammation may emerge from a 
deeper understanding of how selectins bind and distinguish different carbo- 
hydrates. L-Selectin, originally thought to participate only in the immune 
response, is produced by embryos when they are ready to attach to the 
endometrium of the mother’s uterus. For a short period of time, the endo- 
metrial cells present an oligosaccharide on the cell surface. When the 
embryo attaches through lectins, the attachment activates signal pathways 
in the endometrium to make implantation of the embryo possible. 

Another large class of lectins comprises the L-lectins. These lectins are 
especially rich in the seeds of leguminous plants, and many of the initial 
biochemical characterizations of lectins were performed on this readily 
available lectin. Although the exact role of lectins in plants is unclear, they 
can serve as potent insecticides. Other L-type lectins, such as calnexin and 
calreticulin, are prominent chaperones in the eukaryotic endoplasmic retic- 
ulum. Recall that chaperones are proteins that facilitate the folding of other 
proteins (p. 51). 


Influenza virus binds to sialic acid residues 


cz Many pathogens gain entry into specific host cells by adhering to 

cell-surface carbohydrates. For example, influenza virus recognizes 
sialic acid residues linked to galactose residues that are present on cell- 
surface glycoproteins. The viral protein that binds to these sugars is called 
hemagglutinin (Figure 11.31A). 

After binding hemagglutinin, the virus is engulfed by the cell and begins 
to replicate. To exit the cell, a process essentially the reverse of viral entry 
occurs (Figure 11.31B). Viral assembly results in the budding of the viral 
particle from the cell. Upon complete assembly, the viral particle is still 
attached to sialic acid residues of the cell membrane by hemagglutinin on the 
surface of the new virions. Another viral protein, neuraminidase (sialidase), 
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FIGURE 11.31 Viral receptors. 

(A) Influenza virus targets cells by binding 
to sialic acid residues located at the termini 
of oligosaccharides present on cell-surface 
glycoproteins and glycolipids. These 
carbohydrates are bound by hemagglutinin, 
one of the major proteins expressed on 
the surface of the virus. (B) When viral 
replication is complete and the viral particle 
buds from the cell, the other major viral- 
surface protein, neuraminidase, cleaves 
oligosaccharide chains to release the 

viral particle. 
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cleaves the glycosidic bonds between the sialic acid residues and the rest of 
the cellular glycoprotein, freeing the virus to infect new cells, and thus 
spreading the infection throughout the respiratory tract. Inhibitors of this 
enzyme such as oseltamivir (Tamiflu) and zanamivir (Relenza) are impor- 
tant anti-influenza agents. 

Viral hemagglutinin’s carbohydrate-binding specificity may play an 
important role in species specificity of infection and ease of transmission. 
For instance, avian influenza H5N1 (bird flu) is especially lethal and is 
readily spread from bird to bird. Although human beings can be infected by 
this virus, infection is rare and human-to-human transmission 1s rarer still. 
The biochemical basis of these characteristics is that the avian-virus hemag- 
glutinin recognizes a different carbohydrate sequence from that recognized 
in human influenza. Although human beings have the sequence to which 
the avian virus binds, it is located deep in the lungs. Infection by the avian 
virus is thus difficult, and, when it does occur, the avian virus is not readily 
transmitted by sneezing or coughing. 

Plasmodium falciparum, the parasitic protozoan that causes malaria, also 
relies on glycan binding to infect and colonize its host. Glycan-binding 
proteins of the parasitic form initially injected by the mosquito bind to the 
glycosaminoglycan heparin sulfate on the liver, initiating the parasite’s 
entry into the cell. On exiting from the liver later in its life cycle, the parasite 
invades red blood cells by using another glycan-binding protein to bind to 
the carbohydrate moiety of glycophorin, a prominent membrane glycopro- 
tein in red blood cells. Developing means to disrupt the carbohydrate inter- 
actions between pathogens and host cells may prove to be clinically useful. 


SUMMARY 


11.1 Monosaccharides Are the Simplest Carbohydrates 

Carbohydrates are aldoses or ketoses that are rich in hydroxyl groups. 
An aldose is a carbohydrate with an aldehyde group (as in glyceralde- 
hyde and glucose), whereas a ketose contains a keto group (as in dihy- 
droxyacetone and fructose). A sugar belongs to the D series if the abso- 
lute configuration of its asymmetric carbon atom farthest from the 
aldehyde or keto group is the same as that of D-glyceraldehyde. Most 
naturally occurring sugars belong to the D series. The C-1 aldehyde in 
the open-chain form of glucose reacts with the C-5 hydroxyl group to 
form a six-membered pyranose ring. The C-2 keto group in the open- 
chain form of fructose reacts with the C-5 hydroxyl group to form a 
five-membered furanose ring. Pentoses such as ribose and deoxyribose 
also form furanose rings. An additional asymmetric center is formed at 
the anomeric carbon atom (C-1 in aldoses and C-2 in ketoses) in these 
cyclizations. The hydroxyl group attached to the anomeric carbon atom 
is on the opposite side of the ring from the CH,OH group attached to 
the chiral center in the « anomer, whereas it is on the same side of the 
ring as the CH,,OH group in the B anomer. Not all atoms in the ring lie 
in the same plane. Rather, pyranose rings usually adopt the chair con- 
formation, and furanose rings usually adopt the envelope conformation. 
Sugars are joined to alcohols and amines by glycosidic bonds from the 
anomeric carbon atom. For example, N-glycosidic bonds link sugars to 
purines and pyrimidines in nucleotides, RNA, and DNA. 


11.2 Monosaccharides Are Linked to Form Complex Carbohydrates 
Sugars are linked to one another in disaccharides and polysaccharides 
by O-glycosidic bonds. Sucrose, lactose, and maltose are the common 


disaccharides. Sucrose (common table sugar) consists of a-glucose and 
B-fructose joined by a glycosidic linkage between their anomeric car- 
bon atoms. Lactose (in milk) consists of galactose joined to glucose by 
a B-1,4 linkage. Maltose (from starch) consists of two glucoses joined 
by an a-1,4 linkage. Starch is a polymeric form of glucose in plants, 
and glycogen serves a similar role in animals. Most of the glucose units 
in starch and glycogen are in a-1,4 linkage. Cellulose, the major struc- 
tural polymer of plant cell walls, consists of glucose units joined by 
B-1,4 linkages. These B linkages give rise to long straight chains that 
form fibrils with high tensile strength. In contrast, the a linkages in 
starch and glycogen lead to open helices, in keeping with their roles as 
mobilizable energy stores. 


Carbohydrates Can Be Linked to Proteins to Form Glycoproteins 
Carbohydrates are commonly conjugated to proteins. If the protein 
component is predominant, the conjugate of protein and carbohydrate 
is called a glycoprotein. Most secreted proteins, such as the signal 
molecule erythropoietin, are glycoproteins. Glycoproteins are also 
prominent on the external surface of the plasma membrane. Proteins 
bearing covalently linked glycosaminoglycans are proteoglycans. 
Glycosaminoglycans are polymers of repeating disaccharides. One of 
the units in each repeat is a derivative of glucosamine or galactos- 
amine. These highly anionic carbohydrates have a high density of 
carboxylate or sulfate groups. Proteoglycans are found in the extracel- 
lular matrices of animals and are key components of cartilage. 
Mucoproteins, like proteoglycans, are predominantly carbohydrate 
by weight. The protein component is heavily O-glycosylated with 
N-acetylgalactosamine joining the oligosaccharide to the protein. 
Mucoproteins serve as lubricants. 

Glycosyltransferases link the oligosaccharide units on proteins 
either to the side-chain oxygen atom of a serine or threonine resi- 
due or to the side-chain amide nitrogen atom of an asparagine resi- 
due. Protein glycosylation takes place in the lumen of the endoplas- 
mic reticulum. The N-linked oligosaccharides are synthesized on 
dolichol phosphate and subsequently transferred to the protein 
acceptor. Additional sugars are attached in the Golgi complex to 
form diverse patterns. 


Lectins Are Specific Carbohydrate-Binding Proteins 

Carbohydrates on cell surfaces are recognized by proteins called lec- 
tins. In animals, the interplay of lectins and their sugar targets guides 
cell-cell contact. The viral protein hemagglutinin on the surface of the 
influenza virus recognizes sialic acid residues on the surfaces of cells 
invaded by the virus. 


KEY TERMS 


337 
Key Terms 


glycobiology (p. 316) diastereoisomer (p. 317) 


glycomics (p. 316) epimer (p. 318) 


monosaccharide (p. 316) hemiacetal (p. 318) 


ketose (p. 316) pyranose (p. 318) 
aldose (p. 316) hemiketal (p. 318) 


constitutional isomer (p. 317) furanose (p. 319) 


stereoisomer (p. 317) anomer (p. 319) 


enantiomer (p. 317) reducing sugar (p. 321) 


nonreducing sugar (p. 321) 

advanced glycation end product (AGE) 
(p. 322) 

glycosidic bond (p. 322) 

oligosaccharide (p. 323) 

disaccharide (p. 323) 

polysaccharide (p. 324) 

glycogen (p. 324) 
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starch (p. 324) 
cellulose (p. 324) 
glycoprotein (p. 325) 
proteoglycan (p. 326) 


glycoform (p. 326) 

mucin (mucoprotein) (p. 329) 
endoplasmic reticulum (p. 330) 
Golgi complex (p. 330) 


glycosyltransferase (p. 331) 
glycan-binding protein (p. 334) 
lectin (p. 334) 

selectin (p. 335) 


glycosaminoglycan (p. 326) 


PROBLEMS 


dolichol phosphate (p. 330) 


1. Word origin. Account for the origin of the term carbohy- 
drate. 


2. Diversity. How many different oligosaccharides can be 
made by linking one glucose, one mannose, and one galac- 
tose? Assume that each sugar is in its pyranose form. 
Compare this number with the number of tripeptides that 
can be made from three different amino acids. 


3. They go together like a horse and carriage. Match each 
term with its description. 


(a) Enantiomers 1. Has the molecular formula 
of (CH,O),, 
(b) Cellulose 2. Monosaccharides that differ 


at a single asymmetric 
carbon atom 


(c) Lectins 3. The storage form of glu- 
cose in animals 


(d) Glycosyltransferases 4. The storage form of 
glucose in plants 


(e) Epimers 5. Glycoprotein containing 
glycosaminoglycans 

(f) Starch 6. The most abundant organic 
molecule in the biosphere 

(g) Carbohydrates 7. N-Acetylgalactosamine is 
a key component of this 
glycoprotein 

(h) Proteoglycan 8. Carbohydrate-binding 
proteins 

(i) Mucoprotein 9. Enzymes that synthesize 
oligosaccharides 

(j) Glycogen 10. Stereoisomers that are 


mirror images of each other 


4. Couples. Indicate whether each of the following pairs of 
sugars consists of anomers, epimers, or an aldose—ketose pair: 
(a) D-glyceraldehyde and dihydroxyacetone 
(b) D-glucose and D-mannose 

(c) D-glucose and D-fructose 

( 


d) a-D-glucose and B-D-glucose 


(e) D-ribose and D-ribulose 
(f) D-galactose and D-glucose 


5. Carbons and carbonyls. To which classes of sugars do the 
monosaccharides shown here belong? 


CHO 
CHO ee 
H t OH H L OH CHO 
H—C—OH H t OH eo Ose FON 
CH20H or Linen bien 
p-Erythrose D-Ribose D-Glyceraldehyde Dihydroxyacetone 
oO Se _7-CH20H 
On HOH vr eer 
ere ae ee 
or Aten on 
bait taon ee 
b-Erythrulose D-Ribulose D-Fructose 


6. Chemical cousins. Although an aldose with 4 asymmetric 
carbon atoms is capable of forming 16 diastereoisomers, 
only 8 of the isomers are commonly observed, including 
glucose. They are listed below with their structural relation 
to glucose. Using the structure of glucose as a reference, 
draw the structures. 


(a) D-Allose: Epimeric at C-3 ‘cas 
(b) D-Altrose: Isomeric at C-2 and C-3 Ht — On 
(c) D-Mannose: Epimeric at C-2 HO——C-——H 
(d) D-Gulose: Isomeric at C-3 and C-4 H—C—OH 
(e) D-Idose: Isomeric at C-2, C-3 and C-4 H—C—OH 
(f) D-Galactose: Epimeric at C-4 CH20H 
(g) D-Talose: Isomeric at C-2 and C-4 D-Glucose 


7. An art project. Draw the structure of the disaccharide 
a-glycosyl-(1— 6)-galactose in the B anomeric form. 


8. Mutarotation. The specific rotations of the a and B ano- 
mers of D-glucose are +112 degrees and +18.7 degrees, 
respectively. Specific rotation, [a]p, is defined as the 
observed rotation of light of wavelength 589 nm (the D line 
of asodium lamp) passing through 10 cm ofa 1 g ml! solu- 
tion of a sample. When a crystalline sample of a-D-glucose 
is dissolved in water, the specific rotation decreases from 
112 degrees to an equilibrium value of 52.7 degrees. On the 
basis of this result, what are the proportions of the a and B 
anomers at equilibrium? Assume that the concentration of 
the open-chain form is negligible. 


9. Telltale marker. Glucose reacts slowly with hemoglobin 
and other proteins to form covalent compounds. Why is 
glucose reactive? What is the nature of the adduct formed? 


10. Periodate cleavage. Compounds containing hydroxyl 
groups on adjacent carbon atoms undergo carbon—carbon 
bond cleavage when treated with periodate ion (IO, -). How 
can this reaction be used to distinguish between pyrano- 
sides and furanosides? 


11. Sugar lineup. Identify the following four sugars. 


HOH2C 


OH 
HO. 

(c) HOH2C. Oo iH 
H | 


12. Cellular glue. A trisaccharide unit of a cell-surface gly- 
coprotein is postulated to play a critical role in mediating 
cell-cell adhesion in a particular tissue. Design a simple 
experiment to test this hypothesis. 


13. Mapping the molecule. Each of the hydroxyl groups of 
glucose can be methylated with reagents such as dimethyl 
sulfate under basic conditions. Explain how exhaustive 
methylation followed by the complete digestion of a known 
amount of glycogen would enable you to determine the 
number of branch points and reducing ends. 


14. Component parts. Raffinose is a trisaccharide and a 
minor constituent in sugar beets. 


(a) Is raffinose a reducing sugar? Explain. 
(b) What are the monosaccharides that compose raffinose? 


(c) B-Galactosidase is an enzyme that will remove galac- 
tose residues from an oligosaccharide. What are the prod- 
ucts of B- galactosidase treatment of raffinose? 
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Problems 
HOH2C 
H 
HO (6) 2 O 
o—C go =«FA CH,OH 
(e) 
Wr, 
CH20H 
Raffinose 
15. Anomeric differences. a-D- Mannose is CH,OH 


a sweet-tasting sugar. B-D-Mannose, on 
the other hand, tastes bitter. A pure solu- 
tion of a-D-mannose loses its sweet taste 
with time as it is converted into the B 
anomer. Draw the B anomer and explain 
how it is formed from the a anomer. 


a-D-Mannose 


16. A taste of honey. Fructose in its B-D-pyranose form 
accounts for the powerful sweetness of honey. The B-D- 
furanose form, although sweet, is not as sweet as the pyra- 
nose form. The furanose form is the more stable form. Draw 
the two forms and explain why it may not always be wise to 
cook with honey. 


17. Making ends meet. 


(a) Compare the number of reducing ends to nonreducing 
ends in a molecule of glycogen. 


(b) As we will see in Chapter 21, glycogen is an important 
fuel-storage form that is rapidly mobilized. At which end— 
the reducing or nonreducing—would you expect most 
metabolism to take place? 


18. A lost property. Glucose and fructose are reducing sug- 
ars. Sucrose, or table sugar, is a disaccharide consisting of 
both fructose and glucose. Is sucrose a reducing sugar? 
Explain. 


19. Meat and potatoes. Compare the structures of glycogen 
and starch. 


20. Straight or with a twist? Account for the different struc- 
tures of glycogen and cellulose. 


21. Sweet proteins. List the key classes of glycoproteins, 
their defining characteristics, and their biological functions. 


22. Life extender. What is the function of the carbohydrate 
moiety that is attached to EPO? 


23. Cushioning. What is the role of the glycosaminoglycan 
in the cushioning provided by cartilage? 


24. Undelivered mail. Not returned to sender. I-cell disease 
results when proteins normally destined to the lysosomes 
lack the appropriate carbohydrate-addressing molecule 
(p. 332). Suggest another possible means by which I-cell 
disease might arise. 
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25. Appropriate peg. Which amino acids are used for the 
attachment of carbohydrates to proteins? 


26. From one, many. What is meant by a glycoform? 
27. Ome. What is meant by the glycome? 


28. Exponential expansion? Compare the amount of infor- 
mation inherent in the genome, the proteome, and the 
glycome. 


29. Attachments. Suppose, one Sunday afternoon, you are 
relaxing by reading amino acid sequences of various pro- 
teins. Being a bit hungry, you are also thinking of sweet 
snacks. Combining your interests, you wonder whether you 
can detect N-glycosylation sites by simply looking at amino 
acid sequence. Your roomate, who is taking a biochemistry 
course, says “You sure can, to some degree at least, and 
here’s how.” What did your roomate say to explain? 


30. Locks and keys. What does the fact that all organisms 
contain lectins suggest about the role of carbohydrates? 


31. Carbohydrates—notjust for breakfast anymore. Differentiate 
between a glycoprotein and a lectin. 


32. Carbohydrates and proteomics. Suppose that a protein 
contains six potential N-linked glycosylation sites. How 
many possible proteins can be generated, depending on 
which of these sites is actually glycosylated? Do not include 
the effects of diversity within the carbohydrate added. 


Chapter Integration Problems 


33. Like a jigsaw puzzle. Why is it more difficult to deter- 
mine the structure of the oligosaccharides compared to 
amino acid sequences or nucleotide sequences? 


34. Stereospecificity. Sucrose, a major product of photosyn- 
thesis in green leaves, is synthesized by a battery of enzymes. 
The substrates for sucrose synthesis, D-glucose and D- 
fructose, are a mixture of a and B anomers as well as acyclic 
compounds in solution. Nonetheless, sucrose consists of 
a-D-glucose linked by its carbon-1 atom to the carbon-2 
atom of B-D-fructose. How can the specificity of sucrose be 
explained in light of the potential substrates? 


35. Specific recognition. How might the technique of affin- 
ity chromatography be used to purify lectins? 


Data Interpretation Problem 


36. Sore joints. A contributing factor to the development of 
arthritis is the inappropriate proteolytic destruction of the 
aggrecan component of cartilage by the proteolytic enzyme 
aggrecanase. The immune-system signal molecule interleu- 
kin 2 (IL-2) activates aggrecanase; in fact, IL-2 blockers are 
sometimes used to treat arthritis. Studies were undertaken 
to determine whether inhibitors of aggrecanase can coun- 
teract the effects of IL-2. Pieces of cartilage were incubated 
in media with various additions and the amount of aggrecan 
destruction was measured as a function of time. 
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[Data from M.A. Pratta et al., J. Biol. Chem. 
278:45539-45545, 20038, Fig. 7B.] 


(a) Aggrecan degradation was measured by the release of 
glycosaminoglycan. What is the rationale for this assay? 

(b) Why might glycosaminoglycan release not indicate 
aggrecan degradation? 

(c) What is the purpose of the control—cartilage incubated 
with no additions? 

(d) What is the effect of adding IL-2 to the system? 


(e) What is the response when an aggrecanase inhibitor is 
added in addition to IL-2? 


(f) Why is there some aggrecan destruction in the control 
with the passage of time? 


CHAPTER 


Lipids and Cell Membranes 


An HIV particle exits an infected cell by membrane budding. 
Cellular membranes are highly dynamic structures that 
spontaneously self-assemble. Driven by hydrophobic 
interactions, as shown in the diagram at right, the fatty acid 
tails of membrane lipids pack together (green), while the polar 
heads (red) remain exposed on the surfaces. [Micrographs 
from Eye of Science/Photo Researchers. ] 


OUTLINE 
he boundaries of all cells are defined by biological membranes (Figure 
12.1), dynamic structures in which proteins float in a sea of lipids. The 12.1 Fatty Acids Are Key Constituents 
lipid component prevents molecules generated inside the cell from leaking of Lipids 
out and unwanted molecules from diffusing in, while the protein compo- 108 Thee WS Thies Common Wess 
nents act as transport systems that allow the cell to take up specific molecules of Membrane Lipids 


and remove unwanted ones. Such transport systems confer on membranes 
the important property of selective permeability. We will consider these trans- 
port systems in greater detail in the next chapter. 

In addition to an external cell membrane (called the plasma membrane), 


12.3 Phospholipids and Glycolipids 
Readily Form Bimolecular Sheets 
in Aqueous Media 


eukaryotic cells also contain internal membranes that form the boundaries 12.4 Proteins Carry Out Most 

of organelles such as mitochondria, chloroplasts, peroxisomes, and lyso- Membrane Processes 
somes. Functional specialization in the course of evolution has been closely 426 (eis cadena ener 
linked to the formation of such compartments. Specific systems have Proteins Diffuse Rapidly in the 
evolved to allow the targeting of selected proteins into or through particular Plane af the Membrane 


internal membranes and, hence, into specific organelles. External and inter- 
nal membranes share essential properties; these features are the subject of 
this chapter. 

Biological membranes serve several additional functions indispensable for 
life, such as energy storage and information transduction, that are dictated by 
the proteins associated with them. In this chapter, we will examine the prop- 
erties of membrane proteins that enable them to exist in the hydrophobic 
environment of the membrane while connecting two hydrophilic environ- 
ments. In the next chapter, we will discuss the functions of these proteins. 


12.6 Eukaryotic Cells Contain 
Compartments Bounded by 
Internal Membranes 
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FIGURE 12.1 Electron micrograph of a 
plasma cell. This image has been colored 
to indicate the distinct boundary of the cell, 
formed by its plasma membrane. [Steve 
Gschmeissner/Photo Researchers.] 
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Many Common Features Underlie the Diversity of 
Biological Membranes 


Membranes are as diverse in structure as they are in function. However, 
they do have in common a number of important attributes: 


1. Membranes are sheetlike structures, only two molecules thick, that form 
closed boundaries between different compartments. The thickness of most 
membranes is between 60 A (6 nm) and 100 A (10 nm). 


2. Membranes consist mainly of lipids and proteins. The mass ratio of lipids 
to proteins ranges from 1:4 to 4:1. Membranes also contain carbohydrates 
that are linked to lipids and proteins. 


3. Membrane lipids are small molecules that have both hydrophilic and 
hydrophobic moieties. These lipids spontaneously form closed bimolecular 
sheets in aqueous media. These lipid bilayers are barriers to the flow of polar 
molecules. 


4. Specific proteins mediate distinctive functions of membranes. Proteins serve 
as pumps, channels, receptors, energy transducers, and enzymes. Membrane 
proteins are embedded in lipid bilayers, which create suitable environments 
for their action. 


5. Membranes are noncovalent assemblies. The constituent protein and lipid 
molecules are held together by many noncovalent interactions, which act 
cooperatively. 


6. Membranes are asymmetric. The two faces of biological membranes 
always differ from each other. 


7. Membranes are fluid structures. Lipid molecules diffuse rapidly in the 
plane of the membrane, as do proteins, unless they are anchored by specific 
interactions. In contrast, lipid molecules and proteins do not readily rotate 
across the membrane. Membranes can be regarded as two-dimensional solu- 
tions of oriented proteins and lipids. 


8. Most cell membranes are electrically polarized, such that the inside is 
negative [typically —60 millivolts (mV)]. Membrane potential plays a key 
role in transport, energy conversion, and excitability (Chapter 13). 


12.1 Fatty Acids Are Key Constituents of Lipids 


The hydrophobic properties of lipids are essential to their ability to form 
membranes. Most lipids owe their hydrophobic properties to one compo- 
nent, their fatty acids. 


Fatty acid names are based on their parent hydrocarbons 


Fatty acids are long hydrocarbon chains of various lengths and degrees of 
unsaturation that terminate with carboxylic acid groups. The systematic 
name for a fatty acid is derived from the name of its parent hydrocarbon by 
the substitution of oic for the final e. For example, the C,. saturated fatty 
acid is called octadecanoic acid because the parent hydrocarbon is octadec- 
ane. A C,¢ fatty acid with one double bond is called octadecenoic acid; with 
two double bonds, octadecadienoic acid; and with three double bonds, octa- 
decatrienoic acid. The notation 18:0 denotes a C,, fatty acid with no double 
bonds, whereas 18:2 signifies that there are two double bonds. The struc- 
tures of the ionized forms of two common fatty acids—palmitic acid (16:0) 
and oleic acid (18:1)—are shown in Figure 12.2. 


.O 
Te ge ge a ae ee ee a 


Palmitate 
(ionized form of palmitic acid) fe) 
i 
% - 
re) 
Oleate 


(ionized form of oleic acid) 


Fatty acid carbon atoms are numbered starting at the carboxyl termi- 
nus, as shown in the margin. Carbon atoms 2 and 3 are often referred to as 
a and B, respectively. The methyl carbon atom at the distal end of the chain 
is called the w-carbon atom. The position of a double bond is represented 
by the symbol A followed by a superscript number. For example, cis-A? 
means that there is a cis double bond between carbon atoms 9 and 10; 
trans-A* means that there is a trans double bond between carbon atoms 2 
and 3. Alternatively, the position of a double bond can be denoted by 
counting from the distal end, with the w-carbon atom (the methyl carbon) 
as number 1. An w-3 fatty acid, for example, has the structure shown in the 
margin. Fatty acids are ionized at physiological pH, and so it is appropriate 
to refer to them according to their carboxylate form: for example, palmitate 
or hexadecanoate. 


Fatty acids vary in chain length and degree of unsaturation 


Fatty acids in biological systems usually contain an even number of 
carbon atoms, typically between 14 and 24 (Table 12.1). The 16- and 
18-carbon fatty acids are most common. The dominance of fatty acid 
chains containing an even number of carbon atoms reflects the manner in 
which fatty acids are biosynthesized (Chapter 26). The hydrocarbon chain 
is almost invariably unbranched in animal fatty acids. The alkyl chain may 
be saturated or it may contain one or more double bonds. The configura- 
tion of the double bonds in most unsaturated fatty acids is cis. The double 


TABLE 12.1 Some naturally occurring fatty acids in animals 
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FIGURE 12.2 Structures of two fatty 
acids. Palmitate is a 16-carbon, saturated 
fatty acid, and oleate is an 18-carbon fatty 
acid with a single cis double bond. 
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An w-3 fatty acid 


Number of Number of 
carbons double bonds Common name Systematic name Formula 
12 0 Laurate n-Dodecanoate CH,(CH,), »>COO7 
14 0 Myristate n- Tetradecanoate CH,(CH,),,COO- 
16 0 Palmitate n-Hexadecanoate CH,(CH,),,COO7 
18 0 Stearate n-Octadecanoate CH,(CH,),,COO7 
20 0 Arachidate n-Eicosanoate CH,(CH,), ;COO7 
22 0 Behenate n-Docosanoate CH,(CH,),,)COO~ 
24 0 Lignocerate n- Tetracosanoate CH,(CH,),,COO- 
16 1 Palmitoleate cis-A?-Hexadecenoate CH,(CH,);CH=CH(CH,) ,COO- 
18 1 Oleate cis- A?-Octadecenoate CH,(CH,),CH=CH(CH,) ,COO7 
18 2 Linoleate cis, cis- A®, A!2-Octadecadienoate CH,(CH,),(CH=CHCH,), (CH),COO- 
18 3 Linolenate all-cis- A°, A!2, A!°-Octadecatrienoate CH,CH,(CH=CHCH,),(CH,),COO~ 
20 4 Arachidonate all-cis A°, A’, A!!, A!+-Eicosatetraenoate CH,(CH2),(CH=CHCH,),(CH,),COO7 
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FIGURE 12.3 Schematic structure of a 
phospholipid. 


Ry oO 
Pe ne oral 
Acyl groups C CH2 
with fatty acid | — 53 
s O—C-H 


hydrocarbon chain: 


bonds in polyunsaturated fatty acids are separated by at least one methy- 
lene group. 

The properties of fatty acids and of lipids derived from them are 
markedly dependent on chain length and degree of saturation. 
Unsaturated fatty acids have lower melting points than do saturated 
fatty acids of the same length. For example, the melting point of stearic 
acid is 69.6°C, whereas that of oleic acid (which contains one cis dou- 
ble bond) is 13.4°C. The melting points of polyunsaturated fatty acids 
of the Cj. series are even lower. Chain length also affects the melting 
point, as illustrated by the fact that the melting temperature of pal- 
mitic acid (C,,) is 6.5 degrees lower than that of stearic acid (C,,). 
Thus, short chain length and unsaturation enhance the fluidity of fatty 
acids and of their derivatives. 


12.2 There Are Three Common Types of Membrane Lipids 


By definition, lipids are water-insoluble biomolecules that are highly soluble in 
organic solvents such as chloroform. Lipids have a variety of biological roles: 
they serve as fuel molecules, highly concentrated energy stores, signal mol- 
ecules and messengers in signal-transduction pathways, and components of 
membranes. The first three roles of lipids will be considered in later chap- 
ters. Here, our focus is on lipids as membrane constituents. The three major 
kinds of membrane lipids are phospholipids, glycolipids, and cholesterol. We 
begin with lipids found in eukaryotes and bacteria. The lipids in archaea are 
distinct, although they have many features related to membrane formation 
in common with lipids of other organisms. 


Phospholipids are the major class of membrane lipids 


Phospholipids are abundant in all biological membranes. A phospholipid 
molecule is constructed from four components: one or more fatty acids, a 
platform to which the fatty acids are attached, a phosphate, and an alcohol 
attached to the phosphate (Figure 12.3). The fatty acid components provide 
a hydrophobic barrier, whereas the remainder of the molecule has hydro- 
philic properties that enable interaction with the aqueous environment. 
The platform on which phospholipids are built may be glycerol, a three- 
carbon alcohol, or sphingosine, a more complex alcohol. Phospholipids 
derived from glycerol are called phosphoglycerides. A phosphoglyceride 
consists of a glycerol backbone to which are attached two fatty acid chains 
and a phosphorylated alcohol. 
In phosphoglycerides, the hydroxyl groups at C-1 and C-2 of glycerol 
are esterified to the carboxyl groups of the two fatty acid chains. The C-3 
hydroxyl group of the glycerol backbone is esterified to phosphoric 
acid. When no further additions are made, the resulting compound 
is phosphatidate (diacylglycerol 3-phosphate), the simplest phospho- 
glyceride. Only small amounts of phosphatidate are present in 
membranes. However, the molecule is a key intermediate in the 


Rw 9/ E: ae biosynthesis of the other phosphoglycerides (Section 26.1). The 
j He = absolute configuration of the glycerol 3-phosphate moiety of mem- 
0 SS brane lipids is shown in Figure 12.4. 
Phosphatidate The major phosphoglycerides are derived from phosphatidate by 


(Diacylglycerol 3-phosphate) 


FIGURE 12.4 Structure of phosphatidate 
(diacylglycerol 3-phosphate). The absolute 
configuration of the center carbon (C-2) is shown. 


the formation of an ester bond between the phosphate group of phos- 
phatidate and the hydroxy] group of one of several alcohols. The com- 
mon alcohol moieties of phosphoglycerides are the amino acid serine, 
ethanolamine, choline, glycerol, and inositol. 
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The structural formulas of phosphatidylcholine and the other principal 
phosphoglycerides—namely, phosphatidylethanolamine, phosphatidyl- 
serine, phosphatidylinositol, and diphosphatidylglycerol—are given in 


Figure 12.5. 
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FIGURE 12.5 Some common phosphoglycerides found in membranes. 
Sphingomyelin is a phospholipid found in membranes that is not derived 
from glycerol. Instead, the backbone in sphingomyelin is sphingosine, an amino 
alcohol that contains a long, unsaturated hydrocarbon chain (Figure 12.6). In 
sphingomyelin, the amino group of the sphingosine backbone is linked to a 
fatty acid by an amide bond. In addition, the primary hydroxyl group of sphin- 
gosine is esterified to phosphorylcholine. 
Membrane lipids can include carbohydrate moieties 
The second major class of membrane lipids, glycolipids, are sugar-containing 
lipids. Like sphingomyelin, the glycolipids in animal cells are derived 
+H.N FIGURE 12.6 Structures of sphingosine 
: x a and sphingomyelin. The sphingosine 
Ee CL OH moiety of sphingomyelin is highlighted 
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FIGURE 12.7 An archaeon and its 
environment. Archaea can thrive in 
habitats as harsh as a volcanic vent. 
Here, the archaea form an orange mat 
surrounded by yellow sulfurous 
deposits. [Krafft-Explorer/Photo 
Researchers.] 


from sphingosine. The amino group of the sphingosine backbone is acyl- 
ated by a fatty acid, as in sphingomyelin. Glycolipids differ from sphingo- 
myelin in the identity of the unit that is linked to the primary hydroxyl 
group of the sphingosine backbone. In glycolipids, one or more sugars 
(rather than phosphorylcholine) are attached to this group. The simplest 
glycolipid, called a cerebroside, contains a single sugar residue, either glu- 
cose or galactose. 
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More-complex glycolipids, such as gangliosides, contain a branched chain of 
as many as seven sugar residues. Glycolipids are oriented in a completely 
asymmetric fashion with the sugar residues always on the extracellular side of 
the membrane. 


Cholesterol Is a Lipid Based on a Steroid Nucleus 


Cholesterol, the third major type of membrane lipid, has a structure that is 
quite different from that of phospholipids. It is a steroid, built from four 
linked hydrocarbon rings. 


H3C CH; 
CH; 
CH; 
CH; 


HO 
Cholesterol 


A hydrocarbon tail is linked to the steroid at one end, and a hydroxyl 
group is attached at the other end. In membranes, the orientation of the 
molecule is parallel to the fatty acid chains of the phospholipids, and the 
hydroxyl group interacts with the nearby phospholipid head groups. 
Cholesterol is absent from prokaryotes but is found to varying degrees in 
virtually all animal membranes. It constitutes almost 25% of the mem- 
brane lipids in certain nerve cells but is essentially absent from some intra- 
cellular membranes. 


Archaeal membranes are built from ether lipids with branched chains 


WY The membranes of archaea differ in composition from those of 
eukaryotes or bacteria in three important ways. Two of these differ- 
ences clearly relate to the hostile living conditions of many archaea (Figure 
12.7). First, the nonpolar chains are joined to a glycerol backbone by ether 
rather than ester linkages. The ether linkage is more resistant to hydroly- 
sis. Second, the alkyl chains are branched rather than linear. They are 
built up from repeats of a fully saturated five-carbon fragment. These 


branched, saturated hydrocarbons are more resistant to oxidation than the 
unbranched chains of eukaryotic and bacterial membrane lipids. The abil- 
ity of archaeal lipids to resist hydrolysis and oxidation may help these 
organisms to withstand the extreme conditions, such as high temperature, 
low pH, or high salt concentration, under which some of these archaea 
grow. Finally, the stereochemistry of the central glycerol is inverted com- 
pared with that shown in Figure 12.4. 
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Membrane lipid from the archaeon Methanococcus jannaschii 


A membrane lipid is an amphipathic molecule containing a 
hydrophilic and a hydrophobic moiety 


The repertoire of membrane lipids is extensive. However, these lipids pos- 
sess a critical common structural theme: membrane lipids are amphipathic 
molecules (amphiphilic molecules), that is, they contain both a hydrophilic 
and a hydrophobic moiety. 

Let us look at a model of a phosphoglyceride, such as phosphatidylcho- 
line. Its overall shape is roughly rectangular (Figure 12.8A). The two hydro- 
phobic fatty acid chains are approximately parallel to each other, whereas 
the hydrophilic phosphorylcholine moiety points in the opposite direction. 
Sphingomyelin has a similar conformation, as does the archaeal lipid 
depicted. Therefore, the following shorthand has been adopted to represent 
these membrane lipids: the hydrophilic unit, also called the polar head 
group, is represented by a circle, and the hydrocarbon tails are depicted by 
straight or wavy lines (Figure 12.8B). 


FIGURE 12.8 Representations of 
membrane lipids. (A) Space-filling 
models of a phosphoglyceride, 
sphingomyelin, and an archaeal lipid 
show their shapes and distribution of 
hydrophilic and hydrophobic moieties. 
(B) A shorthand depiction of a 
membrane lipid. 
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Archaeal lipid Shorthand depiction 
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FIGURE 12.9 Diagram of a section of a 
micelle. lonized fatty acids readily form such 
structures, but most phospholipids co not. 


FIGURE 12.10 Diagram of a section of a 
bilayer membrane. 


12.3 Phospholipids and Glycolipids Readily Form 
Bimolecular Sheets in Aqueous Media 


What properties enable phospholipids to form membranes? Membrane for- 
mation is a consequence of the amphipathic nature of the molecules. Their polar 
head groups favor contact with water, whereas their hydrocarbon tails inter- 
act with one another in preference to water. How can molecules with these 
preferences arrange themselves in aqueous solutions? One way is to form a 
globular structure called a micelle. The polar head groups form the outside 
surface of the micelle, which is surrounded by water, and the hydrocarbon 
tails are sequestered inside, interacting with one another (Figure 12.9). 

Alternatively, the strongly opposed preferences of the hydrophilic and 
hydrophobic moieties of membrane lipids can be satisfied by forming a lipid 
bilayer, composed of two lipid sheets (Figure 12.10). A lipid bilayer is also 
called a bimolecular sheet. The hydrophobic tails of each individual sheet inter- 
act with one another, forming a hydrophobic interior that acts as a permeabil- 
ity barrier. The hydrophilic head groups interact with the aqueous medium on 
each side of the bilayer. The two opposing sheets are called leaflets. 

The favored structure for most phospholipids and glycolipids in aqueous 
media is a bimolecular sheet rather than a micelle. The reason 1s that the two 
fatty acid chains of a phospholipid or a glycolipid are too bulky to fit into the 
interior of a micelle. In contrast, salts of fatty acids (such as sodium palmi- 
tate, a constituent of soap) readily form micelles because they contain only 
one chain. The formation of bilayers instead of micelles by phospholipids is of 
critical biological importance. A micelle is a limited structure, usually less 
than 200 A (20 nm) in diameter. In contrast, a bimolecular sheet can extend 
to macroscopic dimensions, as much as a millimeter (107 A, or 10° nm) or 
more. Phospholipids and related molecules are important membrane con- 
stituents because they readily form extensive bimolecular sheets. 

Lipid bilayers form spontaneously by a self-assembly process. In other 
words, the structure of a bimolecular sheet is inherent in the structure of the 
constituent lipid molecules. The growth of lipid bilayers from phospholipids 
is rapid and spontaneous in water. Hydrophobic interactions are the major driv- 
ing force for the formation of lipid bilayers. Recall that hydrophobic interactions 
also play a dominant role in the stacking of bases in nucleic acids and in the 
folding of proteins (Sections 1.3 and 2.4). Water molecules are released from 
the hydrocarbon tails of membrane lipids as these tails become sequestered in 
the nonpolar interior of the bilayer. Furthermore, van der Waals attractive 
forces between the hydrocarbon tails favor close packing of the tails. Finally, there 
are electrostatic and hydrogen-bonding attractions between the polar head groups 
and water molecules. Thus, lipid bilayers are stabilized by the full array of 
forces that mediate molecular interactions in biological systems. Because lipid 
bilayers are held together by many reinforcing, noncovalent interactions (pre- 
dominantly hydrophobic), they are cooperative structures. These hydrophobic 
interactions have three significant biological consequences: (1) lipid bilayers 
have an inherent tendency to be extensive; (2) lipid bilayers will tend to close 
on themselves so that there are no edges with exposed hydrocarbon chains, and 
so they form compartments; and (3) lipid bilayers are self-sealing because a 
hole in a bilayer is energetically unfavorable. 


Lipid vesicles can be formed from phospholipids 


The propensity of phospholipids to form membranes has been used to cre- 
ate an important experimental and clinical tool. Lipid vesicles, or liposomes, 
are aqueous compartments enclosed by a lipid bilayer (Figure 12.11). These 
structures can be used to study membrane permeability or to deliver chemicals 
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FIGURE 12.11 Liposome. A liposome, 
or lipid vesicle, is a small aqueous 
compartment surrounded by a lipid bilayer. 
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to cells. Liposomes are formed by suspending a suitable lipid, such as phos- 
phatidylcholine, in an aqueous medium, and then sonicating (i.e., agitating by 
high-frequency sound waves) to give a dispersion of closed vesicles that are 
quite uniform in size. Vesicles formed by this method are nearly spherical and 
have a diameter of about 500 A (50 nm). Larger vesicles (of the order of 1 um 
or 10* A in diameter) can be prepared by slowly evaporating the organic sol- 
vent from a suspension of phospholipid in a mixed-solvent system. 

Ions or molecules can be trapped in the aqueous compartments of lipid 
vesicles by forming the vesicles in the presence of these substances (Figure 
12.12). For example, 500-A-diameter vesicles formed in a 0.1 M glycine solu- 
tion will trap about 2000 molecules of glycine in each inner aqueous compart- 
ment. These glycine-containing vesicles can be separated from the surrounding 
solution of glycine by dialysis or by gel-filtration chromatography (Section 3.1). 
The permeability of the bilayer membrane to glycine can then be determined 
by measuring the rate of efflux of glycine from the inner compartment of the 
vesicle to the ambient solution. Liposomes can be formed with specific mem- 
brane proteins embedded in them by solubilizing the proteins in the presence 
of detergents and then adding them to the phospholipids from which liposomes 
will be formed. Protein—liposome complexes provide valuable experimental 
tools for examining a range of membrane-protein functions. 


2 Therapeutic applications for liposomes are currently under active inves- 
tigation. For example, liposomes containing drugs or DNA can be 
injected into patients. These liposomes fuse with the plasma membrane of 
many kinds of cells, introducing into the cells the molecules that they contain. 
Drug delivery with liposomes often lessens its toxicity. Less of the drug is dis- 
tributed to normal tissues because long-circulating liposomes concentrate in 
regions of increased blood circulation, such as solid tumors and sites of inflam- 
mation. Moreover, the selective fusion of lipid vesicles with particular kinds of 
cells is a promising means of controlling the delivery of drugs to target cells. 
Another well-defined synthetic membrane is a planar bilayer membrane. 
This structure can be formed across a 1-mm hole ina partition between two 
aqueous compartments by dipping a fine paintbrush into a membrane- 
forming solution, such as phosphatidylcholine in decane, and stroking the 
tip of the brush across the hole. The lipid film across the hole thins sponta- 
neously into a lipid bilayer. The electrical conduction properties of this 
macroscopic bilayer membrane are readily studied by inserting electrodes 
into each aqueous compartment (Figure 12.13). For example, the permea- 
bility of the membrane to ions is determined by measuring the current 
across the membrane as a function of the applied voltage. 


Lipid bilayers are highly impermeable to ions and most polar molecules 


Permeability studies of lipid vesicles and electrical-conductance measure- 
ments of planar bilayers have shown that lipid bilayer membranes have a very 
low permeability for ions and most polar molecules. Water is a conspicuous 
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FIGURE 12.12 Preparation of glycine- 
containing liposomes. Liposomes 
containing glycine are formed by the 
sonication of phospholipids in the presence 
of glycine. Free glycine is removed by gel 
filtration. 
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FIGURE 12.13 Experimental arrangement 
for the study of a planar bilayer 
membrane. A bilayer membrane is formed 
across a 1-mm hole in a septum that 
separates two aqueous compartments. This 
arrangement permits measurements of the 
permeability and electrical conductance of 
lipid bilayers. 
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FIGURE 12.15 SDS-acrylamide 

gel patterns of membrane proteins. 

(A) The plasma membrane of erythrocytes. 
(B) The photoreceptor membranes of retinal 
rod cells. (C) The sarcoplasmic reticulum 
membrane of muscle cells. [Courtesy of 

Dr. Theodore Steck (part A) and Dr. David 
MacLennan (part C).] 
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FIGURE 12.14 Permeability coefficients (P) of ions and molecules in a lipid 
bilayer. The ability of molecules to cross a lipid bilayer spans a wide range of values. 


exception to this generalization; it traverses such membranes relatively easily 
because of its low molecular weight, high concentration, and lack of a com- 
plete charge. The range of measured permeability coefficients is very wide 
(Figure 12.14). For example, Na* and K* traverse these membranes 10? 
times as slowly as does H,O. Tryptophan, a zwitterion at pH 7, crosses the 
membrane 10° times as slowly as does indole, a structurally related molecule 
that lacks ionic groups. In fact, the permeability of small molecules is correlated 
with their solubility in a nonpolar solvent relative to their solubility in water. This 
relation suggests that a small molecule might traverse a lipid bilayer mem- 
brane in the following way: first, it sheds its solvation shell of water; then, it is 
dissolved in the hydrocarbon core of the membrane; and, finally, it diffuses 
through this core to the other side of the membrane, where it becomes resol- 
vated by water. An ion such as Na* traverses membranes very slowly because 
the replacement of its coordination shell of polar water molecules by nonpolar 
interactions with the membrane interior is highly unfavorable energetically. 


12.4 Proteins Carry Out Most Membrane Processes 


We now turn to membrane proteins, which are responsible for most of the 
dynamic processes carried out by membranes. Membrane lipids form a 
permeability barrier and thereby establish compartments, whereas specific 
proteins mediate nearly all other membrane functions. In particular, proteins 
transport chemicals and information across a membrane. Membrane lipids 
create the appropriate environment for the action of such proteins. 

Membranes differ in their protein content. Myelin, a membrane that serves 
as an electrical insulator around certain nerve fibers, has a low content of pro- 
tein (18%). Relatively pure lipids are well suited for insulation. In contrast, the 
plasma membranes, or exterior membranes, of most other cells are much more 
metabolically active. They contain many pumps, channels, receptors, and 
enzymes. The protein content of these plasma membranes is typically 50%. 
Energy-transduction membranes, such as the internal membranes of mito- 
chondria and chloroplasts, have the highest content of protein, around 75%. 

The protein components of a membrane can be readily visualized by 
SDS-—polyacrylamide gel electrophoresis. As stated earlier (p. 73), the electro- 
phoretic mobility of many proteins in SDS-containing gels depends on the 
mass rather than on the net charge of the protein. The gel-electrophoresis 
patterns of three membranes—the plasma membrane of erythrocytes, the 
photoreceptor membrane of retinal rod cells, and the sarcoplasmic reticu- 
lum membrane of muscle—are shown in Figure 12.15. It is evident that 
each of these three membranes contains many proteins but has a distinct 
protein composition. In general, membranes performing different functions 
contain different repertoires of proteins. 


Proteins associate with the lipid bilayer ina 
variety of ways 


The ease with which a protein can be dissociated from a 
membrane indicates how intimately it is associated with the 
membrane. Some membrane proteins can be solubilized by 
relatively mild means, such as extraction by a solution of 
high ionic strength (e.g., 1 M NaCl). Other membrane pro- 
teins are bound much more tenaciously; they can be solubi- 
lized only by using a detergent or an organic solvent. 
Membrane proteins can be classified as being either periph- 
eral or integral on the basis of this difference in dissociability 


- 


(Figure 12.16). Integral membrane proteins interact exten- FIGURE 12.16 Integral and peripheral membrane proteins. 
sively with the hydrocarbon chains of membrane lipids, and Integral membrane proteins (a and 6) interact extensively with the 
they can be released only by agents that compete for these hydrocarbon region of the bilayer. Most known integral membrane 


nonpolar interactions. In fact, most integral membrane pro- 
teins span the lipid bilayer. In contrast, peripheral membrane 


proteins traverse the lipid bilayer. Peripheral membrane proteins 
interact with the polar head groups of the lipids (c) or bind to the 
surfaces of integral proteins (). Other proteins are tightly anchored 


proteins are bound to membranes primarily by electrostatic to the membrane by a covalently attached lipid molecule (€). 


and hydrogen-bond interactions with the head groups of 

lipids. These polar interactions can be disrupted by adding salts or by chang- 
ing the pH. Many peripheral membrane proteins are bound to the surfaces of 
integral proteins, on either the cytoplasmic or the extracellular side of the 
membrane. Others are anchored to the lipid bilayer by a covalently attached 
hydrophobic chain, such as a fatty acid. 


Proteins interact with membranes in a variety of ways 


Membrane proteins are more difficult to purify and crystallize than are 
water-soluble proteins. Nonetheless, researchers using x-ray crystallo- 
graphic or electron microscopic methods have determined the three- 
dimensional structures of more than 2000 such proteins at sufficiently 
high resolution to discern the molecular details. As noted in Chapter 2, 
membrane proteins differ from soluble proteins in the distribution of 
hydrophobic and hydrophilic groups. We will consider the structures of 
three membrane proteins in some detail. 


Proteins can span the membrane with alpha helices. The first membrane 
protein that we consider is the archaeal protein bacteriorhodopsin, shown in 
Figure 12.17. This protein uses light energy to transport protons from 
inside to outside the cell, generating a proton gradient used to form ATP. 


(A) (B) 


Cytoplasm 


FIGURE 12.17 Structure of 
bacteriorhodopsin. Notice that 
bacteriorhodopsin consists largely of 
membrane-spanning «a helices (represented 
by yellow cylinders). (A) View through the 
membrane bilayer. The interior of the 
membrane is green and the head groups 
are red. (B) View from the cytoplasmic side 
of the membrane. [Drawn from 1BRX.pdb.] 
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h& FIGURE 12.19 Structure of bacterial 
porin (from Rhodopseudomonas 
blastica). Notice that this membrane 
protein is built entirely of 8 strands. (A) Side 
view. (B) View from the periplasmic space. 
Only one monomer of the trimeric protein 
is shown. [Drawn from 1PRN.pdb.] 


FIGURE 12.20 Amino acid sequence of 
a porin. Some membrane proteins, such 
as porins, are built from 8 strands that tend 
to have hydrophobic and hydrophilic amino 
acids in adjacent positions. The secondary 
structure of porin from Rhodopseudomonas 
blastica is shown, with the diagonal lines 
indicating the direction of hydrogen 
bonding along the 8 sheet. Hydrophobic 
residues (F, |, L, M, V, W, and Y) are shown 
in yellow. These residues tend to lie on the 
outside of the structure, in contact with the 
hydrophobic core of the membrane. 
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FIGURE 12.18 Amino acid sequence of bacteriorhodopsin. The seven helical regions are 
highlighted in yellow and the charged residues in red. 


Bacteriorhodopsin is built almost entirely of a helices; seven closely packed 
a helices, nearly perpendicular to the plane of the cell membrane, span its 
45-A width. Examination of the primary structure of bacteriorhodopsin 
reveals that most of the amino acids in these membrane-spanning « helices 
are nonpolar and only a very few are charged (Figure 12.18). This distribu- 
tion of nonpolar amino acids is sensible because these residues are either in 
contact with the hydrocarbon core of the membrane or with one another. 
Membrane-spanning a helices are the most common structural motif in mem- 
brane proteins. As will be considered in Section 12.5, such regions can often 
be detected by examining amino acid sequence alone. 


A channel protein can be formed from beta strands. Porin, a protein 
from the outer membranes of bacteria such as E. coli and Rhodobacter 

capsulatus, represents a class of 
(6) membrane proteins with a com- 
pletely different type of structure. 
Structures of this type are built from 
B strands and contain essentially no 
a helices (Figure 12.19). 

The arrangement of B strands is 
quite simple: each strand is hydro- 
gen bonded to its neighbor in an 
antiparallel arrangement, forming a 
single B sheet. The B sheet curls up 
to form a hollow cylinder that, as its 
name suggests, forms a pore, or 
channel, in the membrane. The out- 
side surface of porin is appropriately 
nonpolar, given that it interacts with the hydrocarbon core of the mem- 
brane. In contrast, the inside of the channel is quite hydrophilic and is filled 
with water. This arrangement of nonpolar and polar surfaces is accom- 
plished by the alternation of hydrophobic and hydrophilic amino acids 
along each B strand (Figure 12.20). 


Embedding part of a protein in a membrane can link the protein to the 
membrane surface. The structure of the endoplasmic reticulum mem- 
brane-bound enzyme prostaglandin H, synthase-1 reveals a rather different 
role for a helices in protein—membrane associations. This enzyme catalyzes 
the conversion of arachidonic acid into prostaglandin H, in two steps: 
(1) a cyclooxygenase reaction and (2) a peroxidase reaction (Figure 12.21). 
Prostaglandin H.,, promotes inflammation and modulates gastric acid secre- 
tion. The enzyme that produces prostaglandin H, is a homodimer with a 
rather complicated structure consisting primarily of a helices. Unlike bac- 
teriorhodopsin, this protein is not largely embedded in the membrane. 
Instead, it lies along the outer surface of the membrane, firmly bound by a 
set of a helices with hydrophobic surfaces that extend from the bottom of 
the protein into the membrane (Figure 12.22). This linkage is sufficiently 
strong that only the action of detergents can release the protein from the 
membrane. Thus, this enzyme is classified as an integral membrane protein, 
although it does not span the membrane. 
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side chains 


k& FIGURE 12.22 Attachment of prostaglandin H, synthase-1 to the membrane. 
Notice that prostaglandin H, synthase-1 is held in the membrane by a set of a helices 
(orange) coated with hydrophobic side chains. One monomer of the dimeric enzyme 
is shown. [Drawn from 1PTH.pdb.] 


c 2 The localization of prostaglandin H, synthase-| in 
the membrane is crucial to its function. The substrate 
for this enzyme, arachidonic acid, is a hydrophobic mole- 
cule generated by the hydrolysis of membrane lipids. 
Arachidonic acid reaches the active site of the enzyme from 
the membrane without entering an aqueous environment by 
traveling through a hydrophobic channel in the protein 
(Figure 12.23). Indeed, nearly all of us have experienced the 
importance of this channel: drugs such as aspirin and ibu- 
profen block the channel and prevent prostaglandin synthe- 
sis by inhibiting the cyclooxygenase activity of the synthase. 
In particular, aspirin acts through the transfer of its acetyl 
group to a serine residue (Ser 530) that lies along the path to 
the active site (Figure 12.24). 
Two important features emerge from our examination 
of these three examples of membrane-protein structure. 
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FIGURE 12.21 Formation of 


prostaglandin H,. Prostaglandin H, 
synthase-1 catalyzes the formation of 
prostaglandin H, from arachidonic acid 
in two steps. 


Ser 530 


& FIGURE 12.23 Hydrophobic channel of prostaglandin 

H, synthase-1. A view of prostaglandin H, synthase-1 from the 
membrane shows the hydrophobic channel that leads to the active 
site. The membrane-anchoring helices are shown in orange. [Drawn 
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O OH First, the parts of the protein that interact with the hydrophobic parts of 
the membrane are coated with nonpolar amino acid side chains, whereas 


O CH those parts that interact with the aqueous environment are much more 
Tt hydrophilic. Second, the structures positioned within the membrane are 
O quite regular and, in particular, all backbone hydrogen-bond donors and 


acceptors participate in hydrogen bonds. Breaking a hydrogen bond within 
a membrane is quite unfavorable, because little or no water is present to com- 
pete for the polar groups. 


Aspirin 
(Acetylsalicyclic acid) 


. oH Some proteins associate with membranes through 


Seftssg . 
covalently attached hydrophobic groups 


O : ‘ ; : 
The membrane proteins considered thus far associate with the mem- 


FIGURE 12.24 Aspirin’s effects on brane through surfaces generated by hydrophobic amino acid side 

lee habinash besiege aaa chains. However, even otherwise soluble proteins can associate with 

residue in prostaglandin H, synthase-1. membranes if hydrophobic groups are attached to the proteins. Three 
such groups are shown in Figure 12.25: (1) a palmitoyl group attached to 
a specific cysteine residue by a thioester bond, (2) a farnesyl group 
attached to a cysteine residue at the carboxyl terminus, and (3) a glyco- 
lipid structure termed a glycosylphosphatidylinositol (GPI) anchor 
attached to the carboxyl terminus. These modifications are attached by 
enzyme systems that recognize specific signal sequences near the site of 
attachment. 
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FIGURE 12.25 Membrane 
anchors. Membrane anchors are 
hydrophobic groups that are 
covalently attached to proteins 

(in blue) and tether the proteins to 


the membrane. The green circles and HO B 4 

blue square correspond to mannose HO. ie) O 

and B-D-acetylglucosamine (GIcNAc), RO 

respectively. R groups represent OH 

points of additional modification. Glycosylphosphatidylinositol (GP1) anchor 


Transmembrane helices can be accurately predicted from 
amino acid sequences 


Many membrane proteins, like bacteriorhodopsin, employ « helices to span 
the hydrophobic part of a membrane. As noted earlier, most of the residues 
in these a helices are nonpolar and almost none of them are charged. Can we 
use this information to identify likely membrane-spanning regions from 
sequence data alone? One approach to identifying transmembrane helices is 
to ask whether a postulated helical segment is likely to be more stable in a 
hydrocarbon environment or in water. Specifically, we want to estimate the 
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free-energy change when a helical segment is transferred from the interior 
of a membrane to water. Free-energy changes for the transfer of individual 
amino acid residues from a hydrophobic to an aqueous environment are 
given in Table 12.2. For example, the transfer of a helix formed entirely of 
L-arginine residues, a positively charged amino acid, from the interior of a 
membrane to water would be highly favorable [—51.5 kJ mol~! (—12.3 kcal 
mol!) per arginine residue in the helix]. In contrast, the transfer of a helix 
formed entirely of L-phenylalanine, a hydrophobic amino acid, would be 
unfavorable [+15.5 kJ mol~! (+3.7 kcal mol!) per phenylalanine residue 
in the helix]. : 

The hydrocarbon core of a membrane is typically 30 A wide, a length that 
can be traversed by an a helix consisting of 20 residues. We can take the 
amino acid sequence of a protein and estimate the free-energy change that 
takes place when a hypothetical « helix formed of residues 1 through 20 is 
transferred from the membrane interior to water. The same calculation can 
be made for residues 2 through 21, 3 through 22, and so forth, until we 
reach the end of the sequence. The span of 20 residues chosen for this cal- 
culation is called a window. The free-energy change for each window is 
plotted against the first amino acid at the window to create a hydropathy 
plot. Empirically, a peak of +84 kJ mol~! (+20 kcal mol~!) or more in a 
hydropathy plot based on a window of 20 residues indicates that a polypep- 
tide segment could be a membrane-spanning a helix. For example, gly- 
cophorin, a protein found in the membranes of red blood cells, is predicted 
by this criterion to have one membrane-spanning helix, in agreement 
with experimental findings (Figure 12.26). Note, however, that a peak in 
the hydropathy plot does not prove that a segment is a transmembrane 
helix. Even soluble proteins may have highly nonpolar regions. Conversely, 
some membrane proteins contain membrane-spanning features (such as 
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FIGURE 12.26 Locating the membrane-spanning helix of glycophorin. (A) Amino acid 
sequence and transmembrane disposition of glycophorin A from the red-blood-cell 
membrane. Fifteen O-linked carbohydrate units are shown as diamond shapes, and an 
N-linked unit is shown as a lozenge shape. The hydrophobic residues (yellow) buried in the 
bilayer form a transmembrane « helix. The carboxyl-terminal part of the molecule, located on 
the cytoplasmic side of the membrane, is rich in negatively charged (red) and positively 
charged (blue) residues. (B) Hydropathy plot for glycophorin. The free energy for transferring a 
helix of 20 residues from the membrane to water is plotted as a function of the position of the 
first residue of the helix in the sequence of the protein. Peaks of greater than +84 kJ mol71 
(+20 kcal mol~') in hydropathy plots are indicative of potential transmembrane helices. 

[(A) Information from Dr. Vincent Marchesi; (B) data from D. M. Engelman, T. A. Steitz, and 

A. Goldman, Annu. Rev. Biophys. Biophys. Chem. 15:321-353, 1986. Copyright © 1986 by 
Annual Reviews, Inc. All rights reserved.] 


TABLE 12.2 Polarity scale for identify- 
ing transmembrane helices 


Transfer free 


Amino acid energy in kJ mol7! 

residue (kcal mol~') 
Phe 15.5 (3.7) 
Met 14.3 (3.4) 
lle 13.0 (3.1) 
Leu 11.8 (2.8) 
Val 10.9 (2.6) 
Cys 8.4 (2.0) 
Trp 8.0 (1.9) 
Ala 6.7 (1.6) 
The 5.0 (1.2) 
Gly 4.2 (1.0) 
Ser 2.5 (0.6) 
Pro —0.8 (—0.2) 
Tyr —2.9(—0.7) 
His =a) 
Gln —17.2 (—4.1) 
Asn —20.2 (—4.8) 
Glu —34.4 (—8.2) 
Lys —37.0 (—8.8) 
Asp —38.6 (—9.2) 
Arg Seo 


Source: Data from D. M. Engelman, T. A. Steitz, 
and A. Goldman. Annu. Rev. Biophys. Biophys. 
Chem. 15(1986):321-353. 

Note: The free energies are for the transfer of 

an amino acid residue in an a helix from the 
membrane interior (assumed to have a dielectric 
constant of 2) to water. 
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FIGURE 12.27 Hydropathy plot for porin. 
No strong peaks are observed for this 
intrinsic membrane protein, because it is 
constructed from membrane-spanning 8 
strands rather than « helices. 


FIGURE 12.28 Fluorescence recovery 
after photobleaching (FRAP) technique. 
(A) The cell surface fluoresces because of 
a labeled surface component. (B) The 
fluorescent molecules of a small part of the 
surface are bleached by an intense light 
pulse. (C) The fluorescence intensity 
recovers as bleached molecules diffuse out 
of the region and unbleached molecules 
diffuse into it. (D) The rate of recovery 
depends on the diffusion coefficient. 
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a set of cylinder-forming B strands) that escape detection by these plots 
(Figure 12.27). 


12.5 Lipids and Many Membrane Proteins Diffuse 
Rapidly in the Plane of the Membrane 


Biological membranes are not rigid, static structures. On the contrary, lipids 
and many membrane proteins are constantly in lateral motion, a process 
called lateral diffusion. The rapid lateral movement of membrane proteins has 
been visualized by means of fluorescence microscopy using the technique of 
fluorescence recovery after photobleaching (FRAP; Figure 12.28). First, a cell- 
surface component is specifically labeled with a fluorescent chromophore. 
A small region of the cell surface (~3 um?) is viewed through a fluorescence 
microscope. The fluorescent molecules in this region are then destroyed 
(bleached) by a very intense light pulse from a laser, as indicated by the pale 
spot in Figure 12.28B. The fluorescence of this region is subsequently moni- 
tored as a function of time by using a light level sufficiently low to prevent 
further bleaching. If the labeled component is mobile, bleached molecules 
leave and unbleached molecules enter the illuminated region, resulting in an 
increase in the fluorescence intensity. The rate of recovery of fluorescence 
depends on the lateral mobility of the fluorescence-labeled component, 
which can be expressed in terms of a diffusion coefficient, D. The average 
distance S traversed in time t depends on D according to the expression 


S = (4D) 


The diffusion coefficient of lipids in a variety of membranes is about 
1 wm? s~!. Thus, a phospholipid molecule diffuses an average distance of 
2 wm in 1s. This rate means that a lipid molecule can travel from one end of 
a bacterium to the other in a second. The magnitude of the observed diffusion 
coefficient indicates that the viscosity of the membrane is about 100 times 
that of water, rather like that of olive oil. 
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In contrast, proteins vary markedly in their lateral mobility. Some pro- 
teins are nearly as mobile as lipids, whereas others are virtually immobile. For 
example, the photoreceptor protein rhodopsin (Section 33.3), a very mobile 
protein, has a diffusion coefficient of 0.4 wm? s~!. The rapid movement of 
rhodopsin is essential for fast signaling. At the other extreme is fibronectin, 
a peripheral glycoprotein that interacts with the extracellular matrix. For 
fibronectin, D is less than 10~+ um? s~!. Fibronectin has a very low mobil- 
ity because it is anchored to actin filaments on the inside of the plasma 
membrane through integrin, a transmembrane protein that links the extra- 
cellular matrix to the cytoskeleton. 


The fluid mosaic model allows lateral movement but not 
rotation through the membrane 


On the basis of the mobility of proteins in membranes, in 1972 5. Jonathan 
Singer and Garth Nicolson proposed a fluid mosaic model to describe the 
overall organization of biological membranes. The essence of their model is 
that membranes are two-dimensional solutions of oriented lipids and globular 
proteins. The lipid bilayer has a dual role: it is both a solvent for integral mem- 
brane proteins and a permeability barrier. Membrane proteins are free to dif- 
fuse laterally in the lipid matrix unless restricted by special interactions. 
Although the lateral diffusion of membrane components can be rapid, the 
spontaneous rotation of lipids from one face of amembrane to the other is a very 
slow process. The transition of a molecule from one membrane surface to the 
other is called transverse diffusion or flip-flop (Figure 12.29). The flip-flop of 
phospholipid molecules in phosphatidylcholine vesicles has been directly mea- 
sured by electron spin resonance techniques, which show that a phospholipid 
molecule flip-flops once in several hours. Thus, a phospholipid molecule takes 
about 10° times as long to flip-flop across a membrane as it takes to diffuse a 
distance of 50 A in the lateral direction. The free-energy barriers to flip-flopping 
are even larger for protein molecules than for lipids because proteins have more- 
extensive polar regions. In fact, the flip-flop of a protein molecule has not been 
observed. Hence, membrane asymmetry can be preserved for long periods. 


Membrane fluidity is controlled by fatty acid composition and 
cholesterol content 


Many membrane processes, such as transport or signal transduction, 
depend on the fluidity of the membrane lipids, which in turn depends on 
the properties of fatty acid chains. Fatty acid chains in membrane bilayers 
can exist in an ordered, rigid state or in a relatively disordered, fluid state. 
The transition from the rigid to the fluid state takes place abruptly as the 
temperature is raised above T,,, the melting temperature (Figure 12.30). 
This transition temperature depends on the length of the fatty acid chains and 
on their degree of unsaturation (Table 12.3). The presence of saturated fatty 


TABLE 12.3 The melting temperature of phosphatidylcholine containing different 
pairs of identical fatty acid chains 


Fatty acid 

Number of Number of Common 

carbons double bonds name Systematic name T,, CC) 
22 0 Behenate n-Docosanote 75 
18 0 Stearate n-Octadecanoate 58 
16 0 Palmitate n-Hexadecanoate 41 
14 0 Myristate n- Tetradecanoate 24 
18 1 Oleate cis-A9-Octadecenoate —22 
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FIGURE 12.29 Lipid movement in 
membranes. Lateral diffusion of lipids is 
much more rapid than transverse diffusion 
(flip-flop). 
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FIGURE 12.30 The phase-transition, 

or melting, temperature (T,,) for a 
phospholipid membrane. As the 
temperature is raised, the phospholipid 
membrane changes from a packed, ordered 
state to a more random one. 


FIGURE 12.31 Packing of fatty acid 
chains in a membrane. The highly ordered 
packing of fatty acid chains is disrupted by 
the presence of cis double bonds. The 
space-filling models show the packing of 
(A) three molecules of stearate (C,,, 
saturated) and (B) a molecule of oleate 
(C,,, unsaturated) between two molecules 
of stearate. 


Cholesterol 


FIGURE 12.32 Cholesterol disrupts 

the tight packing of the fatty acid chains. 
[Information from on S. L. Wolfe, Molecular 
and Cellular Biology (Wadsworth, 1998).] 
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(A) 


acid residues favors the rigid state because their straight hydrocarbon chains 
interact very favorably with one another. On the other hand, a cis double 
bond produces a bend in the hydrocarbon chain. This bend interferes with a 
highly ordered packing of fatty acid chains, and so'T,,, is lowered (Figure 12.31). 
The length of the fatty acid chain also affects the transition temperature. 
Long hydrocarbon chains interact more strongly than do short ones. 
Specifically, each additional —CH,— group makes a favorable contribution 
of about —2 kJ mol~! (—0.5 kcal mol!) to the free energy of interaction of 
two adjacent hydrocarbon chains. 

Bacteria regulate the fluidity of their membranes by varying the number of 
double bonds and the length of their fatty acid chains. For example, the ratio 
of saturated to unsaturated fatty acid chains in the E. coli membrane 
decreases from 1.6 to 1.0 as the growth temperature is lowered from 42°C to 
27°C. This decrease in the proportion of saturated residues prevents the 
membrane from becoming too rigid at the lower temperature. 

In animals, cholesterol is the key regulator of membrane fluidity. Cholesterol 
contains a bulky steroid nucleus with a hydroxyl group at one end and a 
flexible hydrocarbon tail at the other end. Cholesterol inserts into bilayers 
with its long axis perpendicular to the plane of the membrane. The hydroxy] 
group of cholesterol forms a hydrogen bond with a carbonyl oxygen atom of 
a phospholipid head group, whereas the hydrocarbon tail of cholesterol is 
located in the nonpolar core of the bilayer. The different shape of cholesterol 
compared with that of phospholipids disrupts the regular interactions 
between fatty acid chains (Figure 12.32). 


Lipid rafts are highly dynamic complexes formed between 
cholesterol and specific lipids 


In addition to its nonspecific effects on membrane fluidity, cholesterol can 
form specific complexes with lipids that contain the sphingosine backbone, 
including sphingomyelin and certain glycolipids, and with GPI-anchored 
proteins. These complexes concentrate within small (10-200 nm) and 
highly dynamic regions within membranes. The resulting structures are 
often referred to as lipid rafts. One result of these interactions is the modera- 
tion of membrane fluidity, making membranes less fluid but at the same time 
less subject to phase transitions. The presence of lipid rafts thus represents 
a modification of the original fluid mosaic model for biological membranes. 
Although their small size and dynamic nature have made them very diffi- 
cult to study, it appears that lipid rafts may play a role in concentrating 
proteins that participate in signal transduction pathways and may also serve 
to regulate membrane curvature and budding. 


All biological membranes are asymmetric 


Membranes are structurally and functionally asymmetric. The outer and inner 
surfaces of all known biological membranes have different components and different 


enzymatic activities. A clear-cut example is the pump that regulates 
the concentration of Na* and K* ions in cells (Figure 12.33). This 
transport protein is located in the plasma membrane of nearly all 
cells in higher organisms. The Na*—K* pump is oriented so that it 
pumps Na* out of the cell and K* into it. Furthermore, ATP must 
be on the inside of the cell to drive the pump. Ouabain, a specific 
inhibitor of the pump, is effective only if it is located outside. We 
shall consider the mechanism of this important and fascinating 
family of pumps in Chapter 13. 

Membrane proteins have a unique orientation because, after 
synthesis, they are inserted into the membrane in an asymmetric 
manner. This absolute asymmetry is preserved because mem- 
brane proteins do not rotate from one side of the membrane to 
the other and because membranes are always synthesized by the 
growth of preexisting membranes. Lipids, too, are asymmetrically 
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FIGURE 12.33 Asymmetry of the Na*+-K* transport 
system in plasma membranes. The Nat-K* transport 


system pumps Na* out of the cell and Kt into the cell by 


distributed as a consequence of their mode of biosynthesis, but 
hydrolyzing ATP on the intracellular side of the membrane. 


this asymmetry is usually not absolute, except for glycolipids. In 
the red-blood-cell membrane, sphingomyelin and phosphatidyl- 
choline are preferentially located in the outer leaflet of the bilayer, whereas 
phosphatidylethanolamine and phosphatidylserine are located mainly in the 
inner leaflet. Large amounts of cholesterol are present in both leaflets. 


12.6 Eukaryotic Cells Contain Compartments 
Bounded by Internal Membranes 


Thus far, we have considered only the plasma membrane of cells. Some 
bacteria and archaea have only this single membrane, surrounded by a cell 
wall. Other bacteria, such as E. coli, have two membranes separated by a 
cell wall (made of proteins, peptides, and carbohydrates) lying between 
them (Figure 12.34). The inner membrane acts as the permeability barrier, 
and the outer membrane and the cell wall provide additional protection. 
The outer membrane is quite permeable to small molecules, owing to the 
presence of porins. The region between the two membranes containing the 
cell wall is called the periplasm. 


(A) (B) 
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FIGURE 12.34 Cell membranes of prokaryotes. A schematic view of the membrane of 
bacterial cells surrounded by (A) two membranes or (B) one membrane. 
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Eukaryotic cells, with the exception of plant cells, do not have cell 
walls, and their cell membranes consist of a single lipid bilayer. In plant 
cells, the cell wall is on the outside of the plasma membrane. Eukaryotic 
cells are distinguished from prokaryotic cells by the presence of mem- 

branes inside the cell that form internal compartments. 
For example, peroxisomes, organelles that play a major 
role in the oxidation of fatty acids for energy conversion, 


ye hipesome are defined by a single membrane. Mitochondria, the 
organelles in which ATP is synthesized, are surrounded 
by two membranes. As in the case for a bacterium, the 
outer membrane is quite permeable to small molecules, 
neice whereas the inner membrane 1s not. Indeed, considerable 
evidence now indicates that mitochondria evolved from 
Se bacteria by endosymbiosis (Section 18.1). The nucleus is 
Nuclear pore also surrounded by a double membrane, the nuclear enve- 
complex ; 
lope, that consists of a set of closed membranes that come 
DNA together at structures called nuclear pores (Figure 12.35). 
These pores regulate transport into and out of the 
FIGURE 12.35 Nuclear envelope. The nuclear envelope is a nucleus. ‘The nuclear envelope is linked to another mem- 
double membrane connected to another membrane system brane-defined structure, the endoplasmic reticulum, 
of eukaryotes, the endoplasmic reticulum. [Information from which plays a host of cellular roles, including drug 
Ef. Sonimer anal i taetace, Genome iol OG) 008. 1-10084, detoxification and the modification of proteins for secre- 


2002, reviews, Fig.1.] 


tion. Thus, a eukaryotic cell contains interacting com- 
partments, and transport into and out of these compartments is essential 
to many biochemical processes. 

Membranes must be able to separate or join together so that cells and 
compartments may take up, transport, and release molecules. Many cells 
take up molecules through the process of receptor-mediated endocytosis. 
Here, a protein or larger complex initially binds to a receptor on the cell 
surface. After the receptor is bound, specialized proteins act to cause the 
membrane in this region to invaginate. One of these specialized proteins is 
clathrin, which polymerizes into a lattice network around the growing 
membrane bud, often referred to as a clathrin-coated pit (Figure 12.36). The 
invaginated membrane eventually breaks off and fuses to form a vesicle. 
Various hormones, transport proteins, and antibodies employ receptor- 
mediated endocytosis to gain entry into a cell. A less-advantageous conse- 
quence is that this pathway is available to viruses and toxins as a means of 
invading cells. The reverse process—the fusion of a vesicle to a membrane— 
is a key step in the release of neurotransmitters from a neuron into the syn- 
aptic cleft (Figure 12.37). 


i Clathrin ~_— 
4 coat + 
Ls J 


FIGURE 12.36 Vesicle formation by receptor-mediated endocytosis. Receptor binding on 
the surface of the cell induces the membrane to invaginate, with the assistance of specialized 
intracellular proteins such as clathrin. The process results in the formation of a vesicle within 
the cell. [M. M. Perry and A. B. Gilbert, J. Cell Sci. 39:266, 1979.] 


Let us consider one example of receptor-mediated endocytosis. Iron is a 
critical element for the function and structure of many proteins, including 
hemoglobin and myoglobin (Chapter 7). However, free iron ions are highly 
toxic to cells, owing to their ability to catalyze the formation of free radicals. 
Hence, the transport of iron atoms from the digestive tract to the cells where 
they are most needed must be tightly controlled. In the bloodstream, iron is 
bound very tightly by the protein transferrin, which can bind two Fe** ions 
with a dissociation constant of 10~?3 M at neutral pH. Cells requiring iron 
express the transferrin receptor in their plasma membranes (Section 32.4). 
Formation of a complex between the transferrin receptor and iron-bound trans- 
ferrin initiates receptor-mediated endocytosis, internalizing these complexes 
within vesicles called endosomes (Figure 12.38). As the endosomes mature, 
proton pumps within the vesicle membrane lower the lumenal pH to about 5.5. 
Under these conditions, the affinity of iron ions for transferrin is reduced; these 
ions are released and are free to pass through channels in the endosomal mem- 
branes into the cytoplasm. The iron-free transferrin complex is recycled to the 
plasma membrane, where transferrin is released back into the bloodstream and 
the transferrin receptor can participate in another uptake cycle. 

Although budding and fusion appear deceptively simple, the structures 
of the intermediates in these processes and the detailed mechanisms remain 
ongoing areas of investigation. These processes must be highly specific in 
order to prevent incorrect membrane fusion events. SNARE (soluble 
N-ethylmaleimide-sensitive-factor attachment protein receptor) proteins 
from both membranes help draw appropriate lipid bilayers together by 
forming tightly coiled four-helical bundles (Figure 12.39). Once these 
membranes are in close apposition, the fusion process can proceed. SNARE 
proteins, encoded by gene families in all eukaryotic cells, largely determine 
the compartment with which a vesicle will fuse. The specificity of mem- 
brane fusion ensures the orderly trafficking of membrane vesicles and their 
cargos through eukaryotic cells. 
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FIGURE 12.38 The transferrin receptor cycle. lron-bound transferrin binds to the transferrin 
receptor (TfR) on the surface of cells. Receptor-mediated endocytosis occurs, leading to 

the formation of a vesicle called an endosome. As the lumen of the endosome is acidified 

by the action of proton pumps, iron is released from transferrin, passes through channels in 
the membrane, and is utilized by the cell. The complex between iron-free transferrin and the 
transferrin receptor is returned to the plasma membrane for another cycle. [Information from 

L. Zecca et al., Nat. Rev. Neurosci. 5:863-873, 2004, Fig.1.] 


FIGURE 12.37 Neurotransmitter release. 
Neurotransmitter-containing synaptic 
vesicles (indicated by the arrows) are 
arrayed near the plasma membrane of a 
nerve cell. Synaptic vesicles fuse with 

the plasma membrane, releasing the 
neurotransmitter into the synaptic cleft. 

[T. Reese/ Don Fawcett/Photo Researchers.] 
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FIGURE 12.39 SNARE complexes initiate membrane fusion. The SNARE protein synaptobrevin 
(yellow) from one membrane forms a tight four-helical bundle with the corresponding SNARE 
proteins syntaxin-1 (blue) and SNAP25 (red) from a second membrane. The complex brings 
the membranes close together, initiating the fusion event. [Drawn from 1SFC.pdb.] 


SUMMARY 


Biological membranes are sheetlike structures, typically from 60 to 100 A 
thick, that are composed of protein and lipid molecules held together by 
noncovalent interactions. Membranes are highly selective permeability bar- 
riers. They create closed compartments, which may be entire cells or organ- 
elles within a cell. Proteins in membranes regulate the molecular and ionic 
compositions of these compartments. Membranes also control the flow of 
information between cells. 


12.1 Fatty Acids Are Key Constituents of Lipids 
Fatty acids are hydrocarbon chains of various lengths and degrees of 
unsaturation that terminate with a carboxylic acid group. The fatty 
acid chains in membranes usually contain between 14 and 24 carbon 
atoms; they may be saturated or unsaturated. Short chain length and 
unsaturation enhance the fluidity of fatty acids and their derivatives 
by lowering the melting temperature. 


12.2 There Are Three Common Types of Membrane Lipids 

The major types of membrane lipids are phospholipids, glycolipids, 
and cholesterol. Phosphoglycerides, a type of phospholipid, consist of 
a glycerol backbone, two fatty acid chains, and a phosphorylated alco- 
hol. Phosphatidylcholine, phosphatidylserine, and phosphatidyletha- 
nolamine are major phosphoglycerides. Sphingomyelin, a different 
type of phospholipid, contains a sphingosine backbone instead of 
glycerol. Glycolipids are sugar-containing lipids derived from sphin- 
gosine. Cholesterol, which modulates membrane fluidity, is con- 
structed from a steroid nucleus. A common feature of these membrane 
lipids is that they are amphipathic molecules, having one hydrophobic 
and one hydrophilic end. 


12.3 Phospholipids and Glycolipids Readily Form Bimolecular 

Sheets in Aqueous Media 

Membrane lipids spontaneously form extensive bimolecular sheets in 
aqueous solutions. The driving force for membrane formation is the 
hydrophobic interactions among the fatty acid tails of membrane lip- 
ids. The hydrophilic head groups interact with the aqueous medium. 
Lipid bilayers are cooperative structures, held together by many weak 
bonds. These lipid bilayers are highly impermeable to ions and most 
polar molecules, yet they are quite fluid, which enables them to act as 
a solvent for membrane proteins. 


12.4 


12.5 


12.6 


Proteins Carry Out Most Membrane Processes 

Specific proteins mediate distinctive membrane functions such as trans- 
port, communication, and energy transduction. Many integral mem- 
brane proteins span the lipid bilayer, whereas others are only partly 
embedded in the membrane. Peripheral membrane proteins are bound 
to membrane surfaces by electrostatic and hydrogen-bond interactions. 
Membrane-spanning proteins have regular structures, including B 
strands, although the a helix is the most common membrane-spanning 
structure. Sequences of 20 consecutive nonpolar amino acids can be 
diagnostic of a membrane-spanning « helical region of a protein. 


Lipids and Many Membrane Proteins Diffuse Rapidly in the 

Plane of the Membrane 

Membranes are structurally and functionally asymmetric, as exempli- 
fied by the restriction of sugar residues to the external surface of mam- 
malian plasma membranes. Membranes are dynamic structures in 
which proteins and lipids diffuse rapidly in the plane of the membrane 
(lateral diffusion), unless restricted by special interactions. In contrast, 
the rotation of lipids from one face of a membrane to the other (trans- 
verse diffusion, or flip-flop) is usually very slow. Proteins do not rotate 
across bilayers; hence, membrane asymmetry can be preserved. The 
degree of fluidity of a membrane depends on the chain length of its lip- 
ids and on the extent to which their constituent fatty acids are unsatu- 
rated. In animals, cholesterol content also regulates membrane fluidity. 


Eukaryotic Cells Contain Compartments Bounded by Internal Membranes 
An extensive array of internal membranes in eukaryotes creates com- 
partments within a cell for distinct biochemical functions. For instance, 
a double membrane surrounds the nucleus (the location of most of the 
cell’s genetic material) and the mitochondria (the location of most ATP 
synthesis). A single membrane defines the other internal compart- 
ments, such as the endoplasmic reticulum. Receptor-mediated endo- 
cytosis enables the formation of intracellular vesicles when ligands 
bind to their corresponding receptor proteins in the plasma membrane. 
The reverse process—the fusion of a vesicle to a membrane—is a key 
step in the release of signaling molecules outside the cell. 
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Problems 


KEY TERMS 
fatty acid (p. 342) amphipathic (amphiphilic) molecule lipid raft (p. 358) 
phospholipid (p. 344) (p. 347) receptor-mediated endocytosis (p. 360) 
sphingosine (p. 344) lipid bilayer (p. 348) clathrin (p. 360) 
phosphoglyceride (p. 344) liposome (p. 348) transferrin (p. 361) 
sphingomyelin (p. 345) integral membrane protein (p. 351) transferrin receptor (p. 361) 
glycolipid (p. 345) peripheral membrane protein (p. 351) endosome (p. 361) 
cerebroside (p. 346) hydropathy plot (p. 355) SNARE (soluble N-ethylmaleimide- 
ganglioside (p. 346) lateral diffusion (p. 356) sensitive-factor attachment protein 
cholesterol (p. 346) fluid mosaic model (p. 357) receptor) proteins (p. 361) 
PROBLEMS 


1. Population density. How many phospholipid molecules 
are there in a 1-jm? region of a phospholipid bilayer mem- 
brane? Assume that a phospholipid molecule occupies 


70 A2 of the surface area. 


2. Through the looking-glass. Phospholipids form lipid 
bilayers in water. What structure might form if phospho- 
lipids were placed in an organic solvent? 
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3. Lipid diffusion. What is the average distance traversed by 
a membrane lipid in 1 ws, 1 ms, and 1 s? Assume a diffusion 
coefficient of 1078 cm? s~! 


sc. 
4. Protein diffusion. The diffusion coefficient, D, of a rigid 
spherical molecule is given by 


D=kT/6anr 


in which 7 is the viscosity of the solvent, r is the radius of the 
sphere, k is the Boltzman constant (1.38 x 107!° erg 
degree~!), and T is the absolute temperature. What is the 
diffusion coefficient at 37°C of a 100-kDa protein in a mem- 
brane that has an effective viscosity of 1 poise (1 poise = 1 
erg s_! cm *%)? What is the average distance traversed by 
this protein in 1 ws, 1 ms, and 1 s? Assume that this protein 


is an unhydrated, rigid sphere of density 1.35 gcm~*. 


5. Cold sensitivity. Some antibiotics act as carriers that bind 
an ion on one side of a membrane, diffuse through the 
membrane, and release the ion on the other side. The con- 
ductance of a lipid-bilayer membrane containing a carrier 
antibiotic decreased abruptly when the temperature was 
lowered from 40°C to 36°C. In contrast, there was little 
change in conductance of the same bilayer membrane when 
it contained a channel-forming antibiotic. Why? 


6. Melting point 1. Explain why oleic acid (18 carbons, one 
cis bond) has a lower melting point than stearic acid, which 
has the same number of carbon atoms but is saturated. How 
would you expect the melting point of trans-oleic acid to 
compare with that of cis-oleic acid? Why might most unsat- 
urated fatty acids in phospholipids be in the cis rather than 
the trans conformation? 


7. Melting point 2. Explain why the melting point of pal- 
mitic acid (C,,) is 6.5 degrees lower than that of stearic 


acid (C,.). 


8. A sound diet. Small mammalian hibernators can with- 
stand body temperatures of 0° to 5°C without injury. 
However, the body fats of most mammals have melting 
temperatures of approximately 25°C. Predict how the com- 
position of the body fat of hibernators might differ from 
that of their nonhibernating cousins. 


9. Flip-flop 1. The transverse diffusion of phospholipids in 
a bilayer membrane was investigated by using a fluores- 
cently labeled analog of phosphatidylserine called NBD-PS. 
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NBD-phosphatidylserine (NBD-PS) 


The fluorescence signal of NBD-PS is quenched when 
exposed to sodium dithionite, a reducing agent that is not 
membrane permeable. 


Lipid vesicles containing phosphatidylserine (98%) and 
NBD-PS (2%) were prepared by sonication and purified. 
Within a few minutes of the addition of sodium dithionite, the 
fluorescence signal of these vesicles decreased to ~45% of its 
initial value. Immediately adding a second addition of sodium 
dithionite yielded no change in the fluorescence signal. 
However, if the vesicles were allowed to incubate for 6.5 hours, 
a third addition of sodium dithionite decreased the remaining 
fluorescence signal by 50%. How would you interpret the fluo- 
rescence changes at each addition of sodium dithionite? 


10. Flip-flop 2. Although proteins rarely if ever flip-flop 
across a membrane, the distribution of membrane lipids 
between the membrane leaflets is not absolute except for gly- 
colipids. Why are glycosylated lipids less likely to flip-flop? 


11. Linkages. Platelet-activating factor (PAF) is a phos- 
pholipid that plays a role in allergic and inflammatory 
responses, as well as in toxic shock syndrome. The structure 
of PAF is shown here. How does it differ from the struc- 
tures of the phospholipids discussed in this chapter? 


CH3(CH2);5 —O 
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Platelet-activating factor (PAF) 


12. A question of competition. Would a homopolymer of 
alanine be more likely to form an a helix in water or in a 
hydrophobic medium? Explain. 


13. A false positive. Hydropathy plot analysis of your pro- 
tein of interest reveals a single, prominent hydrophobic 
peak. However, you later discover that this protein is solu- 
ble and not membrane associated. Explain how the hydrop- 
athy plot may have been misleading. 


14. Maintaining fluidity. A culture of bacteria growing at 
37°C was shifted to 25°C. How would you expect this shift 
to alter the fatty acid composition of the membrane phos- 
pholipids? Explain. 


15. Let me count the ways. Each intracellular 
fusion of a vesicle with a membrane requires a 
SNARE protein on the vesicle (called the 
v-SNARE) and a SNARE protein on the target 
membrane (called the t-SNARE). Assume that a 
genome encodes 21 members of the v-SNARE 
NH;* family and 7 members of the t-SNARE family. 
With the assumption of no specificity, how 
coo many potential v-SNARE-t-SNARE interac- 
tions could take place? 


Data Interpretation Problems 


16. Cholesterol effects. The red curve on the following 
graph shows the fluidity of the fatty acids of a phospholipid 
bilayer as a function of temperature. The blue curve shows 
the fluidity in the presence of cholesterol. 


No cholesterol 


+ Cholesterol 


Fluidity — 


Tm 
Temperature — 


(a) What is the effect of cholesterol? 
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18. Not all inhibitors are equal. Ibuprofen and indometh- 
acin are clinically important inhibitors of prostaglandin 
H, synthase-1. Cells expressing this enzyme were incu- 
bated under the following conditions, after which the 
activity of the enzyme was measured by adding radiola- 
beled arachidonic acid and detecting newly-produced 
prostaglandin H,: 


ao) 
(b) Why might this effect be biologi- 8 1° 
cally important? ES sot 
Cc 
. Cc 
17. Hydropathy plots. On the basis of a 8 col 
the following hydropathy plots for = s - 
three proteins (A—C), predict which Ee 
would be membrane proteins. What ra 20 
are the ambiguities with respect to a F 
using such plots to determine if a pro- a. (1) 
tein is a membrane protein? 
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(2) (3) (4) 


Ibuprofen Indomethacin 
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(1) (2) (3) (4) 


(1) 40 min without inhibitor (control) 
(2) 40 min with inhibitor 
(3) 40 min with inhibitor, after which the cells were 
resuspended in medium without inhibitor 
(4) 40 min with inhibitor, after which the cells were 
resuspended in medium without inhibitor and incubated 
for an additional 30 min. 
(a) Provide a hypothesis explaining the different results for 
these two inhibitors. 


(b) How would these results look if aspirin were tested in a 
similar fashion? 


Chapter Integration Problem 


19. The proper environment. An understanding of the struc- 
ture and function of membrane proteins has lagged behind 
that of other proteins. The primary reason is that mem- 
brane proteins are more difficult to purify and crystallize. 
Why might this be the case? 


Membrane Channels and Pumps 
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Dr. Mauricio Montal.] 


he lipid bilayer of biological membranes is intrinsically impermeable to 

ions and polar molecules, yet these species must be able to cross these 
membranes for normal cell function. Permeability is conferred by three 
classes of membrane proteins: pumps, carriers, and channels. Pumps use a 
source of free energy such as ATP hydrolysis or light absorption to drive the 
thermodynamically uphill transport of ions or molecules. Pump action is an 
example of active transport. Carriers mediate the transport of ions and small 
molecules across the membrane without consumption of ATP. Channels 
provide a membrane pore through which ions can flow very rapidly in a 
thermodynamically downhill direction. The action of channels illustrates 
passive transport, or facilitated diffusion. 

Pumps are energy transducers in that they convert one form of free 
energy into another. Two types of ATP-driven pumps, P-type ATPases and 
the ATP-binding cassette (ABC) transporters, undergo conformational 
changes on ATP binding and hydrolysis that cause a bound ion to be trans- 
ported across the membrane. The free energy of ATP hydrolysis is used to 
drive the movement of ions against their concentration gradients, a process 
referred to as primary active transport. In contrast, carriers utilize the gradi- 
ent of one ion to drive the transport of another against its gradient. An 
example of this process, termed secondary active transport, is mediated by 
the E. coli lactose transporter, a well-studied protein responsible for the 
uptake of a specific sugar from the environment of a bacterium. Many 
transporters of this class are present in the membranes of our cells. The 
expression of these transporters determines which metabolites a cell can 
import from the environment. Hence, adjusting the level of transporter 
expression is a primary means of controlling metabolism. 


ee a ae a 


The flow of ions through a single membrane channel (channels are shown in red in 
the illustration at the left) can be detected by the patch-clamp technique, which 
records current changes as the channel alternates between open and closed 
states. [(Left) Information from E. Neher and B. Sakmann. The patch clamp technique. 
Copyright © 1992 by Scientific American, Inc. All rights reserved. (Right) Courtesy of 
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CHAPTER 13 Membrane Channels 
and Pumps 


Pumps can establish persistent gradients of particular ions across mem- 
branes. Specific ion channels can allow these ions to flow rapidly across 
membranes down these gradients. These channels are among the most fas- 
cinating molecules in biochemistry in their ability to allow some ions to flow 
freely through a membrane while blocking the flow of even closely related 
species. The opening, or gating, of these channels can be controlled by the 
presence of certain ligands or a particular membrane voltage. Gated ion 
channels are central to the functioning of our nervous systems, acting as 
elaborately switched wires that allow the rapid flow of current. 

Finally, a different class of channel, the cell-to-cell channel, or gap junc- 
tion, allows the flow of metabolites or ions between cells. For example, gap 
junctions are responsible for synchronizing muscle-cell contraction in the 
beating heart. 


The expression of transporters largely defines the metabolic 
activities of a given cell type 


Each cell type expresses a specific set of transporters in its plasma mem- 
brane. This collection of expressed transporters is important because it 
largely determines the ionic composition inside cells and the compounds 
that can be taken up from the extracellular environment. In some sense, the 
specific array of transporters defines the cell’s characteristics because a cell 
can execute only those biochemical reactions for which it has taken up the 
necessary substrates. 

An example from glucose metabolism illustrates this point. As we will 
see in Chapter 16, tissues differ in their ability to employ different mole- 
cules as energy sources. Which tissues can utilize glucose is largely gov- 
erned by the expression of members of the GLUT family of homologous 
glucose transporters. For example, GLUTS3 is the primary glucose trans- 
porter expressed on the plasma membrane of neurons. This transporter 
binds glucose relatively tightly so that these cells have first call on glucose 
when it is present at relatively low concentrations. We will encounter many 
such examples of the critical role that transporter expression plays in the 
control and integration of metabolism. 


13.1 The Transport of Molecules Across a Membrane 
May Be Active or Passive 


We first consider some general principles of membrane transport. Two 
factors determine whether a molecule will cross a membrane: (1) the perme- 
ability of the molecule in a lipid bilayer and (2) the availability of an energy 
source. 


Many molecules require protein transporters to cross membranes 


As stated in Chapter 12, some molecules can pass through cell membranes 
because they dissolve in the lipid bilayer. Such molecules are called lipo- 
philic molecules. The steroid hormones provide a physiological example. 
These cholesterol relatives can pass through a membrane, but what deter- 
mines the direction in which they will move? Such molecules will pass 
through a membrane down their concentration gradient in a process called 
simple diffusion. In accord with the Second Law of Thermodynamics, mol- 
ecules spontaneously move from a region of higher concentration to one of 
lower concentration. 

Matters become more complicated when the molecule is highly polar. 
For example, sodium ions are present at 143 mM outside a typical cell and 


at 14 mM inside the cell. However, sodium does not freely enter the cell, 
because the charged ion cannot pass through the hydrophobic membrane 
interior. In some circumstances, as during a nerve impulse, sodium ions 
must enter the cell. How are they able to do so? Sodium ions pass through 
specific channels in the hydrophobic barrier formed by membrane pro- 
teins. This means of crossing the membrane is called facilitated diffusion 
because the diffusion across the membrane is facilitated by the channel. It 
is also called passive transport because the energy driving the ion move- 
ment originates from the ion gradient itself, without any contribution by 
the transport system. Channels, like enzymes, display substrate specificity 
in that they facilitate the transport of some ions, but not other, even closely 
related, ions. 

How is the sodium gradient established in the first place? In this case, 
sodium must move, or be pumped, against a concentration gradient. 
Because moving the ion from a low concentration to a higher concentration 
results in a decrease in entropy, it requires an input of free energy. Protein 
transporters embedded in the membrane are capable of using an energy 
source to move the molecule up a concentration gradient. Because an input 
of energy from another source is required, this means of crossing the mem- 
brane is called active transport. 


Free energy stored in concentration gradients can be quantified 


An unequal distribution of molecules is an energy-rich condition because 
free energy is minimized when all concentrations are equal. Consequently, 
to attain such an unequal distribution of molecules requires an input of free 
energy. How can we quantify the amount of energy required to generate a 
concentration gradient (Figure 13.1)? Consider an uncharged solute mole- 
cule. The free-energy change in transporting this species from side 1, where 
it is present at a concentration of c,, to side 2, where it is present at concen- 
tration C5, 1s 


AG = RT In(cy/c1) 


where R is the gas constant (8.315 X 107% kJ mol7! deg™!, or 1.987 x 
107% kcal mol~! deg~!) and T is the temperature in kelvins. For a charged 
species, the unequal distribution across the membrane generates an electri- 
cal potential that also must be considered because the ions will be repelled 
by the like charges. The sum of the concentration and electrical terms is 
called the electrochemical potential or membrane potential. The free-energy 
change is then given by 


AG = RT In(co/c;) + ZFAV 


in which Z is the electrical charge of the transported species, AV is the 
potential in volts across the membrane, and F is the Faraday constant 
(96.5 kJ V~! mol™!, or 23.1 kcal V~! mol~!). 

A transport process must be active when AG is positive, whereas it can 
be passive when AG is negative. For example, consider the transport of an 
uncharged molecule from c, = 1077 M to ye 107-1 M. 


AG = RT1n(10'/107°) 
= (8.315 X 10-7) X 298 X In(107) 
= +11.4kJ mol7!(+2.7 kcal mol7') 


At 25°C (298 K), AG is +11.4 kJ mol~! (+2.7 kcal mol~!), indicating that 
this transport process requires an input of free energy. 
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13.1 Active and Passive 
Transport Compared 
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FIGURE 13.1 Free energy and 
transport. The free-energy change in 
transporting (A) an uncharged solute from a 
compartment at concentration c, to one at 
Cy and (B) a singly charged species across 
a membrane to the side having the same 
charge as that of the transported ion. 
Notice that the free-energy change 
imposed by a membrane potential of 

59 mV is equivalent to that imposed by 

a concentration ratio of 10 for a singly 
charged ion at 25°C. 
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The extracellular fluid of animal cells has a salt concentration similar to that 
of seawater. However, cells must control their intracellular salt concentra- 
tions to facilitate specific processes, such as signal transduction and action 
potential propagation, and prevent unfavorable interactions with high con- 
centrations of ions such as Ca2*. For instance, most animal cells contain a 
high concentration of K* and a low concentration of Na* relative to the 
external medium. These ionic gradients are generated by a specific trans- 
port system, an enzyme that is called the Na*—K* pump or the Na*—K* 
ATPase. The hydrolysis of ATP by the pump provides the energy needed 
for the active transport of Na* out of the cell and K* into the cell, generat- 
ing the gradients. The pump is called the Nat-K* ATPase because the 
hydrolysis of ATP takes place only when Na* and K* are present. This 
ATPase, like all such enzymes, requires Mg?*. 

The change in free energy accompanying the transport of Na* and KT 
can be calculated. Suppose that the concentrations of Na* outside and inside 
the cell are 143 and 14 mM, respectively, and the corresponding values for 
K* are 4 and 157 mM. At amembrane potential of —50 mV and a tempera- 
ture of 37°C, we can use the equation on page 369 to determine that the free- 
energy change in transporting 3 mol of Na* out of the cell and 2 mol of KT 
into the cell is 3(5.99) + 2(9.46) = +36.9 kJ mol7! (+8.8 kcal mol7'). 
Under typical cellular conditions, the hydrolysis of a single ATP molecule 
per transport cycle provides sufficient free energy, about —50 kJ mol7! 
(—12 kcal mol~!) to drive the uphill transport of these ions. The active trans- 
port of Na* and K* is of great physiological significance. Indeed, more than 
a third of the ATP consumed by a resting animal is used to pump these ions. 
The Nat—K* gradient in animal cells controls cell volume, renders neurons 
and muscle cells electrically excitable, and drives the active transport of sug- 
ars and amino acids. 


2/4 The purification of other ion pumps has revealed a large family of 
evolutionarily related ion pumps including proteins from bacteria, 
archaea, and all eukaryotes. Each of these pumps is specific for a particular 
ion or set of ions. Two are of particular interest: the sarcoplasmic reticulum 


O 0 2 Ca?+ ATPase (or SERCA) transports Ca** out of the cytoplasm and into the 

| | sarcoplasmic reticulum of muscle cells, and the gastric Ht-K* ATPase is the 

oe AS0 enzyme responsible for pumping sufficient protons into the stomach to lower 

H 0 the pH to 1.0. These enzymes and the hundreds of known homologs, includ- 

ov a a ing the Na*-K* ATPase, are referred to as P-type ATPases because they 

H ] form a key phosphorylated intermediate. In the formation of this intermedi- 

0) ate, a phosphoryl group from ATP is linked to the side chain of a specific 
Phosphorylaspartate conserved aspartate residue in the ATPase to form phosphorylaspartate. 


P-type ATPases couple phosphorylation and conformational changes 
to pump calcium ions across membranes 


Membrane pumps function by mechanisms that are simple in principle but 
often complex in detail. Fundamentally, each pump protein can exist in two 
principal conformational states, one with ion-binding sites open to one side 
of the membrane and the other with ion-binding sites open to the other side 
(Figure 13.2). To pump ions in a single direction across a membrane, the 
free energy of ATP hydrolysis must be coupled to the interconversion 
between these conformational states. 


Energy input 
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(Sai tee 
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Conformation 1 Conformation 2 


We will consider the structural and mechanistic features of P-type 
ATPases by examining SERCA. The properties of this P-type ATPase 
have been established in great detail by relying on crystal structures of 
the pump in five different states. This enzyme, which constitutes 80% of 
the protein in the sarcoplasmic reticulum membrane, plays an important 
role in relaxation of contracted muscle. Muscle contraction is triggered 
by an abrupt rise in the cytoplasmic calcium ion level. Subsequent mus- 
cle relaxation depends on the rapid removal of Ca?* from the cytoplasm 
into the sarcoplasmic reticulum, a specialized compartment for Ca?t 
storage, by SERCA. This pump maintains a Ca?* concentration of 
approximately 0.1 wM in the cytoplasm compared with 1.5 mM in the 
sarcoplasmic reticulum. 

The first structure of SERCA to be determined had Ca?* bound, but 
no nucleotides present (Figure 13.3). SERCA is a single 110-kDa poly- 
peptide with a transmembrane domain consisting of 10 a helices. The 
transmembrane domain includes sites for binding two calcium ions. Each 
calcium ion is coordinated to seven oxygen atoms coming from a combina- 
tion of side-chain glutamate, aspartate, threonine, and asparagine resi- 
dues, backbone carbonyl groups, and water molecules. A large cytoplas- 
mic headpiece constitutes nearly half the molecular weight of the protein 
and consists of three distinct domains, each with a distinct function. One 
domain (N) binds the ATP nucleotide, another (P) accepts the phosphoryl 
group on a conserved aspartate residue, and the third (A) serves as an 
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FIGURE 13.2 Pump action. A simple 
scheme for the pumping of a molecule 
across a membrane. The pump 
interconverts to two conformational states, 
each with a binding site accessible to a 
different side of the membrane. 


FIGURE 13.3 Calcium-pump structure. 
The overall structure of the SERCA P-type 
ATPase. Notice the two calcium ions 
(green) that lie in the center of the 
transmembrane domain. A conserved 
aspartate residue (Asp 351) that binds a 
phosphoryl group lies in the P domain. 

The designation bb refers to backbone 
carbonyl groups. [Drawn from 1SU4.pdb.] 


Calcium-binding 
sites disrupted 


N and P domains have 
closed around the 

phosphorylaspartate 
analog 


k FIGURE 13.4 Conformational 
changes associated with calcium 
pumping. This structure was determined 
in the absence of bound calcium but with 
a phosphorylaspartate analog present in 
the P domain. Notice how different this 
structure is from the calcium-bound 
form shown in Figure 13.3: both the 
transmembrane part (yellow) and the 
A, P, and N domains have substantially 
rearranged. [Drawn from 1WPG.pdb.] 


FIGURE 13.5 Pumping calcium. 

Ca?t ATPase transports Ca?* through the 
membrane by a mechanism that includes 
(1) Ca2* binding from the cytoplasm, 

(2) ATP binding, (8) ATP cleavage with 

the transfer of a phosphoryl group to Asp 
351 on the enzyme, (4) ADP release and 
eversion of the enzyme to release Ca2* 
on the opposite side of the membrane, 
(5) hydrolysis of the phosphorylaspartate 
residue, and (6) eversion to prepare for the 
binding of Ca2* from the cytoplasm. 
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actuator, linking changes in the N and P domains to the transmembrane 
part of the enzyme. 

SERCA is a remarkably dynamic protein. For example, the structure 
of SERCA without bound Ca?*, but with a phosphorylaspartate analog 
present in the P domain, is shown in Figure 13.4. The N and P domains are 
now closed around the phosphorylaspartate analog, and the A domain has 
rotated substantially relative to its position in SERCA with Ca** bound and 
without the phosphoryl analog. Furthermore, the transmembrane part of the 
enzyme has rearranged significantly and the well-organized Ca?* -binding 
sites are disrupted. These sites are now accessible from the side of the mem- 
brane opposite the N, P, and A domains. 

The structural results can be combined with other studies to construct a 
detailed mechanism for Ca** pumping by SERCA (Figure 13.5): 


1. The catalytic cycle begins with the enzyme in its unphosphorylated state 
with two calcium ions bound. We will refer to the overall enzyme confor- 
mation in this state as E,; with Ca?* bound, it is Ey -(Ca?*),. In this confor- 
mation, SERCA can bind calcium ions only on the cytoplasmic side of the 
membrane. This conformation is shown in Figure 13.3. 


2. In the E, conformation, the enzyme can bind ATP. The N, P, and A 
domains undergo substantial rearrangement as they close around the bound 
ATP, but there is no substantial conformational change in the transmem- 
brane domain. The calcium ions are now trapped inside the enzyme. 


3. The phosphoryl group is then transferred from ATP to Asp 351. 


4. Upon ADP release, the enzyme again changes its overall conforma- 
tion, including the membrane domain this time. This new conformation 
is referred to as E, or E,-P in its phosphorylated form. The process of 
interconverting the E, and E, conformations is sometimes referred to 
as eversion. 


E,-(Ca**), 


E,-(Ca**)2(ATP) 


E,-P-(Ca?*).(ADP) 


Asp 351 YO 


In the E,-P conformation, the Ca?t -binding sites become disrupted and 
the calcium ions are released to the side of the membrane opposite that at 
which they entered; ion transport has been achieved. This conformation is 
shown in Figure 13.4. 


5. The phosphorylaspartate residue is hydrolyzed to release inorganic 
phosphate. 


6. With the release of phosphate, the interactions stabilizing the E, confor- 
mation are lost, and the enzyme everts to the E, conformation. 


The binding of two calcium ions from the cytoplasmic side of the mem- 
brane completes the cycle. 

This mechanism likely applies to other P-type ATPases. For example, 
Na*-K* ATPase is an a,f, tetramer. Its a subunit is homologous to 
SERCA and includes a key aspartate residue analogous to Asp 351. The B 
subunit does not directly take part in ion transport. A mechanism analogous 
to that shown in Figure 13.5 applies, with three Na* ions binding from the 
inside of the cell to the E, conformation and two K* ions binding from out- 
side the cell to the E, conformation. 


Digitalis specifically inhibits the Nat—Kt pump by blocking its 
dephosphorylation 


e Certain steroids derived from plants are potent inhibitors (K; ~ 10 nM) 

of the Nat—K* pump. Digitoxigenin and ouabain are members of 
this class of inhibitors, which are known as cardiotonic steroids because of 
their strong effects on the heart (Figure 13.6). These compounds inhibit 
the dephosphorylation of the E,-P form of the ATPase when applied on the 
extracellular face of the membrane. 

Digitalis is a mixture of cardiotonic steroids derived from the dried leaf 
of the foxglove plant (Digitalis purpurea). The compound increases the 
force of contraction of heart muscle and is consequently a choice drug in the 
treatment of congestive heart failure. Inhibition of the Nat—K* pump by 
digitalis leads to a higher level of Na* inside the cell. The diminished Na* 
gradient results in slower extrusion of Ca** by the sodium—calcium 
exchanger, an antiporter (Section 13.3). The subsequent increase in the 
intracellular level of Ca**+ enhances the ability of cardiac muscle to con- 
tract. It is interesting to note that digitalis was used effectively long before 
the discovery of the Na*—K* ATPase. In 1785, William Withering, a 
British physician, heard tales of an elderly woman, known as “the old 
woman of Shropshire,” who cured people of “dropsy” (which today would 
be recognized as congestive heart failure) with an extract of foxglove. 
Withering conducted the first scientific study of the effects of foxglove on 
congestive heart failure and documented its effectiveness. 
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Foxglove (Digitalis purpurea) is the source of 
digitalis, one of the most widely used 
drugs. [Roger Hall/Shutterstock.] 


FIGURE 13.6 Digitoxigenin. Cardiotonic 
steroids such as digitoxigenin inhibit the 
Na*-K* pump by blocking the 
dephosphorylation of E,-P. 
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FIGURE 13.7 Domain arrangement of 
ABC transporters. ABC transporters are 
a large family of homologous proteins 
composed of two transmembrane domains 
and two ATP-binding domains called ATP- 
binding cassettes (ABCs). (A) The 
multidrug-resistance protein is a single 
polypeptide chain containing all four 
domains, whereas (B) the bacterial lipid 
transporter MsbA consists of a dimer of 
two identical chains, containing one of 
each domain. 


P-type ATPases are evolutionarily conserved and play a 
wide range of roles 


Nie Analysis of the complete yeast genome revealed the presence of 16 pro- 

teins that clearly belong to the P-type ATPase family. More-detailed 
sequence analysis suggests that 2 of these proteins transport H* ions, 2 trans- 
port Ca**, 3 transport Na‘, and 2 transport metals such as Cu?". In addition, 
5 members of this family appear to participate in the transport of phospholip- 
ids with amino acid head groups. These 5 proteins help maintain membrane 
asymmetry by transporting lipids such as phosphatidylserine from the outer 
to the inner leaflet of the bilayer membrane. Such enzymes have been termed 
“flippases.” Remarkably, the human genome encodes 70 P-type ATPases. 
All members of this protein family employ the same fundamental mechanism: 
the free energy of ATP hydrolysis drives membrane transport by means of 
conformational changes, which are induced by the addition and removal of a 
phosphoryl group at an analogous aspartate site in each protein. 


Multidrug resistance highlights a family of membrane pumps with 
ATP-binding cassette domains 


e Studies of human disease revealed another large and important fam- 

ily of active-transport proteins, with structures and mechanisms 
quite different from those of the P-type ATPase family. These pumps were 
identified from studies on tumor cells in culture that developed resistance to 
drugs that had been initially quite toxic to the cells. Remarkably, the devel- 
opment of resistance to one drug had made the cells less sensitive to a range 
of other compounds. This phenomenon is known as multidrug resistance. In 
a significant discovery, the onset of multidrug resistance was found to cor- 
relate with the expression and activity of a membrane protein with an 
apparent molecular mass of 170 kDa. This protein acts as an ATP- 
dependent pump that extrudes a wide range of small molecules from cells 
that express it. The protein is called the multidrug-resistance (MDR) protein 
or P-glycoprotein (“glyco” because it includes a carbohydrate moiety). 
Thus, when cells are exposed to a drug, the MDR pumps the drug out of the 
cell before the drug can exert its effects. 


We Analysis of the amino acid sequences of MDR and homologous pro- 
teins revealed a common architecture (Figure 13.7A). Each protein 
comprises four domains: two membrane-spanning domains and two ATP- 
binding domains. The ATP-binding domains of these proteins are called 
ATP-binding cassettes (ABCs) and are homologous to domains in a large fam- 
ily of transport proteins in bacteria and archaea. Transporters that include 
these domains are called ABC transporters. With 79 members, the ABC 
transporters are the largest single family identified in the E. coli genome. The 
human genome includes more than 150 ABC transporter genes. 

The ABC proteins are members of the P-loop NTPase superfamily 
(Section 9.4). The three-dimensional structures of several members of the 
ABC transporter family have now been determined, including that of the 
bacterial lipid transporter MsbA. In contrast with the eukaryotic MDR 
protein, this protein is a dimer of 62-kDa chains: the amino-terminal half of 
each protein contains the membrane-spanning domain, and the carboxyl- 
terminal half contains the ATP-binding cassette (Figure 13.7B). Prokaryotic 
ABC proteins are often made up of multiple subunits, such as a dimer of 
identical chains, as above, or as a heterotetramer of two membrane-spanning 
domain subunits and two ATP-binding-cassette subunits. The consolida- 
tion of the enzymatic activities of several polypeptide chains in prokaryotes 
to a single chain in eukaryotes is a theme that we will see again. The two 
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ATP-binding cassettes are in contact, but they do not interact strongly in 
the absence of bound ATP (Figure 13.8). On the basis of this structure and 
others, as well as on other experiments, a mechanism for active transport by 
these proteins has been developed (Figure 13.9): 


1. The catalytic cycle begins with the transporter free of both ATP and 
substrate. While the distance between the ATP-binding cassettes in this 
form may vary with the individual transporter, the substrate binding region 
of the transporter faces inward. 


2. Substrate enters the central cavity of the transporter from inside the cell. 
Substrate binding induces conformational changes in the ATP-binding cas- 
settes that increase their affinity for ATP. 


3. ATP binds to the ATP-binding cassettes, changing their conformations 
so that the two domains interact strongly with one another. The close inter- 
action of the ABCs reorients the transmembrane helices such that the sub- 
strate binding site is now facing outside the cell (Figure 13.8, far right). 
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FIGURE 13.8 ABC transporter 
structure. Two structures of the bacterial 
lipid transporter MsbA, a representative 
ABC transporter. The nucleotide-free, 
inward-facing form is on the left and the 
ATP-bound, outward-facing form is shown 
in two views (rotated by 90 degrees) in 
the center and on the right. The two 
ATP-binding cassettes (blue) are related 
to the P-loop NTPases and, like them, 
contain P-loops (green). The a helix 
adjacent to the P-loop is shown in 

red. [Drawn from 3B5W and 3B60.pdb.] 


FIGURE 13.9 ABC transporter 
mechanism. The mechanism includes the 
following steps: (1) opening of the channel 
toward the inside of the cell, (2) substrate 
binding and conformational changes in the 
ATP-binding cassettes, (3) ATP binding and 
opening of the channel to the opposite 
face of the membrane, (4) release of the 
substrate to the outside of the cell, and 
(5) ATP hydrolysis to reset the transporter 
to its initial state. 
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FIGURE 13.10 Antiporters, 
symporters, and uniporters. 
Secondary transporters can 
transport two substrates in 
opposite directions (antiporters), 
two substrates in the same 
direction (symporters), or 

one substrate in either 

direction (uniporter). 


4. The outward facing conformation of the transporter has reduced affinity 
for the substrate, enabling the release of the substrate on the opposite face 
of the membrane. 


5. The hydrolysis of ATP and the release of ADP and inorganic phosphate 
reset the transporter for another cycle. 


Whereas eukaryotic ABC transporters generally act to export molecules 
from inside the cell, prokaryotic ABC transporters often act to import spe- 
cific molecules from outside the cell. A specific binding protein acts in con- 
cert with the bacterial ABC transporter, delivering the substrate to the 
transporter and stimulating ATP hydrolysis inside the cell. These binding 
proteins are present in the periplasm, the compartment between the two 
membranes that surround some bacterial cells (Figure 12.35A). 

Thus, ABC transporters use a substantially different mechanism from 
the P-type ATPases to couple the ATP hydrolysis reaction to conforma- 
tional changes. Nonetheless, the net result is the same: the transporters 
are converted from one conformation capable of binding substrate from 
one side of the membrane to another that releases the substrate on the 
other side. 


13.3 Lactose Permease Is an Archetype of Secondary 
Transporters That Use One Concentration Gradient 
to Power the Formation of Another 


Carriers are proteins that transport ions or molecules across the membrane 
without hydrolysis of ATP. The mechanism of carriers involves both large 
conformational changes and the interaction of the protein with only a few 
molecules per transport cycle, limiting the maximum rate at which trans- 
port can occur. Although carriers cannot mediate primary active transport, 
owing to their inability to hydrolyze ATP, they can couple the thermody- 
namically unfavorable flow of one species of ion or molecule up a concen- 
tration gradient to the favorable flow of a different species down a 
concentration gradient, a process referred to as secondary active transport. 
Carriers that move ions or molecules “uphill” by this means are termed 
secondary transporters or cotransporters. These proteins can be classified as 
either antiporters or symporters. Antiporters couple the downhill flow of 
one species to the uphill flow of another in the opposite direction across the 
membrane; symporters use the flow of one species to drive the flow of a 
different species in the same direction across the membrane. Uniporters, 
another class of carriers, are able to transport a specific species in either 
direction governed only by concentrations of that species on either side of 
the membrane (Figure 13.10). 
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Secondary transporters are ancient molecular machines, common today 
in bacteria and archaea as well as in eukaryotes. For example, approximately 
160 (of around 4000) proteins encoded by the E. coli genome are secondary 
transporters. Sequence comparison and hydropathy analysis suggest that 
members of the largest family have 12 transmembrane helices that appear to 
have arisen by duplication and fusion of a membrane protein containing 6 
transmembrane helices. Included in this family is the lactose permease of E. 
coli. This symporter uses the H* gradient across the E. coli membrane (out- 
side has higher H* concentration) generated by the oxidation of fuel mole- 
cules to drive the uptake of lactose and other sugars against a concentration 
gradient. This transporter has been extensively studied for many decades 
and is a useful archetype for this family. 

The structure of lactose permease has been determined (Figure 13.11). 
As expected from the sequence analysis, the protein consists of two halves, 
each of which comprises six membrane-spanning a helices. The two halves 
are well separated and are joined by a single stretch of polypeptide. In this 
structure, a sugar molecule lies in a pocket in the center of the protein and is 
accessible from a path that leads from the interior of the cell. On the basis of 
this structure and a wide range of other experiments, a mechanism for sym- 
porter action has been developed. This mechanism (Figure 13.12) has many 
features similar to those for P-type ATPases and ABC transporters: 


1. The cycle begins with the two halves oriented so that the opening to the 
binding pocket faces outside the cell, in a conformation different from that 
observed in the structures solved to date. A proton from outside the cell 
binds to a residue in the permease, quite possibly Glu 269. 


2. Inthe protonated form, the permease binds lactose from outside the cell. 


3. The structure everts to the form observed in the crystal structure 
(Figure 13.11). 


4. The permease releases lactose to the inside of the cell. 
5. The permease releases a proton to the inside of the cell. 


6. The permease everts to complete the cycle. 


The site of protonation likely changes in the course of this cycle. 

It is believed that this eversion mechanism applies to all classes of 
secondary transporters, which resemble the lactose permease in overall 
architecture. 


Lactose 
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kK FIGURE 13.11 Structure of lactose 
permease with a bound lactose 
analog. The amino-terminal half of the 
protein is shown in blue and the carboxyl- 
terminal half in red. (A) Side view. (B) Bottom 
view (from inside the cell). Notice that the 
structure consists of two halves that 
surround the sugar and are linked to one 
another by only a single stretch of 
polypeptide. [Drawn from 1PV7.pdb.] 


FIGURE 13.12 Lactose permease 
mechanism. The mechanism begins with 
the permease open to the outside of the 
cell (upper left). The permease binds a 
proton from the outside of the cell (1) and 
then binds its substrate (2). The permease 
everts (3) and then releases its substrate 
(4) and a proton (6) to the inside of the cell. 
It then everts (6) to complete the cycle. 
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FIGURE 13.13 Action potential. Signals 
are sent along neurons by the transient 
depolarization and repolarization of the 
membrane. 


13.4 Specific Channels Can Rapidly Transport lons 
Across Membranes 


Pumps and carriers can move ions across the membrane at rates approach- 
ing several thousand ions per second. Other membrane proteins, the 
passive-transport systems called ion channels, are capable of ion-transport 
rates that are more than 1000 times as fast. These rates of transport 
through ion channels are close to rates expected for ions diffusing freely 
through aqueous solution. Yet ion channels are not simply tubes that span 
membranes through which ions can rapidly flow. Instead, they are highly 
sophisticated molecular machines that respond to chemical and physical 
changes in their environments and undergo precisely timed conforma- 
tional changes. 


Action potentials are mediated by transient changes in Nat 
and Kt permeability 


One of the most important manifestations of ion-channel action is the 
nerve impulse, which is the fundamental means of communication in 
the nervous system. A nerve impulse is an electrical signal produced by the 
flow of ions across the plasma membrane of a neuron. The interior of a 
neuron, like that of most other cells, contains a high concentration of KT 
and a low concentration of Na*. These ionic gradients are generated by the 
Nat—K* ATPase. The cell membrane has an electrical potential deter- 
mined by the ratio of the internal to the external concentration of ions. In 
the resting state, the membrane potential is typically —60 mV. A nerve 
impulse, or action potential, is generated when the membrane potential is 
depolarized beyond a critical threshold value (e.g., from —60 to —40 mV). 
The membrane potential becomes positive within about a millisecond and 
attains a value of about +30 mV before turning negative again (repolariza- 
tion). This amplified depolarization is propagated along the nerve terminal 
(Figure 13.13). 

Ingenious experiments carried out by Alan Hodgkin and Andrew 
Huxley revealed that action potentials arise from large, transient changes 
in the permeability of the axon membrane to Na*™ and K* ions. Depolar- 
ization of the membrane beyond the threshold level leads to an increase 
in permeability to Na*. Sodium ions begin to flow into the cell because 
of the large electrochemical gradient across the plasma membrane. The 
entry of Na* further depolarizes the membrane, leading to a further 
increase in Na* permeability. This positive feedback yields the very rapid 
and large change in membrane potential described above and shown in 
Figure 13.13. 

The membrane spontaneously becomes less permeable to Nat and more 
permeable to K*. Consequently, K* flows outward, and so the membrane 
potential returns to a negative value. The resting level of —60 mV is restored 
in a few milliseconds as the K* conductance decreases to the value charac- 
teristic of the unstimulated state. The wave of depolarization followed by 
repolarization moves rapidly along a nerve cell. The propagation of these 
waves allows a touch at the tip of your toe to be detected in your brain in a 
few milliseconds. 

This model for the action potential postulated the existence of ion chan- 
nels specific for Na* and K*. These channels must open in response to 
changes in membrane potential and then close after having remained open 
for a brief period of time. This bold hypothesis predicted the existence of 
molecules with a well-defined set of properties long before tools existed for 
their direct detection and characterization. 
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Patch-clamp conductance measurements reveal the activities 
of single channels 


Direct evidence for the existence of these channels was provided by the 
patch-clamp technique, which was introduced by Erwin Neher and Bert 
Sakmann in 1976. This powerful technique enables the measurement of the 
ion conductance through a small patch of cell membrane. In this technique, 
a clean glass pipette with a tip diameter of about 1 jum is pressed against an 
intact cell to form a seal (Figure 13.14). Slight suction leads to the formation 
of a very tight seal so that the resistance between the inside of the pipette 
and the bathing solution is many gigaohms (1 gigaohm is equal to 10° 
ohms). Thus, a gigaohm seal (called a gigaseal) ensures that an electric cur- 
rent flowing through the pipette is identical with the current flowing 
through the membrane covered by the pipette. The gigaseal makes possible 
high-resolution current measurements while a known voltage is applied 
across the membrane. Remarkably, the flow of ions through a single chan- 
nel and transitions between the open and the closed states of a channel can 
be monitored with a time resolution of microseconds (Figure 13.15). 
Furthermore, the activity of a channel in its native membrane environment, 
even in an intact cell, can be directly observed. Patch-clamp methods pro- 
vided one of the first views of single biomolecules in action. Subsequently, 
other methods for observing single molecules were invented, opening new 
vistas on biochemistry at its most fundamental level. 


The structure of a potassium ion channel is an archetype for many 
ion-channel structures 


With the existence of ion channels firmly established by patch-clamp 
methods, scientists sought to identify the molecules that form ion channels. 
The Na* channel was first purified from the electric organ of electric eel, 
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FIGURE 13.14 Patch-clamp modes. 

The patch-clamp technique for monitoring 
channel activity is highly versatile. A high- 
resistance seal (gigaseal) is formed 
between the pipette and a small patch of 
plasma membrane. This configuration is 
called cell-attached mode. The breaking of 
the membrane patch by increased suction 
produces a low-resistance pathway 
between the pipette and the interior of the 
cell. The activity of the channels in the 
entire plasma membrane can be monitored 
in this whole-cell mode. To prepare a 
membrane in the excised-patch mode, the 
pipette is pulled away from the cell. A piece 
of plasma membrane with its cytoplasmic 
side now facing the medium is monitored 
by the patch pipette. 


FIGURE 13.15 Observing single 
channels. (A) The results of a patch-clamp 
experiment revealing the small amount of 
current, measured in picoamperes (pA, 
10-12 amperes) passing through a single 
ion channel. The downward spikes indicate 
transitions between closed and open 
states. (B) Closer inspection of one of the 
spikes in (A) reveals the length of time the 
channel is in the open state. 
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Tetrodotoxin 


FIGURE 13.16 Sequence relations of ion 
channels. Like colors indicate structurally 
similar regions of the sodium, calcium, and 
potassium channels. Each of these 
channels exhibits approximate fourfold 
symmetry, either within one chain (sodium, 
calcium channels) or by forming tetramers 
(potassium channels). 
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which is a rich source of the protein forming this channel. The channel was 
purified on the basis of its ability to bind tetrodotoxin, a neurotoxin from 
the puffer fish that binds to Na* channels very tightly (K, ~ 1 nM). The 
lethal dose of this poison for an adult human being is about 10 ng. 

The isolated Na* channel is a single 260-kDa chain. Cloning and 
sequencing of cDNAs encoding Na* channels revealed that the channel 
contains four internal repeats, each having a similar amino acid sequence, 
suggesting that gene duplication and divergence have produced the gene for 
this channel. Hydrophobicity profiles indicate that each repeat contains five 
hydrophobic segments (51, 52, 53, $5, and 86). Each repeat also contains a 
highly positively charged 54 segment; positively charged arginine or lysine 
residues are present at nearly every third residue. It was proposed that seg- 
ments 51 through S6 are membrane-spanning « helices, while the positively 
charged residues in 54 act as the voltage sensors of the channel. 

The purification of K* channels proved to be much more difficult 
because of their low abundance and the lack of known high-affinity ligands 
comparable to tetrodotoxin. The breakthrough came in studies of mutant 
fruit flies that shake violently when anesthetized with ether. The mapping 
and cloning of the gene, termed shaker, responsible for this defect revealed 
the amino acid sequence encoded by a K*-channel gene. The shaker gene 
encodes a 70-kDa protein that contains sequences corresponding to seg- 
ments 51 through 56 in one of the repeated units of the Na* channel. Thus, 
a K*-channel subunit is homologous to one of the repeated units of Nat 
channels. Consistent with this homology, four Shaker polypeptides come 
together to form a functional channel. More recently, bacterial K* channels 
were discovered that contain only the two membrane-spanning regions cor- 
responding to segments 55 and S6. This and other information suggested 
that 55 and S6, including the region between them, form the actual pore in 
the K* channel. Segments $1 through $4 contain the apparatus that opens 
the pore. The sequence relations between these ion channels are summa- 
rized in Figure 13.16. 
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In 1998, Roderick MacKinnon and coworkers determined the structure 
of a K* channel from the bacterium Streptomyces lividans by x-ray crystal- 
lography. This channel contains only the pore-forming segments 55 and S56. 
As expected, the K* channel is a tetramer of identical subunits, each of 
which includes two membrane-spanning a helices (Figure 13.17). The four 
subunits come together to form a pore in the shape of a cone that runs 
through the center of the structure. 


The structure of the potassium ion channel reveals the basis 
of ion specificity 


The structure presented in Figure 13.17 probably represents the K* channel 
in a closed form. Nonetheless, it suggests how the channel is able to exclude 
all but K* ions. Beginning from the inside of the cell, the pore starts with a 
diameter of approximately 10 A and then constricts to a smaller cavity with a 
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View down the pore Side view A single subunit 


k FIGURE 13.17 Structure of the potassium ion channel. The Kt channel, composed of 
four identical subunits, is cone shaped, with the larger opening facing the inside of the cell 
(center). A view down the pore, looking toward the outside of the cell, shows the relations of 
the individual subunits (left). One of the four identical subunits of the pore is illustrated at the 


right, with the pore-forming region shown in gray. [Drawn from 1K4C.pdb.] 


diameter of 8 A. Both the opening to the outside and the central 
cavity of the pore are filled with water, and a K* ion can fit in the 
pore without losing its shell of bound water molecules. 
Approximately two-thirds of the way through the membrane, the 
pore becomes more constricted (3-A diameter). At that point, 
any K* ions must give up their water molecules and interact 
directly with groups from the protein. The channel structure effec- 
tively reduces the thickness of the membrane from 34 A to 12 A by 
allowing the solvated ions to penetrate into the membrane before 
the ions must directly interact with the channel (Figure 13.18). 

For K* ions to relinquish their water molecules, other polar 
interactions must replace those with water. The restricted part 
of the pore is built from residues contributed by the two trans- 
membrane « helices. In particular, a five-amino-acid stretch 
within this region functions as the selectivity filter that deter- 
mines the preference for K* over other ions (Figure 13.19). The 
stretch has the sequence Thr-Val-Gly-Tyr-Gly (TVGYG), and 
is nearly completely conserved in all K* channels. The region of 
the strand containing the conserved sequence lies in an extended 
conformation and is oriented such that the peptide carbonyl 
groups are directed into the channel, in good position to interact 
with the potassium ions. 
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FIGURE 13.18 Path through a channel. A potassium ion 
entering the Kt channel can pass a distance of 22 A into 
the membrane while remaining solvated with water (blue). 
At this point, the pore diameter narrows to 3 A (yellow), 
and potassium ions must shed their water and interact 
with carbonyl groups (red) of the pore amino acids. 


FIGURE 13.19 Selectivity filter of the 
potassium ion channel. Potassium ions 
interact with the carbonyl groups of the 
TVGYG sequence of the selectivity filter, 
located at the 3-A-diameter pore of the K* 
channel. Only two of the four channel 
subunits are shown. 


TABLE 13.1 Properties of alkali cations Potassium ion channels are 100-fold more permeable to K* than to Na‘. 


hake ideation: fee How is this high degree of selectivity achieved? Ions having a radius larger 


radius energy in kJ mol! than 1.5 A cannot pass into the narrow diameter (3 A) of the selectivity filter 
Ton (A) (kcal mol™') of the K* channel. However, a bare Na* is small enough (Table 13.1) to 
Lit 0.60 —410 (—98) pass through the pore. Indeed, the ionic radius of Na* is substantially 
Nat 0.95 Ail (=D) smaller than that of K*. How then is Na* rejected? 
Kt 1.33 —230 (—55) The key point is that the free-energy costs of dehydrating these ions are 
Rbt 1.48 2S (=a) considerable [Na‘, 301 kJ mol! (72 kcal mol~!), and K*, 230 kJ mol~! (55 
Cst 1.69 —197 (—47) kcal mol~!)]. The channel pays the cost of dehydrating Kt by providing opti- 
mal compensating interactions with the carbonyl oxygen atoms lining the selec- 
tivity filter. Careful studies of the potassium channel, enabled by the deter- 
mination of its three-dimensional structure, have revealed that the interior 
of the pore is a highly dynamic, fluid environment. The favorable interac- 
tions between the carbonyl oxygen atoms, which carry a partial negative 
charge, with the cation are balanced by the repulsion of these oxygen atoms 
from one another. For this channel, the ideal balance is achieved with K*, 
but not with Na* (Figure 13.20). Hence, sodium ions are rejected because 
the higher energetic cost of dehydrating them would not be recovered. 
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FIGURE 13.20 Energetic basis of ion selectivity. The energy cost of dehydrating a 
potassium ion is compensated by favorable interactions with the selectivity filter. Because a 
sodium ion is too small to interact favorably with the selectivity filter, the free energy of 
desolvation cannot be compensated and the sodium ion does not pass through the channel. 


The K* channel structure enables a clearer understanding of the 

structure and function of Na* and Ca”* channels because of their 
homology to K* channels. Sequence comparisons and the results of muta- 
genesis experiments have implicated the region between segments $5 and S6 
in ion selectivity in the Ca2* channel. In Ca2* channels, one glutamate resi- 
due of this region in each of the four repeated units plays a major role in 
determining ion selectivity. Residues in the positions corresponding to the 
glutamate residues in Ca?* channels are major components of the selectiv- 
ity filter of the Na* channel. These residues—aspartate, glutamate, lysine, 
and alanine—are located in each of the internal repeats of the Na* channel, 
forming a region termed the DEKA locus. Thus, the potential fourfold 
symmetry of the channel is clearly broken in this region, which explains 
why Na* channels consist of a single large polypeptide chain rather than a 
noncovalent assembly of four identical subunits. The preference of the Nat 
channel for Na* over Kt depends on ionic radius; the diameter of the pore 
determined by these residues and others is sufficiently restricted that small 
ions suchas Na* and Li‘ can pass through the channel, but larger ions such 
as K* are significantly hindered. 
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The structure of the potassium ion channel explains its rapid 
rate of transport 


The tight binding sites required for ion selectivity should slow the progress 
of ions through a channel, yet ion channels achieve rapid rates of ion trans- 
port. How is this paradox resolved? A structural analysis of the K* channel 
at high resolution provides an appealing explanation. Four K*-binding 
sites crucial for rapid ion flow are present in the constricted region of the K* 
channel. Consider the process of ion conductance starting from inside the 
cell (Figure 13.21). A hydrated potassium ion proceeds into the channel and 
through the relatively unrestricted part of the channel. The ion then gives 
up its coordinated water molecules and binds to a site within the selectivity- 
filter region. The ion can move between the four sites within the selectivity 
filter because they have similar ion affinities. As each subsequent potassium 
ion moves into the selectivity filter, its positive charge will repel the potas- 
sium ion at the nearest site, causing it to shift to a site farther up the channel 
and in turn push upward any potassium ion already bound to a site farther 
up. Thus, each ion that binds anew favors the release of an ion from the 
other side of the channel. This multiple-binding-site mechanism solves the 
paradox of high ion selectivity and rapid flow. 


Voltage gating requires substantial conformational changes 
in specific ion-channel domains 


Some Na* and K* channels are gated by membrane potential; that is, they 
change conformation to a highly conducting form in response to changes in 
voltage across the membrane. As already noted, these voltage-gated chan- 
nels include segments 51 through S4 in addition to the pore itself formed by 
$5 and S6. The structure of a voltage-gated K* channel from Aeropyrum 
pernix has been determined by x-ray crystallography (Figure 13.22). The 
segments 51 through $4 form domains, termed “paddles,” that extend 
from the core of the channel. These paddles include the segment 54, the 
voltage sensor itself. Segment 54 forms an a helix lined with positively 
charged residues. In contrast with expectations, segments 51 through S4 
are not enclosed within the protein but, instead, are positioned to lie in the 
membrane itself. 


Repulsion 


FIGURE 13.21 Model for Kt-channel ion 
transport. The selectivity filter has four 
binding sites. Hydrated potassium ions can 
enter these sites, one at a time, losing their 
hydration shells. When two ions occupy 
adjacent sites, electrostatic repulsion forces 
them apart. Thus, as ions enter the channel 
from one side, other ions are pushed out 
the other side. 


383 


FIGURE 13.23 A model for voltage 
gating of ion channels. The voltage- 
sensing paddles lie in the “down” 


position below the closed channel (left). 


Membrane depolarization pulls these 
paddles through the membrane. The 
motion pulls the base of the channel 
apart, opening the channel (right). 
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hk FIGURE 13.22 Structure of a voltage-gated potassium channel. (A) A view looking 
down through the pore. (B) A side view. Notice that the positively charged S4 region (red) lies 
on the outside of the structure at the bottom of the pore. [Drawn from 1ORQ.pdb.] 


A model for voltage gating has been proposed by Roderick MacKinnon 
and coworkers on the basis of this structure and a range of other experi- 
ments (Figure 13.23). In the closed state, the paddles lie in a “down” posi- 
tion. On membrane depolarization, the cytoplasmic side of the membrane 
becomes more positively charged, and the paddles are pulled through the 
membrane into an “up” position. In this position, they pull the four sides of 
the base on the pore apart, increasing access to the selectivity filter and 
opening the channel. 
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A channel can be inactivated by occlusion of the pore: 
the ball-and-chain model 


The K* channel and the Na* channel undergo inactivation within milli- 
seconds of opening (Figure 13.24). A first clue to the mechanism of inacti- 
vation came from exposing the cytoplasmic side of either channel to 
trypsin; cleavage by trypsin produced trimmed channels that stayed persis- 
tently open after depolarization. Furthermore, a mutant Shaker channel 
lacking 42 amino acids near the amino terminus opened in response to 
depolarization but did not inactivate. Remarkably, inactivation was restored 
by adding a synthetic peptide corresponding to the first 20 residues of the 
native channel. 

These experiments strongly support the ball-and-chain model for chan- 
nel inactivation that had been proposed years earlier (Figure 13.25). 
According to this model, the first 20 residues of the K* channel form a 
cytoplasmic unit (the ball) that is attached to a flexible segment of the poly- 
peptide (the chain). When the channel is closed, the ball rotates freely in 
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FIGURE 13.24 Inactivation of the 
potassium ion channel. The amino- 
terminal region of the K* chain is critical 
for inactivation. (A) The wild-type Shaker 
(C) Kt channel displays rapid inactivation 
after opening. (B) A mutant channel lacking 


Mutant + peptide residues 6 through 46 does not inactivate. 
\ (C) Inactivation can be restored by adding a 
peptide consisting of residues 1 through 20 


| | | | at aconcentration of 100 uM. [Data from 
. ID a 60 _W.N. Zagotta, T. Hoshi, and R. W. Aldrich, 
Time after depolarization (ms) Science 250:568-571, 1990] 
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the aqueous solution. When the channel opens, the ball quickly finds a 
complementary site in the open pore and occludes it. Hence, the channel 
opens for only a brief interval before it undergoes inactivation by occlu- 
sion. Shortening the chain speeds inactivation because the ball finds its 
target more quickly. Conversely, lengthening the chain slows inactivation. 
Thus, the duration of the open state can be controlled by the length and 
flexibility of the tether. In some senses, the “ball” domains, which include 
substantial regions of positive charge, can be thought of as large, tethered 
cations that are pulled into the open channel but get stuck and block fur- 
ther ion conductance. 


Closed 


Inactivation 
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The acetylcholine receptor is an archetype for ligand-gated ion 
channels 


Nerve impulses are communicated across synapses by small, diffusible mol- 
ecules called neurotransmitters. One neurotransmitter is acetylcholine. The 
presynaptic membrane of a synapse is separated from the postsynaptic 
membrane by a gap of about 50 nm called the synaptic cleft. The arrival of a 
nerve impulse at the end of an axon leads to the synchronous export of the 
contents of some 300 membrane-bound compartments, or vesicles, of ace- 
tylcholine into the cleft (Figure 13.26). The binding of acetylcholine to the 
postsynaptic membrane markedly changes its ionic permeability, triggering 
an action potential. Acetylcholine opens a single kind of cation channel, 
called the acetylcholine receptor, which is almost equally permeable to Na* 
and to K*. 
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FIGURE 13.25 Ball-and-chain model for 
channel inactivation. The inactivation 
domain, or “ball” (gray), is tethered to the 
channel by a flexible “chain.” In the closed 
state, the ball is located in the cytoplasm. 
Depolarization opens the channel and 
creates a binding site for the positively 
charged ball in the mouth of the pore. 
Movement of the ball into this site 
inactivates the channel by occluding it. 
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Direction of The acetylcholine receptor is the best-understood ligand-gated channel. 
Pemelmpulee This type of channel is gated not by voltage but by the presence of specific 
| — Presynaptic ligands. The binding of acetylcholine to the channel is followed by its tran- 
membrane sient opening. The electric organ of Torpedo marmorata, an electric ray, is 
a choice source of acetylcholine receptors for study because its electro- 
plaxes (voltage-generating cells) are very rich in postsynaptic membranes 
eyiaode dete that respond to this neurotransmitter. The receptor is very densely packed 
@® —_ in these membranes (~20,000 wm). The acetylcholine receptor of the 
electric organ has been solubilized by adding a nonionic detergent to a 
; post-synaptic membrane preparation and purified by affinity chromatog- 
Postsynaptic ; ; ; 
membrane raphy on a column bearing covalently attached cobratoxin, a small protein 
toxin from snakes that has a high affinity for acetylcholine receptors. With 
the use of techniques presented in Chapter 3, the 268-kDa receptor was 
identified as a pentamer of four kinds of membrane-spanning subunits— 
a, B, y, and 6—arranged in the form of a ring that creates a pore through 
the membrane. 

The cloning and sequencing of the cDNAs for the four kinds of subunits 
(50-58 kDa) showed that they have clearly similar sequences; the genes for 
the a, B, y, and 6 subunits arose by duplication and divergence of acommon 
ancestral gene. Each subunit has a large extracellular domain, followed at 
the carboxyl end by four predominantly hydrophobic segments that span 
the bilayer membrane. Acetylcholine binds at the a—y and a—8 interfaces. 
Electron microscopic studies of purified acetylcholine receptors demon- 
strated that the structure has approximate fivefold symmetry, in harmony 
with the similarity of its five constituent subunits (Figure 13.27). 

What is the basis of channel opening? A comparison of the structures 
of the closed and open forms of the channel would be highly revealing, but 
such comparisons have been difficult to obtain. Cryoelectron micrographs 
indicate that the binding of acetylcholine to the extracellular domain 
causes a structural alteration that initiates rotations of the a-helical rods 
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FIGURE 13.26 Schematic representation 
of a synapse. 


The torpedo (Torpedo marmorata, also known 
as the electric ray) has an electric organ, rich in 


acetylcholine receptors, that can deliver a shock lining the membrane-spanning pore. The amino acid sequences of these 
of as much as 200 V for approximately 1 s. helices point to the presence of alternating ridges of small polar or neutral 
[Wes Gladu/Jacana/Photo Researchers. ] residues (serine, threonine, glycine) and large nonpolar ones (isoleucine, 
(A) ( 
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Kk FIGURE 13.27 Structure of the acetylcholine 
receptor. A model for the structure of the acetylcholine 
receptor deduced from high-resolution electron 
microscopic studies reveals that each subunit consists of a 
large extracellular domain consisting primarily of 8 strands, 
four membrane-spanning «a helices, and a final a helix 
inside the cell. (A) A side view shows the pentameric 
receptor with each subunit type in a different color. One 
copy of the a subunit is shown in isolation. (B) A view down 
the channel from outside the cell. [Drawn from 2BG9.pdb.] 
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leucine, phenylalanine). In the closed state, the large residues may occlude 
the channel by forming a tight hydrophobic ring (Figure 13.28). Indeed, 
each subunit has a bulky leucine residue at a critical position. The binding 
of acetylcholine could allosterically rotate the membrane-spanning helices 
so that the pore would be lined by small polar residues rather than by large 
hydrophobic ones. The wider, more polar pore would then be open to the 
passage of Na* and K* ions. 


Action potentials integrate the activities of several ion channels 
working in concert 


To see how ligand-gated and voltage-gated channels work together to gener- 
ate a sophisticated physiological response, we now revisit the action potential 
introduced at the beginning of this section. First, we need to introduce the 
concept of equilibrium potential. Suppose that a membrane separates two 
solutions that contain different concentrations of some cation XT, as well 
as an equivalent amount of anions to balance the charge in each solution 
(Figure 13.29). Let [X*]., be the concentration of X* on one side of the 
membrane (corresponding to the inside of a cell) and [X*],,,, be the concen- 
tration of X* on the other side (corresponding to the outside of a cell). 
Suppose that an ion channel opens that allows X* to move across the mem- 
brane. What will happen? It seems clear that X* will move through the 
channel from the side with the higher concentration to the side with the 
lower concentration. However, positive charges will start to accumulate on 
the side with the lower concentration, making it more difficult to move each 
additional positively charged ion. An equilibrium will be achieved when the 
driving force due to the concentration gradient is balanced by the electro- 
static force resisting the motion of an additional charge. In these circum- 
stances, the membrane potential is given by the Nernst equation: 


Veq —(RYT/2F) In( [Sind LA oue) 


where R is the gas constant and F is the Faraday constant (96.5 kJ V~! mol7!, 
or 23.1 kcal V~! mol~!) and z is the charge on the ion X (e.g., +1 for X*). 
The membrane potential at equilibrium is called the equilibrium poten- 
tial for a given ion at a given concentration ratio across a membrane. For 
sodium with [Na*],, = 14 mM and [Na*],,,, = 143 mM, the equilibrium 
potential is +62 mV at 37°C. Similarly, for potassium with [K*]., = 157 mM 
and [K*],.,, = 4 mM, the equilibrium potential is —98 mV. In the absence of 
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FIGURE 13.28 Opening the acetylcholine 
receptor. Cross sections from electron 
microscopic reconstructions of the 
acetylcholine receptor in (top) its closed 
form and (bottom) its open form. (The open 
form corresponds to the structure shown in 
Figure 13.27). The areas labeled M1, M2, 
M8, and M4 correspond to the four 
membrane-spanning «a helices of one 
subunit. The cross section of the open 
channel was generated by treating the 
receptor with acetylcholine and freezing 

the sample within 20 ms. Notice that 

the hole in the center of the channel is 
substantially larger in the open structure. 
The enlargement of the hole is due to the 
rotation of the M2 helices by approximately 
15 degrees along their long axes. [Courtesy 
of Nigel Unwin.] 


FIGURE 13.29 Equilibrium potential. The 
membrane potential reaches equilibrium 
when the driving force due to the 
concentration gradient is exactly balanced 
by the opposing force due to the repulsion 
of like charges. 
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FIGURE 13.30 Action-potential 
mechanism. (A) On the initiation of an 
action potential, the membrane potential 
moves from the resting potential upward 
toward the Nat equilibrium potential and 
then downward toward the K* equilibrium 
potential. (B) The currents through the Nat 
and Kt channels underlying the action 
potential. (C) The states of the Na* and K* 
channels during the action potential. 


388 


stimulation, the resting potential for a typical neuron is —60 mV. This value 
is close to the equilibrium potential for K* owing to the fact that a small 
number of K* channels are open. 

We are now prepared to consider what happens in the generation of an 
action potential (Figure 13.30). Initially, a neurotransmitter such as acetyl- 
choline is released into the synaptic cleft from a presynaptic membrane 
(Figure 13.26). The released acetylcholine binds to the acetylcholine receptor 
on the postsynaptic membrane, causing it to open within less than a millisec- 
ond. The acetylcholine receptor is a nonspecific cation channel. Sodium ions 
flow into the cell and potassium ions flow out of the cell. Without any further 
events, the membrane potential would move to a value corresponding to the 
average of the equilibrium potentials for Na* and K*, approximately —20 mV. 
However, as the membrane potential approaches —40 mV, the voltage- 
sensing paddles of Na* channels are pulled into the membrane, opening the 
Na‘ channels. With these channels open, sodium ions flow rapidly into the 
cell and the membrane potential rises rapidly toward the Na* equilibrium 
potential (Figure 13.30B, red curve). The voltage-sensing paddles of Kt 
channels also are pulled into the membrane by the changed membrane poten- 
tial, but more slowly than Na* channel paddles. Nonetheless, after approxi- 
mately 1 ms, many K* channels start to open. At the same time, inactivation 
“ball” domains plug the open Na* channels, decreasing the Na* current. 
The acetylcholine receptors that initiated these events are also inactivated on 
this time scale. With the Na* channels inactivated and only the K* channels 
open, the membrane potential drops rapidly toward the K* equilibrium 
potential (Figure 13.30B, blue curve). The open K* channels are susceptible 
to inactivation by their “ball” domains, and these K* currents, too, are 
blocked. With the membrane potential returned to close to its initial value, 
the inactivation domains are released and the channels return to their original 
closed states. These events propagate along the neuron as the depolarization 
of the membrane opens channels in nearby patches of membrane. 

How much current actually flows across the membrane over the course 
of an action potential? This question can be addressed from two comple- 
mentary directions. First, a typical nerve cell contains 100 Na* channels per 
square micrometer. At a membrane potential of +20 mV, each channel 
conducts 107 ions per second. Thus, in a period of 1 millisecond, approxi- 
mately 10° ions flow through each square micrometer of membrane surface. 
Assuming a cell volume of 104 um? and a surface area of 10* pm”, this rate 
of ion flow corresponds to an increase in the Na‘ concentration of less than 
1%. How can this be? A robust action potential is generated because the 
membrane potential is very sensitive to even a slight change in the distribu- 
tion of charge. This sensitivity makes the action potential a very efficient 
means of signaling over long distances and with rapid repetition rates. 


Disruption of ion channels by mutations or chemicals can be 
potentially life-threatening 


e The generation of an action potential requires the precise coordina- 

tion of gating events of a collection of ion channels. Perturbation of 
this timing can have devastating effects. For example, the rhythmic gen- 
eration of action potentials by the heart is absolutely essential to maintain 
delivery of oxygenated blood to the peripheral tissues. Long QT syndrome 
(LQTS) is a genetic disorder in which the recovery of the action potential 
from its peak potential to the resting equilibrium potential is delayed. 
The term “QT” refers to a specific feature of the cardiac electrical activ- 
ity pattern as measured by electrocardiography. LQTS can lead to brief 
losses of consciousness (syncope), disruption of normal cardiac rhythm 


(arrhythmia), and sudden death. The most common mutations identified 
in LQTS patients inactivate K* channels or prevent the proper trafficking 
of these channels to the plasma membrane. The resulting loss in potassium 
permeability slows the repolarization of the membrane and delays the 
induction of the subsequent cardiac contraction, rendering the cardiac tis- 
sue susceptible to arrhythmias. 

Prolongation of the cardiac action potential in this manner can also be 
induced by a number of therapeutic drugs. In particular, the K* channel 
hERG (for human ether-a-go-go-related gene, named for its ortholog in 
Drosophila melanogaster) is highly susceptible to interactions with certain 
drugs. The hydrophobic regions of these drugs can block hERG by binding 
to two nonconserved aromatic residues on the internal surface of the chan- 
nel cavity. In addition, this cavity is predicted to be wider than other KT 
channels because of the absence of a conserved Pro-X-Pro motif within the 
56 hydrophobic segment. Inhibition of hERG by these drugs can lead to an 
increased risk of cardiac arrhythmias and sudden death. Accordingly, a 
number of these agents, such as the antihistamine terfenadine, have been 
withdrawn from the market. Screening for the inhibition of hERG 1s now a 
critical safety hurdle for the pharmaceutical advancement of a molecule to 
an approved drug. 


13.5 Gap Junctions Allow lons and Small Molecules to 
Flow Between Communicating Cells 


The ion channels that we have considered thus far have narrow pores and 
are moderately to highly selective in the ions that they allow to pass 
through them. They are closed in the resting state and have short lifetimes 
in the open state, typically a millisecond, that enable them to transmit 
frequent neural signals. We turn now to a channel with a very different 
role. Gap junctions, also known as cell-to-cell channels, serve as passage- 
ways between the interiors of contiguous cells. Gap junctions are clustered 
in discrete regions of the plasma membranes of apposed cells. Electron 
micrographs of sheets of gap junctions show them tightly packed in a regu- 
lar hexagonal array (Figure 13.31). An approximately 20-A central hole, 
the lumen of the channel, is prominent in each gap junction. These chan- 
nels span the intervening space, or gap, between apposed cells (hence, the 
name “gap junction”). The width of the gap between the cytoplasms of the 
two cells is about 35 A. 

Small hydrophilic molecules as well as ions can pass through gap junc- 
tions. The pore size of the junctions was determined by microinjecting a 
series of fluorescent molecules into cells and observing their passage into 
adjoining cells. All polar molecules with a mass of less than about 1 kDa can 
readily pass through these cell-to-cell channels. Thus, inorganic ions and 
most metabolites (e.g., sugars, amino acids, and nucleotides) can flow between 
the interiors of cells joined by gap junctions. In contrast, proteins, nucleic 
acids, and polysaccharides are too large to traverse these channels. Gap junc- 
tions ave important for intercellular communication. Cells in some excitable 
tissues, such as heart muscle, are coupled by the rapid flow of ions through 
these junctions, which ensures a rapid and synchronous response to stimuli. 
Gap junctions are also essential for the nourishment of cells that are distant 
from blood vessels, as in lens and bone. Moreover, communicating channels 
are important in development and differentiation. For example, the quies- 
cent uterus transforms to a forcefully contracting organ at the onset of labor; 
the formation of functional gap junctions at that time creates a syncytium of 
muscle cells that contract in synchrony. 
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FIGURE 13.31 Gap junctions. This 
electron micrograph shows a sheet of 
isolated gap junctions. The cylindrical 
connexons form a hexagonal lattice having 
a unit-cell length of 85 A. The densely 
stained central hole has a diameter of 
about 20 A. [Don W. Fawcett/Photo 
Researchers. ] 
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FIGURE 13.32 Structure of a gap junction. (A) Six connexins join to form a connexon, or 
hemichannel, within the plasma membrane (yellow). A single connexin monomer is highlighted 
in red. The extracellular region of one connexon binds to the same region of a connexon from 
another cell (blue), forming a complete gap junction. (B) Schematic view of the gap junction, 
oriented in the same direction as in (A). (C) A bottom-up view looking through the pore of a 
gap junction. This perspective is visualized in Figure 13.31. [(A) and (C) Drawn from 2ZW3.pdb; 
(B) Information from Dr. Werner Loewenstein.] 


A cell-to-cell channel is made of 12 molecules of connexin, one of a fam- 
ily of transmembrane proteins with molecular masses ranging from 30 to 
42 kDa. Each connexin molecule contains four membrane-spanning helices 
(Figure 13.32A). Six connexin molecules are hexagonally arrayed to form a 
half-channel, called a connexon or hemichannel. Two connexons join end to 
end in the intercellular space to form a functional channel between the com- 
municating cells (Figure 13.32B). Each connexon adopts a funnel-shape: at 
the cytoplasmic face, the inner diameter of the channel is 35 A, while at its 
innermost point, the pore narrows to a diameter of 14 A (Figure 13.32C). 
Cell-to-cell channels differ from other membrane channels in three respects: 
(1) they traverse two membranes rather than one; (2) they connect cyto- 
plasm to cytoplasm, rather than to the extracellular space or the lumen of an 
organelle; and (3) the connexons forming a channel are synthesized by dif- 
ferent cells. Gap junctions form readily when cells are brought together. 
A cell-to-cell channel, once formed, tends to stay open for seconds to min- 
utes. They are closed by high concentrations of calcium ion and by low pH. 
The closing of gap junctions by Ca** and H™ serves to seal normal cells from 
injured or dying neighbors. Gap junctions are also controlled by membrane 
potential and by hormone-induced phosphorylation. 

The human genome encodes 21 distinct connexins. Different members 
of this family are expressed in different tissues. For example, connexin 26 is 
expressed in key tissues in the ear. Mutations in this connexin are associated 
with hereditary deafness. The mechanistic basis for this deafness appears to 
be insufficient transport of ions or second-messenger molecules, such as 
inositol trisphosphate, between sensory cells. 


13.6 Specific Channels Increase the Permeability of 
Some Membranes to Water 


One more important class of channels does not take part in ion transport at 
all. Instead, these channels increase the rate at which water flows through 
membranes. As noted in Section 12.3, membranes are reasonably 


permeable to water. Why, then, are water-specific channels required? In 
certain tissues, in some circumstances, rapid water transport through mem- 
branes is necessary. In the kidney, for example, water must be rapidly reab- 
sorbed into the bloodstream after filtration. Similarly, in the secretion of 
saliva and tears, water must flow quickly through membranes. These obser- 
vations suggested the existence of specific water channels, but initially the 
channels could not be identified. 

The channels (now called aquaporins) were discovered serendipitously. 
Peter Agre noticed a protein present at high levels in red-blood-cell mem- 
branes that had been missed because the protein does not stain well with 
Coomassie blue. In addition to red blood cells, this protein was found in 
large quantities in tissues such as the kidney and the cornea, precisely the 
tissues thought to contain water channels. On the basis of this observation, 
further studies were designed, revealing that this 24-kDa membrane pro- 
tein is, indeed, a water channel. 

The structure of aquaporin has been determined (Figure 13.33). The 
protein consists of six membrane-spanning a helices. Two loops containing 
hydrophilic residues line the actual channel. Water molecules pass through 
in single file at a rate of 10° molecules per second. Importantly, specific 
positively charged residues toward the center of the channel prevent the 
transport of protons through aquaporin. Thus, aquaporin channels will not 
disrupt proton gradients, which play fundamental roles in energy transduc- 
tion, as we will see in Chapter 18. Remarkably, the aquaporins are channels 
that have evolved specifically to conduct uncharged substrates. 


Hydrophilic 
residues 


SUMMARY 


13.1 The Transport of Molecules Across a Membrane May Be 

Active or Passive 

For a net movement of molecules across a membrane, two features 
are required: (1) the molecule must be able to cross a hydrophobic 
barrier and (2) an energy source must power the movement. Lipophilic 
molecules can pass through a membrane’s hydrophobic interior by 
simple diffusion. These molecules will move down their concentration 
gradients. Polar or charged molecules require proteins to form pas- 
sages through the hydrophobic barrier. Passive transport or facilitated 
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k FIGURE 13.33 Structure of 
aquaporin. The structure of aquaporin 
viewed from the side (left) and from the top 
(right). Notice the hydrophilic residues 
(shown as space-filling models) that line the 
water channel. [Drawn from 1J4N.pdb.] 
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13.2 


13.3 


13.4 


diffusion takes place when an ion or polar molecule moves down its 
concentration gradient. If a molecule moves against a concentration 
gradient, an external energy source is required; this movement is 
referred to as active transport and results in the generation of concen- 
tration gradients. The electrochemical potential measures the com- 
bined ability of a concentration gradient and an uneven distribution of 
charge to drive species across a membrane. 


Two Families of Membrane Proteins Use ATP Hydrolysis to Pump lons 
and Molecules Across Membranes 

Active transport is often carried out at the expense of ATP hydrolysis. 
P-type ATPases pump ions against a concentration gradient and 
become transiently phosphorylated on an aspartic acid residue in the 
process of transport. P-type ATPases, which include the sarcoplasmic 
reticulum Ca** ATPase and the Nat—Kt ATPase, are integral mem- 
brane proteins with conserved structures and catalytic mechanisms. 
Membrane proteins containing ATP-binding cassette domains are 
another family of ATP-dependent pumps. Each pump includes four 
major domains: two domains span the membrane and two others 
contain ABC P-loop ATPase structures. These pumps are not phos- 
phorylated during pumping; rather, they use the energy of ATP bind- 
ing and hydrolysis to drive conformational changes that result in the 
transport of specific substrates across membranes. The multidrug- 
resistance proteins confer resistance on cancer cells by pumping che- 
motherapeutic drugs out of a cancer cell before the drugs can exert 
their effects. 


Lactose Permease is an Archetype of Secondary Transporters That 
Use One Concentration Gradient to Power the Formation of Another 
Carriers are proteins that transport ions or molecules across the mem- 
brane without hydrolysis of ATP. They can be classified as uniporters, 
antiporters, and symporters. Uniporters transport a substrate in either 
direction, determined by the concentration gradient. Antiporters and 
symporters can mediate secondary active transport by coupling the 
uphill flow of one substrate to the downhill flow of another. Antiporters 
couple the downhill flow of one substrate in one direction to the uphill 
flow of another in the opposite direction. Symporters move both sub- 
strates in the same direction. Studies of the lactose permease from 
E. coli have been a source of insight into both the structures and the 
mechanisms of secondary transporters. 


Specific Channels Can Rapidly Transport lons Across Membranes 

Ion channels allow the rapid movement of ions across the hydrophobic 
barrier of the membrane. The activity of individual ion-channel mol- 
ecules can be observed by using patch-clamp techniques. Many ion 
channels have a common structural framework. In regard to Kt chan- 
nels, hydrated potassium ions must transiently lose their coordinated 
water molecules as they move to the narrowest part of the channel, 
termed the selectivity filter. In the selectivity filter, peptide carbonyl 
groups coordinate the ions. Rapid ion flow through the selectivity 
filter is facilitated by ion—ion repulsion, with one ion pushing the next 
ion through the channel. Some ion channels are voltage gated: changes 
in membrane potential induce conformational changes that open these 
channels. Many channels spontaneously inactivate after having been 
open for a short period of time. In some cases, inactivation is due to the 
binding of a domain of the channel termed the “ball” in the mouth of 
the channel to block it. Other channels, typified by the acetylcholine 


13.5 


13.6 


receptor, are opened or closed by the binding of ligands. Ligand-gated 
and voltage-gated channels work in concert to generate action poten- 
tials. Inherited mutations or drugs that interfere with the ion channels 
that produce the action potential can result in potentially life threaten- 
ing conditions. 


Gap Junctions Allow lons and Small Molecules to Flow Between 
Communicating Cells 

In contrast with many channels, which connect the cell interior with 
the environment, gap junctions, or cell-to-cell channels, serve to con- 
nect the interiors of contiguous groups of cells. A cell-to-cell channel 
is composed of 12 molecules of connexin, which associate to form two 
6-membered connexons. 


Specific Channels Increase the Permeability of Some 

Membranes to Water 

Some tissues contain proteins that increase the permeability of 
membranes to water. Each water-channel-forming protein, termed an 
aquaporin, consists of six membrane-spanning « helices and a central 
channel lined with hydrophilic residues that allow water molecules to 
pass in single file. Aquaporins do not transport protons. 


KEY TERMS 
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pump (p. 367) 

carrier (p. 367) 

channel (p. 367) 

active transport (p. 367) 


facilitated diffusion (passive transport) 
(p. 367) 


ATP-driven pump (p. 367) 
primary active transport (p. 367) 
secondary active transport (p. 367) 
simple diffusion (p. 368) 


electrochemical potential (membrane 
potential) (p. 369) 

Na*—K* pump (Na*t-K* ATPase) 
(p. 370) 


sarcoplasmic reticulum Ca?* ATPase 


(SERCA) (p. 370) 
gastric H+-K* ATPase (p. 370) 
P-type ATPase (p. 370) 


PROBLEMS 


eversion (p. 372) 

cardiotonic steroid (p. 373) 

digitalis (p. 373) 

multidrug resistance (p. 374) 

multidrug-resistance (MDR) protein 
(P-glycoprotein) (p. 374) 

ATP-binding cassette (ABC) domain 
(p. 374) 

ABC transporter (p. 374) 


secondary transporter (cotransporter) 
(p. 376) 


antiporter (p. 376) 
symporter (p. 376) 
uniporter (p. 376) 
lactose permease (p. 377) 
ion channel (p. 378) 
nerve impulse (p. 378) 
action potential (p. 378) 


patch-clamp technique (p. 379) 
gigaseal (p. 379) 

selectivity filter (p. 381) 
voltage-gated channel (p. 383) 
ball-and-chain model (p. 384) 
neurotransmitter (p. 385) 
acetylcholine (p. 385) 

synaptic cleft (p. 385) 
acetylcholine receptor (p. 385) 
ligand-gated channel (p. 386) 
equilibrium potential (p. 387) 
Nernst equation (p. 387) 

long QT syndrome (LQTS) (p. 388) 


gap junction (cell-to-cell channels) 
(p. 389) 


connexin (p. 390) 
connexon (hemichannel) (p. 390) 


aquaporin (p. 391) 


1. A helping hand. Differentiate between simple diffusion 
and facilitated diffusion. 


2. Powering movement. What are the two forms of energy 
that can power active transport? 


3. Carriers. Name the three types of carrier proteins. 
Which of these can mediate secondary active transport? 


4. The price of extrusion. What is the free-energy cost at 
25°C of pumping Ca?* out of a cell when the cytoplasmic 
concentration is 0.4 wM, the extracellular concentration is 
1.5 mM, and the membrane potential is —60 mV? 


5. Equilibrium potentials. For a typical mammalian cell, 
the intracellular and extracellular concentrations of the 
chloride ion (Cl~) are 4 mM and 150 mM, respectively. 
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For the calcium ion (Ca?*), the intracellular and extracel- 
lular concentrations are 0.2 w~M and 1.8 mM, respec- 
tively. Calculate the equilibrium potentials at 37°C for 
these two ions. 


6. How sweet it is. Some animal cells take up glucose 
by a symporter powered by the simultaneous entry of 
Nat. The entry of Nat provides a free-energy input of 
10.8 kJ mol~! (2.6 kcal mol~!) under typical cellular 
conditions (external [Nat] = 143 mM, internal [Nat] = 
14 mM, and membrane potential = —50 mV). How largea 
concentration gradient of glucose at 37°C can be generated 
by this free-energy input? 


7. Variations on a theme. Write a detailed mechanism 
for transport by the Na*-K* ATPase based on analogy 
with the mechanism of the Ca** ATPase shown in 
Figure 13.5. 


8. Pumping protons. Design an experiment to show that the 
action of lactose permease can be reversed in vitro to pump 
protons. 


9. Opening channels. Differentiate between ligand-gated 
and voltage-gated channels. 


10. Different directions. The Kt channel and the Nat 
channel have similar structures and are arranged in the 
same orientation in the cell membrane. Yet the Nat chan- 
nel allows sodium ions to flow into the cell and the K* 
channel allows potassium ions to flow out of the cell. 
Explain. 


11. Differing mechanisms. Distinguish the mechanisms by 
which uniporters and channels transport ions or molecules 
across the membrane. 


12. Short circuit. Carbonyl cyanide 4-(trifluoromethoxy) 
phenylhydrazone (FCCP) is a proton ionophore: it enables 
protons to pass freely through membranes. Treatment of 
E. coli with FCCP prevents the accumulation of lactose in 
these cells. Explain. 


13. Working together. The human genome contains more 
than 20 connexin-encoding genes. Several of these genes are 
expressed in high levels in the heart. Why are connexins so 
highly expressed in cardiac tissue? 


14. Structure—activity relations. On the basis of the struc- 
ture of tetrodotoxin, propose a mechanism by which the 
toxin inhibits Nat flow through the Na‘ channel. 


15. Hot stuff. When SERCA is incubated with [y-??7P]ATP 
(a form of ATP in which the terminal phosphate is 
labeled with radioactive 32P) and calcium at 0°C for 20 
seconds and analyzed by gel electrophoresis, a radioac- 
tive band is observed at the molecular weight corre- 
sponding to full-length SERCA. Why is a labeled band 


observed? Would you expect a similar band if you were 


performing a similar assay, with a suitable substrate, for 
the MDR protein? 


16. A dangerous snail. Cone snails are carnivores that inject 
a powerful set of toxins into their prey, leading to rapid 
paralysis. Many of these toxins are found to bind to specific 
ion-channel proteins. Why are such molecules so toxic? 
How might such toxins be useful for biochemical studies? 


17. Pause for effect. Immediately after the repolarization 
phase of an action potential, the neuronal membrane is 
temporarily unable to respond to the stimulation of a sec- 
ond action potential, a phenomenon referred to as the 
refractory period. What is the mechanistic basis for the 
refractory period? 


18. Only a few. Why do only a small number of sodium 
ions need to flow through the Na* channel to change the 
membrane potential significantly? 


19. More than one mechanism. How might a mutation in a 
cardiac voltage-dependent sodium channel cause long QT 
syndrome? 


20. Mechanosensitive channels. Many species contain ion 
channels that respond to mechanical stimuli. On the basis 
of the properties of other ion channels, would you expect 
the flow of ions through a single open mechanosensitive 
channel to increase in response to an appropriate stimulus? 
Why or why not? 


21. Concerted opening. Suppose that a channel obeys the 
concerted allosteric model (MWC model, Section 7.2). The 
binding of ligand to the R state (the open form) is 20 times 
as tight as that to the T state (the closed form). In the 
absence of ligand, the ratio of closed to open channels is 10°. 
If the channel is a tetramer, what is the fraction of open 
channels when 1, 2, 3, and 4 ligands are bound? 


22. Respiratory paralysis. The neurotransmitter acetylcho- 
line is degraded by a specific enzyme that is inactivated by 
Tabun, sarin, and parathion. On the basis of the structures 
below, propose a possible basis for their lethal actions. 


O 
Be 
“oo en 
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23. Ligand-induced channel opening. The ratio of open to 
closed forms of the acetylcholine receptor channel contain- 
ing zero, one, and two bound acetylcholine molecules is 5 X 
10-6, 1.2 X 107%, and 14, respectively. 


(a) By what factor is the open-to-closed ratio increased by 
the binding of the first acetylcholine molecule? The second 
acetylcholine molecule? 


(b) What are the corresponding free-energy contributions 
to channel opening at 25°C? 


(c) Can the allosteric transition be accounted for by the 
MWC concerted model (Section 7.2)? 


24. Frog poison. Batrachotoxin (BTX) is a steroidal alkaloid 
from the skin of Phyllobates terribilis, a poisonous Colombian 
frog (the source of the poison used on blowgun darts). In 
the presence of BTX, Na* channels in an excised patch stay 
persistently open when the membrane is depolarized. They 
close when the membrane is repolarized. Which transition 


is blocked by BTX? 


25. Valium target. y-Aminobutyric acid (GABA) opens 
channels that are specific for chloride ions. The GABA, 
receptor channel is pharmacologically important because it 
is the target of Valium, which is used to diminish anxiety. 


(a) The extracellular concentration of Cl~ is 123 mM 
and the intracellular concentration is 4 mM. In which 
direction does Cl” flow through an open channel when 
the membrane potential is in the —60 mV to +30 mV 
range? 


(b) What is the effect of Cl” -channel opening on the excit- 
ability of a neuron? 


(c) The hydropathy profile of the GABA, receptor resem- 
bles that of the acetylcholine receptor. Predict the number 
of subunits in this Cl~ channel. 


26. Understanding SERCA. To study the mechanism of 
SERCA, you prepare membrane vesicles containing this 
protein oriented such that its ATP binding site is on the 
outer surface of the vesicle. To measure pump activity, you 
use an assay that detects the formation of inorganic phos- 
phate in the medium. When you add calcium and ATP to 
the medium, you observe phosphate production for only a 
short period of time. Only after the addition of calcimycin, 
a molecule that makes membranes selectively permeable 
to calcium, do you observe sustained phosphate produc- 
tion. Explain. 


Chapter Integration Problem 


27. Speed and efficiency matter. Acetylcholine is rapidly 
destroyed by the enzyme acetylcholinesterase. This enzyme, 
which has a turnover number of 25,000 per second, has 
attained catalytic perfection witha k,,,/ Ky, of2 X 10°M~'s71. 
Why is the efficiency of this enzyme physiologically crucial? 
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Mechanism Problem 


28. Remembrance of mechanisms past. Acetylcholinesterase 
converts acetylcholine into acetate and choline. Like serine 
proteases, acetylcholinesterase is inhibited by DIPF. Propose 
a catalytic mechanism for acetylcholine digestion by acetyl- 
cholinesterase. Show the reaction as chemical structures. 


Data Interpretation Problems 


29. Tarantula toxin. Acid sensing is associated with pain, 
tasting, and other biological activities (Chapter 33). Acid 
sensing is carried out by a ligand-gated channel that per- 
mits Na* influx in response to H*. This family of acid- 
sensitive ion channels (ASICs) includes a number of mem- 
bers. Psalmotoxin 1 (PcTX1), a venom from the tarantula, 
inhibits some members of this family. The following elec- 
trophysiological recordings of cells containing several 
members of the ASIC family were made in the presence of 
the toxin at a concentration of 10 nM. The channels were 
opened by changing the pH from 7.4 to the indicated val- 
ues. The PcTX1 was present for a short time (indicated by 
the black bar above the recordings below), after which time 
it was rapidly washed from the system. 
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(A) Electrophysiological recordings of cells exposed to tarantula 
toxin. (B) Plot of peak current of a cell containing the ASIC1a protein 
versus the toxin concentration. [Data from P. Escoubas et al., J. Biol. 
Chem. 275:25116-25121, 2000.] 


(a) Which member of the ASIC family—ASIC1a, ASIC1b, 
ASIC2a, or ASIC3—is most sensitive to the toxin? 


(b) Is the effect of the toxin reversible? Explain. 


(c) What concentration of PceTX1 yields 50% inhibition of 
the sensitive channel? 


30. Channel problems 1. A number of pathological condi- 
tions result from mutations in the acetylcholine receptor 
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channel. One such mutation in the B subunit, BV266M, 
causes muscle weakness and rapid fatigue. An investigation 
of the acetylcholine-generated currents through the acetyl- 
choline receptor channel for both a control and a patient 
yielded the following results. 


Control 
Closed channel 


Open channel 


Patient 


ts ee eee 


What is the effect of the mutation on channel function? 
Suggest some possible biochemical explanations for the effect. 


Closed channel 


Open channel 


31. Channel problems 2. The acetylcholine receptor channel 
can also undergo mutation leading to fast-channel syn- 
drome (FCS), with clinical manifestations similar to those 
of slow-channel syndrome (Problem 30). What would the 


recordings of ion movement look like in this syndrome? 
Suggest a biochemical explanation. 


32. Transport differences. The rate of transport of two mol- 
ecules, indole and glucose, across a cell membrane is shown 
below. What are the differences between the transport 
mechanisms of the two molecules? Suppose that ouabain 
inhibited the transport of glucose. What would this inhibi- 
tion suggest about the mechanism of transport? 


Glucose 


Rate of transport ——> 


20 40 60 
Solute concentration (mM) 


Signal- Transduction Pathways 


"OFF" 
position 


Signal-transduction circuits in biological systems have molecular on-off 
switches that, like those in a computer chip (above), transmit information 
when “on.” Common among these circuits are those including G proteins 
(right), which transmit a signal when bound to GTP and are silent when 
bound to GDP. [(Left) Astrid & Hanns-Frieder Michler/Science Source.] 


A cell is highly responsive to specific chemicals in its environment: it may 
adjust its metabolism or alter gene-expression patterns on sensing the 
presence of these molecules. In multicellular organisms, these chemical sig- 
nals are crucial to coordinating physiological responses (Figure 14.1). Three 
examples of molecular signals that stimulate a physiological response are 
epinephrine (sometimes called adrenaline), insulin, and epidermal growth 
factor (EGF). When a mammal is threatened, its adrenal glands release the 
hormone epinephrine, which stimulates the mobilization of energy stores 
and leads to improved cardiac function. After a full meal, the B cells in the 
pancreas release insulin, which stimulates a host of physiological responses, 
including the uptake of glucose from the bloodstream and its storage as gly- 
cogen. The release of EGF in response to a wound stimulates specific cells to 
grow and divide. In all these cases, the cell receives information that a certain 
molecule within its environment is present above some threshold concentra- 
tion. The chain of events that converts the message “this molecule is present” 
into the ultimate physiological response is called signal transduction. 
Signal-transduction pathways often comprise many components and 
branches. They can thus be immensely complicated and confusing. However, 
the logic of signal transduction can be simplified by examining the com- 
mon strategies and classes of molecules that recur in these pathways. 
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14.1 Heterotrimeric G Proteins 
Transmit Signals and Reset 
Themselves 


14.2 Insulin Signaling: Phosphorylation 
Cascades Are Central to Many 
Signal-Transduction Processes 


14.3 EGF Signaling: Signal- 
Transduction Systems Are Poised 
to Respond 


14.4 Many Elements Recur with 
Variation in Different Signal- 
Transduction Pathways 


14.5 Defects in Signal-Transduction 
Pathways Can Lead to Cancer 
and Other Diseases 
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FIGURE 14.2 Principles of signal 
transduction. An environmental signal is 
first received by interaction with a cellular 
component, most often a cell-surface 
receptor. The information that the signal 
has arrived is then converted into other 
chemical forms, or transduced. Typically, 
the transduction process comprises many 
steps. The signal is often amplified before 
evoking a response. Feedback pathways 
regulate the entire signaling process. 


Epinephrine Insulin Epidermal growth factor (EGF) 
+ + + 
B-Adrenergic receptor Insulin receptor EGF receptor 
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Energy-store Increased glucose 
mobilization uptake 


Expression of 
growth-promoting genes 


FIGURE 14.1 Three signal-transduction pathways. The binding of signaling molecules to 
their receptors initiates pathways that lead to important physiological responses. 


These principles are introduced here because signal-transduction pathways 
affect essentially all of the metabolic pathways that we will be exploring 
throughout the rest of the book. 


Signal transduction depends on molecular circuits 


Signal-transduction pathways follow a broadly similar course that can be 
viewed as a molecular circuit (Figure 14.2). All such circuits contain certain 
key steps: 


1. Release of the Primary Messenger. A stimulus such as a wound or 
digested meal triggers the release of the signal molecule, also called the pri- 
mary messenger. 


2. Reception of the Primary Messenger. Most signal molecules do not enter 
cells. Instead, proteins in the cell membrane act as receptors that bind the sig- 
nal molecules and transfer the information that the molecule has bound from 
the external environment to the cell’s interior. Receptors span the cell mem- 
brane and thus have both extracellular and intracellular components. A bind- 
ing site on the extracellular side specifically recognizes the signal molecule 
(often referred to as the ligand). Such binding sites are analogous to enzyme 
active sites except that no catalysis takes place within them. The interaction of 
the ligand and the receptor alters the tertiary or quaternary structure of the 
receptor so as to induce a structural change on the intracellular side. 


3. Delivery of the Message Inside the Cell by the Second Messenger. Other 
small molecules, called second messengers, are used to relay information 
from receptor—ligand complexes. Second messengers are intracellular mol- 
ecules that change in concentration in response to environmental signals 
and mediate the next step in the molecular information circuit. Some par- 
ticularly important second messengers are cyclic AMP (cAMP) and cyclic 
GMP (cGMP), calcium ion, inositol 1,4,5-trisphosphate (IP,), and diacyl- 
glycerol (DAG; Figure 14.3). 

The use of second messengers has several consequences. First, the signal 
may be amplified significantly: only a small number of receptor molecules 
may be activated by the direct binding of signal molecules, but each acti- 
vated receptor molecule can lead to the generation of many second messen- 
gers. Thus, a low concentration of signal in the environment, even as little as a 
single molecule, can yield a large intracellular signal and response. Second, 
these messengers are often free to diffuse to other cellular compartments 
where they can influence processes throughout the cell. Third, the use of 
common second messengers in multiple signaling pathways creates both 
opportunities and potential problems. Input from several signaling pathways, 
often called cross talk, may alter the concentration of a common second mes- 
senger. Cross talk permits more finely tuned regulation of cell activity than 
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would the action of individual independent pathways. However, inappro- 
priate cross talk can result in the misinterpretation of changes in second- 
messenger concentration. 


4. Activation of Effectors That Directly Alter the Physiological Response. 
The ultimate effect of the signal pathway is to activate (or inhibit) the 
pumps, channels, enzymes, and transcription factors that directly control 
metabolic pathways, gene expression, and the permeability of membranes 
to specific ions. 


5. Termination of the Signal. After a cell has completed its response to a 
signal, the signaling process must be terminated or the cell loses its respon- 
siveness to new signals. Moreover, signaling processes that fail to terminate 
properly can have highly undesirable consequences. As we will see, many 
cancers are associated with signal-transduction processes that are not prop- 
erly terminated, especially processes that control cell growth. 


In this chapter, we will examine components of the three signal-trans- 
duction pathways shown in Figure 14.1. In doing so, we will see several 
classes of adaptor domains present in signal-transduction proteins. These 
domains usually recognize specific classes of molecules and help transfer 
information from one protein to another. The components described in the 
context of these three pathways recur in many other signal-transduction 
pathways; bear in mind that the specific examples are representative of 
many such pathways. 


14.1 Heterotrimeric G Proteins Transmit Signals and 
Reset Themselves 


Epinephrine is a hormone secreted by the adrenal glands of mammals in 
response to internal and external stressors. It exerts a wide range of effects— 
referred to as the fight-or-flight response—to help organisms anticipate the 
need for rapid muscular activity, including acceleration of heart rate, dila- 
tion of the smooth muscle of the airways, and initiation of the breakdown of 
glycogen (Section 21.3) and fatty acids (Section 22.2). Epinephrine signal- 
ing begins with ligand binding to a protein called the B-adrenergic receptor 
(B-AR). The B-AR is a member of the largest class of cell-surface receptors, 
called the seven-transmembrane-helix (7TM) receptors. Members of this 
family are responsible for transmitting information initiated by signals as 
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FIGURE 14.3 Common second 
messengers. Second messengers are 
intracellular molecules that change in 
concentration in response to environmental 
signals. That change in concentration 
conveys information inside the cell. 


Epinephrine 


TABLE 14.1 Biological functions 
mediated by 7TM 
receptors 


Hormone action 
Hormone secretion 
Neurotransmission 
Chemotaxis 

Exocytosis 

Control of blood pressure 
Embryogenesis 

Cell growth and differentiation 
Development 

Smell 

Taste 

Vision 

Viral infection 


Information from J. S. Gutkind, J. Biol. Chem. 
273:1839-1842, 1998. 


FIGURE 14.4 The 7TM receptor. Schematic 
representation of a 7TM receptor showing its 
passage through the membrane seven times. 


FIGURE 14.5 Structures of (A) 
rhodopsin and the B,-adrenergic 
receptor. Three-dimensional 
structure of rhodopsin (A) and the 
B,-adrenergic receptor (8,-AR). 
(B). Notice the resemblance in the 
overall architecture of both 
receptors and the similar locations 
of the rhodopsin ligand 11-cis- 
retinal and the B,-AR blocker 
carazolol. [Drawn from 1F88.pdb 
and 2RH1.pdb.] 
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diverse as hormones, neurotransmitters, odorants, tastants, and even pho- 
tons (Table 14.1). More than 20,000 such receptors are now known, includ- 
ing nearly 800 encoded in the human genome. Furthermore, about one-third 
of the marketed therapeutic drugs target receptors of this class. As the name 
indicates, these receptors contain seven helices that span the membrane 
bilayer (Figure 14.4). 

The first member of the 7TM receptor family to have its three-dimensional 
structure determined was rhodopsin (Figure 14.5A), a protein in the retina of 
the eye that senses the presence of photons and initiates the signaling cascade 
responsible for visual sensation. A single lysine residue within rhodopsin is 
covalently modified by a form of vitamin A, 11-cis-retinal. This modification 
is located near the extracellular side of the receptor, within the region sur- 
rounded by the seven transmembrane helices. As will be considered in 
greater detail in Section 33.3, exposure to light induces the isomerization of 
11-cis-retinal to its all-trans form, producing a structural change in the recep- 
tor that results in the initiation of an action potential that is ultimately inter- 
preted by the brain as visual stimulus. 

In 2007, the first three-dimensional structure of the B, subtype of the 
human adrenergic receptor (B,-AR) bound to an inhibitor was solved by x-ray 
crystallography. This inhibitor, carazolol, competes with epinephrine for bind- 
ing to the B,-AR, much in the same way that competitive inhibitors act at 
enzyme active sites (Section 8.5). The structure of the B,-AR revealed consider- 
able similarities with that of rhodopsin, particularly with respect to the locations 
of 11-cis-retinal in rhodopsin and the binding site for carazolol (Figure 14.5B). 


Ligand binding to 7TM receptors leads to the activation of 
heterotrimeric G proteins 


What is the next step in the pathway? The conformational change in the 
receptor’s cytoplasmic domain activates a protein called a G protein, named 
for the fact that it binds guanyl nucleotides. The activated G protein stimu- 
lates the activity of adenylate cyclase, an enzyme that catalyzes the conver- 
sion of ATP into cAMP. The G protein and adenylate cyclase remain 
attached to the membrane, whereas cAMP, a second messenger, can travel 
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FIGURE 14.6 Activation of protein kinase A by a G-protein pathway. Hormone binding to 
a 7TM receptor initiates a signal-transduction pathway that acts through a G protein and 
cAMP to activate protein kinase A. 


throughout the cell carrying the signal originally brought by the binding of 
epinephrine. Figure 14.6 provides a broad overview of these steps. 

Let us consider the role of the G protein in this signaling pathway in 
greater detail. In its unactivated state, the G protein is bound to GDP. In 
this form, the G protein exists as a heterotrimer consisting of a, B, and yy 
subunits; the a subunit (referred to as G,) binds the nucleotide 
(Figure 14.7). The a subunit is a member of the P-loop NTPase family 
(Section 9.4), and the P-loop participates in nucleotide binding. The a and 
yy subunits are usually anchored to the membrane by covalently attached 
fatty acids. The role of the hormone-bound receptor is to catalyze the 
exchange of GTP for bound GDP. The interaction between the hormone— 
B,-AR complex and the heterotrimeric G protein was illustrated in 
molecular detail when the crystal structure of this complex was deter- 
mined in 2011. In this structure, a synthetic agonist, or small molecule that 
activates a receptor, was used to induce the active conformation of the 
B,-AR. Agonist binding results in the movement of two transmembrane 
helices, yielding an extensive interaction surface for the G, subunit of the 
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FIGURE 14.7 A heterotrimeric 

G protein. (A) A ribbon diagram shows the 
relation between the three subunits. In this 
complex, the a subunit (gray and purple) is 
bound to GDP. Notice that GDP is bound in 
a pocket close to the surface at which the a 
subunit interacts with the By dimer. (B) A 
schematic representation of the heterotrimeric 
G protein. [Drawn from 1GOT.pdb.] 
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FIGURE 14.8 The complex between the 
activated B,-AR and a heterotrimeric 

G protein. (A) When the B,-AR (green) binds 
a receptor agonist, the cytoplasmic face of 
the receptor forms an interaction surface 
with the G, subunit of a heterotrimeric 

G protein. (B) The interaction with the 
activated receptor leads to a substantial 
conformational change in the G, protein, in 
which the GTP binding site is opened, 
enabling nucleotide exchange. In this figure, 
G, in its GTP form is shown in red and in 
its receptor-bound form is shown in 

blue. [Drawn from 3SN6.pdb and 1AZT.pdb.] 
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hk FIGURE 14.9 Adenylate cyclase activation. (A) Adenylate cyclase is a 
membrane protein with two large intracellular domains that contain the catalytic 
apparatus. (B) The structure of a complex between G, in its GTP form bound to a 
catalytic fragment from adenylate cyclase. Notice that the surface of G, that had 
been bound to the By dimer now binds adenylate cyclase. [Drawn from 1AZS.pdb.] 
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heterotrimer (Figure 14.8A). When bound to the receptor, the nucleotide- 
binding site of G, opens substantially, enabling the displacement of GDP 
by GTP (Figure 14.8B). 

On GTP binding, the a subunit simultaneously dissociates from the By 
dimer (Geg,), transmitting the signal that the receptor has bound its ligand. 
A single hormone—receptor complex can stimulate nucleotide exchange in many 
G-protein heterotrimers. Thus, hundreds of G, molecules are converted 
from their GDP form into their GTP form for each bound molecule of hor- 
mone, giving an amplified response. Because they signal through G pro- 
teins, 7TM receptors are often called G-protein-coupled receptors (GPCRs). 


Activated G proteins transmit signals by binding to other proteins 


In the GTP form, the surface of G, that had been bound to Ge, has changed 
its conformation from the GDP form so that it no longer has a high affinity for 
Gg, This surface is now exposed for binding to other proteins. In the B-AR 
pathway, the new binding partner is adenylate cyclase, the enzyme that con- 
verts ATP into cAMP. This enzyme is a membrane protein that contains 
12 membrane-spanning helices; two large cytoplasmic domains form the 
catalytic part of the enzyme (Figure 14.9). The interaction of G, with adenyl- 
ate cyclase favors a more catalytically active conformation of the enzyme, thus 
stimulating cAMP production. Indeed, the G, subunit that participates in the 
B-AR pathway is called G,, (“s” stands for stimulatory). The net result 1s that 
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the binding of epinephrine to the receptor on the cell surface increases the rate of 
cAMP production inside the cell. The generation of cAMP by adenylate cyclase 
provides a second level of amplification because each activated adenylate 
cyclase can convert many molecules of ATP into cAMP. 


Cyclic AMP stimulates the phosphorylation of many target proteins 
by activating protein kinase A 


The increased concentration of cAMP can affect a wide range of cellular pro- 
cesses. In the muscle, cAMP stimulates the production of ATP for muscle con- 
traction. In other cell types, cAMP enhances the degradation of storage fuels, 
increases the secretion of acid by the gastric mucosa, leads to the dispersion of 
melanin pigment granules, diminishes the aggregation of blood platelets, and 
induces the opening of chloride channels. How does cAMP influence so many 
cellular processes? Most effects of cAMP in eukaryotic cells are mediated by the 
activation of a single protein kinase. This key enzyme is protein kinase A (PKA). 

As described earlier, PKA consists of two regulatory (R) chains and two 
catalytic (C) chains (R,C,; see Figure 10.16). In the absence of cAMP, the 
R,C, complex is catalytically inactive. The binding of cAMP to the regula- 
tory chains releases the catalytic chains, which are catalytically active on their 
own. Activated PKA then phosphorylates specific serine and threonine resi- 
dues in many targets to alter their activity. For instance, PKA phosphorylates 
two enzymes that lead to the breakdown of glycogen, the polymeric store of 
glucose, and the inhibition of further glycogen synthesis (Section 21.3). PKA 
stimulates the expression of specific genes by phosphorylating a transcriptional 
activator called the cAMP response element binding (CREB) protein. This 
activity of PKA illustrates that signal-transduction pathways can extend into 
the nucleus to alter gene expression. 

The signal-transduction pathway initiated by epinephrine is summa- 
rized in Figure 14.10. 


G proteins spontaneously reset themselves through GTP hydrolysis 


How is the signal initiated by epinephrine switched off? G, subunits have 
intrinsic GTPase activity, which is used to hydrolyze bound GTP to GDP 
and P.. This hydrolysis reaction 1s slow, however, requiring from seconds to 
minutes. Thus, the GTP form of G, is able to activate downstream compo- 
nents of the signal-transduction pathway before it is deactivated by GTP 
hydrolysis. In essence, the bound GTP acts as a built-in clock that spontane- 
ously resets the G., subunit after a short time period. After GTP hydrolysis 
and the release of P., the GDP-bound form of G, then reassociates with Ge, 
to re-form the inactive heterotrimeric protein (Figure 14.11). 

The hormone-bound activated receptor must be reset as well to prevent the 
continuous activation of G proteins. This resetting is accomplished by two 
processes (Figure 14.12). First, the hormone dissociates, returning the recep- 
tor to its initial, unactivated state. The likelihood that the receptor remains in 
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FIGURE 14.10 Epinephrine signaling 
pathway. The binding of epinephrine to the 
B-adrenergic receptor initiates the signal- 
transduction pathway. The process in each 
step is indicated (in black) at the left of 
each arrow. Steps that have the potential 
for signal amplification are indicated at the 
right in green. 


FIGURE 14.11 Resetting G,. On 
hydrolysis of the bound GTP by the intrinsic 
GTPase activity of G,, G, reassociates with 
the By dimer to form the heterotrimeric G 
protein, thereby terminating the activation 
of adenylate cyclase. 
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FIGURE 14.12 Signal termination. Signal transduction by the 7TM receptor is halted (1) by 
dissociation of the signal molecule from the receptor and (2) by phosphorylation of the 
cytoplasmic C-terminal tail of the receptor and the subsequent binding of B-arrestin. 


its unbound state depends on the extracellular concentration of hormone. 
Second, the signaling cascade initiated by the hormone-receptor complex acti- 
vates a kinase that phosphorylates serine and threonine residues in the 
carboxyl-terminal tail of the receptor. These phosphorylation events result 
in the deactivation of the receptor. In the example under consideration, 
B-adrenergic-receptor kinase (also called G-protein receptor kinase 2, or GRK2) 
phosphorylates the carboxyl-terminal tail of the hormone-receptor complex but 
not the unoccupied receptor. Finally, the molecule B-arrestin binds to the phos- 
phorylated receptor and further diminishes its ability to activate G proteins. 


Some 7TM receptors activate the phosphoinositide cascade 


We now turn to another common second-messenger cascade, also employ- 
ing a 7ITM receptor, that is used by many hormones to evoke a variety of 
responses. The phosphoinositide cascade, like the cAMP cascade, converts 
extracellular signals into intracellular ones. The intracellular second mes- 
sengers formed by activation of this pathway arise from the cleavage of 
phosphatidylinositol 4, 5-bisphosphate (PIP,), a phospholipid present in cell 
membranes. An example of a signaling pathway based on the phos- 
phoinositide cascade is the one triggered by the receptor for angiotensin II, 
a peptide hormone that controls blood pressure. 

Each type of 7TM receptor signals through a distinct G protein. 
Whereas the B-adrenergic receptor activates the G protein G,,., the angio- 
tensin II receptor activates a G protein called ca . In its GTP form, G, 
binds to and activates the B isoform of the enzyme phospholipase C. This 
enzyme catalyzes the cleavage of PIP, into the two second messengers— 
inositol 1,4,5-trisphosphate (IP.,) and dinclalyceral (DAG; Figure 14.13). 

IP, is soluble and diffises away from the membrane. This second 
messenger causes the rapid release of Ca** from the intracellular stores in the 
endoplasmic reticulum (ER), which accumulates a reservoir of Ca?* through 
the action of transporters such as Ca2* ATPase (Section 13.2). On binding 
IP, specific IP;-gated Ca**—channel proteins in the ER membrane open to 
allow calcium ions to flow from the ER into the cytoplasm. Calcium ion is 
itself a signaling molecule: It can bind proteins, including a ubiquitous sig- 
naling protein called calmodulin and enzymes such as protein kinase C. 
By such means, the elevated level of cytoplasmic Ca?* triggers processes 
such as smooth-muscle contraction, glycogen breakdown, and vesicle release. 
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Phosphatidylinositol 4,5-bisphosphate (PIP) 


Diacylglycerol (DAG) 


2- 
oH OPO3 


2-9,PO OH 


Inositol 1,4,5-trisphosphate (IP3) 


FIGURE 14.13 Phospholipase C reaction. Phospholipase C cleaves the membrane lipid 
phosphatidylinositol 4,5-bisphosphate into two second messengers: diacylglycerol, which remains 
in the membrane, and inositol 1,4,5-trisphosphate, which diffuses away from the membrane. 


DAG remains in the plasma membrane. There, it activates protein kinase 
C (PKC), a protein kinase that phosphorylates serine and threonine resi- 
dues in many target proteins. To bind DAG, the specialized DAG-binding 


domains of this kinase require bound calcium. Note that 
DAG and IP, work in tandem: IP; increases the Ca?* 
sencentration, and ‘Ca? facilitates the DAG-mediated 
activation of protein kinase C. The phosphoinositide cas- 
cade is summarized in Figure 14.14. Both IP; and DAG 
act transiently because they are converted into other spe- 
cies by phosphorylation or other processes. 


Calcium ion is a widely used second messenger 


Calcium ion participates in many signaling processes in 
addition to the phosphoinositide cascade. Several properties 
of this ion account for its widespread use as an intracellular 
messenger. First, fleeting changes in Ca** concentration 
are readily detected. At steady state, intracellular levels of 
Ca?* must be kept low to prevent the precipitation of car- 
boxylated and phosphorylated compounds, which form 
poorly soluble salts with Ca?*. Transport systems extrude 
Ca’* from the cytoplasm, maintaining the cytoplasmic 
concentration of Ca** at approximately 100 nM, several 
orders of magnitude lower than that of the extracellular 
medium (Section 13.2). Given this low steady-state level, 
transient increases in Ca** concentration produced by sig- 
naling events can be readily sensed. 

A second property of Ca?* that makes it a highly suit- 
able intracellular messenger is that it can bind tightly to 
proteins and induce substantial structural rearrangements. 
Calcium ions bind well to negatively charged oxygen 
atoms (from the side chains of glutamate and aspartate) 
and uncharged oxygen atoms (main-chain carbonyl 
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FIGURE 14.14 Phosphoinositide cascade. The cleavage of 
phosphatidylinositol 4,5-bisphosphate (PIP.) into diacylglycerol 
(DAG) and inositol 1,4,5-trisphosphate (IP) results in the release 
of calcium ions (owing to the opening of the IP, receptor ion 
channels) and the activation of protein kinase C (owing to the 
binding of protein kinase C to free DAG in the membrane). Calcium 
ions bind to protein kinase C and help facilitate its activation. 
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FIGURE 14.15 Calcium-binding site. In 
one common mode of binding, calcium is 
coordinated to six oxygen atoms of a 
protein and one (top) of water. 
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groups and side-chain oxygen atoms from glutamine and asparagine; 
Figure 14.15). The capacity of Ca?? to be coordinated to multiple ligands— 
from six to eight oxygen atoms—enables it to cross-link different segments of a 
protein and induce significant conformational changes. 

Our understanding of the role of Ca?* in cellular processes has been greatly 
enhanced by our ability to detect changes in Ca’* concentrations inside cells 
and even monitor these changes in real time. This ability depends on the use of 
specially designed dyes such as Fura-2 that bind Ca** and change their fluo- 
rescent properties on Ca** binding. Fura-2 binds Ca** through appropriately 
positioned oxygen atoms (shown in red) within its structure. 
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When such a dye is introduced into cells, changes in available Ca?* concen- 
tration can be monitored with microscopes capable of detecting changes in 
fluorescence (Figure 14.16). Probes for sensing other second messengers 
such as cAMP also have been developed. These molecular-imaging agents 
are greatly enhancing our understanding of signal-transduction processes. 


FIGURE 14.16 Calcium imaging. (A) The (A) 
fluorescence spectra of the calcium-binding dye 
Fura-2 can be used to measure available calcium 

ion concentrations in solution and in cells. (B) A 
series of images show Ca?* spreading across an 
egg cell following fertilization by sperm. These 
images were obtained through the use of Fura-2. 
The images are false colored: orange represents high 
Ca?* concentrations, and green represents low Ca?* 
concentrations. [(A) Information from S. J. Lippard and 
J. M. Berg, Principles of Bioinorganic Chemistry 
(University Science Books, 1994), p. 193; (B) data from 
Carroll M. et al., J. Cell Sci. 116: 4997-5004, 2003.] 
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Calcium ion often activates the regulatory protein calmodulin 


Calmodulin (CaM), a17-kDa protein with four Ca?* -binding sites, serves as 
a calcium sensor in nearly all eukaryotic cells. At cytoplasmic concentrations 
above about 500 nM, Ca?* binds to and activates calmodulin. Calmodulin is 
a member of the EF-hand protein family. The EF hand is a Ca?* -binding 
motif that consists of a helix, a loop, and a second helix. This motif, origi- 
nally discovered in the protein parvalbumin, was named the EF hand 
because the two key helices designated E and F in parvalbumin are posi- 
tioned like the forefinger and thumb of the right hand (Figure 14.17). These 
two helices and the intervening loop form the Ca**-binding motif. Seven 
oxygen atoms are coordinated to each Ca?*, six from the protein and one 
from a bound water molecule. Calmodulin is made up of four EF-hand 
motifs, each of which can bind a single Ca* ion. 

The binding of Ca** to calmodulin induces substantial conformational 
changes in its EF hands, exposing hydrophobic surfaces that can be used to 
bind other proteins. Using its two sets of two EF hands, calmodulin clamps 
down around specific regions of target proteins, usually exposed a helices with 
appropriately positioned hydrophobic and charged groups (Figure 14.18). 
The Ca?*-calmodulin complex stimulates a wide variety of enzymes, pumps, 
and other target proteins by inducing structural rearrangements in these bind- 
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® FIGURE 14.17 EF hand. Formed by 
a helix-loop-helix unit, an EF hand is a 


ing partners. An especially noteworthy set of targets are several calmodulin- binding site for Ca2+ in many calcium- 
dependent protein kinases (CaM kinases) that phosphorylate many different sensing proteins. Here, the E helix is 
proteins and regulate fuel metabolism, ionic permeability, neurotransmitter yellow, the F helix is blue, and calcium 
synthesis, and neurotransmitter release. We see here a recurring theme in sig- is represented by the green sphere. Note 


nal-transduction pathways: the concentration of a second messenger is 


that the calcium ion is bound in a loop 
connecting two nearly perpendicular 


increased (in this case, Ca”*); the signal is sensed by a second-messenger- Kelicés: (orawmtrorACL Lipdes) 


binding protein (in this case, calmodulin); and the second-messenger- binding 
protein acts to generate changes in enzymes (in this case, calmodulin-depen- 
dent kinases) that control effectors. 


(A) 


CaM 
— target 
peptide CaM 
kinase 
y 
CaM kinase | Calmodulin (apo) 


h& FIGURE 14.18 Calmodulin binds to « helices. (A) An « helix (ourple) in CaM kinase | is 
a target for calmodulin. (B) On Ca?2* binding to the apo, or calcium-free, form of calmodulin 
(1), the two halves of calmodulin clamp down around the target helix (2), binding it through 
hydrophobic and ionic interactions. In CaM kinase |, this interaction allows the enzyme to 
adopt an active conformation. [Drawn from 1A06, 1CFD, 1CLL, and 1CM1.pdb.] 


14.2 Insulin Signaling: Phosphorylation Cascades Are 
Central to Many Signal-Transduction Processes 


The signaling pathways that we have examined so far have activated a pro- 
tein kinase as a downstream component of the pathway. We now turn to a 
class of signal-transduction pathways that are initiated by receptors that 


FIGURE 14.19 Insulin structure. 
Notice that insulin consists of two chains 
(shown in blue and yellow) linked by two 
interchain disulfide bonds. The a chain 
(blue) also has an intrachain disulfide 
bond. [Drawn from 1B2F.pdb.] 


Insulin-binding site 


B subunit 


FIGURE 14.20 The insulin receptor. The 
receptor consists of two units, each of 
which consists of an a subunit and a 

8 subunit linked by a disulfide bond. Two a 
subunits, which lie outside the cell, come 
together to form a binding site for insulin. 
Each B subunit lies primarily inside the cell 
and includes a protein kinase domain. 
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include protein kinases as part of their structures. The activation of these pro- 
tein kinases sets in motion other processes that ultimately modify the effec- 
tors of these pathways. 

An example is the signal-transduction pathway initiated by insulin, the 
hormone released in response to increased blood-glucose levels after a meal. 
In all of its detail, this multifaceted pathway is quite complex. Hence, we 
will focus solely on the major branch, which leads to the mobilization of 
glucose transporters to the cell surface. These transporters allow the cell to 
take up the glucose that is plentiful in the blood stream after a meal. 


The insulin receptor is a dimer that closes around 
a bound insulin molecule 


Insulin is a peptide hormone that consists of two chains that are linked by 
three disulfide bonds (Figure 14.19). Its receptor has a quite different struc- 
ture from that of the B-AR. The insulin receptor is a dimer of two identical 
units. Each unit consists of one a chain and one B chain linked to one 
another by a single disulfide bond (Figure 14.20). Each a subunit lies com- 
pletely outside the cell, whereas each B subunit lies primarily inside the cell, 
spanning the membrane with a single transmembrane segment. The two a 
subunits move together to form a binding site for a single insulin molecule, 
a surprising occurrence because two different surfaces on the insulin mole- 
cule must interact with the two identical insulin-receptor chains. The mov- 
ing together of the dimeric units in the presence of an insulin molecule sets 
the signaling pathway in motion. The closing up of an oligomeric receptor or 
the oligomerization of monomeric receptors around a bound ligand is a strategy 
used by many receptors to initiate a signal, particularly by those containing a 
protein kinase. 

Each B subunit consists primarily of a protein kinase domain, homolo- 
gous to protein kinase A. However, this kinase differs from protein kinase 
A in two important ways. First, the insulin-receptor kinase is a tyrosine 
kinase; that is, it catalyzes the transfer of a phosphoryl group from ATP to 
the hydroxyl group of tyrosine, rather than serine or threonine. 
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Because this tyrosine kinase is a component of the receptor itself, the 
insulin receptor is referred to as a receptor tyrosine kinase. Second, the 
insulin receptor kinase is in an inactive conformation when the domain is 
not covalently modified. The kinase is rendered inactive by the position of 
an unstructured loop (called the activation loop) that lies in the center of 
the structure. 


Insulin binding results in the cross-phosphorylation 
and activation of the insulin receptor 


When the two a subunits move together to surround an insulin molecule, 
the two protein kinase domains on the inside of the cell also are drawn 
together. It is important to note that as they come together, the flexible 
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FIGURE 14.21 Activation of the insulin receptor by phosphorylation. The activation 
loop is shown in red in this model of the protein kinase domain of the B subunit of the insulin 
receptor. The unphosphorylated structure on the left is not catalytically active. Notice that, 
when three tyrosine residues in the activation loop are phosphorylated, the activation loop 
swings across the structure and the kinase structure adopts a more compact conformation. 
This conformation is catalytically active. [Drawn from 1IRK.pdb and 1IR3.pdb.] 


activation loop of one kinase subunit is able to fit into the active site of the 
other kinase subunit within the dimer. With the two B subunits forced 
together, the kinase domains catalyze the addition of phosphoryl groups 
from ATP to tyrosine residues in the activation loops. When these tyro- 
sine residues are phosphorylated, a striking conformational change takes 


place (Figure 14.21). The rearrange- 
ment of the activation loop converts 
the kinase into an active conforma- 
tion. Thus, insulin binding on the 
outside of the cell results in the activa- 
tion of a membrane-associated kinase 
within the cell. 


The activated insulin-receptor 
kinase initiates a kinase cascade 


On phosphorylation, the insulin- 
receptor tyrosine kinase is activated. 
Because the two units of the receptor 
are held in close proximity to one 
another, additional sites within the 
receptor also are phosphorylated. These 
phosphorylated sites act as docking 
sites for other substrates, including a 
class of molecules referred to as insulin- 
receptor substrates (IRS; Figure 14.22). 
IRS-1 and IRS-2 are two homologous 
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FIGURE 14.22 Insulin signaling. The 
binding of insulin results in the cross- 
phosphorylation and activation of the insulin 
receptor. Phosphorylated sites on the 
receptor act as binding sites for insulin- 
receptor substrates such as IRS-1. The 
lipid kinase phosphoinositide 3-kinase 
binds to phosphorylated sites on IRS-1 
through its regulatory domain, then 
converts PIP, into PIP. Binding to PIP, 
activates PIP,-dependent protein kinase 
(PDK1), which phosphorylates and 
activates kinases such as Akt1. Activated 
Akt1 can then diffuse throughout the cell to 
continue the signal-transduction pathway. 
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FIGURE 14.23 The modular structure of insulin-receptor substrates IRS-1 and IRS-2. 
This schematic view represents the amino acid sequence common to IRS-1 and IRS-2. 
Each protein contains a pleckstrin homology domain (which binds phosphoinositide lipids), a 
phosphotyrosine-binding domain, and four sequences that approximate Tyr-X-X-Met (YXXM). 
The four sequences are phosphorylated by the insulin-receptor tyrosine kinase. 


proteins with a common modular structure (Figure 14.23). The amino- 
terminal part includes a pleckstrin homology domain, which binds phos- 
phoinositide, and a phosphotyrosine-binding domain. These domains 
act together to anchor the IRS protein to the insulin receptor and the 
associated membrane. Each IRS protein contains four sequences that 
approximate the form Tyr-X-X-Met. These sequences are also sub- 
strates for the activated insulin-receptor kinase. When the tyrosine 
residues within these sequences are phosphorylated to become phospho- 
tyrosine residues, IRS molecules can act as adaptor proteins: they are not 
enzymes but serve to tether the downstream components of this signal- 
ing pathway to the membrane. 

Phosphotyrosine residues, such as those in the IRS proteins, are recog- 
nized most often by Src homology 2 (SH2) domains (Figure 14.24). These 
domains, present in many signal-transduction proteins, bind to stretches of 
polypeptide that contain phosphotyrosine residues. Each specific 5H2 
domain shows a binding preference for phosphotyrosine in a particular 
sequence context. Which proteins contain SH2 domains that bind to phos- 
photyrosine-containing sequences in the IRS proteins? The most important 
of them are in a class of lipid kinases, called phosphoinositide 3-kinases 
(PI3Ks), that add a phosphoryl group to the 3-position of inositol in phos- 


Phosphotyrosine 


SH2 domain 


h FIGURE 14.24 Structure of the SH2 phatidylinositol 4,5-bisphosphate (PIP,; Figure 14.25). These enzymes are 
domain. The domain is shown bound to a heterooligomers that consist of 110-kDa catalytic subunits and 85-kDa 
phosphotyrosine-containing peptide. Notice regulatory subunits. Through 5H2 domains in the regulatory subunits, 


at the top that the negatively charged 
phosphotyrosine residue interacts with two 
arginine residues that are conserved in 


these enzymes bind to the IRS proteins and are drawn to the membrane 
where they can phosphorylate PIP, to form phosphatidylinositol 3,4,5- 


essentially all SH2 domains. [Drawn from trisphosphate (PIP). PIP, in turn, activates a protein kinase, PDK1, by 


1SPS.pdb]] 
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virtue of a pleckstrin homology domain present in this kinase that is specific 
for PIP, (Figure 14.22). The activated PDK1 phosphorylates and activates 
Akt, another protein kinase. Akt is not membrane anchored and moves 
through the cell to phosphorylate targets that include components that con- 
trol the trafficking of the glucose receptor GLUT‘ to the cell surface as well 
as enzymes that stimulate glycogen synthesis (Section 21.4). 


ATP ADP 
RO -enfies 


Phosphatidylinositide 


2 
7 OH OPO; 
OH 
R'O 3-kinase 


Phosphatidylinositol 4,5-bisphosphate (PIP,) Phosphatidylinositol 3,4,5-trisphosphate (PIP,) 


FIGURE 14.25 Action of a lipid kinase in insulin signaling. Phosphorylated IRS-1 and IRS-2 
activate the enzyme phosphatidylinositide 3-kinase, an enzyme that converts PIP, into PIP.. 


The cascade initiated by the binding of insulin to the insulin receptor is 
summarized in Figure 14.26. The signal is amplified at several stages along 
this pathway. Because the activated insulin receptor itself is a protein kinase, 
each activated receptor can phosphorylate multiple IRS molecules. Activated 
enzymes further amplify the signal in at least two of the subsequent steps. 
Thus, a small increase in the concentration of circulating insulin can pro- 
duce a robust intracellular response. Note that although the insulin pathway 
described here may seem complicated, it is substantially less elaborate than 
the full signaling network initiated by insulin. 


Insulin signaling is terminated by the action of phosphatases 


We have seen that the activated G protein promotes its own inactivation by 
the release of a phosphoryl group from GTP. In contrast, proteins phos- 
phorylated on serine, threonine, or tyrosine residues are extremely stable 
kinetically. Specific enzymes, called protein phosphatases, are required to 
hydrolyze these phosphorylated proteins and return them to their initial 
states. Similarly, lipid phosphatases are required to remove phosphoryl 
groups from inositol lipids that had been activated by lipid kinases. In insu- 
lin signaling, three classes of enzymes are of particular importance in shut- 
ting off the signaling pathway: (1) protein tyrosine phosphatases that 
remove phosphoryl! groups from tyrosine residues on the insulin receptor 
and the IRS adaptor proteins, (2) lipid phosphatases that hydrolyze PIP, to 
PIP.,, and (3) protein serine phosphatases that remove phosphoryl groups 
from activated protein kinases such as Akt. Many of these phosphatases are 
activated or recruited as part of the response to insulin. Thus, the binding of 
the initial signal sets the stage for the eventual termination of the response. 


14.3 EGF Signaling: Signal-Transduction Pathways Are 
Poised to Respond 


Our consideration of the signal-transduction cascades initiated by epineph- 
rine and insulin included examples of how components of signal-transduction 
pathways are poised for action, ready to be activated by minor modifica- 
tions. For example, G-protein subunits require only the binding of GTP in 
exchange for GDP to transmit a signal. This exchange reaction is thermody- 
namically favorable, but it is quite slow in the absence of an appropriate 
activated 7TM receptor. Similarly, the tyrosine kinase domains of the 
dimeric insulin receptor are ready for phosphorylation and activation but 
require insulin bound between two a subunits to draw the activation loop of 
one tyrosine kinase into the active site of a partner tyrosine kinase to initiate 
the signaling cascade. 

Next, we examine a signal-transduction pathway that reveals another 
clear example of how these signaling cascades are poised to respond. This 
pathway is activated by the signal molecule epidermal growth factor (EGF). 
Like that of the insulin receptor, the initiator of this pathway is a receptor 
tyrosine kinase. Both the extracellular and the intracellular domains of this 
receptor are ready for action, held in check only by a specific structure that 
prevents receptors from coming together. Furthermore, in the EGF path- 
way, we will encounter several additional classes of signaling components 
that participate in many other signaling networks. 


EGF binding results in the dimerization of the EGF receptor 


Epidermal growth factor is a 6-kDa polypeptide that stimulates the growth 
of epidermal and epithelial cells (Figure 14.27). The EGF receptor (EGFR), 
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FIGURE 14.26 Insulin signaling 
pathway. Key steps in the signal- 
transduction pathway initiated by the 
binding of insulin to the insulin receptor. 


Epidermal growth factor (EGF) 


FIGURE 14.27 Structure of 
epidermal growth factor. Notice that three 
intrachain disulfide bonds stabilize the 
compact three-dimensional structure of the 
growth factor. [Drawn from 1EGF.pdb.] 
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FIGURE 14.28 Modular structure of the EGF receptor. This schematic view of the amino 
acid sequence of the EGF receptor shows the EGF-binding domain that lies outside the cell, a 
single transmembrane helix-forming region, the intracellular tyrosine kinase domain, and the 
tyrosine-rich domain at the carboxyl terminus. 


like the insulin receptor, is a dimer of two identical subunits. Each subunit 
contains an intracellular protein tyrosine kinase domain that participates in 
cross-phosphorylation reactions (Figure 14.28). Unlike those of the insulin 
receptor, however, these units exist as monomers until they bind EGF. 
Moreover, each EGF receptor monomer binds a single molecule of EGF in 
its extracellular domain (Figure 14.29). Thus the dimer binds two ligand 
molecules, in contrast with the insulin-receptor dimer, which binds only one 
ligand. Note that each EGF molecule lies far away from the dimer interface. 
This interface includes a so-called dimerization arm from each monomer that 
reaches out and inserts into a binding pocket on the other monomer. 
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® FIGURE 14.29 EGF receptor dimerization. The structure of the extracellular region of 
the EGF receptor is shown bound to EGF. Notice that the structure is dimeric with one EGF 
molecule bound to each receptor molecule and that the dimerization is mediated by a 
dimerization arm that extends from each receptor molecule. [Drawn from 1IVO.pdb.] 


Although this structure nicely reveals the interactions that support the 
formation of a receptor dimer favoring cross-phosphorylation, it raises 
another question: Why doesn’t the receptor dimerize and signal in the 
absence of EGF? This question has been addressed by examining the struc- 
ture of the EGF receptor in the absence of bound ligand (Figure 14.30). 
This structure is, indeed, monomeric, and each monomer is in a conforma- 
tion that is quite different from that observed in the ligand-bound dimer. 
In particular, the dimerization arm binds to a domain within the same 
monomer that holds the receptor in a closed configuration. In essence, the 


receptor is poised in a spring-loaded conformation held in position by the 
contact between the interaction loop and another part of the structure, 
ready to bind ligand and change into a conformation active for dimerization 
and signaling. 

This observation suggests that a receptor that exists in the extended 
conformation even in the absence of bound ligand would be constitutively 
active. Remarkably, such a receptor exists. This receptor, HER2, is approx- 
imately 50% identical in amino acid sequence with the EGF receptor and 
has the same domain structure. HER2 does not bind any known ligand, yet 
crystallographic studies reveal that it adopts an extended structure very 
similar to that observed for the ligand-bound EGF receptor. Under normal 
conditions, HER2 forms heterodimers with the EGF receptor and other 
members of the EGF receptor family and participates in cross-phosphory- 
lation reactions with these receptors. HER2 is overexpressed in some can- 
cers, presumably contributing to tumor growth by forming homodimers 
that signal even in the absence of ligand. We will return to HER2 when we 
consider approaches to cancer treatment based on knowledge of signaling 
pathways (Section 14.5). 


The EGF receptor undergoes phosphorylation of its 
carboxyl-terminal tail 


Like the insulin receptor, the EGF receptor undergoes cross- 
phosphorylation of one unit by another unit within a dimer. However, 
unlike that of the insulin receptor, the site of this phosphorylation is not 
within the activation loop of the kinase, but rather in a region that lies on 
the C-terminal side of the kinase domain. As many as five tyrosine resi- 
dues in this region are phosphorylated. The dimerization of the EGF 
receptor brings the C-terminal region on one receptor into the active site 
of its partner’s kinase. The kinase itself is in an active conformation 
without phosphorylation, revealing again how this signaling system is 
poised to respond. 


EGF signaling leads to the activation of Ras, a small G protein 


The phosphotyrosines on the EGF receptors act as docking sites for SH2 
domains on other proteins. The intracellular signaling cascade begins with 
the binding of Grb2, a key adaptor protein that contains one SH2 domain 
and two Src homology 3 (SH3) domains. On phosphorylation of the recep- 
tor, the SH2 domain of Grb2 binds to the phos- 

photyrosine residues of the receptor tyrosine EGF 
kinase. Through its two SH3 domains, Grb2 then gages 
binds polyproline-rich polypeptides within a pro- 
tein called Sos. Sos, in turn, binds to Ras and 
activates it. A very prominent signal-transduction 
component, Ras is a member of a class of pro- 
teins called the small G proteins. Like the G 
proteins described in Section 14.1, the small G 
proteins contain bound GDP in their unacti- 
vated forms. Sos opens up the nucleotide-binding 
pocket of Ras, allowing GDP to escape and GTP 
to enter in its place. Because of its effect on Ras, 
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\ FIGURE 14.30 Structure of the 
unactivated EGF receptor. The 


extracellular domain of the EGF receptor is 


shown in the absence of bound EGF. 


Notice that the dimerization arm is bound 


to a part of the receptor that makes it 


unavailable for interaction with the other 


receptor. [Drawn from 1NQL.pdb.] 
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Sos 1s referred to as a guanine-nucleotide-exchange FIGURE 14.31 Ras activation mechanism. The dimerization of the EGF 
factor (GEF). Thus, the binding of EGF to its __ receptor due to EGF binding leads to: (1) the phosphorylation of the 
receptor leads to the conversion of Ras into its C-terminal tails of the receptor, (2) the subsequent recruitment of Grb2 and 
GTP form through the intermediacy of Grb2 Sos, and (3) the exchange of GTP for GDP in Ras. This signal-transduction 


and Sos (Figure 14.31). 


pathway results in the conversion of Ras into its activated GTP-bound form. 
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FIGURE 14.32 EGF signaling pathway. 
The key steps in the pathway initiated by 
EGF binding to the EGF receptor. A kinase 
cascade leads to the phosphorylation of 
transcription factors and concomitant 
changes in gene expression. 
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Activated Ras initiates a protein kinase cascade 


Ras changes conformation when it is transformed from its GDP into its 
GTP form. In the GTP form, Ras binds other proteins, including a protein 
kinase termed Raf. When bound to Ras, Raf undergoes a conformational 
change that activates the Raf protein kinase domain. Both Ras and Raf are 
anchored to the membrane through covalently bound lipid modifications. 
Activated Raf then phosphorylates other proteins, including protein kinases 
termed MEKs. In turn, MEKs activate kinases called extracellular signal- 
regulated kinases (ERKs). ERKs then phosphorylate numerous substrates, 
including transcription factors in the nucleus as well as other protein 
kinases. The complete flow of information from the arrival of EGF at the 
cell surface to changes in gene expression is summarized in Figure 14.32. 


Ww Small G proteins, or small GTPases, constitute a large superfamily 
of proteins—grouped into subfamilies called Ras, Rho, Arf, Rab, 
and Ran—that play a major role in a host of cell functions including growth, 
differentiation, cell motility, cytokinesis (the separation of two cells during 
division), and the transport of materials throughout the cell (Table 14.2). 
As with the heterotrimeric G proteins, the small G proteins cycle between 
an active GTP-bound form and an inactive GDP-bound form. They differ 
from the heterotrimeric G proteins in being smaller (20-25 kDa versus 
30-35 kDa) and monomeric. Nonetheless, the two families are related 
by divergent evolution, and small G proteins have many key mechanistic 
and structural motifs in common with the G, subunit of the heterotrimeric 
G proteins. 


EGF signaling is terminated by protein phosphatases and the 
intrinsic GTPase activity of Ras 


Because so many components of the EGF signal-transduction pathway are 
activated by phosphorylation, we can expect protein phosphatases to play 
key roles in the termination of EGF signaling. Indeed, crucial phosphatases 
remove phosphoryl groups from tyrosine residues on the EGF receptor and 
from serine, threonine, and tyrosine residues in the protein kinases that 
participate in the signaling cascade. The signaling process itself sets in 
motion the events that activate many of these phosphatases. Consequently, 
signal activation also initiates signal termination. 

Like the G proteins activated by 7T’M receptors, Ras possesses intrinsic 
GTPase activity. Thus, the activated GTP form of Ras spontaneously con- 
verts into the inactive GDP form. The rate of conversion can be accelerated 
in the presence of GTPase-activating proteins (GAPs), which interact with 
small G proteins in the GTP form and facilitate GTP hydrolysis. Thus, the 
lifetime of activated Ras is regulated by accessory proteins in the cell. The 
GTPase activity of Ras is crucial for shutting off signals leading to cell 
growth, and so it is not surprising that mutations in Ras are found in many 
types of cancer, as discussed in Section 14.5. 


TABLE 14.2 Ras superfamily of GTPases 


Subfamily Function 

Ras Regulates cell growth through serine-threonine protein kinases 

Rho Reorganizes cytoskeleton through serine-threonine protein kinases 

Arf Activates the ADP-ribosyltransferase of the cholera toxin A subunit; regulates 
vesicular trafficking pathways; activates phospholipase D 

Rab Plays a key role in secretory and endocytotic pathways 

Ran Functions in the transport of RNA and protein into and out of the nucleus 


14.4 Many Elements Recur with Variation in Different 
Signal-Transduction Pathways 


We can begin to make sense of the complexity of signal-transduction path- 
ways by taking note of several common themes that have appeared consis- 
tently in the pathways described in this chapter and underlie many 
additional signaling pathways not considered herein. 


1. Protein kinases are central to many signal-transduction pathways. Protein 
kinases are central to all three signal-transduction pathways described in 
this chapter. In the epinephrine-initiated pathway, cAMP-dependent pro- 
tein kinase (PKA) lies at the end of the pathway, transducing information 
represented by an increase in cAMP concentration into covalent modifica- 
tions that alter the activity of key metabolic enzymes. In the insulin- and 
EGF- initiated pathways, the receptors themselves are protein kinases and 
several additional protein kinases participate downstream in the pathways. 
Signal amplification due to protein kinase cascades is a feature common to 
all three pathways. Although not presented in this chapter, protein kinases 
often phosphorylate multiple substrates and are thus able to generate a 
diversity of responses. 


2. Second messengers participate in many signal-transduction pathways. 
We have encountered several second messengers, including cAMP, 
Ca’*, IP3, and the lipid DAG. Because second messengers are generated 
by enzymes or by the opening of ion channels, their concentrations 
can be tremendously amplified compared with the signals that lead to 
their generation. Specialized proteins sense the concentrations of these 
second messengers and continue the flow of information along signal- 
transduction pathways. 

The second messengers that we have seen recur in many additional 
signal-transduction pathways. For example, in a consideration of the sen- 
sory systems in Chapter 33, we will see how Ca’*t-based signaling and 
cyclic nucleotide-based signaling play key roles in vision and olfaction. 


3. Specialized domains that mediate specific interactions are present in many 
signaling proteins. The “wiring” of many signal-transduction pathways is 
based on particular protein domains that mediate the interactions between 
protein components of a particular signaling cascade. We have encountered 
several of them, including pleckstrin homology domains, which facilitate 
protein interactions with the lipid PIP,; SH2 domains, which mediate inter- 
actions with polypeptides containing phosphorylated tyrosine residues; and 
5H3 domains, which interact with peptide sequences that contain multiple 
proline residues. Many other such domain families exist. In many cases, 
individual members of each domain family have unique features that allow 
them to bind to their targets only within a particular sequence context, mak- 
ing them specific for a given signaling pathway and avoiding unwanted 
cross-talk. Signal-transduction pathways have evolved in large part by the 
incorporation of DNA fragments encoding these domains into genes encoding 
pathway components. 

The presence of these domains is tremendously helpful to scientists try- 
ing to unravel signal-transduction pathways. When a protein in a signal- 
transduction pathway is identified, its amino acid sequence can be analyzed 
for the presence of these specialized domains by the methods described in 
Chapter 6. If one or more domains of known function is found, it is often 
possible to develop clear hypotheses about potential binding partners and 
signal-transduction mechanisms. 
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14.5 Defects in Signal-Transduction Pathways Can Lead 
to Cancer and Other Diseases 


e In light of their complexity, it comes as no surprise that signal- 

transduction pathways occasionally fail, leading to disease states. 
Cancer, a set of diseases characterized by uncontrolled or inappropriate cell 
growth, is strongly associated with defects in signal-transduction proteins. 
Indeed, the study of cancer, particularly cancers caused by certain viruses, 
has contributed greatly to our understanding of signal-transduction pro- 
teins and pathways. 

For example, Rous sarcoma virus 1s a retrovirus that causes sarcoma (a 
cancer of tissues of mesodermal origin such as muscle or connective tis- 
sue) in chickens. In addition to the genes necessary for viral replication, 
this virus carries a gene termed v-src. The v-src gene is an oncogene; it 
leads to the generation of cancerlike characteristics in susceptible cell 
types. The protein encoded by the v-src gene, v-Src, is a protein tyrosine 
kinase that includes SH2 and SH3 domains. The v-Src protein is similar 
in amino acid sequence to a protein normally found in chicken-muscle 
cells referred to as c-Src (for cellular Src; Figure 14.33A). The c-src gene 
does not induce cell transformation and is termed a proto-oncogene, refer- 
ring to the fact that this gene, when mutated, can be converted into an 
oncogene. The protein that it encodes is a signal-transduction protein that 
regulates cell growth. 

Why is the biological activity of the v-Sre protein so different from that 
of c-Src? c-Src contains a key tyrosine residue near its C-terminal end that, 
when phosphorylated, is bound intramolecularly by the upstream SH2 

domain (Figure 14.33B). This interaction maintains the kinase 

domain in an inactive conformation. However, in v-Src, the 
® C-terminal 19 amino acids of c-Src are replaced by a completely dif- 
y ferent stretch of 11 amino acids that lacks this critical tyrosine resi- 
due. Thus, v-Src is always active and can promote unregulated cell 
growth. Since the discovery of Src, many other mutated protein 
kinases have been identified as oncogenes. 

The gene encoding Ras, a component of the EGF -initiated path- 
way, is one of the genes most commonly mutated in human tumors. 
Mammalian cells contain three 21-kDa Ras proteins (H-, K-, and 
N-Ras), each of which cycles between inactive GDP and active GTP 


Protein forms. The most common mutations in tumors lead to a loss of 
kinase the ability to hydrolyze GTP. Thus, the Ras protein is trapped in the 


“on” position and continues to stimulate cell growth, even in the 
absence of a continuing signal. 

Other genes can contribute to cancer development only when 
both copies of the gene normally present in a cell are deleted or oth- 
erwise damaged. Such genes are called tumor-suppressor genes. For 


Phosphotyrosine 


kK FIGURE 14.33 Src structure. (A) Cellular Src 
includes an SH3 domain, an SH2 domain, a protein 
kinase domain, and a carboxyl-terminal tail that 
includes a key tyrosine residue. (B) Structure of c-Src 
in an inactivated form with the key tyrosine residue 
phosphorylated. Notice how the three domains work 
together to keep the enzyme in an inactive 
conformation: the phosphotyrosine residue is bound 
in the SH2 domain and the linker between the SH2 
domain and the protein kinase domain is bound by 
the SH8 domain. [Drawn from 2PTK.pdb.] 


example, genes for some of the phosphatases that participate in the 
termination of EGF signaling are tumor suppressors. Without any 
functional phosphatase present, EGF signaling persists once initi- 
ated, stimulating inappropriate cell growth. 


Monoclonal antibodies can be used to inhibit signal-transduction 
pathways activated in tumors 


ca Mutated or overexpressed receptor tyrosine kinases are fre- 

quently observed in tumors. For instance, the epidermal- 
growth-factor receptor (EGFR) is overexpressed in some human 
epithelial cancers, including breast, ovarian, and colorectal cancer. 


Because some small amount of the receptor can dimerize and activate the 
signaling pathway even without binding to EGF, overexpression of the 
receptor increases the likelihood that a “grow and divide” signal will be 
inappropriately sent to the cell. This understanding of cancer-related signal- 
transduction pathways has led to a therapeutic approach that targets the 
EGER. The strategy is to produce monoclonal antibodies to the extracellular 
domains of the offending receptors. One such antibody, cetuximab (Erbitux), 
has effectively targeted the EGFR in colorectal cancers. Cetuximab inhibits 
the EGFR by competing with EGF for the binding site on the receptor. 
Because the antibody sterically blocks the change in conformation that 
exposes the dimerization arm, the antibody itself cannot induce dimeriza- 
tion. The result is that the EGFR-controlled pathway is not initiated. 

Cetuximab is not the only monoclonal antibody that has been developed 
to target a receptor tyrosine kinase. Trastuzumab (Herceptin) inhibits 
another EGFR family member, HER2, that is overexpressed in approxi- 
mately 30% of breast cancers. Recall that this protein can signal even in the 
absence of ligand, so it is especially likely that overexpression will stimulate 
cell proliferation. Breast-cancer patients are now being screened for HER2 
overexpression and treated with Herceptin as appropriate. Thus, this cancer 
treatment is tailored to the genetic characteristics of the tumor. 


Protein kinase inhibitors can be effective anticancer drugs 


oy The widespread occurrence of overactive protein kinases in cancer 

cells suggests that molecules that inhibit these enzymes might act as 
antitumor agents. For example, more than 90% of patients with chronic 
myelogenous leukemia (CML) show a specific chromosomal defect in can- 
cer cells (Figure 14.34). The translocation of genetic material between chro- 
mosomes 9 and 22 causes the c-abl gene, which encodes a tyrosine kinase of 
the Src family, to be inserted into the bcr gene on chromosome 22. The result 
is the production of a fusion protein called Bcr-Abl that consists primarily 
of sequences for the c-Abl kinase. However, the bcr-abl gene is not regulated 
appropriately; it is expressed at higher levels than that of the gene encoding 
the normal c-Abl kinase, stimulating a growth-promoting pathway. Because 
of this overexpression, leukemia cells express a unique target for chemo- 
therapy. A specific inhibitor of the Bcr-Abl kinase, Gleevec (STI-571, 
imatinib mesylate), has proved to bea highly effective treatment for patients 
suffering from CML. This approach to cancer chemotherapy is fundamen- 
tally distinct from most approaches, which target all rapidly growing cells, 
including normal ones. Because Gleevec targets tumor cells specifically, 
side effects caused by the impairment of normal dividing cells can be mini- 
mized. Thus, our understanding of signal-transduction pathways is leading to 
conceptually new disease treatment strategies. 


Cholera and whooping cough are the result of altered G-protein activity 


cz Although defects in signal-transduction pathways have been most 

extensively studied in the context of cancer, such defects are important 
in many other diseases. Cholera and whooping cough are two pathologies of 
the G-protein-dependent signal pathways. Let us first consider the mecha- 
nism of action of the cholera toxin, secreted by the intestinal bacterium Vibrio 
cholerae. Cholera is a potentially life-threatening, acute diarrheal disease 
transmitted through contaminated water and food. It causes the voluminous 
secretion of electrolytes and fluids from the intestines of infected persons. 
The cholera toxin, also called choleragen, is a protein composed of two func- 
tional units—a B subunit that binds to G,,, gangliosides (Section 26.1) of 
the intestinal epithelium and a catalytic A subunit that enters the cell. 
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FIGURE 14.34 Formation of the bcr-ab/ 
gene by translocation. In chronic 
myelogenous leukemia, parts of 
chromosomes 9 and 22 are reciprocally 
exchanged, causing the bcr and ab/ genes 
to fuse. The protein kinase encoded by the 
bcr-abl gene is expressed at higher levels 
in tumor cells than is the c-ab/ gene in 
normal cells. 
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The A subunit catalyzes the covalent modification of a G,, protein: the a 
subunit is modified by the attachment of an ADP-ribose to an arginine 
residue. This modification stabilizes the GTP-bound form of G,., trap- 
ping the molecule in its active conformation. The active G protein, in turn, 
continuously activates protein kinase A. PKA opens a chloride channel and 
inhibits sodium absorption by the Na*—H* exchanger by phosphorylating 
both the channel and the exchanger. The net result of the phosphorylation 
is an excessive loss of NaCl and the loss of large amounts of water into the 
intestine. Patients suffering from cholera may pass as much as twice their 
body weight in fluid in 4 to 6 days. Treatment consists of rehydration with 
a glucose—electrolyte solution. 

Whereas cholera is a result of a G protein trapped in the active confor- 
mation, causing the signal-transduction pathway to be persistently stimu- 
lated, pertussis is a result of the opposite situation. Pertussis toxin is 
secreted by Bordetella pertussis, the bacterium responsible for whooping 
cough. Like choleragen, pertussis toxin adds an ADP-ribose moiety to a 
G,, subunit. However, in this case, the ADP-ribose group 1s added toa G,; 
protein, a G, subunit that inhibits adenylate cyclase, closes Ca”* chan- 
nels, and opens K* channels. The effect of this modification is to prevent 
binding of the heterotrimeric G, protein to its receptor, trapping it in the 
“off” conformation. The pulmonary symptoms have not yet been traced to 
a particular target of the G,, protein. 


SUMMARY 


In humans and other multicellular organisms, specific signal mole- 
cules are released from cells in one organ and are sensed by cells in 
other organs throughout the body. The message initiated by an extra- 
cellular ligand is converted into specific changes in metabolism or 
gene expression by means of often complex networks referred to as 
signal-transduction pathways. These pathways amplify the initial sig- 
nal and lead to changes in the properties of specific effector molecules. 


14.1 Heterotrimeric G Proteins Transmit Signals and Reset Themselves 
Epinephrine binds to a cell-surface protein called the B-adrenergic 
receptor. This receptor is a member of the seven-transmembrane- 
helix receptor family, so named because each receptor has seven a 
helices that span the cell membrane. When epinephrine binds to the 
B-adrenergic receptor on the outside of the cell, the receptor under- 
goes a conformational change that is sensed inside the cell by a signal- 
ing protein termed a heterotrimeric G protein. The a subunit of the G 
protein exchanges a bound GDP molecule for GTP and concomi- 
tantly releases the heterodimer consisting of the B and y subunits. The 
a subunit in the GTP form then binds to adenylate cyclase and acti- 
vates it, leading to an increase in the concentration of the second mes- 
senger cyclic AMP. This increase in cyclic AMP concentration, in 
turn, activates protein kinase A. Other 71M receptors also signal 
through heterotrimeric G proteins, although these pathways often 
include enzymes other than adenylate cyclase. One prominent 
pathway, the phosphoinositide pathway, leads to the activation of 
phospholipase C, which cleaves a membrane lipid to produce two sec- 
ondary messengers, diacylglycerol and inositol 1,4,5-trisphosphate. 
An increased IP, concentration leads to the release of calcium ion, 
another important second messenger, into the cell. G-protein signal- 
ing is terminated by the hydrolysis of the bound GTP to GDP. 


14.2 


14.3 


14.4 


14.5 


Insulin Signaling: Phosphorylation Cascades Are Central to Many 
Signal-Transduction Processes 

Protein kinases are key components in many signal-transduction 
pathways, including some for which the protein kinase is an integral 
component of the initial receptor. An example of such a receptor is the 
membrane tyrosine kinase bound by insulin. Insulin binding causes 
one subunit within the dimeric receptor to phosphorylate specific 
tyrosine residues in the other subunit. The resulting conformational 
changes dramatically increase the kinase activity of the receptor. The 
activated receptor kinase initiates a signaling cascade that includes 
both lipid kinases and protein kinases. This cascade eventually leads 
to the mobilization of glucose transporters to the cell surface, increas- 
ing glucose uptake. Insulin signaling is terminated through the action 
of phosphatases. 


EGF Signaling: Signal-Transduction Systems Are Poised to Respond 
Only minor modifications are necessary to transform many signal- 
transduction proteins from their inactive into their active forms. 
Epidermal growth factor also signals through a receptor tyrosine 
kinase. EGF binding induces a conformational change that allows 
receptor dimerization and cross-phosphorylation. The phosphoryl- 
ated receptor binds adaptor proteins that mediate the activation of 
Ras, asmall G protein. Activated Ras initiates a protein kinase cascade 
that eventually leads to the phosphorylation of transcription factors 
and changes in gene expression. EGF signaling is terminated by the 
action of phosphatases and the hydrolysis of GTP by Ras. 


Many Elements Recur with Variation in Different Signal- 

Transduction Pathways 

Protein kinases are components of many signal-transduction path- 
ways, both as components of receptors and in other roles. Second mes- 
sengers, including cyclic nucleotides, calcium, and lipid derivatives, 
are common in many signaling pathways. The changes in the concen- 
trations of second messengers are often much larger than the changes 
associated with the initial signal owing to amplification along the 
pathway. Small domains that recognize phosphotyrosine residues or 
specific lipids are present in many signaling proteins and are essential 
to determining the specificity of interactions. 


Defects in Signal-Transduction Pathways Can Lead to Cancer 

and Other Diseases 

Genes encoding components of signal-transduction pathways that 
control cell growth are often mutated in cancer. Some genes can be 
mutated to forms called oncogenes that are active regardless of appro- 
priate signals. Monoclonal antibodies directed against cell-surface 
receptors that participate in signaling have been developed for use in 
cancer treatment. Our understanding of the molecular basis of cancer 
is leading to the development of anticancer drugs directed against spe- 
cific targets, such as the specific kinase inhibitor Gleevec. 
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Key Terms 


KEY TERMS 
primary messenger (p. 398) B-adrenergic receptor (B-AR) (p. 399) G protein (p. 400) 
ligand (p. 398) seven-transmembrane-helix (7TM) agonist (p. 401) 
second messenger (p. 398) receptor (p. 399) G-protein-coupled receptor 


cross talk (p. 398) rhodopsin (p. 400) 


(GPCR) (p. 402) 
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adenylate cyclase (p. 402) insulin (p. 408) 

protein kinase A (PKA) (p. 403) 

B-adrenergic receptor kinase (p. 404) 

phosphoinositide cascade (p. 404) 

phosphatidylinositol 4,5-bisphosphate 
(PIP,) (p. 404) 

phospholipase C (p. 404) 

protein kinase C (PKC) (p. 405) 

calmodulin (CaM) (p. 407) 

EF hand (p. 407) 


calmodulin-dependent protein kinase 
(CaM kinase) (p. 407) 


PROBLEMS 


insulin receptor (p. 408) 

tyrosine kinase (p. 408) 

receptor tyrosine kinase (p. 408) 
insulin-receptor substrate (IRS) (p. 409) 
pleckstrin homology domain (p. 410) 
adaptor protein (p. 410) 

Src homology 2 (SH2) domain (p. 410) 
epidermal growth factor (EGF) (p. 411) 
EGF receptor (EGFR) (p. 411) 
dimerization arm (p. 412) 

Src homology 3 (SH3) domain (p. 413) 


Ras (p. 413) 
small G protein (p. 413) 
guanine-nucleotide-exchange factor 


(GEF) (p. 413) 


extracellular signal-regulated kinase 
(ERK) (p. 414) 


GTPase-activating protein 


(GAP) (p. 414) 
oncogene (p. 416) 
proto-oncogene (p. 416) 
tumor-suppressor gene (p. 416) 


1. Active mutants. Some protein kinases are inactive unless 
they are phosphorylated on key serine or threonine resi- 
dues. In some cases, active enzymes can be generated by 
mutating these serine or threonine residues to aspartate. 
Explain. 


2. In the pocket. SH2 domains bind phosphotyrosine resi- 
dues in deep pockets on their surfaces. Would you expect 
SH2 domains to bind phosphoserine and phosphothreonine 
with high affinity? Why or why not? 


3. On-off. Why is the GTPase activity of G proteins cru- 
cial to the proper functioning of a cell? Why have G proteins 
not evolved to catalyze GTP hydrolysis more efficiently? 


4. Viva la différence. Why is the fact that a monomeric 
hormone binds simultaneously to two identical receptor 
molecules, thus promoting the formation of a dimer of the 
receptor, considered remarkable? 


5. Antibodies mimicking hormones. Antibodies have two 
identical antigen-binding sites. Remarkably, antibodies to 
the extracellular parts of growth-factor receptors often lead 
to the same cellular effects as does exposure to growth fac- 
tors. Explain this observation. 


6. Facile exchange. A mutated form of the a subunit of the 
heterotrimeric G protein has been identified; this form 
readily exchanges nucleotides even in the absence of an 
activated receptor. What would be the effect on a signaling 
pathway containing the mutated a subunit? 


7. Making connections. Suppose that you were investigating 
a newly discovered growth-factor signal-transduction path- 
way. You found that, if you added GTPYS, a nonhydrolyz- 
able analog of GTP, the duration of the hormonal response 
increased. What can you conclude? 


8. Diffusion rates. Usually, rates of diffusion vary 
inversely with molecular weights; so smaller molecules 


diffuse faster than do larger ones. In cells, however, calcium 
ion diffuses more slowly than does cAMP. Propose a pos- 
sible explanation. 


9. Negativity abounds. Fura-2 is not effective for the study 
of calcium levels in intact, living cells. On the basis of how 
Fura-2 is depicted on p. 406, why is it ineffective? 


10. Awash with glucose. Glucose is mobilized for ATP gen- 
eration in muscle in response to epinephrine, which acti- 
vates G,,. Cyclic AMP phosphodiesterase is an enzyme 
that converts cAMP into AMP. How would inhibitors of 
cAMP phosphodiesterase affect glucose mobilization in 
muscle? 


11. Getting it started. The insulin receptor, on dimeriza- 
tion, cross-phosphorylates the activation loop of the other 
receptor molecule, leading to activation of the kinase. 
Propose how this phosphorylation event can take place if 
the kinase starts in an inactive conformation. 


12. Many defects. Considerable effort has been directed 
toward determining the genes in which sequence variation 
contributes to the development of type 2 diabetes. 
Approximately 800 genes have been implicated. Propose an 
explanation for this observation. 


13. Growth-factor signaling. Human growth hormone 
binds to a cell-surface membrane protein that is not a recep- 
tor tyrosine kinase. The intracellular domain of the receptor 
can bind other proteins inside the cell. Furthermore, studies 
indicate that the receptor is monomeric in the absence of 
hormone but dimerizes on hormone binding. Propose a 
possible mechanism for growth-hormone signaling. 


14. Receptor truncation. You prepare a cell line that overex- 
presses a mutant form of EGFR in which the entire intra- 
cellular region of the receptor has been deleted. Predict the 
effect of overexpression of this construct on EGF signaling 
in this cell line. 


15. Hybrid. Suppose that, through genetic manipulations, a 
chimeric receptor is produced that consists of the extracel- 
lular domain of the insulin receptor and the transmembrane 
and intracellular domains of the EGF receptor. Cells 
expressing this receptor are exposed to insulin, and the level 
of phosphorylation of the chimeric receptor is examined. 
What would you expect to observe and why? What would 
you expect to observe if these cells were exposed to EGF? 


16. Total amplification. Suppose that each B-adrenergic 
receptor bound to epinephrine converts 100 molecules of 
G,,, into their GTP forms and that each molecule of acti- 
vated adenylate cyclase produces 1000 molecules of cAMP 
per second. With the assumption of a full response, how 
many molecules of cAMP will be produced in 1 s after the 
formation of a single complex between epinephrine and the 
B-adrenergic receptor? 


Chapter Integration Problems 


17. Nerve-growth-factor pathway. Nerve-growth factor 
(NGF) binds to a protein tyrosine kinase receptor. The 
amount of diacylglycerol in the plasma membrane increases 
in cells expressing this receptor when treated with NGF. 
Propose a simple signaling pathway and identify the iso- 
form of any participating enzymes. Would you expect the 
concentrations of any other common second messengers to 
increase on NGF treatment? 


18. Redundancy. Because of the high degree of genetic vari- 
ability in tumors, typically no single anticancer therapy is 
universally effective for all patients, even within a given 
tumor type. Hence, it is often desirable to inhibit a particu- 
lar pathway at more than one point in the signaling cascade. 
In addition to the EGFR-directed monoclonal antibody 
cetuximab, propose alternative strategies for targeting the 
EGF signaling pathway for antitumor drug development. 


Mechanism Problems 


19. Distant relatives. The structure of adenylate cyclase is 
similar to the structures of some types of DNA polymer- 
ases, suggesting that these enzymes derived from a common 
ancestor. Compare the reactions catalyzed by these two 
enzymes. In what ways are they similar? 


20. Kinase inhibitors as drugs. Functional and structural 
analysis indicates that Gleevec is an ATP-competitive 
inhibitor of the Bcr-Abl kinase. In fact, many kinase inhib- 
itors under investigation or currently marketed as drugs are 
ATP competitive. Can you suggest a potential drawback of 
drugs that utilize this particular mechanism of action? 


Data Interpretation Problems 


21. Establishing specificity. You wish to determine the 
hormone-binding specificity of a newly identified mem- 
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brane receptor. Three different hormones, X, Y, and Z, were 
mixed with the receptor in separate experiments, and the 
percentage of binding capacity of the receptor was deter- 
mined as a function of hormone concentration, as shown in 
graph A. 
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(a) What concentrations of each hormone yield 50% maxi- 
mal binding? 


(b) Which hormone shows the highest binding affinity for 
the receptor? 


You next wish to determine whether the hormone-receptor 
complex stimulates the adenylate cyclase cascade. To do so, 
you measure adenylate cyclase activity as a function of hor- 
mone concentration, as shown in graph B. 
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(c) What is the relation between the binding affinity of the 
hormone-receptor complex and the ability of the hormone 
to enhance adenylate cyclase activity? What can you con- 
clude about the mechanism of action of the hormone— 
receptor complex? 


(d) Suggest experiments that would determine whether a 
G,, protein is a component of the signal-transduction 
pathway. 
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22. Binding issues. A scientist wishes to determine the num- 
ber of receptors specific for a ligand X, which he has in both 
radioactive and nonradioactive form. In one experiment, he 
adds increasing amounts of radioactive X and measures 
how much of it is bound to the cells. The result is shown as 
total activity in the following graph. Next, he performs the 
same experiment, except that he includes a several hun- 
dredfold excess of nonradioactive X. This result is shown as 
nonspecific binding. The difference between the two curves 
is the specific binding. 


Total binding 


Specific binding 


[Ligand bound] —~ 


Nonspecific binding 


[Ligand] —~ 


(a) Why is the total binding not an accurate representation 
of the number of receptors on the cell surface? 


(b) What is the purpose of performing the experiment in 
the presence of excess nonradioactive ligand? 


(c) What is the significance of the fact that specific binding 
attains a plateau? 


23. Counting receptors. With the use of experiments such as 
those described in Problems 21 and 22, the number of 
receptors in the cell membrane can be calculated. Suppose 
that the specific activity of the ligand is 10!* cpm per 
millimole and that the maximal specific binding is 10* cpm 
per milligram of membrane protein. There are 10!° cells per 
milligram of membrane protein. Assume that one ligand 
binds per receptor. Calculate the number of receptor mole- 
cules present per cell. 


Metabolism: Basic Concepts 
and Design 


WUC 
A, 
Te 


An infinite number of books can be written with only a limited number of 
letters, 26 in the case of English. Likewise, the complex biochemistry of a 
cell—intermediary metabolism—is constructed from a limited number of 
recurring motifs, reactions, and molecules. One role of intermediary metabolism 
is the conversion of environmental energy into the cellular energy currency, 
ATP, represented by the model at the right. [(Left) MARKA/Alamy.] 


he concepts of conformation and dynamics developed in Part I— 

especially those dealing with the specificity and catalytic power of 
enzymes, the regulation of their catalytic activity, and the transport of 
molecules and ions across membranes—enable us to now ask questions 
fundamental to biochemistry: 


1. How does a cell extract energy and reducing power from its environment? 


2. How does a cell synthesize the building blocks of its macromolecules and 
then the macromolecules themselves? 


These processes are carried out by a highly integrated network of chemical 
reactions collectively known as metabolism or intermediary metabolism. 
More than a thousand chemical reactions take place in even as simple an 
organism as Escherichia coli. The array of reactions may seem overwhelming 
at first glance. However, closer scrutiny reveals that metabolism has a coher- 
ent design containing many common motifs. These motifs include the use of an 
energy currency and the repeated appearance of a limited number of acti- 
vated intermediates. In fact, a group of about 100 molecules play central 
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FIGURE 15.1 Glucose metabolism. 
Glucose is metabolized to pyruvate in 10 
linked reactions. Under anaerobic conditions, 
pyruvate is metabolized to lactate and, 
under aerobic conditions, to acetyl CoA. The 
glucose-derived carbons of acetyl CoA are 
subsequently oxidized to CO,. 


roles in all forms of life. Furthermore, although the number of reactions in 
metabolism is large, the number of kinds of reactions is small and the mech- 
anisms of these reactions are usually quite simple. Metabolic pathways are 
also regulated in common ways. The purpose of this chapter is to introduce 
some general principles of metabolism to provide a foundation for the more 
detailed studies to follow. These principles are: 


1. Fuels are degraded and large molecules are constructed step by step ina 
series of linked reactions called metabolic pathways. 


2. An energy currency common to all life forms, adenosine triphosphate 
(ATP), links energy-releasing pathways with energy-requiring pathways. 


3. The oxidation of carbon fuels powers the formation of ATP. 


4, Although there are many metabolic pathways, a limited number of types 
of reactions and particular intermediates are common to many pathways. 


5. Metabolic pathways are highly regulated. 


15.1 Metabolism Is Composed of Many Coupled, 
Interconnecting Reactions 


Living organisms require a continual input of free energy for three major 
purposes: (1) the performance of mechanical work in muscle contraction 
and cellular movements, (2) the active transport of molecules and ions, and 
(3) the synthesis of macromolecules and other biomolecules from simple 
precursors. The free energy used in these processes, which maintain an 
organism in a state that is far from equilibrium, is derived from the environ- 
ment. Photosynthetic organisms, or phototrophs, obtain this energy by trap- 
ping sunlight, whereas chemotrophs, which include animals, obtain energy 
through the oxidation of foodstuffs generated by phototrophs. 


Metabolism consists of energy-yielding and energy-requiring reactions 


Metabolism is essentially a sequence of chemical reactions that begins with a 
particular molecule and results in the formation of some other molecule or 
molecules in a carefully defined fashion (Figure 15.1). There are many such 
defined pathways in the cell (Figure 15.2), and we will examine a few of 
them in some detail later. These pathways are interdependent, and their 
activity is coordinated by exquisitely sensitive means of communication in 
which allosteric enzymes are predominant (Section 10.1). We considered 
the principles of this communication in Chapter 14. 

We can divide metabolic pathways into two broad classes: (1) those that 
convert energy from fuels into biologically useful forms, and (2) those that 
require inputs of energy to proceed. Although this division is often impre- 
cise, it is nonetheless a useful distinction in an examination of metabolism. 
Those reactions that transform fuels into cellular energy are called catabolic 
reactions or, more generally, catabolism. 


atabolism 


Fuel (carbohydrates, fats) Ss CO, + H,O + useful energy 
Those reactions that require energy—such as the synthesis of glucose, fats, 
or DNA—are called anabolic reactions or anabolism. The useful forms of 
energy that are produced in catabolism are employed in anabolism to gen- 
erate complex structures from simple ones, or energy-rich states from 
energy-poor ones. 
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Some pathways can be either anabolic or catabolic, depending on the energy 
conditions in the cell. These pathways are referred to as amphibolic 
pathways. 

An important general principle of metabolism is that biosynthetic and 
degradative pathways are almost always distinct. This separation is necessary 
for energetic reasons, as will be evident in subsequent chapters. It also facili- 
tates the control of metabolism. 


A thermodynamically unfavorable reaction can be driven by a 
favorable reaction 


How are specific pathways constructed from individual reactions? A path- 
way must satisfy minimally two criteria: (1) the individual reactions must be 
specific, and (2) the entire set of reactions that constitute the pathway must 
be thermodynamically favored. A reaction that is specific will yield only one 
particular product or set of products from its reactants. As discussed in 
Chapter 8, enzymes provide this specificity. The thermodynamics of 
metabolism is most readily approached in relation to free energy, which was 
discussed in chapters 1 and 8. A reaction can occur spontaneously only if 
AG, the change in free energy, is negative. Recall that AG for the formation 
of products C and D from substrates A and B is given by 


[C][D] 
[A][B] 
Thus, the AG of a reaction depends on the nature of the reactants and prod- 


ucts (expressed by the AG®’ term, the standard free-energy change) and on 
their concentrations (expressed by the second term). 


AG = AG*’ + RTIn 
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FIGURE 15.2 Metabolic pathways. Each 
node represents a specific metabolite. 
[From the Kyoto Encyclopedia of Genes and 
Genomes (www.genome.ad.jp/kegg).] 
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An important thermodynamic fact is that the overall free-energy change 
for a chemically coupled series of reactions is equal to the sum of the free-energy 
changes of the individual steps. Consider the following reactions: 


—— B+C AG?’ = +21 kJ mol! (+5 kcal mol‘) 
BD AG? = —34kJ mol! (—8 kcal mol!) 
— C+ D AG? = —13kJ mol! (—3 kcal mol!) 


Under standard conditions, A cannot be spontaneously converted into B 
and C, because AG®’ is positive. However, the conversion of B into D 
under standard conditions is thermodynamically feasible. Because free- 
energy changes are additive, the conversion of A into C and D has a AG*’ 
of —13 kJ mol~! (—3 kcal mol~!), which means that it can occur spontane- 
ously under standard conditions. Thus, a thermodynamically unfavorable 
reaction can be driven by a thermodynamically favorable reaction to which it 
is coupled. In this example, the reactions are coupled by the shared chemi- 
cal intermediate B. Metabolic pathways are formed by the coupling of 
enzyme-catalyzed reactions such that the overall free energy of the path- 
way is negative. 


15.2 ATP Is the Universal Currency of Free Energy in 
Biological Systems 


Just as commerce is facilitated by the use of a common currency, the 
commerce of the cell—metabolism—is facilitated by the use of a com- 
mon energy currency, ddenosine triphosphate (ATP). Part of the free 
energy derived from the oxidation of foodstuffs and from light is trans- 
formed into this highly accessible molecule, which acts as the free-energy 
donor in most energy-requiring processes such as motion, active trans- 
port, and biosynthesis. Indeed, most of catabolism consists of reactions 
that extract energy from fuels such as carbohydrates and fats and convert 
it into ATP. 


ATP hydrolysis is exergonic 


ATP isa nucleotide consisting of adenine, a ribose, and a triphosphate unit 
(Figure 15.3). The active form of ATP is usually a complex of ATP with 
Mg?? or Mn?*. In considering the role of ATP as an energy carrier, we can 
focus on its triphosphate moiety. ATP is an energy-rich molecule because its 
triphosphate unit contains two phosphoanhydride bonds. A large amount of 
free energy is liberated when ATP is hydrolyzed to adenosine diphosphate 
(ADP) and orthophosphate (P.) or when ATP is hydrolyzed to adenosine 


monophosphate (AMP) and pyrophosphate (PP.). 


ATP + H,O == ADP + P; 
AG*’ = —30.5kJ mol”! (—7.3 kcal mol!) 


ATP + H,O = — AMP + PP; 
AG®’ = —45.6kJ mol! (—10.9 kcal mol!) 


The precise AG for these reactions depends on the ionic strength of the 
medium and on the concentrations of Mg?* and other metal ions (problems 
23 and 34). Under typical cellular concentrations, the AG for these hydro- 
lyses is approximately —50 kJ mol~! (—12 kcal mol~!). 
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FIGURE 15.3 Structures of ATP, ADP, and AMP. These adenylates 

consist of adenine (blue), a ribose (black), and a tri-, di-, or 
HO OH monophosphate unit (red). The innermost phosphorus atom of ATP is 

Adenosine monophosphate (AMP) designated P,, the middle one Pa and the outermost one a 


The free energy liberated in the hydrolysis of ATP is harnessed to 
drive reactions that require an input of free energy, such as muscle con- 
traction. In turn, ATP is formed from ADP and P- when fuel molecules 
are oxidized in chemotrophs or when light is trapped by phototrophs. 
This ATP—ADP cycle is the fundamental mode of energy exchange in bio- 
logical systems. 

Some biosynthetic reactions are driven by the hydrolysis of other 
nucleoside triphosphates—namely, guanosine triphosphate (GTP), uri- 
dine triphosphate (UTP), and cytidine triphosphate (CTP). The diphos- 
phate forms of these nucleotides are denoted by GDP, UDP, and CDP, 
and the monophosphate forms are denoted by GMP, UMP, and CMP. 
Enzymes catalyze the transfer of the terminal phosphoryl group from 
one nucleotide to another. The phosphorylation of nucleoside mono- 
phosphates is catalyzed by a family of nucleoside monophosphate kinases, 
as discussed in Section 9.4. The phosphorylation of nucleoside diphos- 
phates is catalyzed by nucleoside diphosphate kinase, an enzyme with 
broad specificity. 


Nucleoside monophosphate 
kinase 


NMP + ATP == NDP + ADP 


Nucleoside 
monophosphate 


Nucleoside diphosphate 


kinase 


NDP + ATP == NTP + ADP 


Nucleoside 
diphosphate 


It is intriguing to note that although all of the nucleotide triphosphates 
are energetically equivalent, ATP is nonetheless the primary cellular energy 
carrier. In addition, two important electron carriers, NAD* and FAD, as 
well the acyl group carrier, coenzyme A, are derivatives of ATP. The vole of 
ATP in energy metabolism is paramount. 


ATP hydrolysis drives metabolism by shifting the equilibrium 
of coupled reactions 


An otherwise unfavorable reaction can be made possible by coupling to ATP 
hydrolysis. Consider a reaction that is thermodynamically unfavorable 
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without an input of free energy, a situation common to most biosynthetic 
reactions. Suppose that the standard free energy of the conversion of com- 
pound A into compound B is +16.7 kJ mol~! (+4.0 kcal mol~!): 


A == B AG” = +16.7kJ mol”! (+4 kcal mol‘) 


The equilibrium constant K/_ of this reaction at 25°C is related to AG*’ (in 
units of kilojoules per mole) by 

=e e "Sis ei 
Thus, net conversion of A into B cannot take place when the molar ratio of 
B to A is equal to or greater than 1.15 X 107%. However, A can be converted 
into B under these conditions if the reaction is coupled to the hydrolysis of 
ATP. Under standard conditions, the AG®’ of hydrolysis is approximately 
—30.5 kJ mol! (—7.3 kcal mol~!). The new overall reaction is 


A+ ATP + H20 =— B+ ADP + P; 

AG?’ = —13.8kJ mol” !(—3.3 kcal mol‘) 
Its free-energy change of —13.8 kJ mol~! (—3.3 kcal mol~') is the sum 
of the value of AG®°’ for the conversion of A into B [+16.7 kJ mol! 
(+4.0 kcal mol~!)] and the value of AG°’ for the hydrolysis of ATP 
[—30.5 kJ mol~! (—7.3 kcal mol~')]. At pH 7, the equilibrium constant 
of this coupled reaction is 
[Bleq , [ADP ]eqlPileg 
[Aleq [ATP ]eq 


Klg = = 9'3:8/247 _ 9 67 yx 102 

At equilibrium, the ratio of [B] to [A] is given by 
[B] [ATP ]eq 
[A] “ADP JeqlPile 


eq 


eq q 


which means that the hydrolysis of ATP enables A to be converted into B 
until the [B]/[A] ratio reaches a value of 2.67 X 102. This equilibrium 
ratio is strikingly different from the value of 1.15 X 10~? for the reaction 
A-—>B in the absence of ATP hydrolysis. In other words, coupling the 
hydrolysis of ATP with the conversion of A into B under standard condi- 
tions has changed the equilibrium ratio of B to A by a factor of about 10°. 
If we were to use the AG of hydrolysis of ATP under cellular conditions 
[—50.2 kJ mol~! (—12 kcal mol~!)] in our calculations instead of AG’, 
the change in the equilibrium ratio would be even more dramatic, on the 
order of 108. 

We see here the thermodynamic essence of ATP’s action as an energy- 
coupling agent. Cells maintain ATP levels by using oxidizable substrates or 
light as sources of free energy for synthesizing the molecule. In the cell, the 
hydrolysis of an ATP molecule in a coupled reaction then changes the equi- 
librium ratio of products to reactants by a very large factor, of the order of 
108. More generally, the hydrolysis of n ATP molecules changes the equi- 
librium ratio of a coupled reaction (or sequence of reactions) by a factor of 
108". For example, the hydrolysis of three ATP molecules in a coupled reac- 
tion changes the equilibrium ratio by a factor of 1074. Thus, a thermody- 
namically unfavorable reaction sequence can be converted into a favorable one 
by coupling it to the hydrolysis of a sufficient number of ATP molecules in a new 
reaction. It should also be emphasized that A and B in the preceding coupled 
reaction may be interpreted very generally, not only as different chemical 
species. For example, A and B may represent activated and unactivated 
conformations of a protein that is activated by phosphorylation with ATP. 


Through such changes in protein conformation, molecular motors such as 
myosin, kinesin, and dynein convert the chemical energy of ATP into 
mechanical energy (Chapter 34). Indeed, this conversion is the basis of 
muscle contraction. 

Alternatively, A and B may refer to the concentrations of an ion or mol- 
ecule on the outside and inside of a cell, as in the active transport of a nutri- 
ent. The active transport of Na* and K* across membranes is driven by the 
phosphorylation of the sodium—potassium pump by ATP and its subse- 
quent dephosphorylation (Section 13.2). 


The high phosphoryl potential of ATP results from structural 
differences between ATP and its hydrolysis products 


What makes ATP an efficient phosphoryl-group donor? Let us compare 
the standard free energy of hydrolysis of ATP with that of a phosphate ester, 
such as glycerol 3-phosphate: 


ATP + H,O == ADP + P; 
AG°’ = —30.5kJ mol! (—7.3 kcal mol!) 


Glycerol 3-phosphate + H,O0 == glycerol + P; 
AG°’ = —9.2kJ mol! (—2.2 kcal mol!) 


The magnitude of AG®°’ for the hydrolysis of glycerol 3-phosphate is 
much smaller than that of ATP, which means that ATP has a stronger 
tendency to transfer its terminal phosphoryl group to water than does 
glycerol 3-phosphate. In other words, ATP has a higher phosphoryl- 
transfer potential (phosphoryl-group-transfer potential) than does glycerol 
3-phosphate. 

The high phosphoryl-transfer potential of ATP can be explained by 
features of the ATP structure. Because AG®’ depends on the difference in 
free energies of the products and reactants, we need to examine the struc- 
tures of both ATP and its hydrolysis products, ADP and P., to answer this 
question. Four factors are important: resonance stabilization, electrostatic 
repulsion, increase in entropy, and stabilization due to hydration. 


1. Resonance Stabilization. Orthophosphate (P.), one of the products of 
ATP hydrolysis, has greater resonance stabilization than do any of the 
phosphoryl groups of ATP. Orthophosphate has a number of resonance 
forms of similar energy (Figure 15.4), whereas the y phosphoryl group 
of ATP has a smaller number. Forms like that shown in Figure 15.5 are 
unfavorable because a positively charged oxygen atom is adjacent to a 
positively charged phosphorus atom, an electrostatically unfavorable 
juxtaposition. 


if i (j . 
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FIGURE 15.4 Resonance structures of orthophosphate. 


2. Electrostatic Repulsion. At pH 7, the triphosphate unit of ATP carries 
about four negative charges. These charges repel one another because they 
are in close proximity. The repulsion between them is reduced when ATP 
is hydrolyzed. 
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FIGURE 15.5 Improbable resonance 
structure. The structure contributes little to 
the terminal part of ATP, because two 
positive charges are placed adjacent to 
each other. 
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FIGURE 15.6 Compounds with high 
phosphoryl-transfer potential. The 
role of ATP as the cellular energy 
currency is illustrated by its relation to 
other phosphorylated compounds. ATP 
has a phosphoryl-transfer potential that 
is intermediate among the biologically 
important phosphorylated molecules. 
High-phosphoryl-transfer-potential 
compounds (1,3-BPG, PEP, and 
creatine phosphate) derived from the 
metabolism of fuel molecules are used 
to power ATP synthesis. In turn, ATP 
donates a phosphoryl group to other 
biomolecules to facilitate their 
metabolism. [Data from D. L. Nelson 

and M. M. Cox, Lehninger Principles 

of Biochemistry, 5th ed. (W. H. Freeman 
and Company, 2009), Fig. 13-19.] 


3. Increase in Entropy. The entropy of the products of ATP hydrolysis is 
greater, in that there are now two molecules instead of a single ATP 
molecule. We disregard the molecule of water used to hydrolyze the 
ATP; given the high concentration (55.5 M), there is effectively no 
change in the concentration of water during the reaction. 


4. Stabilization Due to Hydration. Water binds to ADP and P., stabilizing 
these molecules, and rendering the reverse reaction, the synthesis of ATP, 
more unfavorable. 


ATP is often called a high-energy phosphate compound, and its phos- 
phoanhydride bonds are referred to as high-energy bonds. Indeed, a “squig- 
gle” (~P) is often used to indicate such a bond. Nonetheless, there is nothing 
special about the bonds themselves. They are high-energy bonds in the sense 
that much free energy is released when they are hydrolyzed, for the reasons 
listed above. 


Phosphoryl-transfer potential is an important form of cellular 
energy transformation 


The standard free energies of hydrolysis provide a convenient means 
of comparing the phosphoryl-transfer potential of phosphorylated 
compounds. Such comparisons reveal that ATP is not the only compound 
with a high phosphoryl-transfer potential. In fact, some compounds 
in biological systems have a higher phosphoryl-transfer potential 
than that of ATP. These compounds include phosphoenolpyruvate 
(PEP), 1,3-bisphosphoglycerate (1,3-BPG), and creatine phosphate 
(Figure 15.6). Thus, PEP can transfer its phosphoryl group to ADP to 
form ATP. Indeed, this transfer is one of the ways in which ATP is gen- 
erated in the breakdown of sugars (Chapter 16). It is significant that 
ATP has a phosphoryl-transfer potential that is intermediate among the 
biologically important phosphorylated molecules (Table 15.1). This 
intermediate position enables ATP to function efficiently as a carrier of 
phosphoryl groups. 
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TABLE 15.1 Standard free energies of hydrolysis of some phosphorylated 


compounds 
Compound kJ mol7! kcal mol~! 
Phosphoenolpyruvate —61.9 —14.8 
1,3-Bisphosphoglycerate —49.4 —11.8 
Creatine phosphate —43.1 —10.3 
ATP (to ADP) =80.8 = 743 
Glucose 1-phosphate —20.9 — 5.0 
Pyrophosphate =119).3) — 4.6 
Glucose 6-phosphate —13.8 = 3.3 
Glycerol 3-phosphate = 92 = 22 


The amount of ATP in muscle suffices to sustain contractile activity for 
less than a second. Creatine phosphate in vertebrate muscle serves as a res- 
ervoir of high-potential phosphoryl groups that can be readily transferred to 
ADP. Indeed, we use creatine phosphate to regenerate ATP from ADP 
every time that we exercise strenuously. This reaction is catalyzed by cre- 
atine kinase. 


Creatine 
kinase 


Creatine phosphate + ADP == ATP + creatine 


At pH 7, the standard free energy of hydrolysis of creatine phosphate is 
—43.1 kJ mol! (—10.3 kcal mol~!), compared with —30.5 kJ mol~! (—7.3 
kcal mol~!) for ATP. Hence, the standard free-energy change in forming 
ATP from creatine phosphate is — 12.6 kJ mol~! (—3.0 kcal mol~!), which 
corresponds to an equilibrium constant of 162. 


[ATP ][creatine] 


= g AG" 247 — 126/247 — 469 
[ADP] [creatine phosphate ] : . 


Keg = 


In resting muscle, typical concentrations of these metabolites are [ATP] = 
4 mM, [ADP] = 0.013 mM, [creatine phosphate] = 25 mM, and [cre- 
atine] = 13 mM. Because of its abundance and high phosphoryl-transfer 
potential relative to that of ATP, creatine phosphate is a highly effective 
phosphoryl buffer. Indeed, creatine phosphate is the major source of 
phosphoryl groups for ATP regeneration for a runner during the first 
4 seconds of a 100-meter sprint. The fact that creatine phosphate can 
replenish ATP pools is the basis of the use of creatine as a dietary supple- 
ment by athletes in sports requiring short bursts of intense activity. After 
the creatine phosphate pool is depleted, ATP must be generated through 
metabolism (Figure 15.7). 


Aerobic metabolism 
(Chapters 17 and 18) 


pe ATP 


Creatine phosphate 


Anaerobic 
metabolism 
(Chapter 16) 


Energy —> 


Minutes —> Hours —> 


Seconds —> 
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FIGURE 15.7 Sources of ATP during 
exercise. In the initial seconds, exercise is 
powered by existing high-phosphoryl- 
transfer compounds (ATP and creatine 
phosphate). Subsequently, the ATP must 
be regenerated by metabolic pathways. 
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Active transport 
_ Biosyntheses 
| Signal amplification 


Motion 


ATP ADP 


FIGURE 15.8 ATP-ADP cycle. This cycle 
is the fundamental mode of energy 
exchange in biological systems. 


Most energy 


15.3 The Oxidation of Carbon Fuels Is an Important 
Source of Cellular Energy 


ATP serves as the principal immediate donor of free energy in biological sys- 
tems rather than as a long-term storage form of free energy. In a typical cell, 
an ATP molecule is consumed within a minute of its formation. Although 
the total quantity of ATP in the body is limited to approximately 100 g, the 
turnover of this small quantity of ATP is very high. For example, a resting 
human being consumes about 40 kg (88 pounds) of ATP in 24 hours. 
During strenuous exertion, the rate of utilization of ATP may be as high as 
0.5 kg/minute. For a 2-hour run, 60 kg (132 pounds) of ATP is utilized. 
Clearly, having mechanisms for regenerating ATP is vital. Motion, active 
transport, signal amplification, and biosynthesis can take place only if ATP 
is continually regenerated from ADP (Figure 15.8). The generation of ATP 
is one of the primary roles of catabolism. The carbon in fuel molecules—such 
as glucose and fats—is oxidized to CO,. The resulting electrons are cap- 
tured and used to regenerate ATP from ADP and P.. 

In aerobic organisms, the ultimate electron acceptor in the oxidation of 
carbon is O, and the oxidation product is CO,. Consequently, the more 
reduced a carbon is to begin with, the more free energy is released by its ox1- 
dation. Figure 15.9 shows the AG®’ of oxidation for one-carbon compounds. 


Least energy 
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Fuel molecules are more complex (Figure 15.10) than the single-carbon 
compounds depicted in Figure 15.9. Nevertheless, oxidation of these fuels 
takes place one carbon at a time. The carbon-oxidation energy is used in 
some cases to create a compound with high phosphoryl-transfer potential 
and in other cases to create an ion gradient. In either case, the end point is 
the formation of ATP. 

CH20OH 
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FIGURE 15.10 Prominent fuels. Fats oO” New New New New New New New cu 
ie HO OH 3 

are a more efficient fuel source than Mi Hp Ho Ho Ho Ho Ho Hp 
carbohydrates such as glucose because OH 
the carbon in fats is more reduced. Glucose Saturated fatty acid 


Compounds with high phosphoryl-transfer potential can couple 
carbon oxidation to ATP synthesis 


How is the energy released from the oxidation of a carbon compound con- 
verted into ATP? As an example, consider glyceraldehyde 3-phosphate 
(shown in the margin), which is a metabolite of glucose formed in the 


oxidation of that sugar. The C-1 carbon (shown in red) is at the aldehyde- OS 
oxidation level and is not in its most oxidized state. Oxidation of the alde- | 
hyde to an acid will release energy. H—C—OH 


CH,0PO37- 
16) oO OH 
SH Sco ee Poe 


ZH 


Oxidation 
H—C—OH > H—C—OH 


CH,OPO3?- CH2OPO37~ 
Glyceraldehyde 3-phosphate 3-Phosphoglyceric acid 


However, the oxidation does not take place directly. Instead, the carbon 
oxidation generates an acyl phosphate, 1,3-bisphosphoglycerate. The elec- 
trons released are captured by NAD™, which we will consider shortly. 


On cH OS 


H—C—OH + NAD* + HPO,?- > H—C—OH + NADH + H* 


CH,0PO32- CH20PO32- 


Glyceraldehyde 3-phosphate 1,3-Bisphosphoglycerate 
(GAP) (1,3-BPG) 


For reasons similar to those discussed for ATP, 1,3-bisphosphoglycerate has 
a high phosphoryl-transfer potential that is, in fact, greater than that of ATP. 
Thus, the hydrolysis of 1,3-BPG can be coupled to the synthesis of ATP. 


2- 
Sc _OPOs O 


H—C—OH + ADP ——> H—C—OH + ATP 


CHOPO37- CH20PO37 
1,3-Bisphosphoglycerate 3-Phosphoglyceric acid 


The energy of oxidation is initially trapped as a high- 
phosphoryl-transfer-potential compound and then used to ( Gradient created 
form ATP. The oxidation energy of a carbon atom is Oxidation of fuels 
transformed into phosphoryl-transfer potential, first as Pump? Plenons Our 
1,3-bisphosphoglycerate and ultimately as ATP. We will 

consider these reactions in mechanistic detail in Chapter 16. ase s 


H* H* 


lon gradients across membranes provide an important Carbon ~—————V 
form of cellular energy that can be coupled to fuels + O, 
ATP synthesis 


As described in Chapter 13, electrochemical potential is ADP ATP 
an effective means of storing free energy. Indeed, the elec- +P; “7 _~+4H,0 
trochemical potential of ion gradients across membranes, ah 
produced by the oxidation of fuel molecules or by photo- —_ _- 
synthesis, ultimately powers the synthesis of most of the 
ATP in cells. In general, ion gradients are versatile means 
of coupling thermodynamically unfavorable reactions to Ht Ht 


CO, + H,0 


ated by the oxidation of carbon fuels account for more 


than 90% of ATP generation (Figure 15.11). This process FIGURE 15.11 hie igo chorstinaenalanie ine soba aes 
power the formation of proton gradients by the action of specific 


proton pumps (yellow cylinders). These proton gradients can in 


is called oxidative phosphorylation (Chapter 18). ATP 


@) Gradient used 
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favorable ones. Indeed, in animals, proton gradients gener- forms pas 


hydrolysis can then be used to form ion gradients of differ- turn drive the synthesis of ATP when the protons flow through an 


ent types and functions. The electrochemical potential ofa ATP-synthesizing enzyme (red complex). 
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Na* gradient, for example, can be tapped to pump Ca? out of cells or to 
transport nutrients such as sugars and amino acids into cells. 


Phosphates play a prominent role in biochemical processes 


We have seen in Section 10.3, Chapter 14, and in this chapter the promi- 
nence of phosphoryl group transfer from ATP to acceptor molecules. How 
is it that phosphate came to play such a prominent role in biology? Phosphate 
and its esters have several characteristics that render it useful for biochemical 
systems. First, phosphate esters have the important property of being ther- 
modynamically unstable while being kinetically stable. Phosphate esters are 
thus molecules whose energy release can be manipulated by enzymes. The 
stability of phosphate esters is due to the negative charges that make them 
resistant to hydrolysis in the absence of enzymes. This accounts for the 
presence of phosphate in the backbone of DNA. Furthermore, because 
phosphate esters are so kinetically stable, they make ideal regulatory mole- 
cules, added to proteins by kinases and removed only by phosphatases. 
Phosphates are also frequently added to metabolites that might otherwise 
diffuse through the cell membrane. Furthermore, even when transporters 
exist for unphosphorylated forms of a metabolite, the addition of a phos- 
phate changes the geometry and polarity of the molecules so that they no 
longer fit in the binding sites of the transporters. 

No other ions have the chemical characteristics of phosphate. Citrate is 
not sufficiently charged to prevent hydrolysis. Arsenate forms esters that 
are unstable and susceptible to spontaneous hydrolysis. Indeed, arsenate is 
poisonous to cells because it can replace phosphate in reactions required for 
ATP synthesis, generating unstable compounds and preventing ATP syn- 
thesis. Silicate is more abundant than phosphate, but silicate salts are virtu- 
ally insoluble, and in fact, are used for biomineralization. Only phosphate 
has the chemical properties to meet the needs of living systems. 


Energy from foodstuffs is extracted in three stages 


Let us take an overall view of the processes of energy conversion 
in higher organisms before considering them in detail in subse- 


LIPIDS POLYSACCHARIDES PROTEINS 


Fatty acids and Glucose and Amino acids 


quent chapters. Hans Krebs described three stages in the genera- 

tion of energy from the oxidation of foodstuffs (Figure 15.12). 
In the first stage, large molecules in food are broken down into 

smaller units in the process of digestion. Proteins are hydrolyzed 


Stage | 


glycerol other sugars J { { 
[| || —— 


to their 20 different amino acids, polysaccharides are hydro- 


lyzed to simple sugars such as glucose, and lipids are hydrolyzed 
Stage Il 


Oxidative 
phosphorylation 


H20 


to glycerol and fatty acids. The degradation products are then 
absorbed by the cells of the intestine and distributed through- 
out the body. This stage is strictly a preparation stage; no useful 
energy is captured in this phase. 

In the second stage, these numerous small molecules are degraded 
to a few simple units that play a central role in metabolism. In fact, 
most of them—sugars, fatty acids, glycerol, and several amino 
acids—are converted into the acetyl unit of acetyl CoA. Some 
ATP is generated in this stage, but the amount is small compared 
with that obtained in the third stage. 

In the third stage, ATP is produced from the complete oxidation 
of the acetyl unit of acetyl CoA. The third stage consists of the 
citric acid cycle and oxidative phosphorylation, which are the final 


Stage Ill 


common pathways in the oxidation of fuel molecules. Acetyl CoA 


FIGURE 15.12 Stages of catabolism. The extraction of brings acetyl units into the citric acid cycle [also called the tricar- 


energy from fuels can be divided into three stages. 


boxylic acid (TCA) cycle or Krebs cycle], where they are 


completely oxidized to CO,. Four pairs of electrons are transferred (three to 
NAD* and one to FAD) for each acetyl group that is oxidized. Then, a proton 
gradient is generated as electrons flow from the reduced forms of these carri- 
ers to O,,, and this gradient is used to synthesize ATP (Chapters 17 and 18). 


15.4 Metabolic Pathways Contain Many Recurring Motifs 


At first glance, metabolism appears intimidating because of the sheer num- 
ber of reactants and reactions. Nevertheless, there are unifying themes that 
make the comprehension of this complexity more manageable. These unify- 
ing themes include common metabolites, reactions, and regulatory schemes 
that stem from a common evolutionary heritage. 


Activated carriers exemplify the modular design and economy 
of metabolism 


We have seen that phosphory] transfer can be used to drive otherwise end- 
ergonic reactions, alter the energy of conformation of a protein, or serve asa 
signal to alter the activity of a protein. The phosphoryl-group donor in all of 
these reactions is ATP. In other words, ATP is an activated carrier of phos- 
phoryl groups because phosphoryl transfer from ATP is an exergonic process. 

The use of activated carriers is a recurring motif in biochemistry, and we 
will consider several such carriers here. Many such activated carriers func- 
tion as coenzymes (Section 8.1): 


1. Activated Carriers of Electrons for Fuel Oxidation. In aerobic organisms, 
the ultimate electron acceptor in the oxidation of fuel molecules is O,. 
However, electrons are not transferred directly to O,. Instead, fuel mole- 
cules transfer electrons to special carriers, which are either pyridine nucleo- 
tides or flavins. The reduced forms of these carriers then transfer their high- 
potential electrons to O,. 

Nicotinamide adenine dinucleotide is a major electron carrier in the ox1- 
dation of fuel molecules (Figure 15.13). The reactive part of NAD* is its 
nicotinamide ring, a pyridine derivative synthesized from the vitamin nia- 
cin. In the oxidation of a substrate, the nicotinamide ring of NAD* accepts a 
hydrogen ion and two electrons, which are equivalent to a hydride ion (H:~). 
The reduced form of this carrier is called NADH. In the oxidized form, the 
nitrogen atom carries a positive charge, as indicated by NAD*. NAD* is 
the electron acceptor in many reactions of the type 


OH I 
+ + 
ee + NAD wa + NADH + H 
H 


In this dehydrogenation, one hydrogen atom of the substrate is directly 
transferred to NAD*, whereas the other appears in the solvent as a proton. 
Both electrons lost by the substrate are transferred to the nicotinamide ring. 
The other major electron carrier in the oxidation of fuel molecules is the 
coenzyme flavin adenine dinucleotide (Figure 15.14). The abbreviations for 
the oxidized and reduced forms of this carrier are FAD and FADH,, 
respectively. FAD is the electron acceptor in reactions of the type 


Xe R R’ 


+ FADH2 


c=——C 
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H H H H 
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Reactive site 
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FIGURE 15.13 Structures of the oxidized 
forms of nicotinamide-derived electron 
carriers. Nicotinamide adenine dinucleotide 
(NAD*) and nicotinamide adenine 
dinucleotide phosphate (NADP*) are 
prominent carriers of high-energy electrons. 
In NADt, R = H; in NADP*, R = PO3-. 
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form of flavin adenine dinucleotide eo eNO eek 
(FAD). This electron carrier consists of a (9) 6 H 
flavin mononucleotide (FMN) unit (shown in 

blue) and an AMP unit (shown in black). HO OH 


The reactive part of FAD is its isoalloxazine ring, a derivative of the vitamin 
riboflavin (Figure 15.15). FAD, like NAD*, can accept two electrons. In 
doing so, FAD, unlike NAD*, takes up two protons. These carriers of high- 
potential electrons as well as flavin mononucleotide (FMN), an electron 
carrier similar to FAD but lacking the adenine nucleotide, will be consid- 
ered further in Chapter 18. 
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isoalloxazine ring component of Oxidized form Reduced form 
FAD and FADH,. (FAD) (FADH2) 


2. An Activated Carrier of Electrons for Reductive Biosynthesis. High- 
potential electrons are required in most biosyntheses because the precursors 
are more oxidized than the products. Hence, reducing power is needed in 
addition to ATP. For example, in the biosynthesis of fatty acids, a keto 
group is reduced to a methylene group in several steps. This sequence of 
reactions requires an input of four electrons. 


The electron donor in most reductive biosyntheses is NADPH, the reduced 
form of nicotinamide adenine dinucleotide phosphate (NADP*; Figure 
15.13). NADPH differs from NADH in that the 2’-hydroxyl group of its 
adenosine moiety is esterified with phosphate. NADPH carries electrons in 
the same way as NADH. However, NADPH is used almost exclusively for 
reductive biosyntheses, whereas NADH is used primarily for the generation of 
ATP. The extra phosphoryl group on NADPH 1s a tag that enables enzymes 
to distinguish between high-potential electrons to be used in anabolism and 
those to be used in catabolism. 
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3. An Activated Carrier of Two-Carbon Fragments. Coenzyme A, another 
central molecule in metabolism, is a carrier of acyl groups derived from the 
vitamin pantothenate (Figure 15.16). Acyl groups are important constitu- 
ents both in catabolism, as in the oxidation of fatty acids, and in anabolism, 
as in the synthesis of membrane lipids. The terminal sulfhydryl group in 
CoA is the reactive site. Acyl groups are linked to CoA by thioester bonds. 
The resulting derivative is called an acyl CoA. An acyl group often linked to 
CoA is the acetyl unit; this derivative is called acetyl CoA. The AG®’ for the 
hydrolysis of acetyl CoA has a large negative value: 


Acetyl CoA + H2O == acetate + CoA + H* 
AG?’ = —31.4kJ mol” !(—7.5 kcal mol‘) 


A thioester is thermodynamically more unstable than an oxygen ester 
because the electrons of the C=O bond cannot form resonance structures 
with the C—S bond that are as stable as those that they can form with the 
C—O bond. Consequently, acetyl CoA has a high acetyl-group-transfer 
potential because transfer of the acetyl group is exergonic. Acetyl CoA carries 
an activated acetyl group, just as ATP carries an activated phosphoryl group. 

The use of activated carriers illustrates two key aspects of metabolism. 
First, NADH, NADPH, and FADH, react slowly with O, in the absence 
of a catalyst. Likewise, ATP and acetyl CoA are hydrolyzed slowly (over 
many hours or even days) in the absence of a catalyst. These molecules are 
kinetically quite stable in the face of a large thermodynamic driving force for 
reaction with O, (in regard to the electron carriers) and H,O (for ATP and 
acetyl CoA). The kinetic stability of these molecules in the absence of specific 
catalysts is essential for their biological function because it enables enzymes to 
control the flow of free energy and reducing power. 

Second, most interchanges of activated groups in metabolism are accomplished 
by a rather small set of carriers (Table 15.2). The existence of a recurring set of 


TABLE 15.2 Some activated carriers in metabolism 


Carrier molecule in activated form Group carried Vitamin precursor 

ATP Phosphoryl 

NADH and NADPH Electrons Nicotinate (niacin) 
(vitamin B;) 

FADH, Electrons Riboflavin (vitamin B,) 

FMNH, Electrons Riboflavin (vitamin B,) 

Coenzyme A Acyl Pantothenate (vitamin B,) 

Lipoamide Acyl 

Thiamine pyrophosphate Aldehyde Thiamine (vitamin B,) 

Biotin co, Biotin (vitamin B.) 

Tetrahydrofolate One-carbon units Folate (vitamin B,) 

S-Adenosylmethionine Methyl 

Uridine diphosphate glucose Glucose 

Cytidine diphosphate diacylglycerol Phosphatidate 

Nucleoside triphosphates Nucleotides 


Note: Many of the activated carriers are coenzymes that are derived from water-soluble vitamins. 
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FIGURE 15.16 Structure of coenzyme 
A (CoA-SH). 
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Oxygen esters are stabilized by resonance 
structures not available to thioesters. 
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F Typical reaction Consequences of 
t dD ; : a 
Gancepre auc Design Vitamin Coenzyme type deficiency 
Thiamine (B,) Thiamine Aldehyde transfer Beriberi (weight loss, 
pyrophosphate heart problems, 
neurological dysfunction) 
Riboflavin (B,) Flavin adenine Oxidation—-reduction Cheliosis and angular 
dinucleotide (FAD) stomatitis (lesions of the 
mouth), dermatitis 
Pyridoxine (B,) Pyridoxal phosphate Group transfer toor Depression, confusion, 
from amino acids convulsions 
Nicotinic acid Nicotinamide adenine Oxidation-reduction Pellagra (dermatitis, 
(niacin) (B;) dinucleotide (NAD*) depression, diarrhea) 
Pantothenic acid (B,) Coenzyme A Acyl-group transfer Hypertension 
Biotin (B,) Biotin—lysine ATP-dependent Rash about the 
adducts (biocytin) carboxylation and eyebrows, muscle 
carboxyl-group pain, fatigue (rare) 
transfer 
Folic acid (By) Tetrahydrofolate Transfer of one- Anemia, neural-tube 


carbon components; defects in development 
thymine synthesis 


B 5’-Deoxyadenosyl Transfer of methyl | Anemia, pernicious 
12 ve y) y. 12 : 
cobalamin groups; anemia, methylmalonic 
intramolecular acidosis 
rearrangements 


activated carriers in all organisms is one of the unifying motifs of biochemistry. 
Furthermore, it illustrates the modular design of metabolism. A small set of 
molecules carries out a very wide range of tasks. Metabolism is readily com- 
prehended because of the economy and elegance of its underlying design. 


Many activated carriers are derived from vitamins 


Almost all the activated carriers that act as coenzymes are derived from 
vitamins. Vitamins are organic molecules that are needed in small amounts 
in the diets of some higher animals. Table 15.3 lists the vitamins that act as 
coenzymes and Figure 15.17 shows the structures of some of them. This 
series of vitamins is known as the vitamin B group. In all cases, the vitamin 
must be modified before it can serve its function. We have already touched 
on the roles of niacin, riboflavin, and pantothenate. We will see these three 
and the other B vitamins many times in our study of biochemistry. 


e Vitamins serve the same roles in nearly all forms of life, but higher 

animals lost the capacity to synthesize them in the course of evolution. 
For instance, whereas E. coli can thrive on glucose and organic salts, human 
beings require at least 12 vitamins in their diet. The biosynthetic pathways for 


CH20OH 
H 
O. No HOH2C Z OH 
re) O HzC CH; SN CH; 
H 
Vitamin B; ra Vitamin Bz Vitamin Bg 
(Pantothenate) H—+—OH (Niacin) (Pyridoxine) 
H——-OH 
FIGURE 15.17 Structures of some 
of the B vitamins. These vitamins H———OH 
are often referred to as water-soluble CH20H 
vitamins because of the ease with Vitamin B> 


which they dissolve in water. (Riboflavin) 


TABLE 15.4 Noncoenzyme vitamins 
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Vitamin Function Deficiency 
A Roles in vision, growth, Night blindness, cornea 
reproduction damage, damage to respiratory 
and gastrointestinal tract 
C (ascorbic acid) Antioxidant Scurvy (swollen and bleeding 
gums, subdermal hemorrhaging) 
D Regulation of calcium Rickets (children): skeletal deformities, 
and phosphate metabolism impaired growth 
Osteomalacia (adults): soft, bending 
bones 
E Antioxidant Lesions in muscles and nerves (rare) 
K Blood coagulation Subdermal hemorrhaging 


vitamins can be complex; thus, it is biologically more efficient to ingest vita- 
mins than to synthesize the enzymes required to construct them from simple 
molecules. This efficiency comes at the cost of dependence on other organ- 
isms for chemicals essential for life. Indeed, vitamin deficiency can generate 
diseases in all organisms requiring these molecules (Tables 15.3 and 15.4). 
Not all vitamins function as coenzymes. Vitamins designated by the letters 
A, C, D, E, and K (Figure 15.18 and Table 15.4) have a diverse array of func- 
tions. Vitamin A (retinol) is the precursor of retinal, the light-sensitive group in 
rhodopsin and other visual pigments (Section 32.3), and retinoic acid, an 
important signaling molecule. A deficiency of this vitamin leads to night blind- 
ness. In addition, young animals require vitamin A for growth. Vitamin C, or 
ascorbate, acts as an antioxidant. A deficiency in vitamin C results in the for- 
mation of unstable collagen molecules and is the cause of scurvy, a disease 
characterized by skin lesions and blood-vessel fragility (Section 27.6). A metab- 
olite of vitamin D is a hormone that regulates the metabolism of calcium and 
phosphorus. A deficiency in vitamin D impairs bone formation in growing 
animals. Vitamin E (a-tocopherol) deficiency causes a variety of neuromuscular 
pathologies. This vitamin inactivates reactive oxygen species such as hydroxyl 
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HO” OH These vitamins are often called 
the fat-soluble vitamins because 


1,25-Dihydroxyvitamin D; (Calcitriol) of their hydrophobic nature. 
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TABLE 15.5 Types of chemical reactions in metabolism 


Type of reaction Description 

Oxidation—-reduction Electron transfer 

Ligation requiring ATP cleavage Formation of covalent bonds (i.e., carbon-carbon bonds) 
Isomerization Rearrangement of atoms to form isomers 

Group transfer Transfer of a functional group from one molecule to another 
Hydrolytic Cleavage of bonds by the addition of water 


Carbon bond cleavage by means ‘Two substrates yielding one product or vice versa. When 
other than hydrolysis or oxidation | _H,O or CO, are a product, a double bond is formed. 


radicals before they can oxidize unsaturated membrane lipids, damaging cell 
structures. Vitamin K is required for normal blood clotting (Section 10.4). 


Key reactions are reiterated throughout metabolism 


Just as there is an economy of design in the use of activated carriers, so is 
there an economy of design in biochemical reactions. The thousands of 
metabolic reactions, bewildering at first in their variety, can be subdivided 
into just six types (Table 15.5). Specific reactions of each type appear 
repeatedly, reducing the number of reactions that a student needs to learn. 


1. Oxidation—reduction reactions are essential components of many path- 
ways. Useful energy is often derived from the oxidation of carbon com- 
pounds. Consider the following two reactions: 


O oO 
- | H - | H 
-C c O C c O 
Oo “ Nc 4 FAD === 07 ad Nc + FADHD (1) 
2 i; poe 
fe) 2 0) 
Succinate Fumarate 
O oO 
me 2 6 - , 2 6 
oN + NADY a OC NADH + HY 2) 
Ho 4 I L 
Malate Oxaloacetate 


These two oxidation—reduction reactions are components of the citric acid 
cycle (Chapter 17), which completely oxidizes the activated two-carbon 
fragment of acetyl CoA to two molecules of CO,. In reaction 1, FADH, 
carries the electrons, whereas, in reaction 2, electrons are carried by NADH. 


2. Ligation reactions form bonds by using free energy from ATP cleavage. 
Reaction 3 illustrates the ATP-dependent formation of a carbon-carbon 
bond, necessary to combine smaller molecules to form larger ones. 
Oxaloacetate is formed from pyruvate and CO,. 


it 
C .O 
HCO ae + CO) + ATP + HO ——= 
0) 0 7 
Pyruvate a | | 
PS Cc .O 
Oo NCO NCH + ADP + P+ Ht 3) 
H2 |! — 


fe) 


Oxaloacetate 


The oxaloacetate can be used in the citric acid cycle, or converted into glu- 
cose or amino acids such as aspartic acid. 


3. Isomerization reactions rearrange particular atoms within a molecule. 
Their role is often to prepare the molecule for subsequent reactions such as 
the oxidation—reduction reactions described in point 1. 


HQ CcoOo- H COoOo- 
ne ae ged OO ee «a 
H> /\ H> 4%, 
H H H OH 
Citrate Isocitrate 


Reaction 4 is, again, acomponent of the citric acid cycle. This isomerization 
prepares the molecule for subsequent oxidation and decarboxylation reac- 
tions by moving the hydroxyl group of citrate from a tertiary to a secondary 
position. 


4. Group-transfer reactions play a variety of roles. Reaction 5 is repre- 
sentative of such a reaction. A phosphoryl group is transferred from the 
activated phosphoryl-group carrier, ATP, to glucose, the initial step in 
glycolysis, akey pathway for extracting energy from glucose (Chapter 16). 
This reaction traps glucose in the cell so that further catabolism can 
take place. 


O fe) fe) 
CH,OH | | i 
O ony? <P adenine 
0 / ~o/ ~o fe) 
OH + oO O (@) —— 
HO OH 
OH 
HO OH 
Glucose ATP 
a - _ 
ye) ~ O 0 
ger j | 
° [\\ P i 
0G OD a7 ee adenine 
J oi ~o O 
OH + O (@) (5) 
HO OH 
OH 
HO OH 
Glucose 6-phosphate ADP 
(G-6P) 


As stated earlier, group-transfer reactions are used to synthesize ATP. We 
also saw examples of their use in signaling pathways (Chapter 14). 


5. Hydrolytic reactions cleave bonds by the addition of water. Hydrolysis 
is a common means of degrading large molecules, either to facilitate 
further metabolism or to reuse some of the components for biosynthetic 
purposes. Proteins are digested by hydrolytic cleavage (chapters 9 
and 10). Reaction 6 illustrates the hydrolysis of a peptide to yield two 
smaller peptides. 
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6. Carbon bonds can be cleaved by means other than hydrolysis or oxidation, 
with two substrates yielding one product or vice versa. When CO, or H,O is 
released, a double bond is formed. The enzymes that catalyze these types of 
reaction are classified as lyases. An important example, illustrated in reaction 
7, is the conversion of the six-carbon molecule fructose 1,6-bisphosphate 
into two three-carbon fragments: dihydroxyacetone phosphate and glyceral- 
dehyde 3-phosphate. 


2- 
O — _7CH20PO3 
Oo CH,0PO37- H 0 
HO—C—H ial vy 

H—C—0OH HO—cC—H + Tl —OH (7) 
es | —OH i CH,0PO32- 

CH,OPO32- 

Fructose 1,6-bisphosphate Dihydroxyacetone phosphate Glyceraldehyde 3-phosphate 
(F-1,6-BP) (DHAP) (GAP) 


This reaction is a critical step in glycolysis (Chapter 16). Dehydrations to form 
double bonds, such as the formation of phosphoenolpyruvate (Figure 15.6) 
from 2-phosphoglycerate (reaction 8), are important reactions of this type. 


O..~.0 O.. ~ .0 
Sc* ~ 
iS 8) 
H—C—OPO3? H Cc + HO ( 
Ce “gage 
CH,0H | 
H 
2-Phosphoglycerate Phosphoenolpyruvate 
(PEP) 


The dehydration sets up the next step in the pathway, a group-transfer reac- 
tion that uses the high phosphoryl-transfer potential of the product PEP to 
form ATP from ADP. 

These six fundamental reaction types are the basis of metabolism. 
Remember that all six types can proceed in either direction, depending on 
the standard free energy for the specific reaction and the concentrations of 
the reactants and products inside the cell. An effective way to learn is to look 
for commonalities in the diverse metabolic pathways that we will be examin- 
ing. There is a chemical logic that, when exposed, renders the complexity of 
the chemistry of living systems more manageable and reveals its elegance. 


Metabolic processes are regulated in three principal ways 


It is evident that the complex network of metabolic reactions must be rigor- 
ously regulated. The levels of available nutrients must be monitored and the 
activity of metabolic pathways must be altered and integrated to create 
homeostasis, a stable biochemical environment. At the same time, metabolic 
control must be flexible, able to adjust metabolic activity to the constantly 
changing external environments of cells. Figure 15.19 illustrates the nutrient 
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pools and their connections that must be monitored and regulated. 
Metabolism is regulated through control of (1) the amounts of enzymes, 
(2) their catalytic activities, and (3) the accessibility of substrates. 


Controlling the amounts of enzymes. The amount of a particular enzyme 
depends on both its rate of synthesis and its rate of degradation. The level of 
many enzymes is adjusted by a change in the rate of transcription of the 
genes encoding them (chapters 29 and 31). In E. coli, for example, the pres- 
ence of lactose induces within minutes a more than 50-fold increase in the 
rate of synthesis of B-galactosidase, the enzyme required for the breakdown 
of this disaccharide. 


Controlling catalytic activity. The catalytic activity of enzymes is controlled 
in several ways. Allosteric control is especially important. For example, the 
first reaction in many biosynthetic pathways is allosterically inhibited by the 
ultimate product of the pathway. The inhibition of aspartate transcarbamo- 
ylase by cytidine triphosphate (Section 10.1) is a well-understood example of 
feedback inhibition. This type of control can be almost instantaneous. Another 
recurring mechanism is reversible covalent modification (Section 10.3). For 
example, glycogen phosphorylase, the enzyme catalyzing the breakdown of 
glycogen, a storage form of sugar, is activated by the phosphorylation of a 
particular serine residue when glucose is scarce (Section 21.1). 
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FIGURE 15.19 Homeostasis. Maintaining 
a constant cellular environment requires 
complex metabolic regulation that 
coordinates the use of nutrient pools. 
[Information from D. U. Silverthorn, Human 
Physiology: An Integrated Approach, 3rd ed. 
(Pearson, 2004), Figure 22-2.] 
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FIGURE 15.20 Energy charge regulates 
metabolism. When the energy charge is 
high, ATP inhibits the relative rates of a 
typical ATP-generating (catabolic) pathway 
and stimulates the typical ATP-utilizing 
(anabolic) pathway. 


1 


Hormones coordinate metabolic relations between different tissues, often 
by regulating the reversible modification of key enzymes. For instance, the 
hormone epinephrine triggers a signal-transduction cascade in muscle, 
resulting in the phosphorylation and activation of key enzymes and leading 
to the rapid degradation of glycogen to glucose, which is then used to supply 
ATP for muscle contraction. As described in Chapter 14, many hormones 
act through intracellular messengers, such as cyclic AMP and calcium ion, 
that coordinate the activities of many target proteins. 

Many reactions in metabolism are controlled by the energy status of the 
cell. One index of the energy status is the energy charge, which is propor- 
tional to the mole fraction of ATP plus half the mole fraction of ADP, given 
that ATP contains two anhydride bonds, whereas ADP contains one. 
Hence, the energy charge is defined as 


[ATP] + %[ADP] 
[ATP] + [ADP] + [AMP] 


Energy charge = 


The energy charge can have a value ranging from 0 (all AMP) to 1 (all ATP). 
ATP-generating (catabolic) pathways are inhibited by a high energy charge, 
whereas ATP-utilizing (anabolic) pathways are stimulated by a high energy 
charge. In plots of the reaction rates of such pathways versus the energy 
charge, the curves are steep near an energy charge of 0.9, where they usually 
intersect (Figure 15.20). It is evident that the control of these pathways has 
evolved to maintain the energy charge within rather narrow limits. In other 
words, the energy charge, like the pH of a cell, is buffered. The energy charge 
of most cells ranges from 0.90 to 0.95, but can fall to less than 0.7 in muscle 
during high-intensity exercise. An alternative index of the energy status is 
the phosphorylation potential, which is defined as 


[ATP] 


Phosphorylati ial = ———_—__ 
osphorylation potentia [ADP] + [P] 


The phosphorylation potential, in contrast with the energy charge, depends 
on the concentration of P. and is directly related to the free-energy storage 
available from ATP. 


Controlling the accessibility of substrates. Controlling the availability of 
substrates is another means of regulating metabolism in all organisms. For 
instance, glucose breakdown can take place in many cells only if insulin is 
present to promote the entry of glucose into the cell. In eukaryotes, metabolic 
regulation and flexibility are enhanced by compartmentalization. The trans- 
fer of substrates from one compartment of a cell to another can serve as a 
control point. For example, fatty acid oxidation takes place in mitochondria, 
whereasfatty acid synthesis takes placeinthecytoplasm. Compartmentalization 
segregates opposed reactions. 


Aspects of metabolism may have evolved from an RNA world 


wy How did the complex pathways that constitute metabolism evolve? 
The current thinking is that RNA was an early biomolecule that 
dominated metabolism, serving both as a catalyst and an information stor- 
age molecule. This hypothetical time is called the RNA world. 

Why do activated carriers such as ATP, NADH, FADH,, and coenzyme 
A contain adenosine diphosphate units? A possible explanation is that these 
molecules evolved from the early RNA catalysts. Non-RNA units such as the 
isoalloxazine ring may have been recruited to serve as efficient carriers of acti- 
vated electrons and chemical units, a function not readily performed by RNA 
itself. We can picture the adenine ring of FADH, binding to a uracil unit in a 


niche of an RNA enzyme (ribozyme) by base-pairing, whereas the isoalloxa- 
zine ring protrudes and functions as an electron carrier. When the more versa- 
tile proteins replaced RNA as the major catalysts, the ribonucleotide coen- 
zymes stayed essentially unchanged because they were already well suited to 
their metabolic roles. The nicotinamide unit of NADH, for example, can 
readily transfer electrons irrespective of whether the adenine unit interacts 
with a base in an RNA enzyme or with amino acid residues in a protein 
enzyme. With the advent of protein enzymes, these important cofactors 
evolved as free molecules without losing the adenosine diphosphate vestige of 
their RNA-world ancestry. That molecules and motifs of metabolism are com- 
mon to all forms of life testifies to their common origin and to the retention of 
functioning modules through billions of years of evolution. Our understand- 
ing of metabolism, like that of other biological processes, is enriched by inquiry 
into how these beautifully integrated patterns of reactions came into being. 


SUMMARY 


All cells transform energy. They extract energy from their environment and 
use this energy to convert simple molecules into cellular components. 


15.1 Metabolism Is Composed of Many Coupled, Interconnecting Reactions 
The process of energy transduction takes place through metabolism, a 
highly integrated network of chemical reactions. Metabolism can be 
subdivided into catabolism (reactions employed to extract energy 
from fuels) and anabolism (reactions that use this energy for biosyn- 
thesis). The most valuable thermodynamic concept for understanding 
bioenergetics is free energy. A reaction can occur spontaneously only 
if the change in free energy (AG) is negative. A thermodynamically 
unfavorable reaction can be driven by a thermodynamically favorable 
one, which is the hydrolysis of ATP in many cases. 


15.2 ATP Is the Universal Currency of Free Energy in Biological Systems 
The energy derived from catabolism is transformed into adenosine 
triphosphate. ATP hydrolysis is exergonic and the energy released can 
be used to power cellular processes, including motion, active transport, 
and biosynthesis. Under cellular conditions, the hydrolysis of ATP 
shifts the equilibrium of a coupled reaction by a factor of 10°. ATP, the 
universal currency of energy in biological systems, is an energy-rich 
molecule because it contains two phosphoanhydride bonds. 


15.3 The Oxidation of Carbon Fuels Is an Important Source of Cellular Energy 
ATP formation is coupled to the oxidation of carbon fuels, either 
directly or through the formation of ion gradients. Photosynthetic 
organisms can use light to generate such gradients. ATP is consumed 
to drive endergonic reactions and in signal-transduction processes. 
The extraction of energy from foodstuffs by aerobic organisms com- 
prises three stages. In the first stage, large molecules are broken down 
into smaller ones, such as amino acids, sugars, and fatty acids. In the 
second stage, these small molecules are degraded to a few simple units, 
such as acetyl CoA, that have widespread roles in metabolism. The 
third stage of metabolism is the citric acid cycle and oxidative phos- 
phorylation, in which ATP is generated as electrons flow to O,, the 
ultimate electron acceptor, and fuels are completely oxidized to CO,. 


15.4 Metabolic Pathways Contain Many Recurring Motifs 
Metabolism is characterized by common motifs. A small number of 
recurring activated carriers, such as ATP, NADH, and acetyl CoA, 
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transfer activated groups in many metabolic pathways. NADPH, 
which carries two electrons at a high potential, provides reducing 
power for reductive biosynthesis of cell components. Many activated 
carriers are derived from vitamins, small organic molecules required in 
the diets of many higher organisms. Moreover, key reaction types are 
used repeatedly in metabolic pathways. 

Metabolism is regulated in a variety of ways. The amounts of some 
critical enzymes are controlled by regulation of the rate of synthesis 
and degradation. In addition, the catalytic activities of many enzymes 
are regulated by allosteric interactions and by covalent modification. 
The movement of many substrates into cells and subcellular compart- 
ments also is controlled. The energy charge, which depends on the 
relative amounts of ATP, ADP, and AMP, plays a role in metabolic 
regulation. A high energy charge inhibits ATP-generating (catabolic) 
pathways, whereas it stimulates ATP-utilizing (anabolic) pathways. 


KEY TERMS 


metabolism or intermediary metabolism 


phototroph (p. 424) 
chemotroph (p. 424) 
catabolism (p. 424) 
anabolism (p. 424) 
amphibolic pathway (p. 425) 


vitamin (p. 438) 


PROBLEMS 


adenosine triphosphate (ATP) (p. 426) 
(p. 423) phosphoryl-transfer potential (p. 429) 
activated carrier (p. 435) 


oxidation—reduction reaction (p. 440) 
ligation reaction (p. 440) 


isomerization reaction (p. 441) 


group-transfer reaction (p. 441) 
hydrolytic reaction (p. 441) 


cleavage of carbon bonds by means other 
than hydrolysis or oxidation (p. 442) 


lyase (p. 442) 
energy charge (p. 444) 
phosphorylation potential (p. 444) 


1. Complex patterns. What is meant by intermediary 
metabolism? 


2. Opposites. Differentiate between anabolism and 
catabolism. 


3. Graffiti. While walking to biochemistry class with a 
friend, you see the following graffiti spray painted on the 
wall of the science building: ‘““When a system is in equilib- 
rium, the Gibbs free energy is maximum.” You are dis- 
gusted, not only at the vandalism, but at the ignorance of 
the vandal. Your friend asks you to explain. 


4. Why bother to eat? What are the three primary uses for 
cellular energy? 


5. Match ’em. 

1. Cellular energy currency____—s a. NAD* 

2. Anabolic electron carrier____—s— bb. Coenzyme A 

3. Phototroph _____ c. Precursor to 

coenzymes 

4. Catabolic electron carrier d. Yields energy 
reaction 

5. Oxidation-reduction e. Requires energy 
reaction 

6. Activated carrier of two f. ATP 


carbon fragments 
7. Vitamin g. Transfers electrons 


h. NADPt 

i. Converts light energy 
to chemical energy 

j. Used in anabolism 
and catabolism 


8. Anabolism 
9. Amphibolic reaction 


10. Catabolism 


6. Charges. In vivo, ATP is usually bound to magnesium or 
manganese ions. Why is this the case? 


7. Energy to burn. What factors account for the high 
phosphoryl-transfer potential of nucleoside triphosphates? 


8. Back in time. Account for the fact that ATP, and not 
another nucleoside triphosphate, is the cellular energy 
currency. 


9. Currency issues. Why does it make good sense to have a sin- 
gle nucleotide, ATP, function as the cellular energy currency? 


10. Environmental conditions. The standard free energy of 
hydrolysis for ATP is —30.5 kJ mol~! (—7.3 kcal mol~!). 


ATP + H,O0 == ADP + P; 


What conditions might be changed to alter the free energy 
of hydrolysis? 


11. Brute force? Metabolic pathways frequently contain 
reactions with positive standard free energy values, yet the 
reactions still take place. How is this possible? 


12. Energy flow. What is the direction of each of the follow- 
ing reactions when the reactants are initially present in 
equimolar amounts? Use the data given in Table 15.1. 


(a) ATP + creatine == creatine phosphate + ADP 

(b) ATP + glycerol = — glycerol 3-phosphate + ADP 
(c) ATP + pyruvate = — phosphoenolpyruvate + ADP 
d) ATP + glucose = — glucose 6-phosphate + ADP 


13. A proper inference. What information do the AG®’ data 
given in Table 15.1 provide about the relative rates of 
hydrolysis of pyrophosphate and acetyl phosphate? 


14. A potent donor. Consider the following reaction: 
ATP + pyruvate = = phosphoenolpyruvate + ADP 


(a) Calculate AG°’ and K’ eq at 25°C for this reaction by 
using the data given in Table 15.1. 


(b) What is the equilibrium ratio of pyruvate to phospho- 
enolpyruvate if the ratio of ATP to ADP is 10? 


15. Isomeric equilibrium. Using the information in Table 15.1, 
calculate AG®’ for the isomerization of glucose 6-phosphate to 
glucose 1-phosphate. What is the equilibrium ratio of glucose 
6-phosphate to glucose 1-phosphate at 25°C? 


16. Activated acetate. The formation of acetyl CoA from 
acetate is an ATP-driven reaction: 


Acetate + ATP + CoA = — acetyl CoA + AMP + PP; 


(a) Calculate AG®’ for this reaction by using data given in 
this chapter. 


(b) The PP. formed in the preceding reaction is rapidly 
hydrolyzed in vivo because of the ubiquity of inorganic 
pyrophosphatase. The AG* for the hydrolysis of PP. is 
—19.2 kJ mol~! (—4.6 kcal mol~!). Calculate the AG® for 
the overall reaction, including pyrophosphate hydrolysis. 
What effect does the hydrolysis of PP. have on the forma- 
tion of acetyl CoA? 


17. Acid strength. The pK of an acid is a measure of its 
proton-group-transfer potential. 

(a) Derive a relation between AG®’ and pK. 

(b) What is the AG” for the ionization of acetic acid, which 
has a pK of 4.8? 


18. Raison d’étre. The muscles of some invertebrates are 
rich in arginine phosphate (phosphoarginine). Propose a 
function for this amino acid derivative. 


Arginine phosphate 
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19. Recurring motif. What is the structural feature common 


to ATP, FAD, NAD?, and CoA? 


20. Ergogenic help or hindrance? Creatine is a popular, but 
untested, dietary supplement. 


(a) What is the biochemical rationale for the use of creatine? 


(b) What type of exercise would most benefit from creatine 
supplementation? 


21. Standard conditions versus real life 1. The enzyme aldolase 
catalyzes the following reaction in the glycolytic pathway: 


. Aldolase 
Fructose 1, 6-bisphosphate === 


dihydroxyacetone phosphate + 
glyceraldehyde 3-phosphate 


The AG®’ for the reaction is +23.8 kJ mol~! (+5.7 kcal 
mol!), whereas the AG in the cell is —1.3 kJ mol7! 
(—0.3 kcal mol~!). Calculate the ratio of reactants to prod- 
ucts under equilibrium and intracellular conditions. Using 
your results, explain how the reaction can be endergonic 
under standard conditions and exergonic under intracellu- 
lar conditions. 


22. Standard conditions versus real life 2. On page 428, 
we showed that a reaction, A == B, witha AG®’ = +16.7 
kJ mol~! (+4.0 kcal mol!) has an Ki, of 1.15 X 1073, 
The Aa is increased to 2.67 X 10? if the reaction is 
coupled to ATP hydrolysis under standard conditions. 
The ATP-generating system of cells maintains the 
[ATP]/[ADP][P.] ratio at a high level, typically of the order 
of 500 M~!. Calculate the ratio of B/A under cellular 


conditions. 


23. Not all alike. The concentrations of ATP, ADP, and 
P. differ with cell type. Consequently, the release of free 
energy with the hydrolysis of ATP will vary with cell 
type. Using the following table, calculate the AG for the 
hydrolysis of ATP in liver, muscle, and brain cells. In 
which cell type is the free energy of ATP hydrolysis 
most negative? 


ATP (mM) ADP (mM) P; (mM) 
Liver 3.5 1.8 5.0 
Muscle 8.0 0.9 8.0 


Brain 2.6 0.7 2.7 


24. Oxidation issues. Examine the pairs of molecules and 
identify the more-reduced molecule in each pair. 


H~| | 
CHz CH; 
(a) Ethanol Acetaldehyde 
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25. Running downhill. Glycolysis is a series of 10 linked 
reactions that convert one molecule of glucose into two mol- 
ecules of pyruvate with the concomitant synthesis of two 
molecules of ATP (Chapter 16). The AG®’ for this set of 
reactions is —35.6 kJ mol~! (—8.5 kcal mol~!), whereas the 
AG is —90 kJ mol! (—22 kcal mol~!). Explain why the 
free-energy release is so much greater under intracellular 
conditions than under standard conditions. 


26. Outsourcing. Outsourcing, a common business practice, 
is contracting with another business to perform a particular 
function. Higher organisms were the original outsourcers, 
frequently depending on lower organisms to perform key 
biochemical functions. Give an example from this chapter 
of biochemical outsourcing. 


27. Breakdown products. Digestion is the first stage in the 
extraction of energy from food, but no useful energy is 


acquired during this stage. Why is digestion considered a 
stage in energy extraction? 


28. High-energy electrons. What are the activated electron 
carriers for catabolism? For anabolism? 


29. Less reverberation. Thioesters, common in biochemis- 
try, are more unstable (energy-rich) than oxygen esters. 
Explain why this is the case. 


30. Classifying reactions. What are the six common types of 
reactions seen in biochemistry? 


31. Staying in control. What are the three principal means 
of controlling metabolic reactions? 


Chapter Integration Problems 


32. Kinetic versus thermodynamic. The reaction of NADH 
with oxygen to produce NAD* and H,O is very exergonic, 
yet the reaction of NADH and oxygen takes place very 
slowly. Why does a thermodynamically favorable reaction 
not occur rapidly? 


33. Activated sulfate. Fibrinogen contains tyrosine-O- 
sulfate. Propose an activated form of sulfate that could react 
in vivo with the aromatic hydroxyl] group of a tyrosine resi- 
due in a protein to form tyrosine-O-sulfate. 


Data Interpretation Problem 


34. Opposites attract. The following graph shows how the 
AG for the hydrolysis of ATP varies as a function of the 
Mg?™ concentration (pMg = —log[Mg?*]). 


36 — + 8.6 
iz + 8.4 
35 -_ 
ap is 7482 7 
ie x 
oo: so 2 
EOP — 
2 BP + 78 g 
(e) 776 ~ 
q 32 - Oo 
] - 474 4 
31 are 
pl to 
123 45 67 


(a) How does decreasing [Mg?*] affect the AG of hydroly- 
sis for ATP? 


(b) Explain this effect. 


CHAPTER 


Glycolysis and Gluconeogenesis 


Usain Bolt sprints to a win in the 200-meter finals at the 
Olympics in London in 2012. Glucose metabolism can 
generate the ATP to power muscle contraction. During a 
sprint, when the ATP needs outpace oxygen delivery, as 
would be the case for Bolt, glucose is metabolized to 
lactate. When oxygen delivery is adequate, glucose is 
metabolized more efficiently to carbon dioxide and 
water. [Christophe Karaba/epa/Corbis.] 


Glycolysis A. Low O> 
Glucose > Pyruvate (last seconds of a sprint) 
B. Normal 
(long slow run) 
ATP 
Mitochondrion ATP 
— Cytoplasm 
Muscle fiber 


OUTLINE 
he first metabolic pathway that we encounter is glycolysis, an ancient 
pathway employed by a host of organisms. Glycolysis is the sequence of 16.1 Glycolysis Is an Energy-Conversion 
reactions that metabolizes one molecule of glucose to two molecules of pyruvate Pathway in Many Organisms 


with the concomitant net production of two molecules of ATP. This process is 
anaerobic (i.e., it does not require O.,) because it evolved before substantial 
amounts of oxygen accumulated in the atmosphere. Pyruvate can be further 
processed anaerobically to lactate (lactic acid fermentation) or ethanol (alco- 
holic fermentation). Under aerobic conditions, pyruvate can be completely 


16.2 The Glycolytic Pathway Is Tightly 
Controlled 


16.3 Glucose Can Be Synthesized 
from Noncarbohydrate Precursors 


oxidized to CO,, generating much more ATP, as will be described in 16.4 Gluconeogenesis and Glycolysis 
chapters 17 and 18. Figure 16.1 shows some possible fates of pyruvate pro- Are Reciprocally Regulated 
duced by glycolysis. 


Because glucose is such a precious fuel, metabolic products, such as 
pyruvate and lactate, are salvaged to synthesize glucose in the process of 
gluconeogenesis. Although glycolysis and gluconeogenesis have some 
enzymes in common, the two pathways are not simply the reverse of each 
other. In particular, the highly exergonic, irreversible steps of glycolysis are 
bypassed in gluconeogenesis. The two pathways are reciprocally regulated Glycolysis 
so that glycolysis and gluconeogenesis do not take place simultaneously in Derived irom the Greer stom give, “Sweet,” 


pee and the word lysis, “dissolution.” 
the same cell to a significant extent. ee eee 
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Gluconeogenesis 


Enzyme 

A term coined by Friedrich Wilhelm Kihne in 
1878 to designate catalytically active 
substances that had formerly been called 
ferments. Derived from the Greek words en, 
“in,” and zyme, “leaven.” 


FERMENTATION COMPLETE 
OXIDATION 


Glycolysis 


FIGURE 16.1 Some fates of glucose. 


Our understanding of glucose metabolism, especially glycolysis, has a 
rich history. Indeed, the development of biochemistry and the delineation of 
glycolysis went hand in hand. A key discovery was made by Hans and 
Eduard Buchner in 1897, quite by accident. The Buchners were interested 
in manufacturing cell-free extracts of yeast for possible therapeutic use. 
These extracts had to be preserved without the use of antiseptics such as 
phenol, and so they decided to try sucrose, a commonly used preservative in 
kitchen chemistry. They obtained a startling result: sucrose was rapidly 
fermented into alcohol by the yeast juice. The significance of this finding 
was immense. The Buchners demonstrated for the first time that fermentation 
could take place outside living cells. The accepted view of their day, asserted 
by Louis Pasteur in 1860, was that fermentation is inextricably tied to living 
cells. The chance discovery by the Buchners refuted this dogma and opened 
the door to modern biochemistry. The Buchners’ discovery inspired the 
search for the biochemicals that catalyze the conversion of sucrose into alco- 
hol. The study of metabolism became the study of chemistry. 

Studies of muscle extracts then showed that many of the reactions of lac- 
tic acid fermentation were the same as those of alcoholic fermentation. This 
exciting discovery revealed an underlying unity in biochemistry. The complete 
glycolytic pathway was elucidated by 1940. Glycolysis is also known as the 
Embden—Meyerhof pathway, after two pioneers of research on glycolysis. 


Glucose is generated from dietary carbohydrates 


We typically consume in our diets a generous amount of starch and a smaller 
amount of glycogen. These complex carbohydrates must be converted into 
simpler carbohydrates for absorption by the intestine and transport in the 
blood. Starch and glycogen are digested primarily by the pancreatic enzyme 
a-amylase and to a lesser extent by salivary w-amylase. Amylase cleaves the 
a-1,4 bonds of starch and glycogen, but not the a-1,6 bonds. The products 
are the di- and trisaccharides maltose and maltotriose. The material not 
digestible because of the a-1,6 bonds is called the limit dextrin. 

Maltase cleaves maltose into two glucose molecules, whereas a-glucosidase 
digests maltotriose and any other oligosaccharides that may have escaped 
digestion by the amylase. a-Dextrinase further digests the limit dextrin. 
Maltase and a-glucosidase are located on the surface of the intestinal 
cells, as is sucrase, an enzyme that degrades the sucrose contributed by 
vegetables to fructose and glucose. The enzyme lactase is responsible for 
degrading the milk sugar lactose into glucose and galactose. The mono- 
saccharides are transported into the cells lining the intestine and then into 
the bloodstream. 


Glucose is an important fuel for most organisms 


aa Glucose is a common and important fuel. In mammals, glucose is the 

only fuel that the brain uses under nonstarvation conditions and the 
only fuel that red blood cells can use at all. Indeed, almost all organisms use 
glucose, and most that do process it in a similar fashion. Recall from 
Chapter 11 that there are many carbohydrates. Why is glucose instead of 
some other monosaccharide such a prominent fuel? We can speculate on the 
reasons. First, glucose is one of several monosaccharides formed from form- 
aldehyde under prebiotic conditions, and so it may have been available as a 
fuel source for primitive biochemical systems. Second, glucose has a low 
tendency, relative to other monosaccharides, to nonenzymatically glycosyl- 
ate proteins. In their open-chain forms, monosaccharides contain carbonyl 
groups that can react with the amino groups of proteins to form Schiff bases, 
which rearrange to form a more stable amino-ketone linkage (p. 321). Such 
nonspecifically modified proteins often do not function effectively. Glucose 
has a strong tendency to exist in the ring conformation and, consequently, 
relatively little tendency to modify proteins. Recall that all the hydroxyl 
groups in the ring conformation of B-glucose are equatorial, contributing to 
the sugar’s high relative stability (p. 320). 


16.1 Glycolysis Is an Energy-Conversion Pathway 
in Many Organisms 


We now begin our consideration of the glycolytic pathway. This pathway is 
common to virtually all cells, both prokaryotic and eukaryotic. In eukaryotic 
cells, glycolysis takes place in the cytoplasm. This pathway can be thought of as 
comprising two stages (Figure 16.2). Stage 1 is the trapping and preparation 
phase. No ATP is generated in this stage. In stage 1, glucose is converted into 
fructose 1,6-bisphosphate in three steps: a phosphorylation, an isomerization, 
and a second phosphorylation reaction. The strategy of these initial steps in gly- 
colysis is to trap the glucose in the cell and form a compound that can be readily 
cleaved into phosphorylated three-carbon units. Stage 1 is completed with the 
cleavage of the fructose 1,6-bisphosphate into two three-carbon fragments. 
These resulting three-carbon units are readily interconvertible. In stage 2, ATP 
is harvested when the three-carbon fragments are oxidized to pyruvate. 


Hexokinase traps glucose in the cell and begins glycolysis 


Glucose enters cells through specific transport proteins (p. 473) and has one 
principal fate: it is phosphorylated by ATP to form glucose 6-phosphate. This 
step is notable for several reasons. Glucose 6-phosphate cannot pass 
through the membrane because of the negative charges on the phosphoryl 
groups, and it is not a substrate for glucose transporters. Also, the addition 
of the phosphoryl group facilitates the eventual metabolism of glucose into 
three-carbon molecules with high-phosphoryl-transfer-potential. The 
transfer of the phosphoryl group from ATP to the hydroxyl group on car- 
bon 6 of glucose is catalyzed by hexokinase. 


CH20H CH,OPO;2- 
O 0 
Hexokinase 
OH + AP OH + ADP + Ht 
HO OH HO OH 
OH OH 
Glucose Glucose 6-phosphate 


(G-6P) 
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( => NADH 
=> ATP 
Vv 
Vv 
ay 
PEP 
> ATP 
\_ Pyruvate 


First stage of glycolysis. The first stage of 
glycolysis begins with the phosphorylation of 
glucose by hexokinase and ends with the 
isomerization of dihydroxyacetone phosphate 
to glyceraldehyde 3-phosphate. 
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FIGURE 16.2 Stages of glycolysis. The glycolytic pathway can be divided into two stages: 
(1) glucose is trapped, destabilized, and cleaved into two interconvertible three-carbon 
molecules generated by cleavage of six-carbon fructose; and (2) ATP is generated. 


Phosphory] transfer is a fundamental reaction in biochemistry. Kinases are 
enzymes that catalyze the transfer of a phosphoryl group from ATP to an 
acceptor. Hexokinase, then, catalyzes the transfer of a phosphoryl group from 
ATP to a variety of six-carbon sugars (hexoses), such as glucose and mannose. 
Hexokinase, like adenylate kinase (Section 9.4) and all other kinases, requires 
Mg?* (or another divalent metal ion such as Mn?*) for activity. The divalent 
metal ion forms a complex with ATP. 

X-ray crystallographic studies of yeast hexokinase revealed that the 
binding of glucose induces a large conformational change in the enzyme. 
Hexokinase consists of two lobes, which move toward each other when 
glucose is bound (Figure 16.3). On glucose binding, one lobe rotates 12 
degrees with respect to the other, resulting in movements of the polypep- 
tide backbone of as much as 8 A. The cleft between the lobes closes, and 
the bound glucose becomes surrounded by protein, except for the 
hydroxyl group of carbon 6, which will accept the phosphoryl group 
from ATP. The closing of the cleft in hexokinase is a striking example of 
the role of induced fit in enzyme action (Section 8.3). 

The glucose-induced structural changes are significant in two respects. 
First, the environment around the glucose becomes more nonpolar, which 
favors reaction between the hydrophilic hydroxyl group of glucose and the ter- 
minal phosphoryl group of ATP. Second, the conformational changes enable 
the kinase to discriminate against HO as a substrate. The closing of the cleft 
keeps water molecules away from the active site. If hexokinase were rigid, a 
molecule of H,O occupying the binding site for the —CH,OH of glucose could 
attack the y phosphoryl group of ATP, forming ADP and P.. In other words, a 
rigid kinase would likely also be an ATPase. It is interesting to note that other 
kinases taking part in glycolysis—phosphofructokinase, phosphoglycerate 
kinase, and pyruvate kinase—also contain clefts between lobes that close when 
substrate is bound, although the structures of these enzymes are different in 
other regards. Substrate-induced cleft closing is a general feature of kinases. Recall 
that protein kinase A also undergoes similar structural changes (p. 298). 


Fructose 1,6-bisphosphate is generated from glucose 6-phosphate 


The next step in glycolysis is the isomerization of glucose 6-phosphate to 
fructose 6-phosphate. Recall that the open-chain form of glucose has an 
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FIGURE 16.3 Induced fit in hexokinase. 
The two lobes of hexokinase are separated 
in the absence of glucose (left). The 
conformation of hexokinase changes 
markedly on binding glucose (right). Notice 
that two lobes of the enzyme come 
together, creating the necessary 
environment for catalysis. [After RSCB 
Protein Data Bank; drawn from yhx and 

thkg by Adam Steinberg.] 
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H 


HO 


Glucose 6-phosphate 


CH ,OPO32- 
QOH 
H 
OH H 
OH 
H OH 


(G-6P) 


2-03POH,C 


Fructose 6-phosphate 


aldehyde group at carbon 1, whereas the open-chain form of fructose has a 
keto group at carbon 2. Thus, the isomerization of glucose 6-phosphate to 
fructose 6-phosphate is a conversion of an aldose into a ketose. The reaction 
catalyzed by phosphoglucose isomerase takes several steps because both glu- 
cose 6-phosphate and fructose 6-phosphate are present primarily in the 
cyclic forms. The enzyme must first open the six-membered ring of glucose 
6-phosphate, catalyze the isomerization, and then promote the formation of 
the five-membered ring of fructose 6-phosphate. 


ee 
H—C—OH a7 ae 
HO—-C—H roel eee 2-Q3POH,C 5 CH 20H 
——— H-c—on OCH CO” k HHO i 
H—C—OH H——C—OH HO 4 
CH,OPO32- ee 


Fructose 6-phosphate 
(open-chain form) 


Fructose 6-phosphate 
(F-6P) 


Glucose 6-phosphate 
(open-chain form) 


A second phosphorylation reaction follows the isomerization step. 
Fructose 6-phosphate is phosphorylated at the expense of ATP to fructose 
1, 6-bisphosphate (F-1,6-BP). The prefix bis- in bisphosphate means that two 
separate monophosphoryl groups are present, whereas the prefix di- in 
diphosphate (as in adenosine diphosphate) means that two phosphoryl 
groups are present and are connected by an anhydride bond. 
2-03POH,C 


CH2OH CH20PO;2- 


Phosphofructokinase 


— 


+ ATP + ADP + Ht 


OH OH 


OH OH 


Fructose 1,6-bisphosphate 
(F-1, 6-BP) 


(F-6P) 
This reaction is catalyzed by phosphofructokinase (PFK), an allosteric 
enzyme that sets the pace of glycolysis. As we will learn, this enzyme plays 
a central role in the metabolism of many molecules in all parts of the body. 

What is the biochemical rationale for the isomerization of glucose 
6-phosphate to fructose 6-phosphate and its subsequent phosphorylation to 
form fructose 1,6-bisphosphate? Had the aldol cleavage taken place in the 
aldose glucose, a two-carbon and a four-carbon fragment would have 
resulted. Two different metabolic pathways, one to process the two-carbon 
fragment and one for the four-carbon fragment, would have been required 
to extract energy. Phosphorylation of the fructose 6-phosphate to fructose 
1,6-bisphosphate prevents the reformation of glucose 6-phosphate. As 
shown below, aldol cleavage of fructose 1,6-bisphosphate yields two phos- 
phorylated interconvertible three-carbon fragments that will be oxidized in 
the later steps of glycolysis to capture energy in the form of ATP. 


The six-carbon sugar is cleaved into two three-carbon fragments 

ee The newly formed fructose 1,6-bisphosphate is cleaved into glyceral- 
dehyde 3-phosphate (GAP) and dihydroxyacetone phosphate (DHAP), 

completing stage 1 of glycolysis. The products of the remaining steps in 

glycolysis consist of three-carbon units rather than six-carbon units. 


7 Dihydroxyacetone 
O CH2OPO32- phosphate 
re” —_ (DHAP) 
HO—C—H " 
| Aldolase 
A te = 7 
H Oo 
SF 
a Cc 
i 7 si | Glyceraldehyde 
- — 3-phosphate 
CH,0PO3* A= 0 0H (GAP) 
Fructose a 
1,6-bisphosphate CH20PO;3? 
(F-1, 6-BP) 


This reaction, which is readily reversible, is catalyzed by aldolase. This enzyme 
derives its name from the nature of the reverse reaction, an aldol condensation. 
Glyceraldehyde 3-phosphate is on the direct pathway of glycolysis, 
whereas dihydroxyacetone phosphate is not. Unless a means exists to con- 
vert dihydroxyacetone phosphate into glyceraldehyde 3-phosphate, a three- 
carbon fragment useful for generating ATP will be lost. These compounds 
are isomers that can be readily interconverted: dihydroxyacetone phosphate 
is a ketose, whereas glyceraldehyde 3-phosphate is an aldose. The isomeri- 
zation of these three-carbon phosphorylated sugars is catalyzed by triose 
phosphate isomerase (TPI, sometimes abbreviated TIM; Figure 16.4). 


x} H 2° 
SC—OH Triose phosphate 
rs isomerase | 
OC — H—C—OH 
CH,OPO32- CH OPO;2- 
Dihydroxyacetone Glyceraldehyde 
phosphate 3-phosphate 


This reaction is rapid and reversible. At equilibrium, 96% of the triose 
phosphate is dihydroxyacetone phosphate. However, the reaction proceeds 
readily from dihydroxyacetone phosphate to glyceraldehyde 3-phosphate 
because the subsequent reactions of glycolysis remove this product. Triose 
phosphate isomerase deficiency, a rare condition, is the only glycolytic 
enzymopathy that is lethal. This deficiency is characterized by severe 
hemolytic anemia and neurodegeneration. 


Mechanism: Triose phosphate isomerase salvages a 
three-carbon fragment 


Much is known about the catalytic mechanism of triose phos- 
phate isomerase. TPI catalyzes the transfer of a hydrogen 
atom from carbon 1 to carbon 2, an intramolecular oxidation— 
reduction. This isomerization of a ketose into an aldose pro- 
ceeds through an enediol intermediate (Figure 16.5). 

X-ray crystallographic and other studies showed that gluta- 
mate 165 plays the role ofa general acid—base catalyst: it abstracts 
a proton (H*) from carbon 1 and then donates it to carbon 2. 
However, the carboxylate group of glutamate 165 by itself is not 
basic enough to pull a proton away from a carbon atom adjacent 
to a carbonyl group. Histidine 95 assists catalysis by donating a 
proton to stabilize the negative charge that develops on the C-2 
carbonyl group. 

Two features of this enzyme are noteworthy. First, TPI dis- 
plays great catalytic prowess. It accelerates isomerization by a 
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hk FIGURE 16.4 Structure of triose 
phosphate isomerase. This enzyme 
consists of a central core of eight parallel B 
strands (orange) surrounded by eight a 
helices (blue). This structural motif, called 
an a8 barrel, is also found in the glycolytic 
enzymes aldolase, enolase, and pyruvate 
kinase. Notice that histidine 95 and 
glutamate 165, essential components of 
the active site of triose phosphate 
isomerase, are located in the barrel. A loop 
(red) closes off the active site on substrate 
binding. [Drawn from 2YPI.pdb.] 
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FIGURE 16.5 Catalytic mechanism 

of triose phosphate isomerase. (1) 
Glutamate 165 acts as a general base by 
abstracting a proton (Ht) from carbon 1. 
Histidine 95, acting as a general acid, 
donates a proton to the oxygen atom 
bonded to carbon 2, forming the enediol 
intermediate. (2) Glutamic acid, now acting 
as a general acid, donates a proton to C-2 
while histidine removes a proton from the 
OH group of C-1. (8) The product is formed, 
and glutamate and histidine are returned to 
their ionized and neutral forms, respectively. 
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factor of 10'° compared with the rate obtained with a simple base catalyst such 


as acetate ion. Indeed, the k..4/Kyy4 ratio for the isomerization of glyceralde- 
hyde 3-phosphate is 2 < 108 M~! s~1, which is close to the diffusion- 
controlled limit. In other words, catalysis takes place every time that enzyme 
and substrate meet. The diffusion-controlled encounter of substrate and 
enzyme is thus the rate-limiting step in catalysis. TPI is an example of a 
kinetically perfect enzyme (Section 8.4). Second, TPI suppresses an undesired 
side reaction, the decomposition of the enediol intermediate into methyl 
glyoxal and orthophosphate. 


HO OH P 
i 0 O 
aes VS 
is =n Z > is c 
H C—OPO32- H CH; 
Ho 
Enediol intermediate Methyl glyoxal 


In solution, this physiologically useless reaction is 100 times as fast as isom- 
erization. Moreover, methyl] glyoxal is a highly reactive compound that can 
modify the structure and function of a variety of biomolecules, including 
proteins and DNA. The reaction of methyl] glyoxal with a biomolecule is an 
example of deleterious reactions called advanced glycation end products, 
discussed earlier (AGEs, Section 11.1). Hence, TPI must prevent the ene- 
diol from leaving the enzyme. This labile intermediate is trapped in the 
active site by the movement of a loop of 10 residues (Figure 16.4). This loop 
serves as a lid on the active site, shutting it when the enediol is present and 
reopening it when isomerization is completed. We see here a striking example 
of one means of preventing an undesirable alternative reaction: the active site 
is kept closed until the desirable reaction takes place. 

Thus, two molecules of glyceraldehyde 3-phosphate are formed from one 
molecule of fructose 1,6-bisphosphate by the sequential action of aldolase 
and triose phosphate isomerase. The economy of metabolism is evident in 
this reaction sequence. The isomerase funnels dihydroxyacetone phosphate 
into the main glycolytic pathway; a separate set of reactions is not needed. 


The oxidation of an aldehyde to an acid powers the formation of a 
compound with high phosphoryl-transfer potential 


The preceding steps in glycolysis have transformed one molecule of glucose 
into two molecules of glyceraldehyde 3-phosphate, but no energy has yet 
been extracted. On the contrary, thus far, two molecules of ATP have been 
invested. We come now to the second stage of glycolysis, a series of steps 
that harvest some of the energy contained in glyceraldehyde 3-phosphate as 
ATP. The initial reaction in this sequence is the conversion of glyceraldehyde 
3-phosphate into 1, 3-bisphosphoglycerate (1,3-BPG), a reaction catalyzed by 
glyceraldehyde 3-phosphate dehydrogenase. 


Hy 2 Glyceraldehyde *0,PO_ ma) 
C 3-phosphate C 
| dehydrogenase | 
— + NADt + P) == H——C——OH + NADH + Ht 
CH20PO32- CH 0PO;32- 
Glyceraldehyde 1,3-Bisphosphoglycerate 
3-phosphate (1,3-BPG) 
(GAP) 


1,3-Bisphosphoglycerate is an acyl phosphate, which is a mixed anhydride 
of phosphoric acid and a carboxylic acid. Such compounds have a high 
phosphoryl-transfer potential; one of its phosphoryl groups is transferred to 
ADP in the next step in glycolysis. 

The reaction catalyzed by glyceraldehyde 3-phosphate dehydrogenase 
can be viewed as the sum of two processes: the oxidation of the aldehyde to 
a carboxylic acid by NAD* and the joining of the carboxylic acid and ortho- 
phosphate to form the acyl-phosphate product. 


O% UH SS _OH 
| Oxidation | 
a ek + NAD+ + H,O ——=— ae + NADH + Ht 
CH20PO32- CH2O0PO;2- 
OW OH Acyl-phosphate OW P03 
ie formation C 
| (dehydration) | 


H—C—\OH + P|) — == H—_C—_OH + H20 


CHOPO;2- CH OPO;2- 


The first reaction is thermodynamically quite favorable, with a standard 
free-energy change, AG°’, of approximately —50 kJ mol~! (—12 kcal 
mol~!), whereas the second reaction is quite unfavorable, with a standard 
free-energy change of the same magnitude but the opposite sign. If these 
two reactions simply took place in succession, the second reaction would 
have a very large activation energy and thus not take place at a biologically 
significant rate. These two processes must be coupled so that the favorable 
aldehyde oxidation can be used to drive the formation of the acyl phosphate. 
How are these reactions coupled? The key is an intermediate, formed as a 
result of the aldehyde oxidation, that is linked to the enzyme by a thioester 
bond. Thioesters are high-energy compounds found in many biochemical 
pathways (Section 15.4). This intermediate reacts with orthophosphate to 
form the high-energy compound 1,3-bisphosphoglycerate. 

The thioester intermediate is higher in free energy than the free carboxylic 
acid is. The favorable oxidation and unfavorable phosphorylation reactions 
are coupled by the thioester intermediate, which preserves much of the free 


457 
16.1 Glycolysis 


Glucose 


ATP ‘ 


» 4 


ATP 4 
Vv 


F-1,6-BP 


Second stage of glycolysis. The oxidation of 
three-carbon fragments yields ATP. 


458 


CHAPTER 16 Glycolysis and 
Gluconeogenesis 


kK FIGURE 16.7 Structure of 
glyceraldehyde 3-phosphate 
dehydrogenase. Notice that the active site 
includes a cysteine residue and a histidine 


residue adjacent to a bound NAD* molecule. 


The sulfur atom of cysteine will link with the 
substrate to form a transitory thioester 
intermediate. [Drawn from 1GAD.pdb.] 
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FIGURE 16.6 Free-energy profiles for glyceraldehyde oxidation followed by acyl- 
phosphate formation. (A) A hypothetical case with no coupling between the two processes. 
The second step must have a large activation barrier, making the reaction very slow. (B) The 
actual case with the two reactions coupled through a thioester intermediate. 


energy released in the oxidation reaction. We see here the use of a covalent 
enzyme-bound intermediate as a mechanism of energy coupling. A free-energy 
profile of the glyceraldehyde 3-phosphate dehydrogenase reaction, com- 
pared with a hypothetical process in which the reaction proceeds without 
this intermediate, reveals how this intermediate allows a favorable process 
to drive an unfavorable one (Figure 16.6). 


Mechanism: Phosphorylation is coupled to the oxidation of 
glyceraldehyde 3-phosphate by a thioester intermediate 


The active site of glyceraldehyde 3-phosphate dehydrogenase includes a 
reactive cysteine residue, as well as NAD* and a crucial histidine 
(Figure 16.7). Let us consider in detail how these components cooperate in 
the reaction mechanism (Figure 16.8). In step 1, the aldehyde substrate 
reacts with the sulfhydryl group of cysteine 149 on the enzyme to form a 
hemithioacetal. Step 2 is the transfer of a hydride ion to a molecule of NADT 
that is tightly bound to the enzyme and is adjacent to the cysteine residue. 
This reaction is favored by the deprotonation of the hemithioacetal by 
histidine 176. The products of this reaction are the reduced coenzyme 
NADH and a thioester intermediate. This thioester intermediate has a free 
energy close to that of the reactants (Figure 16.6). In step 3, the NADH 
formed from the aldehyde oxidation leaves the enzyme and is replaced by 
a second molecule of NAD*. This step is important because the positive 
charge on NAD? polarizes the thioester intermediate to facilitate the 
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FIGURE 16.8 Catalytic mechanism of glyceraldehyde 
3-phosphate dehydrogenase. The reaction proceeds through a 
thioester intermediate, which allows the oxidation of glyceraldehyde 
to be coupled to the phosphorylation of 3-phosphoglycerate. 

(1) Cysteine reacts with the aldehyde group of the substrate, forming 
a hemithioacetal. (2) An oxidation takes place with the transfer of a 
hydride ion to NADt, forming a thioester. This reaction is facilitated 
by the transfer of a proton to histidine. (3) The reduced NADH is 
exchanged for an NAD* molecule. (4) Orthophosphate attacks the 
thioester, forming the product 1,3-BPG. 


attack by orthophosphate. In step 4, orthophosphate attacks the thioester 
to form 1,3-BPG and free the cysteine residue. This example illustrates the 
essence of energy transformations and of metabolism itself: energy released 
by carbon oxidation is converted into high phosphoryl-transfer potential. 


ATP is formed by phosphoryl transfer from 1,3-bisphosphoglycerate 


1,3-Bisphosphoglycerate is an energy-rich molecule with a greater 
phosphoryl-transfer potential than that of ATP (Section 15.2). Thus, 1,3- 
BPGcan be used to power the synthesis of ATP from ADP. Phosphoglycerate 
kinase catalyzes the transfer of the phosphoryl group from the acyl phos- 
phate of 1,3-bisphosphoglycerate to ADP. ATP and 3-phosphoglycerate 
are the products. 


2- es 
o _OPOs OS 
Phosphoglycerate | 
kinase 
H—C—OH + ADP + H* H—C—OH + ATP 
CH20PO32- CH,OPO;2- 
1,3-Bisphosphoglycerate 3-Phosphoglycerate 
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3-Phosphoglycerate 


The formation of ATP in this manner is referred to as substrate-level phos- 
phorylation because the phosphate donor, 1,3-BPG, is a substrate with high 
phosphoryl-transfer potential. We will contrast this manner of ATP forma- 
tion with the formation of ATP from ionic gradients in chapters 18 and 19. 
Thus, the outcomes of the reactions catalyzed by glyceraldehyde 
3-phosphate dehydrogenase and phosphoglycerate kinase are as follows: 


1. Glyceraldehyde 3-phosphate, an aldehyde, is oxidized to 3-phospho- 
glycerate, a carboxylic acid. 


2. NAD* is concomitantly reduced to NADH. 


3. ATP is formed from P; and ADP at the expense of carbon-oxidation 
energy. 


In essence, the energy released during the oxidation of glyceraldehyde 
3-phosphate to 3-phosphoglycerate is temporarily trapped as 1,3- 
bisphosphoglycerate. This energy powers the transfer of a phosphoryl 
group from 1,3-bisphosphoglycerate to ADP to yield ATP. Keep in mind 
that, because of the actions of aldolase and triose phosphate isomerase, two 
molecules of glyceraldehyde 3-phosphate were formed and hence two mol- 
ecules of ATP were generated. These ATP molecules make up for the two 
molecules of ATP consumed in the first stage of glycolysis. 


Additional ATP is generated with the formation of pyruvate 


In the remaining steps of glycolysis, 3-phosphoglycerate is converted into 
pyruvate, and a second molecule of ATP is formed from ADP. 


ADP 
0.7 0 = 0 + Ht ATP 9 
“C’ HO LC OPO;2- -¢ 0 
——————T_ H— C—0P0,27 = | | 
Phosphoglycerate Enolase C Pyruvate H 
mutase H—C—OH HO Su kinase CHs 
H 
2-Phosphoglycerate Phosphenolpyruvate Pyruvate 


The first reaction is a rearrangement. The position of the phosphoryl group 
shifts in the conversion of 3-phosphoglycerate into 2-phosphoglycerate, a reac- 
tion catalyzed by phosphoglycerate mutase. In general, a mutase is an enzyme 
that catalyzes the intramolecular shift of a chemical group, such as a phos- 
phoryl group. The phosphoglycerate mutase reaction has an interesting 
mechanism: the phosphoryl group is not simply moved from one carbon 
atom to another. This enzyme requires catalytic amounts of 2,3-bisphos- 
phoglycerate (2,3-BPG) to maintain an active-site histidine residue in a 
phosphorylated form. This phosphoryl group 1s transferred to 3-phospho- 
glycerate to reform 2,3-bisphosphoglycerate. 


Enz-His-phosphate + 3-phosphoglycerate === 
Enz-His + 2,3-bisphosphoglycerate 


The mutase then functions as a phosphatase: it converts 2,3-bisphospho- 
glycerate into 2-phosphoglycerate. The mutase retains the phosphoryl 
group to regenerate the modified histidine. 


Enz-His + 2,3-bisphosphoglycerate === 
Enz-His-phosphate + 2-phosphoglycerate 


The sum of these reactions yields the mutase reaction: 
3-Phosphoglycerate === 2-phosphoglycerate 


In the next reaction, the dehydration of 2-phosphoglycerate introduces 
a double bond, creating an enol. Enolase catalyzes this formation of the enol 
phosphate phosphoenolpyruvate (PEP). This dehydration markedly elevates 
the transfer potential of the phosphoryl group. An enol phosphate has a 
high phosphoryl-transfer potential, whereas the phosphate ester of an 
ordinary alcohol, such as 2-phosphoglycerate, has a low one. The AG®’ of 
the hydrolysis of a phosphate ester of an ordinary alcohol is —13 kJ mol7! 
(—3 kcal mol~!), whereas that of phosphoenolpyruvate is —62 kJ mol~! 
(—15 kcal mol~'). 

Why does phosphoenolpyruvate have such a high phosphory1-transfer 
potential? The phosphoryl group traps the molecule in its unstable enol 
form. When the phosphoryl group has been donated to ATP, the enol 
undergoes a conversion into the more stable ketone—namely, pyruvate. 


ADP 
= ° oe AIP - 9 9 
C OPO3*- aC OH —_ 
O Now oO New oo ee 
po Aye 
H~ H HH Hs 
Phosphenolpyruvate Pyruvate Pyruvate 


(enol form) 


Thus, the high phosphoryl-transfer potential of phosphoenolpyruvate arises 
primarily from the large driving force of the subsequent enol—ketone conversion. 
Hence, pyruvate is formed, and ATP is generated concomitantly. The vir- 
tually irreversible transfer of a phosphoryl group from phosphoenolpyru- 
vate to ADP is catalyzed by pyruvate kinase. What is the energy source for 
the formation of phosphoenolpyruvate? The answer to this question 
becomes clear when we compare the structures of 2-phosphoglycerate and 
pyruvate. The formation of pyruvate from 2-phosphoglycerate is, in 
essence, an internal oxidation—reduction reaction; carbon 3 takes electrons 
from carbon 2 in the conversion of 2-phosphoglycerate into pyruvate. 
Compared with 2-phosphoglycerate, C-3 is more reduced in pyruvate, 
whereas C-2 is more oxidized. Once again, carbon oxidation powers the 
synthesis of a compound with high phosphoryl-transfer potential, phos- 
phoenolpyruvate here and 1,3-bisphosphoglycerate earlier, which allows 
the synthesis of ATP. 

Because the molecules of ATP used in forming fructose 1,6-bisphosphate 
have already been regenerated, the two molecules of ATP generated from 
phosphoenolpyruvate are “profit.” 


Two ATP molecules are formed in the conversion of 
glucose into pyruvate 


The net reaction in the transformation of glucose into pyruvate is 


Glucose + 2 P; + 2 ADP + 2 NAD* > 
2 pyruvate + 2 ATP + 2NADH +2H"* +250 


Thus, two molecules of ATP are generated in the conversion of glucose into 
two molecules of pyruvate. The reactions of glycolysis are summarized in 
Table 16.1. 

The energy released in the anaerobic conversion of glucose into two 
molecules of pyruvate is about —90 kJ mol~!(—22 kcal mol7!). We shall 
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TABLE 16.1 Reactions of glycolysis 


Step Reaction 


Glucose + ATP —> glucose 6-phosphate + ADP + H* 


1 
2. Glucose 6-phosphate fructose 6-phosphate 

3. Fructose 6-phosphate + ATP —> fructose 1,6-bisphosphate + ADP + H* 
4 


Fructose 1,6-bisphosphate ——= 
dihydroxyacetone phosphate + glyceraldehyde 3-phosphate 


5 Dihydroxyacetone phosphate glyceraldehyde 3-phosphate 


6  Glyceraldehyde 3-phosphate + P, + NAD* === 
1,3-bisphosphoglycerate + NADH + H* 


7 1,3-Bisphosphoglycerate + ADP === 3-phosphoglycerate + ATP 
3-Phosphoglycerate 


2-phosphoglycerate 


2-Phosphoglycerate phosphoenolpyruvate + H,O 
10 Phosphoenolpyruvate + ADP + H* —> pyruvate + ATP 
Note: AG, the actual free-energy change, has been calculated from AG*’ and known concentrations of 
reactants under typical physiological conditions. Glycolysis can proceed only if the AG values of all reactions 


are negative. The small positive AG values of three of the above reactions indicate that the concentrations 
of metabolites in vivo in cells undergoing glycolysis are not precisely known. 


see in chapters 17 and 18 that much more energy can be released from glu- 
cose in the presence of oxygen. 


NAD* is regenerated from the metabolism of pyruvate 


The conversion of glucose into two molecules of pyruvate has resulted in 
the net synthesis of ATP. However, an energy-converting pathway that 
stops at pyruvate will not proceed for long, because redox balance has 
not been maintained. As we have seen, the activity of glyceraldehyde 
3-phosphate dehydrogenase, in addition to generating a compound with 
high phosphoryl-transfer potential, reduces NAD* to NADH. In the 
cell, there are limited amounts of NAD™, which 1s derived from the vita- 
min niacin (B,), a dietary requirement for human beings. Consequently, 
NAD? must be regenerated for glycolysis to proceed. Thus, the final 
process in the pathway is the regeneration of NAD* through the metab- 
olism of pyruvate. 

The sequence of reactions from glucose to pyruvate is similar in most 
organisms and most types of cells. In contrast, the fate of pyruvate is 
variable. Three reactions of pyruvate are of primary importance: conver- 
sion into ethanol, lactate, or carbon dioxide (Figure 16.9). The first two 
reactions are fermentations that take place in the absence of oxygen. 
A fermentation is an ATP-generating process in which 
organic compounds act both as the donor and as the 
acceptor of electrons. In the presence of oxygen, the most 
common situation in multicellular organisms and in 
many unicellular ones, pyruvate is metabolized to carbon 
dioxide and water through the citric acid cycle and the 


electron-transport chain with oxygen serving as the final 


FIGURE 16.9 Diverse fates of pyruvate. Ethanol and lactate 
can be formed by reactions that include NADH. Alternatively, a 


(Chapter 17) to form acetyl CoA. 


electron acceptor. We now take a closer look at these three 
possible fates of pyruvate. 


1. Ethanol is formed from pyruvate in yeast and several 


Further 
oxidation other microorganisms. The first step is the decarboxylation 


of pyruvate. This reaction is catalyzed by pyruvate decar- 
boxylase, which requires the coenzyme thiamine pyrophos- 
two-carbon unit from pyruvate can be coupled to coenzyme A phate, a coenzyme, derived from the vitamin thiamine (B,). 


The second step is the reduction of acetaldehyde to ethanol 


AG" inkJ mol~! AG inkJ mol7! 


Enzyme Reaction type (kcal mol!) (kcal mol™') 
Hexokinase Phosphory] transfer —16.7 (—4.0 —33.5 (—8.0) 
Phosphoglucose isomerase Isomerization +1.7 (+0.4 el ae ORG) 
Phosphofructokinase Phosphory] transfer —14.2 (-3.4 —22.2 (—5.3) 
Aldolase Aldol cleavage +23.8 (+5.7 =i, (0.3) 
Triose phosphate isomerase Isomerization +7.5(+1.8 +2.5(+0.6) 
Glyceraldehyde 3-phosphate Phosphorylation coupled +6.3(+1.5 —1.7 (—0.4) 

dehydrogenase to oxidation 
Phosphoglycerate kinase Phosphory] transfer —18.8 (—4.5 +1.3(+0.3) 
Phosphoglycerate mutase Phosphory] shift +4.6(41.1 +0.8 (+0.2) 
Enolase Dehydration +1.7 (+0.4 —3.3 (—0.8) 
Pyruvate kinase Phosphory] transfer =3il4(=7.5 —16.7 (—4.0) 


by NADH, in a reaction catalyzed by alcohol dehydrogenase. This reaction 


regenerates NAD™. 


9 NADH 
a Ht CO. + Ht  NAD* 
ae 2 - H_ 0 H OH 
Pyruvate Alcohol 
CH3 decarboxylase CHs dehydrogenase CH3 
Pyruvate Acetaldehyde Ethanol 


The active site of alcohol dehydrogenase contains a zinc ion that is coordi- 
nated to the sulfur atoms of two cysteine residues and a nitrogen atom of 
histidine (Figure 16.10). This zinc ion polarizes the carbonyl group of the 
substrate to favor the transfer of a hydride from NADH. 

The conversion of glucose into ethanol is an example of alcoholic fermen- 
tation. The net result of this anaerobic process is 


Glucose + 2P,+ 2 ADP +2H*t > 
2 ethanol + 2 CO, +2ATP+2 H,O 


Note that NAD*t and NADH do not appear in this equation, even 
though they are crucial for the overall process. NADH generated by the 
oxidation of glyceraldehyde 3-phosphate is consumed in the reduction 
of acetaldehyde to ethanol. Thus, there is no net oxidation—reduction in 
the conversion of glucose into ethanol (Figure 16.11). The ethanol formed 
in alcoholic fermentation provides a key ingredient for brewing and 
winemaking. 


2. Lactate is formed from pyruvate in a variety of microorganisms in a 
process called lactic acid fermentation. Certain types of skeletal muscles 
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FIGURE 16.10 Active site of alcohol 
dehydrogenase. The active site contains a 
zinc ion bound to two cysteine residues 
and one histidine residue. Notice that the 
zinc ion binds the acetaldehyde substrate 
through its oxygen atom, polarizing the 
substrate so that it more easily accepts a 
hydride from NADH. Only the nicotinamide 
ring of NADH is shown. 


FIGURE 16.11 Maintaining redox balance. 
The NADH produced by the glyceraldehyde 
3-phosphate dehydrogenase reaction must 
be reoxidized to NADt for the glycolytic 
pathway to continue. In alcoholic 
fermentation, alcohol dehydrogenase 
oxidizes NADH and generates ethanol. In 
lactic acid fermentation (not shown), lactate 
dehydrogenase oxidizes NADH while 
generating lactic acid. 
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Glucose 


Regeneration of NAD*. 


in most animals can also function anaerobically for short periods. For 
example, a specific type of muscle fiber, called fast twitch or type IIb 
fibers, performs short bursts of intense exercise. The ATP needs rise 
faster than the ability of the body to provide oxygen to the muscle. The 
muscle functions anaerobically until fatigue sets in, which is caused, in 
part, by lactate buildup. Indeed, the pH of resting type IIb muscle 
fibers, which is about 7.0, may fall to as low as 6.3 during the bout 
of exercise. The drop in pH inhibits phosphofructokinase (p. 469). 
A lactate/H* symporter allows the exit of lactate from the muscle cell. 
The reduction of pyruvate by NADH to form lactate is catalyzed by 
lactate dehydrogenase. 


NADH ; 
+ H* ~~ NAD* Ox. 9 


1 \ / | 
AC O oe 
ON oo HOCH 
O 7 Lactate | 


dehydrogenase 
CH; ydrogen, CH; 


Pyruvate Lactate 


The overall reaction in the conversion of glucose into lactate is 
Glucose + 2 P; + 2 ADP — 2 lactate + 2 ATP + 2H2O 


As in alcoholic fermentation, there is no net oxidation—reduction. The 
NADH formed in the oxidation of glyceraldehyde 3-phosphate is con- 
sumed in the reduction of pyruvate. The regeneration of NAD* in the reduc- 
tion of pyruvate to lactate or ethanol sustains the continued process of glycolysis 
under anaerobic conditions. 


3. Only a fraction of the energy of glucose is released in its anaerobic con- 
version into ethanol or lactate. Much more energy can be extracted aerobi- 
cally by means of the citric acid cycle and the electron-transport chain. The 
entry point to this oxidative pathway is acetyl coenzyme A (acetyl CoA), 
which is formed inside mitochondria by the oxidative decarboxylation 
of pyruvate. 


Pyruvate + NAD* + CoA = acetyl CoA + CO, + NADH + H* 


This reaction, which is catalyzed by the pyruvate dehydrogenase complex, 
will be considered in detail in Chapter 17. The NAD* required for this 
reaction and for the oxidation of glyceraldehyde 3-phosphate is regenerated 
when NADH ultimately transfers its electrons to O, through the electron- 
transport chain in mitochondria. 


Fermentations provide usable energy in the absence of oxygen 


Fermentations yield only a fraction of the energy available from the com- 
plete combustion of glucose. Why is a relatively inefficient metabolic path- 
way so extensively used? The fundamental reason is that oxygen is not 
required. The ability to survive without oxygen affords a host of living 
accommodations such as soils, deep water, and skin pores. Some organisms, 
called obligate anaerobes, cannot survive in the presence of O,,, a highly reac- 
tive compound. The bacterium Clostridium perfringens, the cause of gan- 
grene, is an example of an obligate anaerobe. Other pathogenic obligate 
anaerobes are listed in Table 16.2. Some organisms, such as yeast, are facul- 
tative anaerobes that metabolize glucose aerobically when oxygen is present 
and perform fermentation when oxygen is absent. 

Although we have considered only lactic acid and alcoholic fermenta- 
tion, microorganisms are capable of generating a wide array of molecules as 


TABLE 16.2 Examples of pathogenic obligate anaerobes 


Bacterium Result of infection 


Clostridium tetani Tetanus (lockjaw) 


Clostridium botulinum Botulism (an especially severe type of food poisoning) 


Gas gangrene (gas is produced as an end point of the fermentation, 
distorting and destroying the tissue) 


Clostridium perfringens 


Bartonella hensela Cat scratch fever (flu-like symptoms) 


Bacteroides fragilis Abdominal, pelvic, pulmonary, and blood infections 


end points of fermentation (Table 16.3). Indeed, many food products, 
including sour cream, yogurt, various cheeses, beer, wine, and sauerkraut, 
result from fermentation. 


The binding site for NAD* is similar in many dehydrogenases 


bs The three dehydrogenases—glyceraldehyde 3-phosphate dehydroge- 

nase, alcohol dehydrogenase, and lactate dehydrogenase—have quite 
different three-dimensional structures. However, their NAD*-binding 
domains are strikingly similar (Figure 16.12). This nucleotide-binding 
region is made up of four a helices and a sheet of six parallel B strands. 
Moreover, in all cases, the bound NAD* displays nearly the same confor- 
mation. This common structural domain is often called a Rossmann fold 
after Michael Rossmann, who first recognized it. This fold likely represents 
a primordial dinucleotide-binding domain that recurs in the dehydroge- 
nases of glycolysis and other enzymes because of their descent from a com- 
mon ancestor. 


Nicotinamide-binding domain 


Pyrophosphate 


Ribose 
\ 


Fructose is converted into glycolytic intermediates 
by fructokinase 


Although glucose is the most widely used monosaccharide, others also are 
important fuels. Let us consider how fructose is funneled into the glyco- 
lytic pathway (Figure 16.13). There are no catabolic pathways for metabo- 
lizing fructose, and so the strategy is to convert this sugar into a metabolite 
of glucose. 


TABLE 16.3 Starting and ending points 
of various fermentations 


Glucose > lactate 

Lactate > acetate 
Glucose > ethanol 
Ethanol > acetate 
Arginine > carbon dioxide 
Pyrimidines > carbon dioxide 
Purines > formate 
Ethylene glycol => acetate 
Threonine > propionate 
Leucine > 2-alkylacetate 
Phenylalanine > propionate 


Note: The products of some fermentations are the 
substrates for others. 


FIGURE 16.12 NAD*-binding region in 
dehydrogenases. Notice that the 
nicotinamide-binding half (yellow) is 
structurally similar to the adenine-binding 
half (red). The two domains together form a 
structural motif called a Rossmann fold. 
The NAD* molecule binds in an extended 
conformation. [Drawn from 3LDH.pdb.] 
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The main site of fructose metabolism is the liver, using 
the fructose 1-phosphate pathway (Figure 16.14). The first 
step is the phosphorylation of fructose to fructose 1-phosphate 
by fructokinase. Fructose 1-phosphate is then split into glyc- 
evaldehyde and dihydroxyacetone phosphate, an intermediate 
in glycolysis. This aldol cleavage is catalyzed by a specific 
fructose 1-phosphate aldolase. Glyceraldehyde is then phos- 
phorylated to glyceraldehyde 3-phosphate, a glycolytic inter- 
mediate, by triose kinase. In other tissues, such as adipose 
tissue, fructose can be phosphorylated to fructose 6-phosphate 
by hexokinase. 


ae : ' Gass Excessive fructose consumption can lead to 
( DHAP —( GAP } ee) pathological conditions 


|. 


Pyruvate 


e Fructose, acommonly used sweetener, is a component 
of sucrose and high fructose corn syrup (which con- 
tains approximately 55% fructose and 45% glucose). 
Epidemiological as well as clinical studies have linked exces- 


FIGURE 16.13 Entry points in glycolysis for fructose sive fructose consumption to fatty liver, insulin insensitivity, 


and galactose. 


Fructose 


ATP 


+ ADP 


Fructose 1-phosphate 


Fructokinase 


Fructose 1-phosphate 
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FIGURE 16.14 Fructose metabolism. 
Fructose enters the glycolytic pathway 
in the liver through the fructose 
1-phosphate pathway. 
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and obesity. These conditions may eventually result in type 
2 diabetes (Chapter 27). Studies have shown that these disorders are not 
necessarily the result of simple excess energy consumption, but rather how 
fructose is processed by the liver. What aspects of liver fructose metabo- 
lism are the contributing factors then? Note that, as shown in Figure 16.14, 
the actions of fructokinase and triose kinase bypass the most important 
regulatory step in glycolysis, the phosphofructokinase-catalyzed reaction. 
The fructose-derived glyceraldehyde 3-phosphate and dihydroxyacetone 
phosphate are processed by glycolysis to pyruvate and subsequently to 
acetyl CoA in an unregulated fashion. As we will see in Chapter 22, this 
excess acetyl CoA is converted to fatty acids, which can be transported to 
adipose tissue, resulting in obesity. The liver also begins to accumulate 
fatty acids, resulting in fatty liver. The activity of the fructokinase and 
triose kinase can deplete the liver of ATP and inorganic phosphate, com- 
promising liver function. We will return to the topic of obesity and caloric 
homeostasis in Chapter 27. 


Galactose is converted into glucose 6-phosphate 


Like fructose, galactose is an abundant sugar that must be converted into 
metabolites of glucose (Figure 16.13). Galactose is converted into glucose 
6-phosphate in four steps. The first reaction in the galactose—glucose intercon- 
version pathway is the phosphorylation of galactose to galactose 1-phosphate 
by galactokinase. 


CH20OH ADP CH2OH 
ATP + H* | 
HO 0 HO Q 
on Galactokinase i fe) = 
OH QO 4 
Np 
OH OH IN 
' ~O 
O 
Galactose Galactose 1-phosphate 


Galactose 1-phosphate then acquires a uridyl group from uridine diphos- 
phate glucose (UDP-glucose), an activated intermediate in the synthesis of 
carbohydrates (p. 331 and Section 21.4). 


CH OH CH OH 467 
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The products of this reaction, which is catalyzed by galactose 1-phosphate 
uridyl transferase, are UDP-galactose and glucose 1-phosphate. The galactose 
moiety of UDP-galactose is then epimerized to glucose. The configuration of 
the hydroxyl group at carbon 4 is inverted by UDP-galactose 4-epimerase. 

The sum of the reactions catalyzed by galactokinase, the transferase, and 
the epimerase is 


Galactose + ATP —> glucose 1-phosphate + ADP + H~ 


Note that UDP-glucose is not consumed in the conversion of galactose into 
glucose, because it is regenerated from UDP-galactose by the epimerase. 
This reaction is reversible, and the product of the reverse direction also is 
important. The conversion of UDP-glucose into UDP-galactose is essential for 
the synthesis of galactosyl residues in complex polysaccharides and glycopro- 
teins if the amount of galactose in the diet is inadequate to meet these needs. 

Finally, glucose 1-phosphate, formed from galactose, is isomerized to 
glucose 6-phosphate by phosphoglucomutase. 


Phosphoglucomutase 


Glucose 1-phosphate === = glucose 6-phosphate 


We shall return to this reaction when we consider the synthesis and degra- 
dation of glycogen, which proceeds through glucose 1-phosphate, in 
Chapter 21. 


Many adults are intolerant of milk because they are deficient in lactase 


e Many adults are unable to metabolize the milk sugar lactose and 
experience gastrointestinal disturbances if they drink milk. Lactose 
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Scanning electron micrograph of 
Lactobacillus. The anaerobic bacterium 
Lactobacillus is shown here. As suggested by 
its name, this genus of bacteria ferments glu- 
cose into lactic acid and is widely used in the 
food industry. Lactobacillus is also a component 
of the normal human bacterial flora of the 
urogenital tract where, because of its ability 
to generate an acidic environment, it prevents 
the growth of harmful bacteria. [Power and 
Syred/ Science Photo Library.] 


intolerance, or hypolactasia, is most commonly caused by a deficiency of the 
enzyme lactase, which cleaves lactose into glucose and galactose. 


CH20H CH20OH CH,0H CH2OH 
ei H 2 H ° Lactase i 2 ‘a e 
oO O oO aH OH + OH 
OH HO OH 
OH OH OH OH 
Lactose Galactose Glucose 


“Deficiency” is not quite the appropriate term, because a decrease in lactase 
is normal in the course of development in all mammals. As children are 
weaned and milk becomes less prominent in their diets, lactase activity nor- 
mally declines to about 5% to 10% of the level at birth. This decrease is not 
as pronounced with some groups of people, most notably Northern 
Europeans, and people from these groups can continue to ingest milk with- 
out gastrointestinal difficulties. With the development of dairy farming, an 
adult with active lactase would have a selective advantage in being able to 
consume calories from the readily available milk. Indeed, estimates suggest 
that people with the mutation would produce almost 20% more fertile off- 
spring. Because dairy farming appeared in northern Europe about 10,000 
years ago, the evolutionary selective pressure on lactase persistence must 
have been substantial, attesting to the biochemical value of being able to use 
milk as an energy source into adulthood. 

What happens to the lactose in the intestine of a lactase-deficient per- 
son? The lactose is a good energy source for microorganisms in the colon, 
and they ferment it to lactic acid while generating methane (CH,) and 
hydrogen gas (H,). The gas produced creates the uncomfortable feeling of 
gut distension and the annoying problem of flatulence. The lactate pro- 
duced by the microorganisms is osmotically active and draws water into the 
intestine, as does any undigested lactose, resulting in diarrhea. If severe 
enough, the gas and diarrhea hinder the absorption of other nutrients such 
as fats and proteins. The simplest treatment is to avoid the consumption of 
products containing much lactose. Alternatively, the enzyme lactase can be 
ingested with milk products. 


Galactose is highly toxic if the transferase is missing 


Less common than lactose intolerance are disorders that interfere 

with the metabolism of galactose. The disruption of galactose 
metabolism is referred to as galactosemia. The most common form, called 
classic galactosemia, is an inherited deficiency in galactose 1-phosphate 
uridyl transferase activity. Afflicted infants fail to thrive. They vomit or 
have diarrhea after consuming milk, and enlargement of the liver and 
jaundice are common, sometimes progressing to cirrhosis. Cataracts will 
form, and lethargy and retarded mental development also are common. 
The blood-galactose level is markedly elevated, and galactose is found in 
the urine. The absence of the transferase in red blood cells is a definitive 
diagnostic criterion. 

The most common treatment is to remove galactose (and lactose) from 
the diet. An enigma of galactosemia is that, although elimination of galac- 
tose from the diet prevents liver disease and cataract development, the 
majority of patients still suffer from central nervous system malfunction, 
most commonly a delayed acquisition of language skills. Female patients 
also display ovarian failure. 


Cataract formation is better understood. A cataract is the clouding of the 
normally clear lens of the eye (Figure 16.15). If the transferase is not active 
in the lens of the eye, the presence of aldose reductase causes the accumulat- 
ing galactose to be reduced to galactitol. 
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Galactitol is poorly metabolized and accumulates in the lens. Water will dif- 
fuse into the lens to maintain osmotic balance, triggering the formation of 
cataracts. In fact, there is a high incidence of cataract formation with age in 
populations that consume substantial amounts of milk into adulthood. 


16.2 The Glycolytic Pathway Is Tightly Controlled 


The glycolytic pathway has a dual role: it degrades glucose to generate ATP 
and it provides building blocks for biosynthetic reactions. The rate of con- 
version of glucose into pyruvate is regulated to meet these two major cellu- 
lar needs. In metabolic pathways, enzymes catalyzing essentially irreversible 
reactions are potential sites of control. In glycolysis, the reactions catalyzed 
by hexokinase, phosphofructokinase, and pyruvate kinase are virtually irre- 
versible, and each of them serves as a control site. These enzymes become 
more active or less so in response to the reversible binding of allosteric effec- 
tors or to covalent modification. In addition, the amounts of these impor- 
tant enzymes are varied by the regulation of transcription to meet changing 
metabolic needs. The time required for allosteric control, regulation by 
phosphorylation, and transcriptional control is measured typically in milli- 
seconds, seconds, and hours, respectively. We will consider the control of 
glycolysis in two different tissues—skeletal muscle and liver. 


Glycolysis in muscle is regulated to meet the need for ATP 


Glycolysis in skeletal muscle provides ATP primarily to power contraction. 
Consequently, the primary control of muscle glycolysis is the energy charge of 
the cell—the ratio of ATP to AMP. Let us examine how each of the key 
regulatory enzymes responds to changes in the amounts of ATP and AMP 
present in the cell. 


Phosphofructokinase. Phosphofructokinase is the most important control site 
in the mammalian glycolytic pathway (Figure 16.16). High levels of ATP 
allosterically inhibit the enzyme (a 340-kDa tetramer). ATP binds to a spe- 
cific regulatory site that is distinct from the catalytic site. The binding of 
ATP lowers the enzyme’s affinity for fructose 6-phosphate. Thus, a high 
concentration of ATP converts the hyperbolic binding curve of fructose 
6-phosphate into a sigmoidal one (Figure 16.17). AMP reverses the inhibi- 
tory action of ATP, and so the activity of the enzyme increases when the 
ATP/AMP ratio is lowered. In other words, glycolysis is stimulated as the 
energy charge falls. A decrease in pH also inhibits phosphofructokinase 


FIGURE 16.15 Cataracts are evident as 

the clouding of the lens. (A) A healthy eye. 
(B) An eye with a cataract. [(A) © Imageafter; 
(B) SPL/Photo Researchers. ] 
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Low [ATP] 


Reaction velocity — 


[Fructose 6-phosphate] —> 


FIGURE 16.17 Allosteric regulation of 
phosphofructokinase. A high level of ATP 
inhibits the enzyme by decreasing its affinity 
for fructose 6-phosphate. 
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hk FIGURE 16.16 Structure of phosphofructokinase. The structure of phosphofructokinase 
from E. coli comprises a tetramer of four identical subunits. Notice the separation of the 
catalytic and allosteric sites. Each subunit of the human liver enzyme consists of two domains 
that are similar to the E. coli enzyme. [Drawn from 1PFK.pdb.] 


activity by augmenting the inhibitory effect of ATP. The pH might fall 
when fast twitch muscle is functioning anaerobically, producing excessive 
quantities of lactic acid. The inhibitory effect protects the muscle from 
damage that would result from the accumulation of too much acid. 

Why is AMP and not ADP the positive regulator of phosphofructoki- 
nase? When ATP is being utilized rapidly, the enzyme adenylate kinase 
(Section 9.4) can form ATP from ADP by the following reaction: 


ADP + ADP == ATP + AMP 


Thus, some ATP is salvaged from ADP, and AMP becomes the signal for 
the low-energy state. Moreover, the use of AMP as an allosteric regulator 
provides an especially sensitive control. We can understand why by consid- 
ering, first, that the total adenylate pool ([ATP], [ADP], [AMP]]) ina cell is 
constant over the short term and, second, that the concentration of ATP is 
greater than that of ADP and the concentration of ADP is, in turn, greater 
than that of AMP. Consequently, small-percentage changes in [ATP] result 
in larger-percentage changes in the concentrations of the other adenylate 
nucleotides. This magnification of small changes in [ATP] to larger changes 
in [AMP] leads to tighter control by increasing the range of sensitivity of 
phosphofructokinase (Problem 46). 


Hexokinase. Phosphofructokinase is the most prominent regulatory enzyme 
in glycolysis, but it is not the only one. Hexokinase, the enzyme catalyzing 
the first step of glycolysis, is inhibited by its product, glucose 6-phosphate. 
High concentrations of this molecule signal that the cell no longer requires 
glucose for energy or for the synthesis of glycogen, a storage form of glucose 


(Chapter 21), and the glucose will be left in the blood. A rise in glucose 
6-phosphate concentration is a means by which phosphofructokinase com- 
municates with hexokinase. When phosphofructokinase is inactive, the con- 
centration of fructose 6-phosphate rises. In turn, the level of glucose 6-phos- 
phate rises because it is in equilibrium with fructose 6-phosphate. Hence, the 
inhibition of phosphofructokinase leads to the inhibition of hexokinase. 

Why is phosphofructokinase rather than hexokinase the pacemaker of 
glycolysis? The reason becomes evident on noting that glucose 6-phosphate 
is not solely a glycolytic intermediate. In muscle, glucose 6-phosphate can 
also be converted into glycogen. The first irreversible reaction unique to the 
glycolytic pathway, the committed step (Section 10.1), is the phosphorylation 
of fructose 6-phosphate to fructose 1,6-bisphosphate. Thus, it is highly 
appropriate for phosphofructokinase to be the primary control site in gly- 
colysis. In general, the enzyme catalyzing the committed step in a metabolic 
sequence is the most important control element in the pathway. 


Pyruvate kinase. Pyruvate kinase, the enzyme catalyzing the third irrevers- 
ible step in glycolysis, controls the outflow from this pathway. This final step 
yields ATP and pyruvate, a central metabolic intermediate that can be oxi- 
dized further or used as a building block. ATP allosterically inhibits pyru- 
vate kinase to slow glycolysis when the energy charge is high. When the pace 
of glycolysis increases, fructose 1,6-bisphosphate, the product of the preced- 
ing irreversible step in glycolysis, activates the kinase to enable it to keep 
pace with the oncoming high flux of intermediates. A summary of the regu- 
lation of glycolysis in resting and active muscle is shown in Figure 16.18. 
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DURING EXERCISE 
(glycolysis stimulated) 
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ye Fructose 1,6-bisphosphate Fructose 1,6-bisphosphate 
IN a 
v 
i High energy 
charge 
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FIGURE 16.18 Regulation of glycolysis in muscle. At rest (left), glycolysis is not very active 
(thin arrows). The high concentration of ATP inhibits phosphofructokinase (PFK), pyruvate 
kinase, and hexokinase. Glucose 6-phosphate is converted into glycogen (Chapter 21). During 
exercise (right), the decrease in the ATP/AMP ratio resulting from muscle contraction activates 
phosphofructokinase and hence glycolysis. The flux down the pathway is increased, as 
represented by the thick arrows. 
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FIGURE 16.19 Regulation of 
phosphofructokinase by fructose 
2,6-bisphosphate. In high concentrations, 
fructose 6-phosphate (F-6P) activates the 
enzyme phosphofructokinase (PFK) through 
an intermediary, fructose 2,6-bisphosphate 
(F-2,6-BP). 
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FIGURE 16.20 Activation of 
phosphofructokinase by fructose 
2,6-bisphosphate. (A) The sigmoidal 
dependence of velocity on substrate 
concentration becomes hyperbolic in 
the presence of 1 ~M fructose 
2,6-bisphosphate. (B) ATP, acting as a 
substrate, initially stimulates the reaction. 
As the concentration of ATP increases, it 
acts as an allosteric inhibitor. The inhibitory 
effect of ATP is reversed by fructose 
2,6-bisphosphate. [Data from E. Van 
Schaftingen, M. F. Jett, L. Hue, and 

H. G. Hers, Proc. Natl. Acad. Sci. 

U. S. A. 78:3483-3486, 1981.] 


The regulation of glycolysis in the liver illustrates the biochemical 
versatility of the liver 


The liver has more diverse biochemical functions than does muscle. 
Significantly, the liver maintains blood-glucose levels: it stores glucose as 
glycogen when glucose is plentiful, and it releases glucose when supplies 
are low. It also uses glucose to generate reducing power for biosynthesis 
(Section 20.3) as well as to synthesize a host of biochemicals. So, although 
the liver has many of the regulatory features of muscle glycolysis, the regula- 
tion of glycolysis in the liver is more complex. 


Phosphofructokinase. Liver phosphofructokinase can be regulated by 
ATP as in muscle, but such regulation is not as important since the liver 
does not experience the sudden ATP needs that a contracting muscle does. 
Likewise, low pH is not an important metabolic signal for the liver enzyme, 
because lactate is not normally produced in the liver. Indeed, as we will see, 
lactate is converted into glucose in the liver. 

Glycolysis in the liver furnishes carbon skeletons for biosyntheses, and 
so a signal indicating whether building blocks are abundant or scarce should 
also regulate phosphofructokinase. In the liver, phosphofructokinase is inhib- 
ited by citrate, an early intermediate in the citric acid cycle (Chapter 17). 
A high level of citrate in the cytoplasm means that biosynthetic precursors 
are abundant, and so there is no need to degrade additional glucose for this 
purpose. Citrate inhibits phosphofructokinase by enhancing the inhibitory 
effect of ATP. 

The key means by which glycolysis in the liver responds to changes in 
blood glucose is through the signal molecule fructose 2,6-bisphosphate 
(F-2,6-BP), a potent activator of phosphofructokinase (Figure 16.19). In 
the liver, the concentration of fructose 6-phosphate rises when blood- 
glucose concentration is high, and the abundance of fructose 6-phosphate 
accelerates the synthesis of F-2,6-BP (Figure 16.20). Hence, an abundance 
of fructose 6-phosphate leads to a higher concentration of F-2,6-BP. The 
binding of fructose 2,6-bisphosphate increases the affinity of phosphofruc- 
tokinase for fructose 6-phosphate and diminishes the inhibitory effect of 
ATP. Glycolysis is thus accelerated when glucose is abundant. Such a pro- 
cess is called feedforward stimulation. We will turn to the synthesis and 
degradation of this important regulatory molecule after we have considered 
gluconeogenesis. 
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Hexokinase and glucokinase. The hexokinase reac- 
tion in the liver is controlled as in the muscle. 
However, the liver, in keeping with its role as monitor 
of blood-glucose levels, possesses another specialized 
isozyme of hexokinase, called glucokinase, which is 
not inhibited by glucose 6-phosphate. The role of 
glucokinase is to provide glucose 6-phosphate for 
the synthesis of glycogen and for the formation of 
fatty acids (Section 22.1). Remarkably, glucokinase 
displays the sigmoidal kinetics characteristic of an 
allosteric enzyme even though it functions as amono- 
mer. Glucokinase phosphorylates glucose only when 
glucose is abundant because the affinity of glucoki- 
nase for glucose is about 50-fold lower than that of 
hexokinase. Moreover, when glucose concentration is 
low, glucokinase is inhibited by the liver-specific 
glucokinase regulatory protein. The low affinity of 
glucokinase for glucose gives the brain and muscles 
first call on glucose when its supply is limited, and it 
ensures that glucose will not be wasted when it is 
abundant. Drugs that activate liver glucokinase are 
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FIGURE 16.21 Control of the catalytic activity of pyruvate 


kinase. Pyruvate kinase is regulated by allosteric effectors and covalent 


modification. Fructose 1,6-bisphosphate allosterically stimulates the 


enzyme, while ATP and alanine are allosteric inhibitors. Glucagon, secreted 
in response to low blood glucose, promotes phosphorylation and inhibition 
of the enzyme. When blood glucose levels are adequate, the enzyme is 


dephosphorylated and activated. 


being evaluated as a treatment for type 2 or insulin-insensitive diabetes. 
Glucokinase is also present in the B cells of the pancreas, where the increased 
formation of glucose 6-phosphate by glucokinase when blood-glucose levels 
are elevated leads to the secretion of the hormone insulin. Insulin signals the 
need to remove glucose from the blood for storage as glycogen or conversion 


into fat. 


Pyruvate kinase. Several isozymic forms of pyruvate kinase (a tetramer 
of 57-kDa subunits) encoded by different genes are present in mammals: 
the L type predominates in the liver, and the M type in muscle and the 
brain. The L and M forms of pyruvate kinase have many properties in 
common. Indeed, the liver enzyme behaves much like the muscle enzyme 
with regard to allosteric regulation except that the liver enzyme is also 
inhibited by alanine (synthesized in one step from pyruvate), a signal that 
building-blocks are available. Moreover, the isozymic forms differ in their 
susceptibility to covalent modification. The catalytic properties of the L 
form—but not of the M form—are also controlled by reversible phos- 
phorylation (Figure 16.21). When the blood-glucose level is low, the 
glucagon-triggered cyclic AMP cascade (p. 483) leads to the phosphoryla- 
tion of pyruvate kinase, which diminishes its activity. This hormone- 
triggered phosphorylation prevents the liver from consuming glucose 
when it is more urgently needed by the brain and muscle. We see here a 
clear-cut example of how isoenzymes contribute to the metabolic diver- 
sity of different organs. We will return to the control of glycolysis after 


considering gluconeogenesis. 


A family of transporters enables glucose to enter and 


leave animal cells 


Several glucose transporters mediate the thermodynamically downhill 
movement of glucose across the plasma membranes of animal cells. Each 
member of this protein family, named GLUT1 to GLUTS, consists of a 
single polypeptide chain about 500 residues long (Table 16.4). Each glucose 
transporter has a 12-transmembrane-helix structure similar to that of lac- 


tose permease (Section 13.3). 
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TABLE 16.4 Family of glucose transporters 


Name Tissue location Kyy Comments 


GLUT1 ~All mammalian tissues 1mM Basal glucose uptake 
GLUT2 Liver and pancreatic B cells 15-20mM_ Inthe pancreas, plays a role in the 
regulation of insulin 
In the liver, removes excess glucose 


from the blood 
GLUT3 — All mammalian tissues 1mM Basal glucose uptake 
GLUT4 Muscle and fat cells 5mM Amount in muscle plasma membrane 


increases with endurance training 


GLUTS5 — Small intestine _ Primarily a fructose transporter 


The members of this family have distinctive roles: 


1. GLUT1 and GLUT3, present in nearly all mammalian cells, are 
responsible for basal glucose uptake. Their K,, value for glucose is about 1 
mM, significantly less than the normal serum-glucose level, which typically 
ranges from 4 mM to 8 mM. Hence, GLUT1 and GLUT3 continually 
transport glucose into cells at an essentially constant rate. 


2. GLUT2, present in liver and pancreatic B cells, is distinctive in having 
a very high K,, value for glucose (15-20 mM). Hence, glucose enters these 
tissues at a biologically significant rate only when there is much glucose in the 
blood. The pancreas can sense the glucose level and accordingly adjust the 
rate of insulin secretion. The high K,, value of GLUT2 also ensures that 
glucose rapidly enters liver cells only in times of plenty. 


3. GLUT4, which has a K,, value of 5 mM, transports glucose into muscle 
and fat cells. The number of GLUT4 transporters in the plasma membrane 
increases rapidly in the presence of insulin, which signals the fed state. 
Hence, insulin promotes the uptake of glucose by muscle and fat. Endurance 
exercise training increases the amount of this transporter present in muscle 
membranes. 


4. GLUTS5, present in the small intestine, functions primarily as a fructose 
transporter. 


Aerobic glycolysis is a property of rapidly growing cells. 


e Tumors have been known for decades to display enhanced rates of 

glucose uptake and glycolysis. Indeed, rapidly growing tumor cells 
will metabolize glucose to lactate even in the presence of oxygen, a process 
called aerobic glycolysis or the Warburg effect, after Otto Warburg, the 
biochemist who first noted this characteristic of cancer cells in the 1920s. 
In fact, tumors with a high glucose uptake are particularly aggressive, and 
the cancer is likely to have a poor prognosis. A nonmetabolizable glucose 
analog, 2-'SF-2-D-deoxyglucose, detectable by a combination of positron 
emission tomography (PET) and computer-aided tomography (CAT), 
easily visualizes tumors and allows monitoring of the effectiveness of 
treatment (Figure 16.22). 

What selective advantage does aerobic glycolysis offer the tumor over 
the energetically more efficient oxidative phosphorylation? Research is 
being actively pursued to answer the question, but we can speculate on the 
benefits. First, aerobic glycolysis generates lactic acid that is then secreted. 
Acidification of the tumor environment has been shown to facilitate tumor 
invasion and inhibit the immune system from attacking the tumor. 
However, even leukemias perform aerobic glycolysis, and leukemia is not 
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FIGURE 16.22 Tumors can be visualized with 2-'8F-2-D-deoxyglucose (FDG) and 
positron emission tomography. (A) A nonmetabolizable glucose analog infused into a 
patient and detected by a combination of positron emission and computer-aided tomography 
reveals the presence of a malignant tumor (T). (B) After 4 weeks of treatment with a tyrosine 
kinase inhibitor (Section 14.5), the tumor shows no uptake of FDG, indicating decreased 
metabolism. Excess FDG, which is excreted in the urine, also visualizes the kidneys (K) and 
bladder (B). [Images courtesy of A. D. Van den Abbeele, Dana-Farber Cancer Institute, Boston.] 


an invasive cancer. Second, and perhaps more importantly, the increased 
uptake of glucose and formation of glucose 6-phosphate provides sub- 
strates for another metabolic pathway, the pentose phosphate pathway 
(Chapter 20), that generates biosynthetic reducing power. Moreover, the 
pentose phosphate pathway, in cooperation with glycolysis, produces pre- 
cursors for biomolecules necessary for growth, such as nucleotides. Finally, 
cancer cells grow more rapidly than the blood vessels that nourish them; 
thus, as solid tumors grow, the oxygen concentration in their environment 
falls. In other words, they begin to experience hypoxia, a deficiency of 
oxygen. The use of aerobic glycolysis reduces the dependence of cell 
growth on oxygen. Not all of the precursor needs are met by enhanced 
glucose metabolism. Cancer cells also require glutamine, which is chan- 
neled into the mitochondria to replenish citric acid cycle components used 
for biosynthesis. 

What biochemical alterations facilitate the switch to aerobic glycolysis? 
Again, the answers are not complete, but changes in gene expression of 
isozymic forms of two glycolytic enzymes may be crucial. Tumor cells 
express an isozyme of hexokinase that binds to mitochondria. There, the 
enzyme has ready access to any ATP generated by oxidative phosphoryla- 
tion and is not susceptible to feedback inhibition by its product, glucose 
6-phosphate. More importantly, an embryonic isozyme of pyruvate kinase, 
pyruvate kinase M, is also expressed. Remarkably, this isozyme has a lower 
catalytic rate than normal pyruvate kinase and creates a bottleneck, allowing 
the use of glycolytic intermediates for biosynthetic processes required for 


TABLE 16.5 Proteins in glucose 
metabolism encoded by 
genes regulated by 
hypoxia-inducible factor 
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Hexokinase 

Phosphofructokinase 

Aldolase 
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FIGURE 16.23 Alteration of gene 
expression in tumors owing to 

hypoxia. The hypoxic conditions inside a 
tumor mass lead to the activation of the 
hypoxia-inducible transcription factor 
(HIF-1), which induces metabolic adaptation 
(an increase in glycolytic enzymes) and 
activates angiogenic factors that stimulate 
the growth of new blood vessels. [Information 
from C. V. Dang and G. L. Semenza, Trends 
Biochem. Sci. 24:68-72, 1999.] 
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cell proliferation. The need for biosynthetic precursors is greater than the 
need for ATP, suggesting that even glycolysis at a reduced rate produces 
sufficient ATP to allow cell proliferation. Although originally observed in 
cancer cells, the Warburg effect is also seen in noncancerous, rapidly divid- 
ing cells. 


Cancer and endurance training affect glycolysis in a similar fashion 


The hypoxia that some tumors experience with rapid growth activates a 
transcription factor, hypoxia-inducible transcription factor (HIF-1). HIF-1 
increases the expression of most glycolytic enzymes and the glucose trans- 
porters GLUT1 and GLUT3 (Table 16.5). These adaptations by the cancer 
cells enable a tumor to survive until blood vessels can grow. HIF-1 also 
increases the expression of signal molecules, such as vascular endothelial 
growth factor (VEGF), that facilitate the growth of blood vessels that will 
provide nutrients to the cells (Figure 16.23). Without new blood vessels, 
a tumor would cease to grow and either die or remain harmlessly small. 
Efforts are underway to develop drugs that inhibit the growth of blood 
vessels in tumors. 

Interestingly, anaerobic exercise training—forcing muscles to rely on 
lactic acid fermentation for ATP production—also activates HIF-1, pro- 
ducing the same effects as those seen in the tumor—enhanced ability to 
generate ATP anaerobically and a stimulation of blood-vessel growth. 
These biochemical effects account for the improved athletic performance 
that results from training and demonstrate how behavior can affect bio- 
chemistry. Other signals from sustained muscle contraction trigger muscle 
mitochondrial biogenesis, allowing for more efficient aerobic energy gen- 
eration and forestalling the need to resort to lactic acid fermentation for 
ATP synthesis (Chapter 27). 


16.3 Glucose Can Be Synthesized from 
Noncarbohydrate Precursors 


We now turn to the synthesis of glucose from noncarbohydrate precursors, 
a process called gluconeogenesis. Maintaining levels of glucose is impor- 
tant because the brain depends on glucose as its primary fuel and red 
blood cells use glucose as their only fuel. The daily glucose requirement 
of the brain in a typical adult human being is about 120 g, which 
accounts for most of the 160 g of glucose needed daily by the whole body. 
The amount of glucose present in body fluids is about 20 g, and that 
readily available from glycogen is approximately 190 g. Thus, the direct 
glucose reserves are sufficient to meet glucose needs for about a day. 
Gluconeogenesis is especially important during a longer period of fast- 
ing or starvation (Section 27.5). 

The gluconeogenic pathway converts pyruvate into glucose. Noncarbohydrate 
precursors of glucose are first converted into pyruvate or enter the pathway 
at later intermediates such as oxaloacetate and dihydroxyacetone phosphate 
(Figure 16.24). The major noncarbohydrate precursors are lactate, amino 
acids, and glycerol. Lactate is formed by active skeletal muscle when the rate 
of glycolysis exceeds the rate of oxidative metabolism. Lactate is readily 
converted into pyruvate by the action of lactate dehydrogenase (p. 464). 
Amino acids are derived from proteins in the diet and, during starvation, 
from the breakdown of proteins in skeletal muscle (Section 23.1). The 
hydrolysis of triacylglycerols (Section 22.2) in fat cells yields glycerol and 
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FIGURE 16.24 Pathway of 
gluconeogenesis. The reactions and 
enzymes unique to gluconeogenesis 
are shown in red. The other reactions 
are common to glycolysis. The enzymes 
for gluconeogenesis are located in the 
cytoplasm, except for pyruvate 
carboxylase (in the mitochondria) and 
glucose 6-phosphatase (membrane 
bound in the endoplasmic reticulum). 
The entry points for lactate, glycerol, 
and amino acids are shown. 


478 


CHAPTER 16 Glycolysis and 
Gluconeogenesis 


fatty acids. Glycerol is a precursor of glucose, but animals cannot convert 
fatty acids into glucose, for reasons that will be given later. Glycerol may 
enter either the gluconeogenic or the glycolytic pathway at dihydroxyace- 
tone phosphate. 


ADP NADH 
ATP + Ht NADt + H* 
vias \ ass Re ot jfth0H 
HO—-C——-H > HO—-C—-H > O=C 
Glycerol | Glycerol \ 
kinase 2 phosphate CH>,OPO-2- 
CH20H CH20PO3 dehydrogenase : 
Glycerol Glycerol Dihydroxyacetone 
phosphate phosphate 


The major site of gluconeogenesis is the liver, with a small amount 
also taking place in the kidney. Little gluconeogenesis takes place in 
the brain, skeletal muscle, or heart muscle. Rather, gluconeogenesis in 
the liver and kidney helps to maintain the glucose level in the blood so that 
the brain and muscle can extract sufficient glucose from it to meet their meta- 
bolic demands. 


Gluconeogenesis is not a reversal of glycolysis 


In glycolysis, glucose is converted into pyruvate; in gluconeogenesis, pyru- 
vate is converted into glucose. However, gluconeogenesis is not a reversal of 
glycolysis. Several reactions must differ because the equilibrium of glycolysis 
lies far on the side of pyruvate formation. The actual free energy change for 
the formation of pyruvate from glucose is about —90 kJ mol~! (—22 kcal 
mol” ') under typical cellular conditions. Most of the decrease in free energy 
in glycolysis takes place in the three essentially irreversible steps catalyzed 
by hexokinase, phosphofructokinase, and pyruvate kinase. 


Hexokinase 


Glucose + ATP ———> Glucose 6-phosphate + ADP 
AG = —33kJ mol! (—8.0 kcal mol!) 


Phosphofructokinase 


Fructose 6-phosphate + ATP 
fructose 1,6-bisphosphate + ADP 
AG = —22kJ mol! (—5.3 kcal mol!) 


Pyruvate kinase 


Phosphoenolpyruvate + ADP pyruvate + ATP 


AG = —17kJ mol! (—4.0 kcal mol!) 


In gluconeogenesis, these virtually irreversible reactions of glycolysis 
must be bypassed. 


The conversion of pyruvate into phosphoenolpyruvate begins 
with the formation of oxaloacetate 


The first step in gluconeogenesis 1s the carboxylation of pyruvate to form 
oxaloacetate at the expense of a molecule of ATP, a reaction catalyzed by 
pyruvate carboxylase. This reaction occurs in the mitochondria. 


O. _coo- 
oO Sc7 
= i Te | 
wv + Oy. AP = ho a + ADP + P; + 2Ht 
| coo- 


CH; 
Pyruvate Oxaloacetate 


| 
O 
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Carboxybiotin covalently bound to 
€-amino group of a lysine 


Pyruvate carboxylase requires biotin, a covalently attached prosthetic 
group, which serves as the carrier of activated CO,. The carboxylate group 
of biotin is linked to the e-amino group of a specific lysine residue by an 
amide bond (Figure 16.25). Recall that, in aqueous solutions, CO, exists 
primarily as HCO, with the aid of carbonic anhydrase (Section 9.2). 

The carboxylation of pyruvate takes place in three stages: 


HCO; + ATP = = HOCO,-PO;’” + ADP 
Biotin-enzyme + HOCO,-PO;?~ == CO, — biotin-enzyme + P; 
CO, — biotin-enzyme + pyruvate == biotin-enzyme + oxaloacetate 


Pyruvate carboxylase functions as a tetramer composed of four identical 
subunits, and each subunit consists of four domains (Figure 16.26). The 
biotin carboxylase domain (BC) catalyzes the formation of carboxyphos- 
phate and the subsequent attachment of CO, to the second domain, the 
biotin carboxyl carrier protein (BCCP), the site of the covalently attached 
biotin. Once bound to CO, BCCP leaves the biotin carboxylase active site 
and swings almost the entire length of the subunit (~75A) to the active site 
of the pyruvate carboxylase domain (PC), which transfers the CO, to pyru- 
vate to form oxaloacetate. BCCP in one subunit interacts with the active sites 
on an adjacent subunit. The fourth domain (PT) facilitates the formation of 
the tetramer. 
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FIGURE 16.25 Structure of biotin and 
carboxybiotin. 


FIGURE 16.26 A subunit of pyruvate 
carboxylase. Biotin, covalently attached to 
the BCCP, transports CO, from the BC active 
site to the PC active site of an adjacent 
subunit. (BC) biotin carboxylase; (BCCP) 
biotin carboxyl carrier protein; (PC) pyruvate 
carboxylase; (PT) pyruvate carboxylase 
tetramerization domain. [Information from 

G. Lasso, L.P.C. Yu, D. Gil, S. Xiang, L. Tong, 
and M. Valle, Structure 18:1300-1310, 2010.] 
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Cytoplasm 


Pyruvate 
CO, + ATP 
[ 
“> ADP + P; 
¢ 
Oxaloacetate — 
NADH + H* 


~ ~~» NAD* 
WV 
Malate 


Malate 
_. NAD* 
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v 
FIGURE 16.27 Compartmental 
cooperation. Oxaloacetate used in 
the cytoplasm for gluconeogenesis is 
formed in the mitochondrial matrix by the 
carboxylation of pyruvate. Oxaloacetate 
leaves the mitochondrion by a specific 
transport system (not shown) in the form of 
malate, which is reoxidized to oxaloacetate 
in the cytoplasm. 


The first partial reaction of pyruvate carboxylase, the formation of car- 
boxybiotin, depends on the presence of acetyl CoA. Biotin is not carboxyl- 
ated unless acetyl CoA is bound to the enzyme. Acetyl CoA has no effect on 
the second partial reaction. The allosteric activation of pyruvate carboxylase 
by acetyl CoA is an important physiological control mechanism that will be 
discussed in Section 17.4. 


Oxaloacetate is shuttled into the cytoplasm and converted into 
phosphoenolpyruvate 


Oxaloacetate must thus be transported to the cytoplasm to complete the 
synthesis of phosphoenolpyruvate. Oxaloacetate is first reduced to malate 
by malate dehydrogenase. Malate is transported across the mitochondrial 
membrane and reoxidized to oxaloacetate by a cytoplasmic NAD*-linked 
malate dehydrogenase (Figure 16.27). The formation of oxaloacetate from 
malate also provides NADH for use in subsequent steps in gluconeogene- 
sis. Finally, oxaloacetate is simultaneously decarboxylated and phosphory- 
lated by phosphoenolpyruvate carboxykinase (PEPCK) to generate 
phosphoenolpyruvate. The phosphoryl donor is GTP. The CO, that was 
added to pyruvate by pyruvate carboxylase comes off in this step. 
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The sum of the reactions catalyzed by pyruvate carboxylase and phos- 
phoenolpyruvate carboxykinase is 


Pyruvate + ATP + GTP + H,O —> 
phosphoenolpyruvate + ADP + GDP + P. + 2H* 


This pair of reactions bypasses the irreversible reaction catalyzed by pyru- 
vate kinase in glycolysis. 

Why is a carboxylation and a decarboxylation required to form phospho- 
enolpyruvate from pyruvate? Recall that, in glycolysis, the presence of a 
phosphory] group traps the unstable enol isomer of pyruvate as phosphoenol- 
pyruvate (p. 461). However, the addition of a phosphoryl group to pyruvate 
is a highly unfavorable reaction: the AG®’ of the reverse of the glycolytic reac- 
tion catalyzed by pyruvate kinase is +31 kJ mol! (+7.5 kcal mol~!). In glu- 
coneogenesis, the use of the carboxylation and decarboxylation steps results 
in a much more favorable AG°’. The formation of phosphoenolpyruvate from 
pyruvate in the gluconeogenic pathway hasa AG” of +0.8kJ mol! (+0.2 kcal 
mol~!). A molecule of ATP is used to power the addition of a molecule of 
CO, to pyruvate in the carboxylation step. That CO, is then removed to 
power the formation of phosphoenolpyruvate in the decarboxylation step. 
Decarboxylations often drive reactions that are otherwise highly endergonic. 
This metabolic motif is used in the citric acid cycle (Chapter 17), the pentose 
phosphate pathway (Chapter 20), and fatty acid synthesis (Section 22.4). 


The conversion of fructose 1,6-bisphosphate into fructose 
6-phosphate and orthophosphate is an irreversible step 


On formation, phosphoenolpyruvate is metabolized by the enzymes of gly- 
colysis but in the reverse direction. These reactions are near equilibrium 


under intracellular conditions; so, when conditions favor gluconeogenesis, 
the reverse reactions will take place until the next irreversible step is 
reached. This step is the hydrolysis of fructose 1,6-bisphosphate to fructose 
6-phosphate and P.. 


Fructose 1,6-bisphosphatase 


Fructose 1,6-bisphosphate + H,O 
fructose 6-phosphate + P. 


The enzyme responsible for this step is fructose 1,6-bisphosphatase. Like its 
glycolytic counterpart, it is an allosteric enzyme that participates in the 
regulation of gluconeogenesis. We will return to its regulatory properties 
later in the chapter. 


The generation of free glucose is an important control point 


The fructose 6-phosphate generated by fructose 1,6-bisphosphatase is 
readily converted into glucose 6-phosphate. In most tissues, gluconeo- 
genesis ends here. Free glucose is not generated; rather, the glucose 
6-phosphate is processed in some other fashion, notably to form glyco- 
gen. One advantage to ending gluconeogenesis at glucose 6-phosphate is 
that, unlike free glucose, the molecule is not transported out of the cell. 
The enzyme responsible for the conversion of glucose 6-phosphate into 
glucose, glucose 6-phosphatase, is present only in 
tissues whose metabolic duty is to maintain 
blood-glucose homeostasis—tissues that release 
glucose into the blood. These tissues are the liver 
and to a lesser extent the kidney. 

This final step in the generation of glucose does 
not take place in the cytoplasm. Rather, glucose 
6-phosphate is transported into the lumen of the 
endoplasmic reticulum, where it is hydrolyzed to 
glucose by glucose 6-phosphatase, which is bound 


H,0+ — glucose 
6-phosphate 
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Cytoplasmic side 


ER lumen 


to the membrane (Figure 16.28). Glucose and P. FIGURE 16.28 Generation of glucose from glucose 
1 


are then shuttled back to the cytoplasm by a pair of 
transporters. The glucose transporter in the endo- 


6-phosphate. Several endoplasmic reticulum (ER) proteins play a role in the 
generation of glucose from glucose 6-phosphate. T1 transports glucose 
6-phosphate into the lumen of the ER, whereas T2 and T3 transport P, and 


plasmic reticulum membrane is like those found in glucose, respectively, back into the cytoplasm. [Information from A. Buchell 
the plasma membrane. and |. D. Waddel, Biochem. Biophys. Acta 1092:129-137, 1991.] 


Six high-transfer-potential phosphoryl groups are 
spent in synthesizing glucose from pyruvate 


The formation of glucose from pyruvate is energetically unfavorable unless 
it is coupled to reactions that are favorable. Compare the stoichiometry of 
gluconeogenesis with that of the reverse of glycolysis. 

The stoichiometry of gluconeogenesis is 


2 Pyruvate + 4ATP + 2GTP + 2NADH + 6H,0 > 
glucose + 4 ADP + 2GDP + 6P; + 2NAD* + 2H” 
AG?’ = —48kJ mol! (—11 kcal mol!) 


In contrast, the stoichiometry for the reversal of glycolysis is 


2 Pyruvate + 2 ATP + NADH + 2H,0 > 
glucose + 2 ADP + 2P; + 2NAD* + 2H~ 
AG?’ = +90 kJ mol”! (+22 kcal mol!) 


Note that six nucleoside triphosphate molecules are hydrolyzed to synthesize 
glucose from pyruvate in gluconeogenesis, whereas only two molecules of 
ATP are generated in glycolysis in the conversion of glucose into pyruvate. 


TABLE 16.6 Reactions of gluconeogenesis 


Reaction 
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Pyruvate + CO, + ATP + H,O ——> oxaloacetate + ADP + P, + 2H™ 


Oxaloacetate + GTP ———= 


phosphoenolpyruvate + GDP + CO, 


Phosphoenolpyruvate + H,O == 2-phosphoglycerate 


2-Phosphoglycerate 


3-phosphoglycerate 


3-Phosphoglycerate + ATP == 1,3-bisphosphoglycerate + ADP 


1,3-Bisphosphoglycerate + NADH + H* 


Glyceraldehyde 3-phosphate 


Glyceraldehyde 3-phosphate + dihydroxyacetone phosphate 


glyceraldehyde 3-phosphate + NAD* + P. 


dihydroxyacetone phosphate 


fructose 1,6-bisphosphate 


Fructose 1,6-bisphosphate + H,0 ——-> fructose 6-phosphate + P; 


Fructose 6-phosphate 


glucose 6-phosphate 


Glucose 6-phosphate + H,O ——> + glucose + P; 


Thus, the extra cost of gluconeogenesis is four high-phosphory]l-transfer- 
potential molecules for each molecule of glucose synthesized from pyruvate. 
The four additional molecules having high phosphoryl-transfer potential 
are needed to turn an energetically unfavorable process (the reversal of gly- 
colysis) into a favorable one (gluconeogenesis). Here we have a clear exam- 
ple of the coupling of reactions: NTP hydrolysis is used to power an 
energetically unfavorable reaction. The reactions of gluconeogenesis are 
summarized in Table 16.6. 


16.4 Gluconeogenesis and Glycolysis Are 
Reciprocally Regulated 


Gluconeogenesis and glycolysis are coordinated so that, within a cell, one 
pathway is relatively inactive while the other is highly active. If both sets of 
reactions were highly active at the same time, the net result would be the 
hydrolysis of four nucleoside triphosphates (two ATP molecules plus two 
GTP molecules) per reaction cycle. Both glycolysis and gluconeogenesis are 
highly exergonic under cellular conditions, and so there is no thermody- 
namic barrier to such simultaneous activity. However, the amounts and 
activities of the distinctive enzymes of each pathway are controlled so that 
both pathways are not highly active at the same time. The rate of glycolysis 
is also determined by the concentration of glucose, and the rate of gluconeo- 
genesis by the concentrations of lactate and other precursors of glucose. The 
basic premise of the reciprocal regulation is that, when energy is needed or 
glycolytic intermediates are needed for biosynthesis, glycolysis will pre- 
dominate. When there is a surplus of energy and glucose precursors, gluco- 
neogenesis will take over. 


Energy charge determines whether glycolysis or gluconeogenesis 
will be most active 


The first important regulation site is the interconversion of fructose 
6-phosphate and fructose 1,6-bisphosphate (Figure 16.29). Consider first a 
situation in which energy is needed. In this case, the concentration of AMP 
is high. Under this condition, AMP stimulates phosphofructokinase but 
inhibits fructose 1,6-bisphosphatase. Thus, glycolysis is turned on and glu- 
coneogenesis is inhibited. Conversely, high levels of ATP and citrate indi- 
cate that the energy charge is high and that biosynthetic intermediates are 
abundant. ATP and citrate inhibit phosphofructokinase, whereas citrate 
activates fructose 1,6-bisphosphatase. Under these conditions, glycolysis is 
nearly switched off and gluconeogenesis is promoted. Why does citrate take 
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part in this regulatory scheme? As we will see in Chapter 17, citrate reports 
on the status of the citric acid cycle, the primary pathway for oxidizing fuels 
in the presence of oxygen. High levels of citrate indicate an energy-rich situ- 
ation and the presence of precursors for biosynthesis. 

Glycolysis and gluconeogenesis are also reciprocally regulated at the 
interconversion of phosphoenolpyruvate and pyruvate in the liver. The 
glycolytic enzyme pyruvate kinase is inhibited by allosteric effectors ATP 
and alanine, which signal that the energy charge is high and that building 
blocks are abundant. Conversely, pyruvate carboxylase, which catalyzes the 
first step in gluconeogenesis from pyruvate, is inhibited by ADP. Likewise, 
ADP inhibits phosphoenolpyruvate carboxykinase. Pyruvate carboxylase is 
activated by acetyl CoA, which, like citrate, indicates that the citric acid 
cycle is producing energy and biosynthetic intermediates (Chapter 17). 
Hence, gluconeogenesis is favored when the cell is rich in biosynthetic pre- 
cursors and ATP. 


The balance between glycolysis and gluconeogenesis in the 
liver is sensitive to blood-glucose concentration 


In the liver, rates of glycolysis and gluconeogenesis are adjusted to maintain 
blood-glucose levels. The signal molecule fructose 2,6-bisphosphate strongly 
stimulates phosphofructokinase (PFK) and inhibits fructose 1, 6-bisphosphatase. 
When blood glucose is low, fructose 2,6-bisphosphate loses a phosphoryl 
group to form fructose 6-phosphate, which is not an allosteric effector of 
PFK. How is the concentration of fructose 2,6-bisphosphate controlled to 
rise and fall with blood-glucose levels? Two enzymes regulate the concentra- 
tion of this molecule: one phosphorylates fructose 6-phosphate and the other 
dephosphorylates fructose 2,6-bisphosphate. Fructose 2,6-bisphosphate is 
formed in a reaction catalyzed by phosphofructokinase 2 (PFK2), a different 
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FIGURE 16.29 Reciprocal regulation of 
gluconeogenesis and glycolysis in the 
liver. The level of fructose 
2,6-bisphosphate is high in the fed state 
and low in starvation. Another important 
control is the inhibition of pyruvate kinase 
by phosphorylation during starvation. 


Kinase domain Phosphatase domain 


enzyme from phosphofructokinase. Fructose 6-phosphate 
is formed through the hydrolysis of fructose 2,6- 
bisphosphate by a specific phosphatase, fructose bisphos- 
phatase 2 (FBPase2). The striking finding is that both 
PFK2 and FBPase2 are present in a single 55-kDa polypep- 
tide chain (Figure 16.30). This bifunctional enzyme contains 
an N-terminal regulatory domain, followed by a kinase 
domain and a phosphatase domain. PFK2 resembles ade- 
nylate kinase in having a P-loop NTPase domain (Section 
9.4), whereas FBPase2 resembles phosphoglycerate mutase 


a (p. 460). Recall that the mutase is essentially a phospha- 

1 32 250 470 tase. In the bifunctional enzyme, the phosphatase activity 

Regulatory evolved to become specific for F-2,6-BP. The bifunctional 

eomala enzyme itself probably arose by the fusion of genes encod- 
FIGURE 16.30 Domain structure of the bifunctional enzyme ing the kinase and phosphatase domains. 

phosphofructokinase 2. The kinase domain (purple) is fused to What controls whether PFK2 or FBPase2 dominates 


the phosphatase domain (red). The kinase domain is a P-loop NTP 
hydrolase domain, as indicated by the purple shading (Section 


the bifunctional enzyme’s activities in the liver? The activ- 


9.4). The bar represents the amino acid sequence of the ities of PFK2 and FBPase2 are reciprocally controlled by 


enzyme. [Drawn from 1BIF.pdb.] 


FIGURE 16.31 Control of the synthesis 
and degradation of fructose 
2,6-bisphosphate. A low blood-glucose 
level as signaled by glucagon leads to 
the phosphorylation of the bifunctional 
enzyme and hence to a lower level of 
fructose 2,6-bisphosphate, slowing 
glycolysis. Insulin accelerates the 
formation of fructose 2,6-bisphosphate 
by facilitating the dephosphorylation of 
the bifunctional enzyme. 
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phosphorylation of a single serine residue. When glucose is 

scarce, such as during a night’s fast, a rise in the blood level 
of the hormone glucagon triggers acyclic AMP signal cascade (Section 14.1), 
leading to the phosphorylation of this bifunctional enzyme by protein 
kinase A (Figure 16.31). This covalent modification activates FBPase2 and 
inhibits PFK2, lowering the level of F-2,6-BP. Gluconeogenesis predomi- 
nates. Glucose formed by the liver under these conditions is essential for the 
viability of the brain. Glucagon stimulation of protein kinase A also inacti- 
vates pyruvate kinase in the liver (p. 473). 

Conversely, when blood-glucose levels are high, such as after a meal, glu- 
coneogenesis is not needed. Insulin is secreted and initiates a signal pathway 
that activates a protein phosphatase, which removes the phosphoryl group 
from the bifunctional enzyme. This covalent modification activates PFK2 
and inhibits FBPase2. The resulting rise in the level of F-2,6-BP accelerates 
glycolysis. In liver, a key role of glycolysis is to generate metabolites for 
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biosynthesis. The coordinated control of glycolysis and gluconeogenesis is 
facilitated by the location of the kinase and phosphatase domains on the same 
polypeptide chain as the regulatory domain. 

The hormones insulin and glucagon also regulate the amounts of essen- 
tial enzymes. These hormones alter gene expression primarily by changing 
the rate of transcription. Insulin levels rise subsequent to eating, when 
there is plenty of glucose for glycolysis. To encourage glycolysis, insulin 
stimulates the expression of phosphofructokinase, pyruvate kinase, and the 
bifunctional enzyme that makes and degrades F-2,6-BP. Glucagon rises 
during fasting, when gluconeogenesis is needed to replace scarce glucose. 
To encourage gluconeogenesis, glucagon inhibits the expression of the 
three regulated glycolytic enzymes and stimulates instead the production 
of two key gluconeogenic enzymes, phosphoenolpyruvate carboxykinase 
and fructose 1,6-bisphosphatase. Transcriptional control in eukaryotes is 
much slower than allosteric control, taking hours or days instead of seconds 
to minutes. 


Substrate cycles amplify metabolic signals and produce heat 


A pair of reactions such as the phosphorylation of fructose 6-phosphate to 
fructose 1,6-bisphosphate and its hydrolysis back to fructose 6-phosphate is 
called a substrate cycle. As already mentioned, both reactions are not fully 
active at the same time because of reciprocal allosteric controls. However, 
isotope-labeling studies have shown that some fructose 6-phosphate is 
phosphorylated to fructose 1,6-bisphosphate even during gluconeogenesis. 
There also is a limited degree of cycling in other pairs of opposed irrevers- 
ible reactions. This cycling was regarded as an imperfection in metabolic 
control, and so substrate cycles have sometimes been called futile cycles. 
In malignant hyperthermia, there is rapid uncontrolled hydrolysis of ATP, 
which generates heat and can raise the body temperature to 44 °C (111°F). 
Muscles may become rigid and suffer substantial tissue damage. 

Despite such extraordinary circumstances, substrate cycles now seem 
likely to be biologically important. One possibility is that substrate cycles 
amplify metabolic signals. Suppose that the rate of conversion of A into B is 
100 and of B into A is 90, giving an initial net flux of 10. Assume that an 
allosteric effector increases the A > B rate by 20% to 120 and reciprocally 
decreases the B > A rate by 20% to 72. The new net flux is 48, and so a 20% 
change in the rates of the opposing reactions has led to a 380% increase in 
the net flux. In the example shown in Figure 16.32, this amplification is 
made possible by the rapid hydrolysis of ATP. The flux down the glycolytic 
pathway has been suggested to increase as much as 1000-fold at the initia- 
tion of intense exercise. Because the allosteric activation of enzymes alone 
seems unlikely to explain this increased flux, the existence of substrate 
cycles may partly account for the rapid rise in the rate of glycolysis. 


Lactate and alanine formed by contracting muscle are 
used by other organs 


Lactate produced by active skeletal muscle and erythrocytes is a source of 
energy for other organs. Erythrocytes lack mitochondria and can never 
oxidize glucose completely. In contracting fast-twitch skeletal muscle 
fibers during vigorous exercise, the rate at which glycolysis produces 
pyruvate exceeds the rate at which the citric acid cycle oxidizes it. In these 
cells, lactate dehydrogenase reduces excess pyruvate to lactate to restore 
redox balance (p. 464). However, lactate is a dead end in metabolism. It 
must be converted back into pyruvate before it can be metabolized. Both 
pyruvate and lactate diffuse out of these cells through carriers into the 
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FIGURE 16.32 Substrate cycle. This ATP- 
driven cycle operates at two different rates. 
A small change in the rates of the two 
opposing reactions results in a large 
change in the net flux of product B. 
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FIGURE 16.33 The Cori cycle. Lactate formed by active 
muscle is converted into glucose by the liver. This cycle shifts 
part of the metabolic burden of active muscle to the liver. 
The symbol ~P represents nucleoside triphosphates. 


FIGURE 16.34 PATHWAY INTEGRATION: 


Cooperation between glycolysis and 
gluconeogenesis during a sprint. 
Glycolysis and gluconeogenesis are 
coordinated, in a tissue-specific fashion, 
to ensure that the energy needs of all cells 
are met. Consider a sprinter. In skeletal 

leg muscle, glucose will be metabolized 
aerobically to CO, and H,O or, more likely 
(thick arrows) during a sprint, anaerobically 
to lactate. In cardiac muscle, the lactate 
can be converted into pyruvate and used 
as a fuel, along with glucose, to power the 
heartbeats to keep the sprinter’s blood 
flowing. Gluconeogenesis, a primary 
function of the liver, will be taking place 
rapidly (thick arrows) to ensure that enough 
glucose is present in the blood for skeletal 
and cardiac muscle, as well as for other 
tissues. Glycogen, glycerol, and amino 
acids are other sources of energy that we 
will learn about in later chapters. 
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gluconeogenic 


blood. In contracting skeletal muscle, the formation and 
release of lactate lets the muscle generate ATP in the absence 
of oxygen and shifts the burden of metabolizing lactate from 
muscle to other organs. The pyruvate and lactate in the 
bloodstream have two fates. In one fate, the plasma mem- 
branes of some cells, particularly cells in cardiac muscle and 
slow-twitch (type 1) skeletal muscle, contain carriers that 
make the cells highly permeable to lactate and pyruvate. 
These molecules diffuse from the blood into such perme- 
able cells. Once inside these well-oxygenated cells, lactate 
can be reverted back to pyruvate and metabolized through 
the citric acid cycle and oxidative phosphorylation to gener- 
ate ATP. The use of lactate in place of glucose by these cells 
makes more circulating glucose available to the active mus- 
cle cells. In the other fate, excess lactate enters the liver and 
is converted first into pyruvate and then into glucose by the 
pathway. Contracting skeletal muscle supplies lactate to the 


liver, which uses it to synthesize and release glucose. Thus, the liver restores 
the level of glucose necessary for active muscle cells, which derive ATP from 
the glycolytic conversion of glucose into lactate. These reactions constitute the 
Cori cycle (Figure 16.33). 

Studies have shown that alanine, like lactate, is a major precursor of 


glucose in the 
skeletons of so 


liver. The alanine is generated in muscle when the carbon 
me amino acids are used as fuels. The nitrogens from these 


amino acids are transferred to pyruvate to form alanine; the reverse reaction 
takes place in the liver. This process also helps maintain nitrogen balance. 


The interplay 


between glycolysis and gluconeogenesis is summarized in 


Figure 16.34, which shows how these pathways help meet the energy needs 


of different cell types. 
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Bloodstream 


MA Isozymic forms of lactate dehydrogenase in different tissues catalyze the 

interconversions of pyruvate and lactate (Section 10.2). Lactate dehy- 
drogenase is a tetramer of two kinds of 35-kDa subunits encoded by similar 
genes: the H type predominates in the heart, and the homologous M type in 
skeletal muscle and the liver. These subunits associate to form five types of 
tetramers: H,, H,M,, H,M,, H,M,, and M,. The H, isozyme (type 1) has 
higher affinity for substrates than that of the M, isozyme (type 5) and, unlike 
M,, is allosterically inhibited by high levels of pyruvate. The other isozymes 
have intermediate properties, depending on the ratio of the two kinds of 
chains. The H, isozyme oxidizes lactate to pyruvate, which is then used as 
a fuel by the heart through aerobic metabolism. Indeed, heart muscle never 
functions anaerobically. In contrast, M, is optimized to operate in the reverse 
direction, to convert pyruvate into lactate to allow glycolysis to proceed under 
anaerobic conditions. Interestingly, endurance training increases the H, 
isozyme in slow-twitch muscle fibers, enabling these fibers to use the lactate 
generated by other fiber types. 


Glycolysis and gluconeogenesis are evolutionarily intertwined 


Ma The metabolism of glucose has ancient origins. Organisms living in 
the early biosphere depended on the anaerobic generation of energy 
until significant amounts of oxygen began to accumulate 2 billion years ago. 
Glycolytic enzymes were most likely derived independently rather than by 
gene duplication, because glycolytic enzymes with similar properties do not 
have similar amino acid sequences. Although there are four kinases and two 
isomerases in the pathway, sequence and structural comparisons suggest that 
these sets of enzymes are not related to one another by divergent evolution. 
The common dinucleotide-binding domain found in the dehydrogenases 
(Figure 16.12) and the a barrels are the only major recurring elements. 

We can speculate on the relationship between glycolysis and gluconeo- 
genesis if we think of glycolysis as consisting of two segments: the metabo- 
lism of hexoses (the upper segment) and the metabolism of trioses (the 
lower segment). The enzymes of the upper segment are different in some 
species and are missing entirely in some archaea, whereas enzymes of the 
lower segment are quite conserved. In fact, four enzymes of the lower seg- 
ment are present in all species. This lower part of the pathway is common to 
glycolysis and gluconeogenesis. This common part of the two pathways may 
be the oldest part, constituting the core to which the other steps were added. 
The upper part would have varied according to the sugars that were avail- 
able to evolving organisms in particular niches. Interestingly, this core part 
of carbohydrate metabolism can generate triose precursors for ribose sugars, 
a component of RNA and a critical requirement for the RNA world. Thus, 
we are left with the unanswered question, Was the original core pathway 
used for energy conversion or biosynthesis? 


SUMMARY 


16.1 Glycolysis Is an Energy-Conversion Pathway in Many Organisms 
Glycolysis is the set of reactions that converts glucose into pyruvate. 
The 10 reactions of glycolysis take place in the cytoplasm. In the first 
stage, glucose is converted into fructose 1,6-bisphosphate by a phos- 
phorylation, an isomerization, and a second phosphorylation reaction. 
Fructose 1,6-bisphosphate is then cleaved by aldolase into dihydroxy- 
acetone phosphate and glyceraldehyde 3-phosphate, which are readily 
interconvertible. Two molecules of ATP are consumed per molecule 
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of glucose in these reactions. In the second stage, ATP is generated. 
Glyceraldehyde 3-phosphate is oxidized and phosphorylated to form 
1,3-bisphosphoglycerate, an acyl phosphate with a high phosphoryl- 
transfer potential. This molecule transfers a phosphoryl group to ADP 
to form ATP and 3-phosphoglycerate. A phosphoryl shift and a dehy- 
dration form phosphoenolpyruvate, a second intermediate with a high 
phosphoryl-transfer potential. Another molecule of ATP is generated 
as phosphoenolpyruvate is converted into pyruvate. There is a net gain 
of two molecules of ATP in the formation of two molecules of pyru- 
vate from one molecule of glucose. 

The electron acceptor in the oxidation of glyceraldehyde 3-phosphate 
is NAD*, which must be regenerated for glycolysis to continue. In aero- 
bic organisms, the NADH formed in glycolysis transfers its electrons to 
O, through the electron-transport chain, which thereby regenerates 
NAD*. Under anaerobic conditions and in some microorganisms, 
NAD* is regenerated by the reduction of pyruvate to lactate. In other 
microorganisms, NAD* is regenerated by the reduction of pyruvate to 
ethanol. These two processes are examples of fermentations. 


The Glycolytic Pathway Is Tightly Controlled 

The glycolytic pathway has a dual role: it degrades glucose to generate 
ATP, and it provides building blocks for the synthesis of cellular com- 
ponents. The rate of conversion of glucose into pyruvate is regulated to 
meet these two major cellular needs. Under physiological conditions, 
the reactions of glycolysis are readily reversible except for those cata- 
lyzed by hexokinase, phosphofructokinase, and pyruvate kinase. 
Phosphofructokinase, the most important control element in glycoly- 
sis, is inhibited by high levels of ATP and citrate, and it is activated by 
AMP and fructose 2,6-bisphosphate. In the liver, this bisphosphate 
signals that glucose is abundant. Hence, phosphofructokinase is active 
when either energy or building blocks are needed. Hexokinase 1s inhib- 
ited by glucose 6-phosphate, which accumulates when phosphofructo- 
kinase is inactive. ATP and alanine allosterically inhibit pyruvate 
kinase, the other control site, and fructose 1,6-bisphosphate activates 
the enzyme. Consequently, pyruvate kinase is maximally active when 
the energy charge is low and glycolytic intermediates accumulate. 


Glucose Can Be Synthesized from Noncarbohydrate Precursors 
Gluconeogenesis, which occurs primarily in the liver, is the synthesis 
of glucose from noncarbohydrate sources, such as lactate, amino acids, 
glycerol, and alanine produced from pyruvate by active skeletal mus- 
cle. Several of the reactions that convert pyruvate into glucose are 
common to glycolysis. Gluconeogenesis, however, requires four new 
reactions to bypass the essential irreversibility of three reactions in 
glycolysis. In two of the new reactions, pyruvate is carboxylated in 
mitochondria to oxaloacetate, which in turn is decarboxylated and 
phosphorylated in the cytoplasm to phosphoenolpyruvate. ATP and 
GTP are consumed in these reactions, which are catalyzed by pyru- 
vate carboxylase and phosphoenolpyruvate carboxykinase, respec- 
tively. The other distinctive reactions of gluconeogenesis are the 
hydrolyses of fructose 1,6-bisphosphate and glucose 6-phosphate, 
which are catalyzed by specific phosphatases. 


Gluconeogenesis and Glycolysis Are Reciprocally Regulated 

Gluconeogenesis and glycolysis are reciprocally regulated so that 
one pathway is relatively inactive while the other is highly active. 
Phosphofructokinase and fructose 1,6-bisphosphatase are key 


control points. Fructose 2,6-bisphosphate, an intracellular signal mole- 489 


cule present at higher levels when glucose is abundant, activates 


Problems 


glycolysis and inhibits gluconeogenesis by regulating these enzymes. 
Pyruvate kinase and pyruvate carboxylase are regulated by other effec- 
tors so that both are not maximally active at the same time. Allosteric 
regulation and reversible phosphorylation, which are rapid, are comple- 
mented by transcriptional control, which takes place in hours or days. 


KEY TERMS 


glycolysis (p. 449) 

lactic acid fermentation (p. 449) 
alcoholic fermentation (p. 449) 
gluconeogenesis (p. 449) 

hexokinase (p. 451) 

kinase (p. 453) 

phosphofructokinase (PFK) (p. 454) 
thioester intermediate (p. 457) 
substrate-level phosphorylation (p. 460) 


PROBLEMS 


pyruvate kinase (p. 461) 
fermentation (p. 462) 

obligate anaerobe (p. 464) 
facultative anaerobe (p. 464) 
Rossmann fold (p. 465) 
committed step (p. 471) 
feedforward stimulation (p.472) 
glucokinase (p. 473) 

aerobic glycolysis (p. 474) 


pyruvate carboxylase (p. 478) 

biotin (p. 479) 

phosphoenolpyruvate carboxykinase 
(p. 480) 

fructose 1,6-bisphosphatase (p. 481) 

glucose 6-phosphatase (p. 481) 

bifunctional enzyme (p. 484) 

substrate cycle (p. 485) 

Cori cycle (p. 486) 


1. Gross versus net. The gross yield of ATP from the metab- 
olism of glucose to two molecules of pyruvate is four mole- 
cules of ATP. However, the net yield is only two molecules 
of ATP. Why are the gross and net values different? 


2. Go together like the owl and the pussycat. Match each term 
with its description. 


(a) Hexokinase 1. Forms fructose 1,6- 
bisphosphate 
(b) Phosphoglucose 2. Generates the first 
isomerase high-phosphoryl-transfer- 


potential compound that 
isnot ATP 

(c) Phosphofructokinase 3. Converts glucose 
6-phosphate into fructose 
6-phosphate 


(d) Aldolase 4, Phosphorylates glucose 

(e) Triose phosphate 5. Generates the second 
isomerase molecule of ATP 

(f) Glyceraldehyde 6. Cleaves fructose 1, 6- 
3-phosphate dehy- bisphosphate 
drogenase 


(g) Phosphoglycerate 7. Generates the second 
kinase high-phosphoryl-transfer- 
potential compound that 
isnot ATP 


(h) Phosphoglycerate 8. Catalyzes the interconver- 
sion of three-carbon isomers 


mutase 
(i) Enolase 9. Converts 3-phosphoglycerate 
into 2-phosphoglycerate 
(j) Pyruvate kinase 10. Generates the first molecule 
of ATP 


3. Who takes? Who gives? Lactic acid fermentation and 
alcoholic fermentation are oxidation—reduction reactions. 
Identify the ultimate electron donor and electron acceptor. 


4, ATP yield. Each of the following molecules is processed 
by glycolysis to lactate. How much ATP is generated from 
each molecule? 


(a) Glucose 6-phosphate 

(b) Dihydroxyacetone phosphate 
(c) Glyceraldehyde 3-phosphate 
(d) Fructose 

(e) Sucrose 


5. Enzyme redundancy? Why is it advantageous for the liver 
to have both hexokinase and glucokinase to phosphorylate 
glucose? 


6. Magic? The interconverison of DHAP and GAP greatly 
favors the formation of DHAP at equilibirum. Yet the con- 
version of DHAP by triose phosphate isomerase proceeds 


readily. Why? 


7. Between two extremes. What is the role of a thioester in the 
formation of ATP in glycolysis? 


8. Corporate sponsors. Some of the early research on gly- 
colysis was supported by the brewing industry. Why would 
the brewing industry be interested in glycolysis? 


9. Recommended daily allowance. The recommended daily 
allowance for the vitamin niacin is 15 mg per day. How 
would glycolysis be affected by niacin deficiency? 


10. Who’s on first? Although both hexokinase and phos- 
phofructokinase catalyze irreversible steps in glycolysis and 
the hexokinase-catalyzed step is first, phosphofructokinase 


490 
CHAPTER 16 Glycolysis and Gluconeogenesis 


is nonetheless the pacemaker of glycolysis. What does this 
information tell youabout the fate of the glucose 6- phosphate 
formed by hexokinase? 


11. The tortoise and the hare. Why is the regulation of phos- 
phofructokinase by energy charge not as important in the 
liver as it is in muscle? 


12. Like Minneapolis and St. Paul. Match each term with its 
description. 


(a) Lactate 1. Generates oxaloacetate 

(b) Pyruvate carboxylase 2. Readily converted into 
DHAP 

(c) Acetyl CoA 3. Generates a high- 


phosphoryl-transfer- 
potential compound 
(d) Phosphoenolpyruvate 4. Found predominantly 


carboxykinase in liver 
(e) Glycerol 5. Gluconeogenic counter- 
part of PFK 
(f) Fructose 1,6- 6. Readily converted into 
bisphosphatase pyruvate 


(g) Glucose 6-phosphatase 7: Required for pyruvate 
carboxylase activity 


13. Running in reverse. Why can’t the reactions of the 
glycolytic pathway simply be run in reverse to synthesize 
glucose? 


14. Road blocks. What reactions of glycolysis are not read- 
ily reversible under intracellular conditions? 


15. No pickling. Why is it in the muscle’s best interest to 
export lactic acid into the blood during intense exercise? 


16. Aprés vous. Why is it physiologically advantageous for 
the pancreas to use GLUT2, with a high K,,, as the trans- 
porter that allows glucose entry into 8 cells? 


17. Bypass. In the liver, fructose can be converted into glyc- 
eraldehyde 3-phosphate and dihydroxyacetone phosphate 
without passing through the phosphofructokinase- 
regulated reaction. Show the reactions that make this 
conversion possible. Why might ingesting high levels of 
fructose have deleterious physiological effects? 


18. Match ’em 1. The following sequence is a part of the 
sequence of reactions in gluconeogenesis. 


Pyruvate ——> Oxaloacetate —> Malate —> 
A B Cc 
Oxaloacetate —— Phosphoenolpyruvate 
D 


Match the capital letters representing the reaction in the 
gluconeogenic pathway with parts a, b, c, etc. 


(a) takes place in mitochondria 
(b) takes place in the cytoplasm 
(c) produces CO, 


d) consumes CO, 

e) requires NADH 

f) produces NADH 

g) requires ATP 

h) requires GTP 

i) requires thiamine 

(j) requires biotin 

(k) is regulated by acetyl CoA. 


ps ee, es 


19. Like Batman and Robin. Match parts a through j with 
parts 1 through 10. 


(a) Glucose 6-phosphate 1. Inhibits phosphofructo- 
kinase in the liver 


(b) [ATP] < [AMP] 2. Glucokinase 
(c) Citrate 3. GLUT2 
(d) Low pH 4. Inhibits hexokinase 
(e) Fructose 1, 6- 5. Inhibits phosphofructo- 
bisphosphate kinase in muscle 
(f) Fructose 2, 6 bisphos- 6. Inhibits phosphofructo- 
phate kinase 
(g) Insulin 7. Stimulates pyruvate 
kinase 
(h) Hasahigh Kj, for glu- 8. Stimulates phosphofruc- 
cose tokinase in the liver 
(i) Transporter specificto 9. Causes the insertion of 
liver and pancreas GLUT4 into cell 
membranes 
Gj) [ATP] > [AMP] 10. Stimulates phosphofruc- 
tokinase 


20. Trouble ahead. Suppose that a microorganism that was 
an obligate anaerobe suffered a mutation that resulted in the 
loss of triose phosphate isomerase activity. How would this 
loss affect the ATP yield of fermentation? Could such an 
organism survive? 


21. Kitchen chemistry. Sucrose is commonly used to preserve 
fruits. Why is glucose not suitable for preserving foods? 


22. Tracing carbon atoms 1. Glucose labeled with #C at C-1 
is incubated with the glycolytic enzymes and necessary 
cofactors. 


(a) What is the distribution of '*C in the pyruvate that is 
formed? (Assume that the interconversion of glyceralde- 
hyde 3-phosphate and dihydroxyacetone phosphate is very 
rapid compared with the subsequent step.) 


(b) Ifthe specific activity of the glucose substrate is 10 mCi 
mmol! (millicuries per mole, a measure of radioactivity 
per mole), what is the specific activity of the pyruvate that 
is formed? 


23. Lactic acid fermentation. 


(a) Write a balanced equation for the conversion of glucose 
into lactate. 


(b) Calculate the standard free-energy change of this reaction 
by using the data given in Table 16.1 and the fact that AG®” is 
—25 kJ mol~! (—6 kcal mol!) for the following reaction: 


Pyruvate + NADH + H* == lactate + NAD* 


What is the free-energy change (AG, not AG®’) of this reac- 
tion when the concentrations of reactants are: glucose, 5 mM; 
lactate, 0.05 mM; ATP, 2mM; ADP, 0.2 mM; and P., 1 mM? 


24. High potential. What is the equilibrium ratio of phos- 
phoenolpyruvate to pyruvate under standard conditions 
when [ATP]/[ADP] = 10? 


25. Hexose-triose equilibrium. What are the equilibrium 
concentrations of fructose 1,6-bisphosphate, dihydroxyac- 
etone phosphate, and glyceraldehyde 3-phosphate when 1 
mM fructose 1,6-bisphosphate is incubated with aldolase 
under standard conditions? 


26. Double labeling. 3-Phosphoglycerate labeled uniformly 
with '4C is incubated with 1,3-BPG labeled with 32P at 
C-1. What is the radioisotope distribution of the 2,3-BPG 
that is formed on addition of BPG mutase? 


27. An informative analog. Xylose has the same structure as 
that of glucose except that it has a hydrogen atom at G-5 in 
place of a hydroxymethyl group. The rate of ATP hydroly- 
sis by hexokinase is markedly enhanced by the addition of 
xylose. Why? 


28. Distinctive sugars. The intravenous infusion of fructose 
into healthy volunteers leads to a two- to fivefold increase in 
the level of lactate in the blood, a far greater increase than that 
observed after the infusion of the same amount of glucose. 


(a) Why is glycolysis more rapid after the infusion of 
fructose? 


(b) Fructose has been used in place of glucose for intrave- 
nous feeding. Why is this use of fructose unwise? 


29. It is not hard to meet expenses. They are everywhere. 
What energetic barrier prevents glycolysis from simply 
running in reverse to synthesis glucose? What is the ener- 
getic cost to overcome this barrier? 


30. Match ’em 2. Indicate which of the conditions listed in 
the right-hand column increase the activity of the glycolytic 
and gluconeogenic pathways. 


(a) glycolysis_ 1. increase in ATP 

(b) gluconeogenesis . increase in AMP 
increase in fructose 
2,6-bisphosphate 
increase in citrate 
increase in acetyl CoA 
increase in insulin 
increase in glucagon 
fasting 

fed 
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31. Waste not, want not. Why is the conversion of lactic 
acid from the blood into glucose in the liver in an organism’s 
best interest? 


32. Road blocks bypassed. How are the irreversible reactions 
of glycolysis bypassed in gluconeogenesis? 


33. Pointlessness averted. What are the regulatory means 
that prevent high levels of activity in glycolysis and gluco- 
neogenesis simultaneously? 


34. Different needs. Liver is primarily a gluconeogenic tis- 
sue, whereas muscle is primarily glycolytic. Why does this 
division of labor make good physiological sense? 


35. Metabolic mutants. What would be the effect on an 
organism’s ability to use glucose as an energy source if a 
mutation inactivated glucose 6-phosphatase in the liver? 


36. Never let me go. Why does the lack of glucose 
6-phosphatase activity in the brain and muscle make good 
physiological sense? 


37. Counting high-energy compounds 1. How many NTP 
molecules are required for the synthesis of one molecule of 
glucose from two molecules of pyruvate? How many 
NADH molecules? 


38. Counting high-energy compounds 2. How many NTP 
molecules are required to synthesize glucose from each of 
the following compounds? 


(a) Glucose 6-phosphate 

(b) Fructose 1,6-bisphosphate 

(c) Two molecules of oxaloacetate 

(d) Two molecules of dihydroxyacetone phosphate 

39. Lending a hand. How might enzymes that remove 


amino groups from alanine and aspartate contribute to glu- 
coneogenesis? 


40. More metabolic mutants. Predict the effect of each of the 
following mutations on the pace of glycolysis in liver cells: 


(a) Loss of the allosteric site for ATP in phosphofructokinase 
(b) Loss of the binding site for citrate in phosphofructokinase 
(c) Loss of the phosphatase domain of the bifunctional 
enzyme that controls the level of fructose 2,6-bisphosphate 


(d) Loss of the binding site for fructose 1,6-bisphosphate 
in pyruvate kinase 


41. Yet another metabolic mutant. What are the likely con- 
sequences of a genetic disorder rendering fructose 
1,6-bisphosphatase in the liver less sensitive to regulation 
by fructose 2,6-bisphosphate? 


42. Biotin snatcher. Avidin, a 70-kDa protein in egg white, 
has very high affinity for biotin. In fact, it is a highly 
specific inhibitor of biotin enzymes. Which of the following 
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conversions would be blocked by the addition of avidin to a 
cell homogenate? 


(a) Glucose > pyruvate 
(b) Pyruvate — glucose 

(c) Oxaloacetate > glucose 

(d) Malate > oxaloacetate 

(e) Pyruvate > oxaloacetate 

(f) Glyceraldehyde 3-phosphate — fructose 1,6-bisphosphate 


43. Tracing carbon atoms 2. If cells synthesizing glucose 
from lactate are exposed to CO, labeled with '*C, what will 
be the distribution of label in the newly synthesized glucose? 


44, Arsenate poisoning. Arsenate.(AsO,3~) closely resem- 
bles P. in structure and reactivity. In the reaction catalyzed 
by glyceraldehyde 3-phosphate dehydrogenase, arsenate 
can replace phosphate in attacking the energy-rich thioester 
intermediate. The product of this reaction, 1-arseno- 
3-phosphoglycerate, is unstable. It and other acyl arsenates 
are rapidly and spontaneously hydrolyzed. What is the 
effect of arsenate on energy generation in a cell? 


45. Reduce, reuse, recycle. In the conversion of glucose into 
two molecules of lactate, the NADH generated earlier in the 
pathway is oxidized to NADt. Why is it not to the cell’s 
advantage to simply make more NAD* so that the regen- 
eration would not be necessary? After all, the cell would 
save much energy because it would no longer need to syn- 
thesize lactic acid dehydrogenase. 


46. Adenylate kinase again. Adenylate kinase, an enzyme 
considered in great detail in Chapter 9, is responsible for 
interconverting the adenylate nucleotide pool: 


ADP + ADP == ATP + AMP 


The equilibrium constant for this reaction is close to 1, 
inasmuch as the number of phosphoanhydride bonds is the 
same on each side of the equation. Using the equation for 
the equilibrium constant for this reaction, show why 
changes in [AMP] are a more effective indicator of the 
adenylate pool than [ATP]. 


47. Working at cross-purposes? Gluconeogenesis takes place 
during intense exercise, which seems counterintuitive. Why 
would an organism synthesize glucose and at the same time 
use glucose to generate energy? 


48. Powering pathways. Compare the stoichiometries of 
glycolysis and gluconeogenesis. Recall that the input of one 
ATP equivalent changes the equilibrium constant of a reac- 
tion by a factor of about 10° (Section 15.2). By what factor 
do the additional high-phosphoryl-transfer compounds 
alter the equilibrium constant of gluconeogenesis? 


Mechanism Problem 


49. Argument by analogy. Propose a mechanism for the 
conversion of glucose 6- phosphate into fructose 6-phosphate 


by phosphoglucose isomerase based on the mechanism of 
triose phosphate isomerase. 


Chapter Integration Problems 


50. Not just for energy. People with galactosemia display 
central nervous system abnormalities even if galactose is 
eliminated from the diet. The precise reason for it is not 
known. Suggest a plausible explanation. 


51. Power to the rbc! Hexokinase in red blood cells has a Ky, 
of approximately 50 wM. Because life is hard enough as it 
is, let's assume that the hexokinase displays Michaelis— 
Menten kinetics. What concentration of blood glucose 
would yield v, equal to 90% V_...? What does this result tell 
you if normal blood glucose levels range between approxi- 
mately 3.6 and 6.1 mM? 


52. State function. Fructose 2,6-bisphosphate is a potent 
stimulator of phosphofructokinase. Explain how fructose 
2,6-bisphosphate might function in the concerted model for 
allosteric enzymes. 


Data Interpretation Problems 


53. Now, that’s unusual. Phosphofructokinase has recently 
been isolated from the hyperthermophilic archaeon 
Pyrococcus furiosus. It was subjected to standard biochemi- 
cal analysis to determine basic catalytic parameters. The 
processes under study were of the form 


Fructose 6-phosphate + (x — P;) > 
fructose 1,6-bisphosphate + (x) 


The assay measured the increase in fructose 1,6-bisphos- 
phate. Selected results are shown in the adjoining graph. 
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[Data from J. E. Tuininga et al. J. Biol Chem. 274:21023-21028, 1999.] 


(a) How does the P. furiosus phosphofructokinase differ 
from the phosphofructokinase considered in this chapter? 
(b) What effects do AMP and ATP have on the reaction 
with ADP? 


54. Cool bees. In principle, a futile cycle that includes phos- 
phofructokinase and fructose 2,6-bisphosphatase could be 
used to generate heat. The heat could be used to warm 


tissues. For instance, certain bumblebees have been reported 
to use such a futile cycle to warm their flight muscles on 
cool mornings. Scientists undertook a series of experiments 
to determine if a number of species of bumblebee use this 
futile cycle. Their approach was to measure the activity of 
PFK and F-1,6-BPase in flight muscle. 


(a) What was the rationale for comparing the activities of 
these two enzymes? 

(b) The data below show the activites of both enzymes for a 
variety of bumblebee species (genera Bombus and Psithyrus). 
Do these results support the notion that bumblebees use 
futile cycles to generate heat? Explain. 

(c) Inwhich species might futile cycling take place? Explain 
your reasoning. 

(d) Do these results prove that futile cycling does not par- 
ticipate in heat generation? 
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55. Confused? The adjoining graph shows muscle phospho- 
fructokinase activity as a function of ATP concentration in 
the presence of a constant concentration of fructose 6-phos- 
phate. Explain these results and discuss how they relate to 
the role of phosphofructokinase in glycolysis. 
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The Citric Acid Cycle 


CHAPTER 


2 CO, 
ATP 


Roundabouts, or traffic circles, function as hubs to 
facilitate traffic flow. The citric acid cycle is the 
biochemical hub of the cell, oxidizing carbon fuels, 
usually in the form of acetyl CoA, as well as serving as 
a source of precursors for biosynthesis. [(Left) Lynn 


he metabolism of glucose to pyruvate in glycolysis, an anaerobic proc- 

ess, harvests but a fraction of the ATP available from glucose. Most of 
the ATP generated in metabolism is provided by the aerobic processing of 
glucose. This process starts with the complete oxidation of glucose deriva- 
tives to carbon dioxide. This oxidation takes place in a series of reactions 
called the citric acid cycle, also known as the tricarboxylic acid (TCA) cycle 
or the Krebs cycle. The citric acid cycle is the final common pathway for the 
oxidation of fuel molecules—carbohydrates, fatty acids, and amino acids. 
Most fuel molecules enter the cycle as acetyl coenzyme A. 
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OUTLINE 


17.1 The Pyruvate Dehydrogenase 
Complex Links Glycolysis to the 
Citric Acid Cycle 


17.2 The Citric Acid Cycle Oxidizes 
Two-Carbon Units 


17.3 Entry to the Citric Acid Cycle and 
Metabolism Through It Are 
Controlled 


17.4 The Citric Acid Cycle Is a Source 
of Biosynthetic Precursors 


17.5 The Glyoxylate Cycle Enables 
Plants and Bacteria to Grow on 
Acetate 
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FIGURE 17.1 Mitochondrion. The double 
membrane of the mitochondrion is evident 
in this electron micrograph. The numerous 
invaginations of the inner mitochondrial 
membrane are called cristae. The oxidative 
decarboxylation of pyruvate and the 
sequence of reactions in the citric acid 
cycle take place within the matrix. [(Left) 
Omikron/Photo Researchers.] 
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FIGURE 17.2 Overview of the citric acid 
cycle. The citric acid cycle oxidizes two- 
carbon units, producing two molecules of 
CO,, one molecule of ATP, and high-energy 
electrons in the form of NADH and FADH,. 


FIGURE 17.3 Cellular respiration. The 
citric acid cycle constitutes the first stage in 
cellular respiration, the removal of high- 
energy electrons from carbon fuels in the 
form of NADH and FADH, (blue pathway). 
These electrons reduce O, to generate a 
proton gradient (red pathway), which is 
used to synthesize ATP (green pathway). 
The reduction of O, and the synthesis of 
ATP constitute oxidative phosphorylation. 
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Under aerobic conditions, the pyruvate generated from glucose is oxida- 
tively decarboxylated to form acetyl CoA. In eukaryotes, the reactions of 
the citric acid cycle take place in the matrix of the mitochondria (Figure 17.1), 
in contrast with those of glycolysis, which take place in the cytoplasm. 


The citric acid cycle harvests high-energy electrons 


The citric acid cycle is the central metabolic hub of the cell. It is the gateway 
to the aerobic metabolism of any molecule that can be transformed into an 
acetyl group or a component of the citric acid cycle. The cycle is also an 
important source of precursors for the building blocks of many other mol- 
ecules such as amino acids, nucleotide bases, and porphyrin (the organic 
component of heme). The citric acid cycle component, oxaloacetate, is also 
an important precursor to glucose (Section 16.3). 

What is the function of the citric acid cycle in transforming fuel mole- 
cules into ATP? Recall that fuel molecules are carbon compounds that are 
capable of being oxidized—that is, of losing electrons (Chapter 15). The 
citric acid cycle includes a series of oxidation—reduction reactions that result 
in the oxidation of an acetyl group to two molecules of carbon dioxide. This 
oxidation generates high-energy electrons that will be used to power the 
synthesis of ATP. The function of the citric acid cycle is the harvesting of high- 
energy electrons from carbon fuels. 

The overall pattern of the citric acid cycle is shown in Figure 17.2. A four- 
carbon compound (oxaloacetate) condenses with a two-carbon acetyl unit to 
yield a six-carbon tricarboxylic acid. The six-carbon compound releases CO, 
twice in two successive oxidative decarboxylations that yield high-energy 
electrons. A four-carbon compound remains. This four-carbon compound is 
further processed to regenerate oxaloacetate, which can initiate another 
round of the cycle. Two carbon atoms enter the cycle as an acety] unit and two 
carbon atoms leave the cycle in the form of two molecules of CO,. 

Note that the citric acid cycle itself neither generates a large amount of 
ATP nor includes oxygen as a reactant (Figure 17.3). Instead, the citric acid 
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cycle removes electrons from acetyl CoA and uses these electrons to reduce 
NAD* and FAD to form NADH and FADH.,. Three hydride ions (hence, 
six electrons) are transferred to three molecules of nicotinamide adenine 
dinucleotide (NAD*), and one pair of hydrogen atoms (hence, two elec- 
trons) is transferred to one molecule of flavin adenine dinucleotide (FAD). 
Electrons released in the reoxidation of NADH and FADH, flow through 
a series of membrane proteins (referred to as the electron-transport chain) to 
generate a proton gradient across the inner mitochondrial membrane. These 
protons then flow through ATP synthase to generate ATP from ADP and 
inorganic phosphate. These electron carriers yield nine molecules of ATP 
when they are oxidized by O, in oxidative phosphorylation (Chapter 18). 
The citric acid cycle, in conjunction with oxidative phosphorylation, 
provides the preponderance of energy used by aerobic cells—in human 
beings, greater than 90%. It is highly efficient because the oxidation of a 
limited number of citric acid cycle molecules can generate large amounts of 
NADH and FADH,,. Note in Figure 17.2 that the four-carbon molecule, 
oxaloacetate, that initiates the first step in the citric acid cycle is regenerated 
at the end of one passage through the cycle. Thus, one molecule of oxaloac- 
etate is capable of participating in the oxidation of many acetyl molecules. 


17.1 The Pyruvate Dehydrogenase Complex Links 
Glycolysis to the Citric Acid Cycle 


Carbohydrates, most notably glucose, are processed by glycolysis into 
pyruvate (Chapter 16). Under anaerobic conditions, the pyruvate is con- 
verted into lactate or ethanol, depending on the organism. Under aerobic 
conditions, the pyruvate is transported into mitochondria by a specific car- 
rier protein embedded in the mitochondrial membrane. In the mitochon- 
drial matrix, pyruvate is oxidatively decarboxylated by the pyruvate 
dehydrogenase complex to form acetyl CoA. 


Pyruvate + CoA + NAD* — acetyl CoA + CO, + NADH + H* 


This irreversible reaction is the link between glycolysis and the citric acid cycle 
(Figure 17.4). Note that the pyruvate dehydrogenase complex produces 
CO, and captures high-transfer-potential electrons in the form of NADH. 
Thus, the pyruvate dehydrogenase reaction has many of the key features of 
the reactions of the citric acid cycle itself. 

The pyruvate dehydrogenase complex is a large, highly integrated com- 
plex of three distinct enzymes (Table 17.1). Pyruvate dehydrogenase com- 
plex is a member of a family of homologous complexes that include the 
citric acid cycle enzyme a-ketoglutarate dehydrogenase complex (p. 505). 
These complexes are giant, larger than ribosomes, with molecular masses 
ranging from 4 million to 10 million kDa (Figure 17.5). As we will see, their 


TABLE 17.1 Pyruvate dehydrogenase complex of E. coli 


Number of Prosthetic 


Enzyme Abbreviation chains group Reaction catalyzed 

Pyruvate E, 24 TPP Oxidative decarboxylation 
dehydrogenase of pyruvate 
component 

Dihydrolipoyl E, 24 Lipoamide Transfer of acetyl group 
transacetylase to CoA 

Dihydrolipoyl E, 12 FAD Regeneration of the 


dehydrogenase oxidized form of lipoamide 
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FIGURE 17.4 The link between glycolysis 
and the citric acid cycle. Pyruvate 
produced by glycolysis is converted into 
acetyl CoA, the fuel of the citric acid cycle. 
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FIGURE 17.5 Electron micrograph of the 
pyruvate dehydrogenase complex from 
E. coli. [Courtesy of Dr. Lester Reed.] 
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Mechanism: The synthesis of acetyl coenzyme A from pyruvate 
requires three enzymes and five coenzymes 


The mechanism of the pyruvate dehydrogenase reaction is wonderfully 
complex, more so than is suggested by its simple stoichiometry. The 
reaction requires the participation of the three enzymes of the pyruvate 
dehydrogenase complex and five coenzymes. The coenzymes thiamine 
pyrophosphate (TPP), lipoic acid, and FAD serve as catalytic cofactors, 
and CoA and NAD? are stoichiometric cofactors, cofactors that function 
as substrates. 
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The conversion of pyruvate into acetyl CoA consists of three steps: 
decarboxylation, oxidation, and transfer of the resultant acetyl group to 
CoA. A fourth step is required to regenerate the active enzyme. 
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These steps must be coupled to preserve the free energy derived from the 
decarboxylation step to drive the formation of NADH and acetyl CoA: 


1. Decarboxylation. Pyruvate combines with TPP and is then decarboxyl- 
ated to yield hydroxyethyl-TPP (Figure 17.6). 
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This reaction, the rate limiting step in acetyl CoA synthesis, is catalyzed by 
the pyruvate dehydrogenase component (E,) of the multienzyme complex. 
TPP is the prosthetic group of the pyruvate dehydrogenase component. 


2. Oxidation. The hydroxyethyl group attached to TPP is oxidized to form 
an acetyl group while being simultaneously transferred to lipoamide, a 
derivative of lipoic acid that is linked to the side chain of a lysine residue by 
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an amide linkage. Note that this transfer results in the formation of an 
energy-rich thioester bond. 
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The oxidant in this reaction is the disulfide group of lipoamide, which is 
reduced to its disulfhydryl form. This reaction, also catalyzed by the pyru- 
vate dehydrogenase component E,, yields acetyllipoamide. 


3. Formation of Acetyl CoA. The acetyl group is transferred from acetylli- 
poamide to CoA to form acetyl CoA. 
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Dihydrolipoyl transacetylase (E,) catalyzes this reaction. The energy-rich 
thioester bond is preserved as the acetyl group is transferred to CoA. Recall 
that CoA serves as a carrier of many activated acyl groups, of which acetyl 
is the simplest (Section 15.3). Acetyl CoA, the fuel for the citric acid cycle, 
has now been generated from pyruvate. 


4. Regeneration of oxidized lipoamide. The pyruvate dehydrogenase com- 
plex cannot complete another catalytic cycle until the dihydrolipoamide is 
oxidized to lipoamide. In the fourth step, the oxidized form of lipoamide is 
regenerated by dihydrolipoyl dehydrogenase (E,). Two electrons are trans- 
ferred to an FAD prosthetic group of the enzyme and then to NAD™. 


NADt 


+ FADH> Ss FAD + NADH + Ht 


H * R” 


Dihydrolipoamide Lipoamide 


CH 4 Ss CH 5 
Hydroxyethyl-TPP 


FIGURE 17.6 Mechanism of the E, 
decarboxylation reaction. E, is the 
pyruvate dehydrogenase component of the 
pyruvate dehydrogenase complex. A key 
feature of the prosthetic group, TPP, is that 
the carbon atom between the nitrogen and 
sulfur atoms in the thiazole ring is much 
more acidic than most =C— groups, with 
a pK, value near 10. (1) This carbon center 
ionizes to form a carbanion. (2) The 
carbanion readily adds to the carbonyl 
group of pyruvate. (8) This addition is 
ollowed by the decarboxylation of 
pyruvate. The positively charged ring of 
TPP acts as an electron sink that stabilizes 
he negative charge that is transferred to 
he ring as part of the decarboxylation. (4) 
Protonation yields hydroxyethyl- TPP. 
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FIGURE 17.7 Schematic representation 
of the pyruvate dehydrogenase 
complex. The transacetylase core (E,) is 
shown in red, the pyruvate dehydrogenase 
component (E,) in yellow, and the 
dihydrolipoyl dehydrogenase (E,) in green. 


FIGURE 17.8 Structure of the 
transacetylase (E,) core. Each red ball 
represents a trimer of three E, subunits. 
Notice that each subunit consists of three 
domains: a lipoamide-binding domain, a 
small domain for interaction with E,, and 
a large transacetylase catalytic domain. 
The transacetylase domains of three 
identical subunits are shown, with one 
depicted in red and the others in white 
in the ribbon representation. 


This electron transfer from FAD to NAD* is unusual because the common 
role for FAD is to receive electrons from NADH. The electron-transfer 
potential of FAD is increased by its chemical environment within the 
enzyme, enabling it to transfer electrons to NAD*. Proteins tightly associ- 
ated with FAD or flavin mononucleotide (FMN) are called flavoproteins. 


Flexible linkages allow lipoamide to move between 
different active sites 


The structures of all of the component enzymes of the bacterial pyruvate 
dehydrogenase complex are known, albeit from different complexes and 
species. Thus, it is now possible to construct an atomic model of the com- 
plex to understand its activity (Figure 17.7). 

The core of the complex is formed by the transacetylase component E,. 
Transacetylase consists of eight catalytic trimers assembled to form a hollow 
cube. Each of the three subunits forming a trimer has three major domains 
(Figure 17.8). At the amino terminus is a small domain that contains a bound 
flexible lipoamide cofactor attached to a lysine residue. This domain is homol- 
ogous to biotin-binding domains such as that of pyruvate carboxylase (Figure 
16.26). The lipoamide domain is followed by a small domain that interacts 
with E within the complex. A larger transacetylase domain completes an E, 
subunit. E, is an a8, tetramer, and E, is an af dimer. Multiple copies of E, 
and E, surround the E, core. In mammals, this core contains another protein, 
E,-binding protein (E,-BP), which facilitates the interaction between E, and 
E,. IfE5-BP is missing, the complex has greatly reduced activity. 

How do the three distinct active sites work in concert (Figure 17.9)? The 
key is the long, flexible lipoamide arm of the E, subunit, which carries sub- 
strate from active site to active site: 


1. Pyruvate is decarboxylated at the active site of E,, forming the hydroxy- 
ethyl-TPP intermediate, and CO, leaves as the first product. This active 
site lies deep within the E, soniplex. connected to the enzyme surface by a 
20-A-long hydrophobic channel 
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2. E, inserts the lipoamide arm of the lipoamide domain into the deep 
channel in E, leading to the active site. 


3. E, catalyzes the transfer of the acetyl group to the lipoamide. The acety- 
lated arm then leaves E, and enters the E, cube to visit the active site of E,, 
located deep in the cube at the subunit interface. 


4, The acetyl moiety is then transferred to CoA, and the second product, 
acetyl CoA, leaves the cube. The reduced lipoamide arm then swings to the 
active site of the E, flavoprotein. 


5. At the E, active site, the lipoamide is oxidized by coenzyme FAD. The 
reactivated lipoamide is ready to begin another reaction cycle. 


6. The final product, NADH, 1s produced with the reoxidation of FADH, 
to FAD. 


The structural integration of three kinds of enzymes and the long, flexible 
lipoamide arm make the coordinated catalysis of a complex reaction possible. 
The proximity of one enzyme to another increases the overall reaction rate 
and minimizes side reactions. All the intermediates in the oxidative decar- 
boxylation of pyruvate remain bound to the complex throughout the reac- 
tion sequence and are readily transferred as the flexible arm of E, calls on 
each active site in turn. 


17.2 The Citric Acid Cycle Oxidizes Two-Carbon Units 


The conversion of pyruvate into acetyl CoA by the pyruvate dehydrogenase 
complex is the link between glycolysis and cellular respiration because 
acetyl CoA is the fuel for the citric acid cycle. Indeed, all fuels are ultimately 
metabolized to acetyl CoA or components of the citric acid cycle. 


FIGURE 17.9 Reactions of the pyruvate 
dehydrogenase complex. At the top (left), 
the enzyme (represented by a yellow, a 
green, and two red spheres) is unmodified 
and ready for a catalytic cycle. (1) Pyruvate 
is decarboxylated to form hydroxyethyl- 
TPP. (2) The lipoamide arm of E, moves 
into the active site of E,. (8) E, catalyzes 
the transfer of the two-carbon group to 
the lipoamide group to form the acetyl- 
lipoamide complex. (4) E, catalyzes 

the transfer of the acetyl moiety to CoA 

to form the product acetyl CoA. The 
dihydrolipoamide arm then swings to the 
active site of E;. E, catalyzes (5) the 
oxidation of the dihydrolipoamide and 

(6) the transfer of the protons and electrons 
to NADt to complete the reaction cycle. 
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Synthase 

An enzyme catalyzing a synthetic reaction in 
which two units are joined usually without the 
direct participation of ATP (or another 
nucleoside triphosphate). 


Citrate synthase forms citrate from oxaloacetate and acetyl coenzyme A 


The citric acid cycle begins with the condensation of a four-carbon unit, 
oxaloacetate, and a two-carbon unit, the acetyl group of acetyl CoA. 
Oxaloacetate reacts with acetyl CoA and H,O to yield citrate and CoA. 
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This reaction, which is an aldol condensation followed by a hydrolysis, is 
catalyzed by citrate synthase. Oxaloacetate first condenses with acetyl 
CoA to form citryl CoA, a molecule that is energy rich because it contains 
the thioester bond that originated in acetyl CoA. The hydrolysis of citryl 
CoA thioester to citrate and CoA drives the overall reaction far in the 
direction of the synthesis of citrate. In essence, the hydrolysis of the 


thioester powers the synthesis of a new molecule from two precursors. 


Mechanism: The mechanism of citrate synthase prevents 
undesirable reactions 


Because the condensation of acetyl CoA and oxaloacetate initiates the 
citric acid cycle, it is very important that side reactions, notably the 
hydrolysis of acetyl CoA to acetate and CoA, be minimized. Let us 
briefly consider how the citrate synthase prevents the wasteful hydrolysis 
of acetyl CoA. 

Mammalian citrate synthase is a dimer of identical 49-kDa subunits. 
Each active site is located in a cleft between the large and the small 
domains of a subunit, adjacent to the subunit interface. X-ray crystallo- 
graphic studies of citrate synthase and its complexes with several sub- 
strates and inhibitors have revealed that the enzyme undergoes large 
conformational changes in the course of catalysis. Citrate synthase exhib- 
its sequential, ordered kinetics: oxaloacetate binds first, followed by ace- 
tyl CoA. The reason for the ordered binding is that oxaloacetate induces a 
major structural rearrangement leading to the creation of a binding site for 
acetyl CoA. The binding of oxaloacetate converts the open form of the 
enzyme into a closed form (Figure 17.10). In each subunit, the small 
domain rotates 19 degrees relative to the large domain. Movements as large 
as 15 A are produced by the rotation of a helices elicited by quite small shifts 
of side chains around bound oxaloacetate. These structural changes create 
a binding site for acetyl CoA. 

Citrate synthase catalyzes the condensation reaction by bringing the 
substrates into close proximity, orienting them, and polarizing certain 
bonds (Figure 17.11). The donation and removal of protons transforms 
acetyl CoA into an enol intermediate. The enol attacks oxaloacetate to form 
a carbon-carbon double bond linking acetyl CoA and oxaloacetate. 
The newly formed citryl CoA induces additional structural changes in 
the enzyme, causing the active site to become completely enclosed. The 
enzyme cleaves the citryl CoA thioester by hydrolysis. CoA leaves the 
enzyme, followed by citrate, and the enzyme returns to the initial open 
conformation. 


Oxaloacetate- 
binding sites 


FIGURE 17.10 Conformational 


We can now understand how the wasteful hydrolysis of acetyl CoA is changes in citrate synthase on binding 
prevented. Citrate synthase is well suited to hydrolyze citryl CoA but not oxaloacetate. The small domain of each 
acetyl CoA. How is this discrimination accomplished? First, acetyl CoA subunit of the homodimer is shown in 
does not bind to the enzyme until oxaloacetate is bound and ready for con- vole tie lAige colal sak enei i 


, : : ‘ . blue. (Left) Open form of enzyme alone. 
densation. Second, the catalytic residues crucial for the hydrolysis of the (Right) Glosed:foru opine oalsacetate- 


thioester linkage are not appropriately positioned until citryl CoA is formed. enzyme complex. [Drawn from 5CSC.pdb 
As with hexokinase and triose phosphate isomerase (Section 16.1), induced and 4CTS.pdb_] 
fit prevents an undesirable side reaction. 
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FIGURE 17.11 Mechanism of synthesis of citryl CoA by citrate synthase. (1) In the 
substrate complex (left), His 274 donates a proton to the carbonyl oxygen of acetyl CoA to 
promote the removal of a methyl proton by Asp 375 to form the enol intermediate (center). 
(2) Oxaloacetate is activated by the transfer of a proton from His 320 to its carbonyl carbon 
atom. (3) Simultaneously, the enol of acetyl CoA attacks the carbonyl carbon of oxaloacetate 
to form a carbon-carbon bond linking acetyl CoA and oxaloacetate. His 274 is reprotonated. 
Citryl CoA is formed. His 274 participates again as a proton donor to hydrolyze the thioester 
(not shown), yielding citrate and CoA. 
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FIGURE 17.12 Binding of citrate to 
the iron-sulfur complex of aconitase. A 
4Fe-4S iron-sulfur cluster is a component 
of the active site of aconitase. Notice that 
one of the iron atoms of the cluster binds 
to a COO™ group and an OH group of 
citrate. [Drawn from 1C96.pdb.] 


Citrate is isomerized into isocitrate 


The hydroxyl group is not properly located in the citrate molecule for the 
oxidative decarboxylations that follow (Problem 27). Thus, citrate is isom- 
erized into isocitrate to enable the six-carbon unit to undergo oxidative 
decarboxylation. The isomerization of citrate is accomplished by a dehydya- 
tion step followed by a hydration step. The result is an interchange of an H 
and an OH. The enzyme catalyzing both steps is called aconitase because 
cis-dconitate is an intermediate. 


coo- Ccoo- 
—— Pd T00Gy, ial He on 
-00c—C—OH a | — -00C—C—H 
CH> -ooc~ cis. ke 
cOo- coo- lee: 
Citrate cis-Aconitate Isocitrate 


Aconitase is an iron-sulfur protein, or nonheme-iron protein, in that it con- 
tains iron that is not bonded to heme. Rather, its four iron atoms are com- 
plexed to four inorganic sulfides and three cysteine sulfur atoms, leaving 
one iron atom available to bind citrate through one of its COO™ groups and 
an OH group (Figure 17.12). This Fe-5 cluster participates in dehydrating 
and rehydrating the bound substrate. 


Isocitrate is oxidized and decarboxylated to alpha-ketoglutarate 


We come now to the first of four oxidation—reduction reactions in the citric 
acid cycle. The oxidative decarboxylation of isocitrate is catalyzed by isocit- 
rate dehydrogenase. 


Isocitrate + NAD* ——> a-ketoglutarate + CO, + NADH 


The intermediate in this reaction is oxalosuccinate, an unstable B-ketoacid. 
While bound to the enzyme, it loses CO, to form a-ketoglutarate. 
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This oxidation generates the first high-transfer-potential electron carrier, 
NADH, in the cycle. 


Succinyl coenzyme A is formed by the oxidative decarboxylation 
of alpha-ketoglutarate 


The conversion of isocitrate into a-ketoglutarate is followed by a second 
oxidative decarboxylation reaction, the formation of succinyl CoA from 
a-ketoglutarate. 
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This reaction is catalyzed by the a-ketoglutarate dehydrogenase com- 

plex, an organized assembly of three kinds of enzymes that is homol- 
ogous to the pyruvate dehydrogenase complex. In fact, the E, component 1s 
identical in both enzymes. The oxidative decarboxylation of a-ketoglutarate 
closely resembles that of pyruvate, also an a-ketoacid. 


Pyruvate dehydrogenase complex 


acetyl CoA + CO, + NADH + H* 


Pyruvate + CoA + NADt 


iy Ketoglutarate ee CoA + NAD+ a-Ketoglutarate dehydrogenase complex 
os —_—)]s  —- 
succinyl CoA + CO, + NADH 


Both reactions include the decarboxylation of an a-ketoacid and the subse- 
quent formation of a thioester linkage with CoA that has a high transfer 
potential. The reaction mechanisms are entirely analogous (p. 497). 


A compound with high phosphoryl-transfer potential is generated 
from succinyl coenzyme A 


Succinyl CoA is an energy-rich thioester compound. The AG®’ for the 
hydrolysis of succinyl CoA is about —33.5 kJ mol~! (—8.0 kcal mol7'‘), 
which is comparable to that of ATP (—30.5 kJ mol~!, or —7.3 kcal mol~!). 
In the citrate synthase reaction, the cleavage of the thioester bond powers 
the synthesis of the six-carbon citrate from the four-carbon oxaloacetate 
and the two-carbon fragment. The cleavage of the thioester bond of succinyl 
CoA is coupled to the phosphorylation of a purine nucleoside diphosphate, usu- 
ally ADP. This reaction, which is readily reversible, is catalyzed by succinyl 
CoA synthetase (succinate thiokinase). 
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FIGURE 17.13 Reaction mechanism of 


succinyl CoA synthetase. 


The reaction 


proceeds through a phosphorylated enzyme 


intermediate. (1) Orthophos| 
coenzyme A, which genera 


phate displaces 
es another 


energy-rich compound, succinyl phosphate. 
(2) A histidine residue removes the phosphory! 


group with the concomitan 
succinate and phosphohist 


generation of 
idine. (8) The 


phosphohistidine residue then swings over 
to a bound ADP, and (4) the phosphoryl 
group is transferred to form ATP. 
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This reaction is the only step in the citric acid cycle that directly yields 
a compound with high phosphoryl-transfer potential. In mammals, there 
are two isozymic forms of the enzyme, one specific for ADP and one for 
GDP. In tissues that perform large amounts of cellular respiration, such as 
skeletal and heart muscle, the ADP-requiring isozyme predominates. In 
tissues that perform many anabolic reactions, such as the liver, the GDP- 
requiring enzyme is common. The GDP-requiring enzyme is believed to 
work in reverse of the direction observed in the TCA cycle; that is, GTP is 
used to power the synthesis of succinyl CoA, which is a precursor for heme 
synthesis. The E. coli enzyme uses either GDP or ADP as the phosphoryl- 
group acceptor. 

Note that the enzyme nucleoside diphosphokinase, which catalyzes the 
following reaction, 


GTP + ADP == GDP + ATP 


allows the y phosphory] group to be readily transferred from GTP to form 
ATP, thereby allowing the adjustment of the concentration of GTP or ATP 
to meet the cell’s need. 


Mechanism: Succinyl coenzyme A synthetase transforms types 
of biochemical energy 


The mechanism of this reaction is a clear example of an energy trans- 
formation: energy inherent in the thioester molecule is transformed 
into phosphoryl-group-transfer potential (Figure 17.13). The first step 
is the displacement of coenzyme A by orthophosphate, which generates 
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another energy-rich compound, succinyl phosphate. A hist1- 
dine residue plays a key role as a moving arm that detaches the 
phosphoryl group, then swings over to a bound ADP and 
transfers the group to form ATP. The participation of high- 
energy compounds in all the steps is attested to by the fact 
that the reaction is readily reversible: AG°’ = —3.4 kJ mol7! 
(—0.8 kcal mol~'). The formation of ATP at the expense of 
succinyl CoA is an example of substrate-level phosphorylation 
(Section 16.1). 


NY Succinyl CoA synthetase is an a8, heterodimer; the func- 

tional unit is one aB pair. The enzyme mechanism shows 
that a phosphoryl group is transferred first to succinyl CoA 
bound in the a subunit and then to a nucleoside diphosphate 
bound in the B subunit. Examination of the three-dimensional 
structure of succinyl CoA synthetase reveals that each subunit 
comprises two domains (Figure 17.14). The amino-terminal 
domains of the two subunits have different structures, each char- 
acteristic of its role in the mechanism. The amino-terminal 
domain of the a subunit forms a Rossmann fold (Section 16.1), 
which binds the ADP substrate of succinyl CoA synthetase. The 
amino-terminal domain of the B subunit is an ATP-grasp 
domain, found in many enzymes, which here binds and activates 
ADP. Succinyl CoA synthetase has evolved by adopting these 
domains and harnessing them to capture the energy associated 
with succinyl CoA cleavage, which is used to drive the generation 
of a nucleoside triphosphate. 


Oxaloacetate is regenerated by the oxidation of succinate 
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hk FIGURE 17.14 Structure of succinyl CoA 
synthetase. The enzyme is composed of two subunits. 
The a subunit contains a Rossmann fold that binds the 
ADP component of CoA, and the B subunit contains a 
nucleotide-activating region called the ATP-grasp domain. 
The ATP-grasp domain is shown here binding a molecule 
of ADP. Notice that the histidine residue is between the 
CoA and the ADP. This histidine residue picks up the 
phosphoryl group from near the CoA and swings over to 
transfer it to the nucleotide bound in the ATP-grasp 
domain. [Drawn from 1CQI.pdb.] 


Reactions of four-carbon compounds constitute the final stage of the citric 


acid cycle: the regeneration of oxaloacetate. 
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The reactions constitute a metabolic motif that we will see again in fatty acid 
synthesis and degradation as well as in the degradation of some amino acids. 
A methylene group (CH,,) is converted into a carbonyl group C=O in three 
steps: an oxidation, a hydration, and a second oxidation reaction. 
Oxaloacetate is thereby regenerated for another round of the cycle, and 


more energy is extracted in the form of FADH, and NADH. 


Succinate is oxidized to fumarate by succinate dehydrogenase. The hydro- 
gen acceptor is FAD rather than NAD™, which is used in the other three 
oxidation reactions in the cycle. FAD is the hydrogen acceptor in this reac- 
tion because the free-energy change is insufficient to reduce NAD*. FAD 
is nearly always the electron acceptor in oxidations that remove two hydro- 
gen atoms from a substrate. In succinate dehydrogenase, the isoalloxazine 
ring of FAD is covalently attached to a histidine side chain of the enzyme 


(denoted E-FAD). 
E-FAD + succinate === E-FADH) + fumarate 
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Succinate dehydrogenase, like aconitase, is an iron-sulfur protein. 
Indeed, succinate dehydrogenase contains three different kinds of iron— 
sulfur clusters: 2Fe-2S (two iron atoms bonded to two inorganic sulfides), 
3Fe-48, and 4Fe-45. Succinate dehydrogenase—which consists of a 70-kDa 
and a 27-kDa subunit—differs from other enzymes in the citric acid cycle 
in being embedded in the inner mitochondrial membrane. In fact, succinate 
dehydrogenase is directly associated with the electron-transport chain, the link 
between the citric acid cycle and ATP formation. FADH, produced by the 
oxidation of succinate does not dissociate from the enzyme, in contrast with 
NADH produced in other oxidation—reduction reactions. Rather, two elec- 
trons are transferred from FADH, directly to iron-sulfur clusters of the 
enzyme, which in turn passes the electrons to coenzyme Q (CoQ). 
Coenzyme Q, an important member of the electron-transport chain, passes 
electrons to the ultimate acceptor, molecular oxygen, as we shall see in 
Chapter 18. 

The next step is the hydration of fumarate to form L-malate. Fumarase 
catalyzes a stereospecific trans addition of Ht and OH~. The OH™ group 
adds to only one side of the double bond of fumarate; hence, only the L iso- 
mer of malate is formed. 
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Fumarate L-Malate 


Finally, malate is oxidized to form oxaloacetate. This reaction is cata- 
lyzed by malate dehydrogenase, and NAD* is again the hydrogen acceptor. 


Malate + NAD* = oxaloacetate + NADH + H* 


The standard free energy for this reaction, unlike that for the other steps 
in the citric acid cycle, is significantly positive (AG°’ = +29.7 kJ mol~!, or 
+7.1 kcal mol~!). The oxidation of malate is driven by the use of the 
products—oxaloacetate by citrate synthase and NADH by the electron- 
transport chain. 


The citric acid cycle produces high-transfer-potential electrons, 
ATP, and CO, 


The net reaction of the citric acid cycle is 


Acetyl CoA + 3NAD* + FAD + ADP + P; + 2H,0 — 
2 CO, + 3NADH + FADH, + ATP + 2H* + CoA 


Let us recapitulate the reactions that give this stoichiometry (Figure 17.15 
and Table 17.2): 


1. Two carbon atoms enter the cycle in the condensation of an acetyl unit 
(from acetyl CoA) with oxaloacetate. Two carbon atoms leave the cycle in 
the form of CO, in the successive decarboxylations catalyzed by isocitrate 
dehydrogenase and a-ketoglutarate dehydrogenase. 


2. Four pairs of hydrogen atoms leave the cycle in four oxidation reactions. 
Two NAD? molecules are reduced in the oxidative decarboxylations of 
isocitrate and a-ketoglutarate, one FAD molecule is reduced in the oxidation 


a 509 
oO 
CoA CH> 17.2 Reactions of the Citric Acid Cycle 
H20 + MN ] 
HsC”'s-CoA -OOC—C—OH 
be coo- 
. P Brera 7 Aconitase H—C—OH 
Ke _COO y' coo- 
¢ : -OOC—C—-H 
] Citrate 
ct 
— a + 
NADH + Ht nica oe aa 
Oxaloacetate Isocitrate 
Malate Isocitrate 
dehydrogenase dehydrogenase NADH + CO> 
NADt -00c 
coo” eae 
HO—C—H vs 
CH bn, 
COO- be 
Malate a-Ketoglutarate 
a-Ketoglutarate NAD+ + CoA 
dehydrogenase 
Fumarase 
CoA—S. 
H20 H_ -COO- NADH + CO2 
{ 
-00C~ ~“H Succinate 60 nha CoA 
7 synthetase 2 
pinamis dehydrogenase ) Y Coo 
CH2 Succinyl CoA 
FADH, tas ADP + P; FIGURE 17.15 The citric acid cycle. 
] ATP Notice that since succinate is a symmetric 
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TABLE 17.2 Citric acid cycle 
AG” 
Prosthetic 
Step Reaction Enzyme group Type* kJ mol! kcal mol~! 
1 Acetyl CoA + oxaloacetate + H,O — Citrate a 31.4 7.5 
citrate + CoA + Ht synthase 
2a Citrate ==  cis-aconitate + H,O Aconitase Fe-S b +8.4 +2.0 
2b cis-Aconitate + H,O == isocitrate Aconitase Fe-S c —2.1 —0.5 
3 Isocitrate + NAD* == Isocitrate dt+e —8.4 —220 
a-ketoglutarate + CO, + NADH dehydrogenase 
4 a-Ketoglutarate + NAD? + CoA == a-Ketoglutarate Lipoic acid, dt+e —30.1 —7.2 
succinyl CoA + CO, + NADH dehydrogenase FAD, TPP 
complex 
5 Succinyl CoA + P; + ADP == Succinyl CoA if 3.3 —0.8 
succinate + ATP + CoA synthetase 
6 Succinate + FAD (enzyme-bound) == Succinate FAD, Fe-S e 0 0 
fumarate + FADH,(enzyme-bound) dehydrogenase 
Zi Fumarate + H,0 == L-malate Fumarase € =3,,8 —0.9 
8 L-Malate + NAD* == Malate e +29.7 +71 
oxaloacetate + NADH + Ht dehydrogenase 


*Reaction type: (a) condensation; (b) dehydration; (c) hydration; (d) decarboxylation; (e) oxidation; (f) substrate-level phosphorylation. 
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of succinate, and one NAD* molecule is reduced in the oxidation of malate. 
Recall also that one NAD* molecule is reduced in the oxidative decarbox- 
ylation of pyruvate to form acetyl CoA. 


3. One compound with high phosphoryl-transfer potential, usually ATP, 
is generated from the cleavage of the thioester linkage in succinyl CoA. 


4. Two water molecules are consumed: one in the synthesis of citrate by the 
hydrolysis of citryl CoA and the other in the hydration of fumarate. 


Isotope-labeling studies have revealed that the two carbon atoms that 
enter each cycle are not the ones that leave. The two carbon atoms that 
enter the cycle as the acetyl group are retained during the initial two 
decarboxylation reactions (Figure 17.15) and then remain incorporated in 
the four-carbon acids of the cycle. Note that succinate is a symmetric 
molecule. Consequently, the two carbon atoms that enter the cycle can 
occupy any of the carbon positions in the subsequent metabolism of the 
four-carbon acids. The two carbons that enter the cycle as the acetyl 
group will be released as CO, in subsequent trips through the cycle. To 
understand why citrate is not processed as a symmetric molecule, see 
Problems 34 and 35. 

Evidence is accumulating that the enzymes of the citric acid cycle are 
physically associated with one another. The close arrangement of enzymes 
enhances the efficiency of the citric acid cycle because a reaction product 
can pass directly from one active site to the next through connecting chan- 
nels, a process called substrate channeling. 

As will be considered in Chapter 18, the electron-transport chain oxi- 
dizes the NADH and FADH, formed in the citric acid cycle. The transfer 
of electrons from these carriers to O,, the ultimate electron acceptor, leads 
to the generation of a proton gradient across the inner mitochondrial 
membrane. This proton-motive force then powers the generation of ATP; 
the net stoichiometry is about 2.5 ATP per NADH, and 1.5 ATP per 
FADH,. Consequently, nine high-transfer-potential phosphoryl groups 
are generated when the electron-transport chain oxidizes 3 NADH mole- 
cules and 1 FADH, molecule, and one ATP is directly formed in one 
round of the citric acid cycle. Thus, one acetyl unit generates approxi- 
mately 10 molecules of ATP. In dramatic contrast, the anaerobic glycoly- 
sis of an entire glucose molecule generates only 2 molecules of ATP (and 
2 molecules of lactate). 

Recall that molecular oxygen does not participate directly in the citric 
acid cycle. However, the cycle operates only under aerobic conditions 
because NAD* and FAD can be regenerated in the mitochondrion only by 
the transfer of electrons to molecular oxygen. Glycolysis has both an aerobic 
and an anaerobic mode, whereas the citric acid cycle is strictly aerobic. 
Glycolysis can proceed under anaerobic conditions because NAD™ is 
regenerated in the conversion of pyruvate into lactate or ethanol. 


17.3 Entry to the Citric Acid Cycle and Metabolism 
Through It Are Controlled 


The citric acid cycle is the final common pathway for the aerobic oxidation 
of fuel molecules. Moreover, as we will see shortly (Section 17.4) and 
repeatedly elsewhere in our study of biochemistry, the cycle is an important 
source of building blocks for a host of important biomolecules. As befits its 
role as the metabolic hub of the cell, entry into the cycle and the rate of the 
cycle itself are controlled at several stages. 


The pyruvate dehydrogenase complex is regulated allosterically 
and by reversible phosphorylation 


As stated earlier, glucose can be formed from pyruvate (Section 16.3). 
However, the formation of acetyl CoA from pyruvate is an irreversible step in 
animals and thus they are unable to convert acetyl CoA back into glucose. The 
oxidative decarboxylation of pyruvate to acetyl CoA commits the carbon 
atoms of glucose to one of two principal fates: oxidation to CO, by the citric 
acid cycle, with the concomitant generation of energy, or incorporation into 
lipid (Figure 17.16). As expected of an enzyme at a critical branch point in 
metabolism, the activity of the pyruvate dehydrogenase complex is stringently 
controlled. High concentrations of reaction products inhibit the reaction: ace- 
tyl CoA inhibits the transacetylase component (E,) by binding directly, 
whereas NADH inhibits the dihydrolipoyl dehydrogenase (E;). High concen- 
trations of NADH and acetyl CoA inform the enzyme that the energy needs 
of the cell have been met or that fatty acids are being degraded to produce 
acetyl CoA and NADH. In either case, there is no need to metabolize pyru- 
vate to acetyl CoA. This inhibition has the effect of sparing glucose, because 
most pyruvate is derived from glucose by glycolysis (Section 16.1). 

The key means of regulation of the complex in eukaryotes is covalent 
modification (Figure 17.17). Phosphorylation of the pyruvate dehydrogenase 
component (E,) by pyruvate dehydrogenase kinase (PDK) switches off the 
activity of the complex. There are four isozymes of PDK that are expressed 
in a tissue-specific manner. Deactivation is reversed by the pyruvate dehydro- 
genase phosphatase (PDP), of which there are two isozymic forms. In mam- 
mals, the kinase and the phosphatase are associated with the E,-E,-BP 
complex, again highlighting the structural and mechanistic importance of 
this core. Both the kinase and the phosphatase are regulated. To see how this 
regulation works in biological conditions, consider muscle that is becoming 
active after a period of rest (Figure 17.18). At rest, the muscle will not have 
significant energy demands. Consequently, the NADH/NAD‘, acetyl 
CoA/CoA, and ATP/ADP ratios will be high. These high ratios promote 
phosphorylation and inactivation of the complex by activating PDK. In 
other words, high concentrations of immediate (acetyl CoA and NADH) 
and ultimate (ATP) products inhibit the activity. Thus, pyruvate dehydroge- 
nase is switched off when the energy charge is high. 
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FIGURE 17.16 From glucose to acetyl 
CoA. The synthesis of acetyl CoA by the 
pyruvate dehydrogenase complex is a key 
irreversible step in the metabolism of 
glucose. 


(B) LOW ENERGY CHARGE 
Pyruvate 


NAD* 


© 


Acetyl CoA 


(ac) ADP~ , ~ATP 


e 


FIGURE 17.17 Regulation of the pyruvate dehydrogenase 
complex. A specific kinase phosphorylates and inactivates pyruvate 
dehydrogenase (PDH), and a phosphatase activates the 
dehydrogenase by removing the phosphoryl group. The kinase 

and the phosphatase also are highly regulated enzymes. 


FIGURE 17.18 Response of the pyruvate dehydrogenase 
complex to the energy charge. The pyruvate dehydrogenase 
complex is regulated to respond to the energy charge of the cell. 
(A) The complex is inhibited by its immediate products, NADH and 
acetyl CoA, as well as by the ultimate product of cellular respiration, 
ATP. (B) The complex is activated by pyruvate and ADP, which inhibit 
the kinase that phosphorylates PDH. 
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FIGURE 17.19 Control of the citric acid 
cycle. The citric acid cycle is regulated 
primarily by the concentration of ATP and 
NADH. The key control points are the 
enzymes isocitrate dehydrogenase and 
a-ketoglutarate dehydrogenase. 


As exercise begins, the concentrations of ADP and pyruvate will 
increase as muscle contraction consumes ATP and glucose is converted into 
pyruvate to meet the energy demands. Both ADP and pyruvate activate the 
dehydrogenase by inhibiting the kinase. Moreover, the phosphatase is 
stimulated by Ca*t, the same signal that initiates muscle contraction. A rise 
in the cytoplasmic Ca?* level (Section 35.2) elevates the mitochondrial 
Ca’t level. The rise in mitochondrial Ca** activates the phosphatase, 
enhancing pyruvate dehydrogenase activity. 

In some tissues, the phosphatase is regulated by hormones. In liver, epi- 
nephrine binds to the a-adrenergic receptor to initiate the phosphatidylino- 
sitol pathway (Section 14.1), causing an increase in Ca?* concentration that 
activates the phosphatase. In tissues capable of fatty acid synthesis, such as 
the liver and adipose tissue, insulin, the hormone that signifies the fed state, 
stimulates the phosphatase, increasing the conversion of pyruvate into ace- 
tyl CoA. Acetyl CoA is the precursor for fatty acid synthesis (Section 22.4). 
In these tissues, the pyruvate dehydrogenase complex is activated to funnel 
glucose to pyruvate and then to acetyl CoA and ultimately to fatty acids. 


ee In people with a phosphatase deficiency, pyruvate dehydrogenase is 

always phosphorylated and thus inactive. Consequently, glucose is 
processed to lactate rather than acetyl CoA. This condition results in unre- 
mitting lactic acidosis—high blood levels of lactic acid. In such an acidic 
environment, many tissues malfunction, most notably the central nervous 
system (Problems 13 and 14). 


The citric acid cycle is controlled at several points 


The rate of the citric acid cycle is precisely adjusted to meet an animal cell’s 
needs for ATP (Figure 17.19). The primary control points are the allosteric 
enzymes isocitrate dehydrogenase and a-ketoglutarate dehydrogenase, the 
first two enzymes in the cycle to generate high-energy electrons. 

The first control site is isocitrate dehydrogenase. The enzyme is allosteri- 
cally stimulated by ADP, which enhances the enzyme’s affinity for sub- 
strates. The binding of isocitrate, NAD*t, Mg?t, and ADP is mutually 
cooperative. In contrast, ATP is inhibitory. The reaction product NADH 
also inhibits isocitrate dehydrogenase by directly displacing NAD*. It is 
important to note that several steps in the cycle require NAD* or FAD, 
which are abundant only when the energy charge is low. 

A second contyol site in the citric acid cycle is a-ketoglutarate dehydrogenase, 
which catalyzes the rate-limiting step in the citric acid cycle. Some aspects of 
this enzyme’s control are like those of the pyruvate dehydrogenase complex, as 
might be expected from the homology of the two enzymes. a-Ketoglutarate 
dehydrogenase is inhibited by succinyl CoA and NADH, the products of the 
reaction that it catalyzes. In addition, a-ketoglutarate dehydrogenase is inhib- 
ited by a high energy charge. Thus, the rate of the cycle is reduced when the cell 
has a high level of ATP. a-Ketoglutarate dehydrogenase deficiency is observed 
in anumber of neurological disorders, including Alzheimer disease. 

The use of isocitrate dehydrogenase and a-ketoglutarate dehydrogenase 
as control points integrates the citric acid cycle with other pathways and 
highlights the central role of the citric acid cycle in metabolism. For 
instance, the inhibition of isocitrate dehydrogenase leads to a buildup of 
citrate, because the interconversion of isocitrate and citrate is readily revers- 
ible under intracellular conditions. Citrate can be transported to the cyto- 
plasm, where it signals phosphofructokinase to halt glycolysis (Section 16.2) 
and where it can serve as a source of acetyl CoA for fatty acid synthesis 
(Section 22.4). The a-ketoglutarate that accumulates when a-ketoglutarate 


dehydrogenase is inhibited can be used as a precursor for several amino 
acids and the purine bases (Chapter 23 and Chapter 25). 

In many bacteria, the funneling of two-carbon fragments into the cycle 
also is controlled. The synthesis of citrate from oxaloacetate and acetyl CoA 
carbon units is an important control point in these organisms. ATP is an allo- 
steric inhibitor of citrate synthase. The effect of AT'P is to increase the value 
of Ky, for acetyl CoA. Thus, as the level of ATP increases, less of this 
enzyme is bound to acetyl CoA and so less citrate is formed. 


Defects in the citric acid cycle contribute to the development of cancer 


e Four enzymes crucial to cellular respiration are known to contribute 
to the development of cancer: succinate dehydrogenase, fumarase, 
pyruvate dehydrogenase kinase and isocitrate dehydrogenase. Mutations 
that alter the activity of the first three of these enzymes enhance aerobic 
glycolysis (Section 16.2). In aerobic glycolysis, cancer cells preferentially 
metabolize glucose to lactate even in the presence of oxygen. Defects in 
these enzymes share a common biochemical link: the transcription factor 
hypoxia inducible factor 1 (HIF-1). 

Normally, HIF-1 up-regulates the enzymes and transporters that 
enhance glycolysis only when oxygen concentration falls, a condition called 
hypoxia. Under normal conditions, HIF-1 is hydroxylated by prolyl 
hydroxylase 2 and is subsequently destroyed by the proteasome, a large 
complex of proteolytic enzymes (Chapter 23). The degradation of HIF-1 
prevents the stimulation of glycolysis. Prolyl hydroxylase 2 requires a-keto- 
glutarate, ascorbate, and oxygen for activity. Thus, when oxygen concentra- 
tion falls, the prolyl hydroxylase 2 is inactive, HIF-1 is not hydroxylated 
and not degraded, and the synthesis of proteins required for glycolysis is 
stimulated. As a result, the rate of glycolysis is increased. 

Recent research suggestions that defects in the enzymes of the citric acid 
cycle may significantly affect the regulation of prolyl hydroxylase 2. When 
either succinate dehydrogenase or fumarase is defective, succinate and fuma- 
rate accumulate in the mitochondria and spill over into the cytoplasm. Both 
succinate and fumarate are competitive inhibitors of prolyl hydroxylase 2. 
The inhibition of prolyl hydroxylase 2 results in the stabilization of HIF-1, 
since HIF-1 is no longer hydroxylated. Lactate, the end product of glycolysis, 
also appears to inhibit prolyl hydroxylase 2 by interfering with the action of 
ascorbate. In addition to increasing the amount of the proteins required for 
glycolysis, HIF-1 also stimulates the production of pyruvate dehydrogenase 
kinase (PDK). The kinase inhibits the pyruvate dehydrogenase complex, 
preventing the conversion of pyruvate into acetyl CoA. The pyruvate remains 
in the cytoplasm, further increasing the rate of aerobic glycolysis. Moreover, 
mutations in PDK that lead to enhanced activity contribute to increased aero- 
bic glycolysis and the subsequent development of cancer. By enhancing gly- 
colysis and increasing the concentration of lactate, the mutations in PDK 
result in the inhibition of hydroxylase and the stabilization of HIF-1. 

Mutations in isocitrate dehydrogenase result in the generation of an 
oncogenic metabolite, 2-hydroxyglutarate. The mutant enzyme catalyzes the 
conversion of isocitrate to a-ketoglutarate, but then reduces a-ketoglutarate 
to form 2-hydroxyglutarate. 2-Hydroxyglutarate alters the methylation 
patterns in DNA (Section 32.2) and reduces dependence on growth factors 
for proliferation. These changes alter gene expression and promote unre- 
strained cell growth. 

These observations linking citric acid cycle enzymes to cancer suggest 
that cancer is also a metabolic disease, not simply a disease of mutant growth 
factors and cell cycle control proteins. The realization that there is a metabolic 
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FIGURE 17.21 PATHWAY INTEGRATION: 
Pathways active during exercise after a 
night’s rest. The rate of the citric acid cycle 
increases during exercise, requiring the 
replenishment of oxaloacetate and acetyl 
CoA. Oxaloacetate is replenished by its 
formation from pyruvate. Acetyl CoA may 
be produced from the metabolism of both 
pyruvate and fatty acids. 


component to cancer opens the door to new thinking about the control of 
cancer. Indeed, preliminary experiments suggest that if cancer cells undergo- 
ing aerobic glycolysis are forced by pharmacological manipulation to use 
oxidative phosphorylation, the cancer cells lose their malignant properties. It 
is also interesting to note that the citric acid cycle, which has been studied for 
decades, still has secrets to be revealed by future biochemists. 


17.4 The Citric Acid Cycle Is a Source of 
Biosynthetic Precursors 


Thus far, discussion has focused on the citric acid cycle as the major degrada- 
tive pathway for the generation of ATP. As a major metabolic hub of the cell, 
the citric acid cycle also provides intermediates for biosyntheses (Figure 17.20). 
For example, most of the carbon atoms in porphyrins come from succinyl CoA. 
Many of the amino acids are derived from a-ketoglutarate and oxaloacetate. 
These biosynthetic processes will be considered in subsequent chapters. 
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FIGURE 17.20 Biosynthetic roles of the citric acid cycle. Intermediates are drawn off for 
biosyntheses (shown by red arrows) when the energy needs of the cell are met. Intermediates 
are replenished by the formation of oxaloacetate from pyruvate. 


The citric acid cycle must be capable of being rapidly replenished 


An important consideration is that citric acid cycle intermediates must be 
replenished if any are drawn off for biosyntheses. Suppose that much oxaloac- 
etate is converted into amino acids for protein synthesis and, subsequently, 
the energy needs of the cell rise. The citric acid cycle will operate to a 
reduced extent unless new oxaloacetate is formed because acetyl CoA can- 
not enter the cycle unless it condenses with oxaloacetate. Even though oxa- 
loacetate is recycled, a minimal level must be maintained to allow the cycle 
to function. 

How is oxaloacetate replenished? Mammals lack the enzymes for the net 
conversion of acetyl CoA into oxaloacetate or any other citric acid cycle 
intermediate. Rather, oxaloacetate is formed by the carboxylation of pyru- 
vate, in a reaction catalyzed by the biotin-dependent enzyme pyruvate car- 
boxylase (Figure 17.21). 


Pyruvate + CO, + ATP + H,O = oxaloacetate + ADP + P; + 2H* 


Recall that this enzyme plays a crucial role in gluconeogenesis (Section 
16.3). It is active only in the presence of acetyl CoA, which signifies the 
need for more oxaloacetate. If the energy charge is high, oxaloacetate is con- 
verted into glucose. If the energy charge is low, oxaloacetate replenishes the 
citric acid cycle. The synthesis of oxaloacetate by the carboxylation of pyru- 
vate is an example of an anaplerotic reaction (anaplerotic is of Greek origin, 
meaning to “fill up’’), a reaction that leads to the net synthesis, or replenish- 
ment, of pathway components. Note that because the citric acid cycle is a 
cycle, it can be replenished by the generation of any of the intermediates. 
Glutamine is an especially important source of citric acid cycle intermedi- 
ates in rapidly growing cells, including cancer cells. Glutamine is converted 
into glutamate and then into a-ketoglutarate. 


The disruption of pyruvate metabolism is the cause of beriberi and 
poisoning by mercury and arsenic 


e Beriberi, a neurologic and cardiovascular disorder, is caused by a 

dietary deficiency of thiamine (also called vitamin B,). The disease has 
been and continues to bea serious health problem in the Far East because rice, 
the major food, has a rather low content of thiamine. This deficiency is partly 
ameliorated if the whole rice grain is soaked in water before milling; some of 
the thiamine in the husk then leaches into the rice kernel. The problem is 
exacerbated if the rice is polished to remove the outer layer (that is, converted 
from brown to white rice), because only the outer layer contains significant 
amounts of thiamine. A form of beriberi, called Wernicke’s encephalopathy, 
is also occasionally seen in alcoholics who are severely malnourished and thus 
thiamine deficient. The disease is characterized by neurologic and cardiac 
symptoms. Damage to the peripheral nervous system is expressed as pain in 
the limbs, weakness of the musculature, and distorted skin sensation. The 
heart may be enlarged and the cardiac output inadequate. 

Which biochemical processes might be affected by a deficiency of thia- 
mine? Thiamine is the precursor of the cofactor thiamine pyrophosphate. 
This cofactor is the prosthetic group of three important enzymes: pyruvate dehy- 
drogenase, a-ketoglutarate dehydrogenase, and transketolase. Transketolase 
functions in the pentose phosphate pathway, which will be considered in 
Chapter 20. The common feature of enzymatic reactions utilizing TPP is the 
transfer of an activated aldehyde unit. In beriberi, the levels of pyruvate and 
a-ketoglutarate in the blood are higher than normal. The increase in the level 
of pyruvate in the blood is especially pronounced after the ingestion of glu- 
cose. A related finding is that the activities of the pyruvate and a-ketoglutarate 
dehydrogenase complexes in vivo are abnormally low. The low transketolase 
activity of red blood cells in beriberi is an easily measured and reliable diag- 
nostic indicator of the disease. 

Why does TPP deficiency lead primarily to neurological disorders? The 
nervous system relies essentially on glucose as its only fuel. The product of 
glycolysis, pyruvate, can enter the citric acid cycle only through the pyru- 
vate dehydrogenase complex. With that enzyme deactivated, the nervous 
system has no source of fuel. In contrast, most other tissues can use fats as a 
source of fuel for the citric acid cycle. 

Symptoms similar to those of beriberi appear in organisms exposed to 
mercury or arsenite (AsO,,°~). Both materials have a high affinity for neigh- 
boring sulfhydryls, such as those in the reduced dihydrolipoyl groups of the 
E, component of the pyruvate dehydrogenase complex (Figure 17.22). The 
binding of mercury or arsenite to the dihydrolipoy] groups inhibits the com- 
plex and leads to central nervous system pathologies. The proverbial phrase 
“mad as a hatter” refers to the strange behavior of poisoned hat makers who 
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17.4 A Source of Biosynthetic Precursors 


Beriberi 

A vitamin-deficiency disease first described in 
1630 by Jacob Bonitus, a Dutch physician 
working in Java: 


“A certain very troublesome affliction, which 
attacks men, is called by the inhabitants 
Beriberi (which means sheep). | believe 
those, whom this same disease attacks, with 
their knees shaking and the legs raised up, 
walk like sheep. It is a kind of paralysis, or 
rather Tremor: for it penetrates the motion 
and sensation of the hands and feet indeed 
sometimes of the whole body.” 


[The Granger Collection.] 


FIGURE 17.22 Arsenite poisoning. Arsenite inhibits the pyruvate 
dehydrogenase complex by inactivating the dihydrolipoamide 
component of the transacetylase. Some sulfhydryl reagents, such HS 
as 2,3-dimercaptoethanol, relieve the inhibition by forming a 

complex with the arsenite that can be excreted. 


+ As——O- 
HO 
Dihydrolipoamide Arsenite 
from pyruvate 
dehydrogenase 


component E, 


The manuscript proposing the citric acid 
cycle was submitted for publication to Nature 
but was rejected in June 1937. That same 
year it was published in Enzymologia. Dr. 
Krebs proudly displayed the rejection letter 
throughout his career as encouragement for 
young scientists. 


“The editor of NATURE presents his 
compliments to Dr. H. A. Krebs and regrets 
that as he has already sufficient letters to fill 
the correspondence columns of NATURE for 
seven or eight weeks, it is undesirable to 
accept further letters at the present time on 
account of the time delay which must occur 
in their publication. 

If Dr. Krebs does not mind much delay 
the editor is prepared to keep the letter until 
the congestion is relieved in the hope of 
making use of it. 

He returns it now, in case Dr. Krebs 
prefers to submit it for early publication to 
another periodical.” 
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2,3-Dimercaptopropanol 
(BAL) 


-0—As Excreted 


Arsenite chelate 
on enzyme 


Restored enzyme 


used mercury nitrate to soften and shape animal furs. This form of mercury 
is absorbed through the skin. Similar symptoms afflicted the early photog- 
raphers, who used vaporized mercury to create daguerreotypes. 

Treatment for these poisons is the administration of sulfhydryl reagents 
with adjacent sulfhydryl groups to compete with the dihydrolipoyl resi- 
dues for binding with the metal ion. The reagent—metal complex is then 
excreted in the urine. Indeed, 2,3-dimercaptopropanol (Figure 17.22) was 
developed after World War I as an antidote to lewisite, an arsenic-based 
chemical weapon. This compound was initially called BAL, for British 
anti-lewisite. 


The citric acid cycle may have evolved from preexisting pathways 


— How did the citric acid cycle come into being? Although definitive 

answers are elusive, informed speculation is possible. We can perhaps 
begin to comprehend how evolution might work at the level of biochemical 
pathways. 

The citric acid cycle was most likely assembled from preexisting reac- 
tion pathways. As noted earlier, many of the intermediates formed in the 
citric acid cycle are used in metabolic pathways for amino acids and por- 
phyrins. Thus, compounds such as pyruvate, a-ketoglutarate, and oxaloac- 
etate were likely present early in evolution for biosynthetic purposes. The 
oxidative decarboxylation of these a-ketoacids is quite favorable thermo- 
dynamically and can be used to drive the synthesis of both acyl CoA 
derivatives and NADH. These reactions almost certainly formed the core 
of processes that preceded the citric acid cycle evolutionarily. Interestingly, 
a-ketoglutarate and oxaloacetate can be interconverted by transamination 
of the respective amino acids by aspartate aminotransferase, another key 
biosynthetic enzyme. Thus, cycles comprising smaller numbers of interme- 
diates used for a variety of biochemical purposes could have existed before 
the present form evolved. 


17.5 The Glyoxylate Cycle Enables Plants and Bacteria 
to Grow on Acetate 


Acetyl CoA that enters the citric acid cycle has but one fate: oxidation to 
CO, and H,O. Most organisms thus cannot convert acetyl CoA into 
glucose. Although oxaloacetate, a key precursor to glucose, is formed in 
the citric acid cycle, the two decarboxylations that take place before 


the regeneration of oxaloacetate pre- 
clude the net conversion of acetyl 
CoA into glucose. 

In plants and in some microorgan- 
isms, there is a metabolic pathway 
that allows the conversion of acetyl | 
CoA generated from fats stores into CHa 
glucose. This reaction sequence, 
called the glyoxylate cycle, is similar to 
the citric acid cycle but bypasses the 
two decarboxylation steps of the cycle. 
Another important difference is that 
two molecules of acetyl CoA enter per NAD* 
turn of the glyoxylate cycle, compared 
with one in the citric acid cycle. 

The glyoxylate cycle(Figure 17.23), 
like the citric acid cycle, begins with | 
the condensation of acetyl CoA and | 
oxaloacetate to form citrate, which is | 
then isomerized to isocitrate. Instead CcOO- 
of being decarboxylated, as in the cit- Malate 
ric acid cycle, isocitrate is cleaved by 
isocitrate lyase into succinate and gly- 
oxylate. The ensuing steps regenerate 
oxaloacetate from glyoxylate. First, CoA 
acetyl CoA condenses with glyoxylate 
to form malate in a reaction catalyzed 
by malate synthase, and then malate 
is oxidized to oxaloacetate, as in the 
citric acid cycle. The sum of these 
reactions is 


2 Acetyl CoA + NAD* + 2H,O —> 
succinate + 2CoA + NADH + 2H~* 


In plants, these reactions take place in organelles called glyoxysomes. This 
cycle is especially prominent in oil-rich seeds, such as those from sunflow- 
ers, cucumbers, and castor beans. Succinate, released midcycle, can be 
converted into carbohydrates by a combination of the citric acid cycle and 
gluconeogenesis. The carbohydrates power seedling growth until the cell 
can begin photosynthesis. Thus, organisms with the glyoxylate cycle gain a 
metabolic versatility because they can use acetyl CoA as a precursor of glu- 
cose and other biomolecules. 


+H 0 


COoO- 


NADH + Ht Oxaloacetate 


Malate 
dehydrogenase 


Malate 
synthase 


SUMMARY 


The citric acid cycle is the final common pathway for the oxidation 
of fuel molecules. It also serves as a source of building blocks for 
biosyntheses. 


17.1. The Pyruvate Dehydrogenase Complex Links Glycolysis to the 
Citric Acid Cycle 
Most fuel molecules enter the cycle as acetyl CoA. The link between 
glycolysis and the citric acid cycle is the oxidative decarboxylation of 
pyruvate to form acetyl CoA. In eukaryotes, this reaction and those of 
the cycle take place inside mitochondria, in contrast with glycolysis, 
which takes place in the cytoplasm. 


Acetyl CoA 
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FIGURE 17.23 The glyoxylate cycle. 

The glyoxylate cycle allows plants and some 
microorganisms to grow on acetate because 
the cycle bypasses the decarboxylation 
steps of the citric acid cycle. The reactions 
of this cycle are the same as those of the 
citric acid cycle except for the ones 
catalyzed by isocitrate lyase and malate 
synthase, which are boxed in blue. 
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KEY TERMS 


17.2 The Citric Acid Cycle Oxidizes Two-Carbon Units 


17.3 


The cycle starts with the condensation of oxaloacetate (C,) and the acetyl 
unit (C,) of acetyl CoA to give citrate (C,), which is isomerized to isocit- 
rate (C,). Oxidative decarboxylation of this intermediate gives a- 
ketoglutarate (C). The second molecule of carbon dioxide comes off in 
the next reaction, in which a-ketoglutarate is oxidatively decarboxylated 
to succinyl CoA (C,). The thioester bond of succinyl CoA is cleaved by 
orthophosphate to yield succinate, and an ATP is concomitantly gener- 
ated. Succinate is oxidized to fumarate (C,), which is then hydrated to 
form malate (C,). Finally, malate is oxidized to regenerate oxaloacetate 
(C,). Thus, two carbon atoms from acetyl CoA enter the cycle, and two 
carbon atoms leave the cycle as CO, in the successive decarboxylations 
catalyzed by isocitrate dehydrogenase and a-ketoglutarate dehydroge- 
nase. In the four oxidation—reduction reactions in the cycle, three pairs 
of electrons are transferred to NAD* and one pair to FAD. These 
reduced electron carriers are subsequently oxidized by the electron- 
transport chain to generate approximately 9 molecules of ATP. In addi- 
tion, 1 molecule of ATP 1s directly formed in the citric acid cycle. Hence, 
a total of 10 molecules of ATP are generated for each two-carbon frag- 
ment that is completely oxidized to H,O and CO,. 


Entry to the Citric Acid Cycle and Metabolism Through It Are Controlled 
The citric acid cycle operates only under aerobic conditions because it 
requires a supply of NAD* and FAD. These electron acceptors are 
regenerated when NADH and FADH, transfer their electrons to O, 
through the electron-transport chain, with the concomitant produc- 
tion of ATP. Consequently, the rate of the citric acid cycle depends on 
the need for ATP. The irreversible formation of acetyl CoA from 
pyruvate is an important regulatory point for the entry of glucose- 
derived pyruvate into the citric acid cycle. The activity of the pyruvate 
dehydrogenase complex is stringently controlled by reversible phos- 
phorylation. In eukaryotes, the regulation of two enzymes in the cycle 
also is important for control. A high energy charge diminishes the 
activities of isocitrate dehydrogenase and a-ketoglutarate dehydroge- 
nase. These mechanisms complement each other in reducing the rate 
of formation of acetyl CoA when the energy charge of the cell is high 
and when biosynthetic intermediates are abundant. 


17.4 The Citric Acid Cycle Is a Source of Biosynthetic Precursors 


When the cell has adequate energy available, the citric acid cycle can 
also provide a source of building blocks for a host of important biomol- 
ecules, such as nucleotide bases, proteins, and heme groups. This use 
depletes the cycle of intermediates. When the cycle again needs to 
metabolize fuel, anaplerotic reactions replenish the cycle intermediates. 


17.5 The Glyoxylate Cycle Enables Plants and Bacteria to Grow on Acetate 


The glyoxylate cycle enhances the metabolic versatility of many plants 
and bacteria. This cycle, which uses some of the reactions of the citric 
acid cycle, enables these organisms to subsist on acetate because it 
bypasses the two decarboxylation steps of the citric acid cycle. 


citric acid (tricarboxylic acid, TCA; 
Krebs) cycle (p. 495) 

acetyl CoA (p. 495) 

oxidative phosphorylation (p. 497) 

pyruvate dehydrogenase complex (p. 497) 

flavoprotein (p. 500) 


citrate synthase (p. 502) beriberi (p. 515) 
iron-sulfur (nonheme iron) protein glyoxylate cycle (p. 517) 
(p. 504) isocitrate lyase (p. 517) 
isocitrate dehydrogenase (p. 504) malate synthase (p. 517) 

a-ketoglutarate dehydrogenase (p. 505) glyoxysome (p. 517) 


anaplerotic reaction (p. 515) 
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1. A one-way link. What reaction serves to link glycolysis 
and the citric acid cycle, and what is the enzyme that cata- 
lyzes the reaction? 


2. Naming names. What are the five enzymes (including 
regulatory enzymes) that constitute the pyruvate dehydro- 
genase complex? Which reactions do they catalyze? 


3. Coenzymes. What coenzymes are required by the pyru- 
vate dehydrogenase complex? What are their roles? 


4. More coenzymes. Distinguish between catalytic coen- 
zymes and stoichiometric coenzymes in the pyruvate dehy- 
drogenase complex. 


5. Waste and fraud? Figure 17.9 shows the steps in the 
pyruvate dehydrogenase complex reaction cycle. A key 
product, acetyl CoA, is released after the fourth step. What 
is the purpose of the remaining steps? 


6. Joined at the hip. List some of the advantages of organiz- 
ing the enzymes that catalyze the formation of acetyl CoA 
from pyruvate into a single large complex. 


7. Flow of carbon atoms. What is the fate of the radioac- 
tive label when each of the following compounds is 
added to a cell extract containing the enzymes and cofac- 
tors of the glycolytic pathway, the citric acid cycle, and 
the pyruvate dehydrogenase complex? (The !4C label is 
printed in red.) 


(a) ; (b) f 
Fs, ws 
H3C coo- H5C coo" 
(c) ; (d) 
Cc Uo 
H;C~ ~COO- H3C~ 5c 


(e) Glucose 6-phosphate labeled at C-1. 
8. C, + C,>C, 


(a) Which enzymes are required to get net synthesis of oxa- 
loacetate from acetyl CoA? 


(b) Write a balanced equation for the net synthesis. 


(c) Do mammalian cells contain the requisite enzymes? 


9. Driving force. What is the AG®’ for the complete oxida- 
tion of the acetyl unit of acetyl CoA by the citric acid cycle? 


10. Acting catalytically. The citric acid cycle itself, which is 
composed of enzymatically catalyzed steps, can be thought 
of essentially as a supramolecular enzyme. Explain. 


11. A potent inhibitor. Thiamine thiazolone pyrophosphate 
binds to pyruvate dehydrogenase about 20,000 times as 


strongly as does thiamine pyrophosphate, and it competi- 
tively inhibits the enzyme. Why? 


R’ 


H3C 
3 Ne 
ae 
R Ss 
TPP 


HC p 
3 
N 
| S=o 
R Ss 
Thiazolone analog 
of TPP 


12. Like Holmes and Watson. Match each term with its 


description. 


(a) Acetyl CoA 

(b) Citric acid cycle 

(c) Pyruvate dehydrogenase 
complex 

(d) Thiamine pyrophos- 
phate 

(e) Lipoic acid 

(f) Pyruvate dehydrogenase 


(g) Acetyllipoamide 

(h) Dihydrolipoy] trans- 
acetylase 

(i) Dihydrolipoyl dehydro- 
genase 

(j) Beriberi 


1. Catalyzes the link 
between glycolysis and 
the citric acid cycle. 

2. Coenzyme required by 
transacetylase 

3. Final product of pyruvate 
dehydrogenase 

4, Catalyzes the formation 
of acetyl CoA 

5. Regenerates active trans- 
acetylase 

6. Fuel for the citric acid 
cycle 

7. Coenzyme required by 
pyruvate dehydrogenase 

8. Catalyzes the oxidative 
decarboxylation of pyru- 
vate 

9. Due to a deficiency of 
thiamine 

10. Central metabolic hub 


13. Lactic acidosis. Patients in shock often suffer from 
lactic acidosis owing to a deficiency of O,. Why does a 


lack of O, lead to lactic acid accumulation? One treatment 
for shock is to administer dichloroacetate (DCA), which 
inhibits the kinase associated with the pyruvate dehydro- 
genase complex. What is the biochemical rationale for 
this treatment? 


14. DCA again. Patients with pyruvate dehydrogenase defi- 
ciency show high levels of lactic acid in the blood. However, 
in some cases, treatment with dichloroacetate (DCA) lowers 
lactic acid levels. 


(a) How does DCA act to stimulate pyruvate dehydroge- 
nase activity? 


(b) What does this suggest about pyruvate dehydrogenase 
activity in patients who respond to DCA? 


15. Energy rich. What are the thioesters in the reaction 
catalyzed by PDH complex? 
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16. Alternative fates. Compare the regulation of the pyru- 
vate dehydrogenase complex in muscle and in liver. 


17. Mutations. 


(a) Predict the effect of a mutation that enhances the activ- 
ity of the kinase associated with the PDH complex. 


(b) Predict the effect of a mutation that reduces the activity 
of the phosphatase associated with the PDH complex. 


18. Flaking paint, green wallpaper. Clare Boothe Luce, 
ambassador to Italy in the 1950s (and Connecticut con- 
gressperson, playwright, editor of Vanity Fair, and the 
wife of Henry Luce, founder of Time magazine and Sports 
Illustrated) became ill when she was staying at the ambas- 
sadorial residence in Rome. The paint on the dining room 
ceiling, an arsenic-based paint, was flaking; the wallpaper 
of her bedroom in the ambassadorial residence was colored 
a mellow green owing to the presence of cupric arsenite in 
the pigment. Suggest a possible cause of Ambassador 
Luce’s illness. 


19. Like Jack and Jill. Match each enzyme with its 
description. 


(a) Pyruvate dehydrogenase 1.Catalyzes the formation 
complex of isocitrate 


(b) Citrate synthase 2. Synthesizes succinyl CoA 
(c) Aconitase 3. Generates malate 
(d) Isocitrate dehydrogenase 4. Generates ATP 

5. Converts pyruvate into 
(e) a-Ketoglutarate dehy- acetyl CoA 


drogenase 6. Converts pyruvate into 


(f) Succinyl CoA synthetase —_ oxaloacetate 

7. Condenses oxaloacetate 
(g) Succinate dehydrogenase and acetyl CoA 

8. Catalyzes the formation 
(h) Fumarase of oxaloacetate 


(1) Malate dehydrogenase 9. Synthesizes fumarate 
10. Catalyzes the formation 
of a-ketoglutarate 


Qj) Pyruvate carboxylase 


20. A hoax, perhaps? The citric acid cycle is part of aero- 
bic respiration, but no O, 1s required for the cycle. Explain 
this paradox. 


21. One of a kind. How is succinate dehydrogenase unique 
compared with the other enzymes in the citric acid cycle? 


22. Coupling reactions. The oxidation of malate by NAD* to 
form oxaloacetate is a highly endergonic reaction under stan- 
dard conditions [AG°’ = 29 kJ mol~! (7 kcal mol~!)]. The 


reaction proceeds readily under physiological conditions. 


(a) Explain why the reaction proceeds as written under 
physiological conditions. 


(b) Assuming an [NAD*]/[NADH] ratio of 8 and a pH of 
7, what is the lowest [malate]/[oxaloacetate] ratio at which 
oxaloacetate can be formed from malate? 


23. Synthesizing a-ketoglutarate. It is possible, with the use 
of the reactions and enzymes considered in this chapter, to 
convert pyruvate into a-ketoglutarate without depleting 
any of the citric acid cycle components. Write a balanced 
reaction scheme for this conversion, showing cofactors and 
identifying the required enzymes. 


24. Seven o'clock roadblock. Malonate is a competitive 
inhibitor of succinate dehydrogenase. How will the concen- 
trations of citric acid cycle intermediates change immedi- 
ately after the addition of malonate? Why is malonate not a 
substrate for succinate dehydrogenase? 


COO- 


CH 


COoO- 
Malonate 


25. No signal, no activity. Why is acetyl CoA an especially 
appropriate activator for pyruvate carboxylase? 


26. Power differentials. As we will see in the next chap- 
ter, when NADH reacts with oxygen, 2.5 ATP are gener- 
ated. When FADH, reduces oxygen, only 1.5 ATP are 
generated. Why then does succinate dehydrogenase pro- 
duce FADH, and not NADH when succinate is reduced 


to fumarate? 


27. Back to Orgo (or do you say O chem?). Before any oxida- 
tion can occur in the citric acid cycle, citrate must be isom- 
erized into isocitrate. Why is this the case? 


28. A nod is as good as a wink to a blind horse. Explain why 
a GTP molecule, or another nucleoside triphosphate, is 
energetically equivalent toan ATP molecule in metabolism. 


29. One from two. The synthesis of citrate from acetyl CoA 
and oxaloacetate is a biosynthetic reaction. What is the 
energy source that drives formation of citrate? 


30. Versatility. What is the chief benefit of being able to 
perform the glyoxylate cycle? 


Chapter Integration Problems 


31. Fats into glucose? Fats are usually metabolized into ace- 
tyl CoA and then further processed through the citric acid 
cycle. In Chapter 16, we saw that glucose can be synthe- 
sized from oxaloacetate, a citric acid cycle intermediate. 
Why, then, after a long bout of exercise depletes our carbo- 
hydrate stores, do we need to replenish those stores by eat- 
ing carbohydrates? Why do we not simply replace them by 
converting fats into carbohydrates? 


32. Alternative fuels. As we will see (Chapter 22), fatty acid 
breakdown generates a large amount of acetyl CoA. What 
will be the effect of fatty acid breakdown on pyruvate dehy- 
drogenase complex activity? On glycolysis? 


Mechanism Problems 


33. Theme and variation. Propose a reaction mechanism for 
the condensation of acetyl CoA and glyoxylate in the glyox- 
ylate cycle of plants and bacteria. 


34. Symmetry problems. In experiments carried out in 
1941 to investigate the citric acid cycle, oxaloacetate 
labeled with !4C in the carboxyl carbon atom farthest from 
the keto group was introduced to an active preparation of 
mitochondria. 


Coo- 
Oxaloacetate 


Analysis of the a-ketoglutarate formed showed that 
none of the radioactive label had been lost. Decarboxylation 
of a-ketoglutarate then yielded succinate devoid of radioac- 
tivity. All the label was in the released CO,. Why were the 
early investigators of the citric acid cycle surprised that all 
the label emerged in the CO,’ 


35. Symmetric molecules reacting asymmetrically. The inter- 
pretation of the experiments described in Problem 34 was 
that citrate (or any other symmetric compound) cannot be 
an intermediate in the formation of a-ketoglutarate, because 
of the asymmetric fate of the label. This view seemed com- 
pelling until Alexander Ogston incisively pointed out in 
1948 that “it is possible that an asymmetric enzyme which 
attacks a symmetrical compound can distinguish between its 
identical groups [italics added].” For simplicity, consider a 
molecule in which two hydrogen atoms, a group X, and a 
different group Y are bonded to a tetrahedral carbon atom 
as a model for citrate. Explain how a symmetric molecule 
can react with an enzyme in an asymmetric way. 


Data Interpretation Problem 


36. A little goes a long way. As will become clearer in 
Chapter 18, the activity of the citric acid cycle can be 
monitored by measuring the amount of O, consumed. 
The greater the rate of O, consumption, the faster the 
rate of the cycle. Hans Krebs used this assay to investigate 
the cycle in 1937. He used as his experimental system 
minced pigeon-breast muscle, which is rich in mitochon- 
dria. In one set of experiments, Krebs measured the O, 
consumption in the presence of carbohydrate only and in 
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the presence of carbohydrate and citrate. The results are 
shown in the following table. 


Effect of citrate on oxygen consumption by minced 
pigeon-breast muscle 


Micromoles of oxygen consumed 


Carbohydrate Carbohydrate plus 
Time (min) only 3 pmol of citrate 
10 26 28 
60 43 62 
90 46 TE 
150 49 85 


(a) How much O, would be absorbed if the added citrate 
were completely oxidized to H,O and CO,? 


(b) On the basis of your answer to part a, what do the 
results given in the table suggest? 


37. Ayrsenite poisoning. The effect of arsenite on the experi- 
mental system of Problem 36 was then examined. 
Experimental data (not presented here) showed that the 
amount of citrate present did not change in the course of the 
experiment in the absence of arsenite. However, if arsenite 
was added to the system, different results were obtained, as 
shown in the following table. 


Disappearance of citric acid in pigeon-breast muscle 
in the presence of arsenite 


Micromoles 
of citrate used 


Micromoles of citrate 
found after 40 minutes 


Micromoles of 
citrate added 


22 0.6 ai 
44 20.0 24 
90 56.0 34 


(a) What is the effect of arsenite on the disappearance of 
citrate? 


(b) How is the action of arsenite altered by the addition of 
more citrate? 


(c) What do these data suggest about the site of action of 
arsenite? 


38. Isocitrate lyase and _ tuberculosis. The bacterium 
Mycobacterium tuberculosis, the cause of tuberculosis, can 
invade the lungs and persist in a latent state for years. During 
this time, the bacteria reside in granulomas—nodular scars 
containing bacteria and host-cell debris in the center and 
surrounded by immune cells. The granulomas are lipid-rich, 
oxygen-poor environments. How these bacteria manage to 
persist is something of a mystery. The results of recent 
research suggest that the glyoxylate cycle is required for the 
persistence. The following data show the amount of bacteria 
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[presented as colony-forming units (cfu)] in mice lungs in 
the weeks after an infection. 


(A) 
Wild type 

7 }_ 
> z 
Do 6 Isocitrate lyase 
S gene deleted 
oo 
o 5 


alLlL_™t | | | of 
0 2 4 6 8 10 12 14 #16 
Weeks postinfection 


In graph A, the black circles represent the results for 
wild-type bacteria and the red circles represent the results 
for bacteria from which the gene for isocitrate lyase was 


deleted. 
(a) What is the effect of the absence of isocitrate lyase? 


The techniques described in Chapter 5 were used to rein- 
sert the gene encoding isocitrate lyase into bacteria from 
which it had previously been deleted. 


In graph B, black circles represent bacteria into which 
the gene was reinserted and red circles represent bacteria in 
which the gene was still missing. 


(b) Do these results support those obtained in part a? 
(c) What is the purpose of the experiment in part b? 


(d) Why do these bacteria perish in the absence of the gly- 
oxylate cycle? 


(B) Isocitrate lyase 
gene restored 


6 
2 Isocitrate lyase 
5 
2 gene deleted 
= 
24 
3 


0 2 4 6 8 10 12 14 16 
Weeks postinfection 


[Data from KcKinney et al., Nature 406:735-738, 2000.] 


Oxidative Phosphorylation 


ATP 


he amount of ATP that human beings require to go about their lives is 

staggering. A sedentary male of 70 kg (154 lbs) requires about 8400 kJ 
(2000 kcal) for a day’s worth of activity. To provide this much energy 
requires 83 kg of ATP. However, human beings possess only about 250 g 
of ATP at any given moment. The disparity between the amount of ATP 
that we have and the amount that we require is compensated by recycling 
ADP back to ATP. Each ATP molecule is recycled approximately 300 
times per day. This recycling takes place primarily through oxidative 
phosphorylation. 

We begin our study of oxidative phosphorylation by examining the oxi- 
dation—reduction reactions that allow the flow of electrons from NADH 
and FADH, to oxygen. The electron flow takes place in four large protein 
complexes that are embedded in the inner mitochondrial membrane, 
together called the respiratory chain or the electron-transport chain. 


NADH +10, B*+—*H,0 + NAD* 
AG* = —220.1 kJ mol~! (—52.6 kcal mol~!) 


The overall reaction is exergonic. Importantly, three of the complexes of the 
electron-transport chain use the energy released by the electron flow to 
pump protons out of the mitochondrial matrix. In essence, energy is trans- 
formed. The resulting unequal distribution of protons generates a pH gradi- 
ent and a transmembrane electrical potential that creates a proton-motive 
force. ATP is synthesized when protons flow back to the mitochondrial 
matrix through an enzyme complex. 


ADP +2 oF i= ALP + HZ0 
AG*' = +30.5 kJ mol~! (+7.3 kcal mol7!) 
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Delivered by 
blood flow 


Mitochondria, stained green, form a network inside a 
fibroblast cell (left). Mitochondria oxidize carbon fuels to 
form cellular energy in the form of ATP. [(Left) Courtesy of 
Michael P. Yaffee, Department of Biology, University of 
California at San Diego.] 
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FIGURE 18.1 Overview of oxidative 
phosphorylation. Oxidation and ATP 
synthesis are coupled by transmembrane 
proton fluxes. Electrons flow from NADH and 
FADH, through four protein complexes to 
reduce oxygen to water (yellow tube). Three 
of the complexes pump protons from the 
mitochondrial matrix to the intermembrane 
space. The protons return to the matrix by 
flowing through another protein complex, 
ATP synthase (red structure), powering the 
synthesis of ATP. 


Respiration 

An ATP-generating process in which an 
inorganic compound (such as molecular 
oxygen) serves as the ultimate electron 
acceptor. The electron donor can be either 
an organic compound or an inorganic one. 
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+++ 


Matrix 


Membrane 


Thus, the oxidation of fuels and the phosphorylation of ADP are coupled 
by a proton gradient across the inner mitochondrial membrane (Figure 18.1). 

Collectively, the generation of high-transfer-potential electrons by the 
citric acid cycle, their flow through the respiratory chain, and the accompa- 
nying synthesis of ATP is called respiration or cellular respiration. 


18.1 Eukaryotic Oxidative Phosphorylation 
Takes Place in Mitochondria 


Recall that a biochemical role of the citric acid cycle, which takes place in 
the mitochondrial matrix, is the generation of high-energy electrons. It is 
fitting, therefore, that oxidative phosphorylation, which will convert the 
energy of these electrons into ATP, also takes place in mitochondria. 
Mitochondria are oval-shaped organelles, typically about 2 pm in length 
and 0.5 wm in diameter, about the size of a bacterium. 


Mitochondria are bounded by a double membrane 


Electron microscopic studies revealed that mitochondria have two mem- 
brane systems: an outer membrane and an extensive, highly folded inner 
membrane. The inner membrane is folded into a series of internal ridges 
called cristae. Hence, there are two compartments in mitochondria: (1) the 
intermembrane space between the outer and the inner membranes and (2) the 
matrix, which is bounded by the inner membrane (Figure 18.2). The mito- 
chondrial matrix is the site of most of the reactions of the citric acid cycle 
and fatty acid oxidation. In contrast, oxidative phosphorylation takes place 
in the inner mitochondrial membrane. The increase in surface area of the 
inner mitochondrial membrane provided by the cristae creates more sites 
for oxidative phosphorylation than would be the case with a simple, 
unfolded membrane. Humans contain an estimated 14,000 m2 of inner 
mitochondrial membrane, which is the approximate equivalent of three 
football fields in the United States. 

The outer membrane is quite permeable to most small molecules and 
ions because it contains mitochondrial porin, a 30- to 35-kDa pore- 
forming protein also known as VDAC, for voltage-dependent anion 
channel. VDAC, the most prevalent protein in the outer mitochondrial 
membrane, plays a role in the regulated flux of metabolites—usually 
anionic species such as phosphate, chloride, organic anions, and the 
adenine nucleotides—across the outer membrane. In contrast, the inner 
membrane is impermeable to nearly all ions and polar molecules. A large 
family of transporters shuttles metabolites such as ATP, pyruvate, and 
citrate across the inner mitochondrial membrane. The two faces of this 
membrane will be referred to as the matrix side and the cytoplasmic side 


Intermembrane _ Cristae 
space 


Matrix 


Inner membrane 


Outer membrane 


(B) 


(the latter because it is freely accessible to most small molecules in the 
cytoplasm). They are also called the N and P sides, respectively, because 
the membrane potential is negative on the matrix side and positive on the 
cytoplasmic side. 

In prokaryotes, the electron-driven proton pumps and ATP-synthesizing 
complexes are located in the cytoplasmic membrane, the inner of two 
membranes. The outer membrane of bacteria, like that of mitochondria, 1s 
permeable to most small metabolites because of the presence of porins. 


Mitochondria are the result of an endosymbiotic event 


Ne Mitochondria are semiautonomous organelles that live in an endo- 

symbiotic relation with the host cell. These organelles contain their 
own DNA, which encodes a variety of different proteins and RNAs. 
Mitochondrial DNA is usually portrayed as being circular, but recent 
research suggests that the mitochondrial DNA of many organisms may be 
linear. The genomes of mitochondria range broadly in size across species. 
The mitochondrial genome of the protist Plasmodium falciparum consists 
of fewer than 6000 base pairs (bp), whereas those of some land plants com- 
prise more than 200,000 bp (Figure 18.3). Human mitochondrial DNA 
comprises 16,569 bp and encodes 13 respiratory-chain proteins as well as 
the small and large ribosomal RNAs and enough tRNAs to translate all 
codons. However, mitochondria also contain many proteins encoded by 
nuclear DNA. Cells that contain mitochondria depend on these organelles 
for oxidative phosphorylation, and the mitochondria in turn depend on 
the cell for their very existence. How did this intimate symbiotic relation 
come to exist? 

An endosymbiotic event is thought to have occurred whereby a free-living 
organism capable of oxidative phosphorylation was engulfed by another 
cell. The double-membrane, circular DNA (with exceptions) and the 
mitochondrial-specific transcription and translation machinery all point to 
this conclusion. Thanks to the rapid accumulation of sequence data for 
mitochondrial and bacterial genomes, speculation on the origin of the 
“original” mitochondrion with some authority is now possible. The most 
mitochondrial-like bacterial genome is that of Rickettsia prowazekii, the 
cause of louse-borne typhus. The genome for this organism is more 
than 1 million base pairs in size and contains 834 protein-encoding genes. 
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FIGURE 18.2 Electron micrograph 

(A) and diagram (B) of a mitochondrion. 
[(A) Keith R. Porter/Science Source. 

(B) Information from Wolfe, Biology of 
the Cell, 2e, © 1981 Brooks/Cole, a part 
of Cengage Learning, Inc. Reproduced 
by permission www. cengage.com/ 
permission 3.] 
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FIGURE 18.3 Sizes of mitochondrial 
genomes. The sizes of three mitochondrial 
genomes compared with the genome of 
Rickettsia, a relative of the presumed 
ancestor of all mitochondria. For genomes 
of more than 60 kbp, the DNA coding 
region for genes with known function is 
shown in red. 


Acanthamoeba 


(an ameba) 
Schizosaccharomyces_ Porphyra 
(a yeast) (red alga) 


Plasmodium 
(a protozoan) 


rp!1,10,18,19,20,27,31,32,34 


FIGURE 18.4 Overlapping gene complements of mitochondria. 
The genes present within each oval are those present within the 
organism represented by the oval. Only rRNA- and protein-coding 
genes are shown. The genome of Reclinomonas contains all the 
protein-coding genes found in all the sequenced mitochondrial 
genomes. [Data from M. W. Gray, G. Burger, and B. F. Lang, Science 


283:1476-1481, 1999.] 


Voltmeter 


1 M H* in equilibrium 
with 1 atm H, gas 


Solution of 
1 MX and 1M xX" 


FIGURE 18.5 Measurement of redox 
potential. Apparatus for the measurement of 
the standard oxidation—reduction potential of 
a redox couple. Electrons flow through the 
wire connecting the cells, whereas ions flow 
through the agar bridge. 
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Marchantia (a protozoan) 
(a moss) 


Sequence data suggest that all extant mitochondria are 
derived from an ancestor of R. prowazekii as the result of a 
single endosymbiotic event. 

The evidence that modern mitochondria result from a 
single event comes from examination of the most bacteria- 
like mitochondrial genome, that of the protozoan 
Reclinomonas americana. Its genome contains 97 genes, of 
which 62 specify proteins. The genes encoding these pro- 
teins include all of the protein-coding genes found in all of 
the sequenced mitochondrial genomes (Figure 18.4). Yet, 
this genome encodes less than 2% of the protein-coding 
genes in the bacterium E. coli. In other words, a small frac- 
tion of bacterial genes—2%—1is found in all examined 
mitochondria. How is it possible that all mitochondria 
have the same 2% of the bacterial genome? It seems 
unlikely that mitochondrial genomes resulting from sev- 
eral endosymbiotic events could have been independently 
reduced to the same set of genes found in R. americana. 
Thus, the simplest explanation is that the endosymbiotic 
event took place just once and all existing mitochondria are 
descendants of that ancestor. 

Note that transient engulfment of prokaryotic cells by 
larger cells is not uncommon in the microbial world. In 
regard to mitochondria, such a transient relationship 
became permanent as the bacterial cell lost DNA, making 
it incapable of independent living, and the host cell became dependent on 
the ATP generated by its tenant. 


Reclinomonas 


18.2 Oxidative Phosphorylation Depends on 
Electron Transfer 


In Chapter 17, the generation of NADH and FADH, by the oxidation of 
acetyl CoA was identified as an important function of the citric acid cycle. 
In oxidative phosphorylation, electrons from NADH and FADH, are used 
to reduce molecular oxygen to water. The highly exergonic reduction is 
accomplished by a number of electron-transfer reactions, which take place 
in a set of membrane proteins known as the electron-transport chain. 


The electron-transfer potential of an electron is 
measured as redox potential 


In oxidative phosphorylation, the electron-transfer potential of NADH or 
FADH, is converted into the phosphoryl-transfer potential of ATP. To better 
understand this conversion, we need quantitative expressions for these 
forms of free energy. The measure of phosphoryl-transfer potential is 
already familiar to us: it is given by AG®’ for the hydrolysis of the activated 
phosphoryl compound. The corresponding expression for the electron- 
transfer potential is Ep, the reduction potential (also called the redox potential 
or oxidation—reduction potential). 

Consider a substance that can exist in an oxidized form X and a reduced 
form X~. Such a pair is called a redox couple and is designated X : X~. The 
reduction potential of this couple can be determined by measuring the elec- 
tromotive force generated by an apparatus called a sample half-cell con- 
nected to a standard reference half-cell (Figure 18.5). The sample half-cell 
consists of an electrode immersed in a solution of 1 M oxidant (X) and 1M 


reductant (X~). The standard reference half-cell consists of an electrode 
immersed in a 1 M H® solution that is in equilibrium with H, gas at 1 atmo- 
sphere (1 atm) of pressure. The electrodes are connected to a voltmeter, and 
an agar bridge allows ions to move from one half-cell to the other, establish- 
ing electrical continuity between the half-cells. Electrons then flow from 
one half-cell to the other through the wire connecting the two half-cells to 
the voltmeter. If the reaction proceeds in the direction 


Xo op > Ae GE, 
the reactions in the half-cells (referred to as half-reactions or couples) must be 
X > X+e7 HY +e" => 1B 


Thus, electrons flow from the sample half-cell to the standard reference 
half-cell, and the sample-cell electrode is taken to be negative with respect 
to the standard-cell electrode. The reduction potential of the X : X~ couple is 
the observed voltage at the start of the experiment (when X, X~, and H® are 
1 M with 1 atm of H,). The reduction potential of the H* : H, couple is 
defined to be 0 volts. In oxidation—reduction reactions, the donor of elec- 
trons, in this case X~, is called the reductant or reducing agent, whereas the 
acceptor of electrons, H* here, is called the oxidant or oxidizing agent. 

The meaning of the reduction potential is now evident. A negative 
reduction potential means that the oxidized form of a substance has lower 
affinity for electrons than does H.,,, as in the preceding example. A positive 
reduction potential means that the oxidized form of a substance has higher 
affinity for electrons than does H,. These comparisons refer to standard 
conditions—namely, 1 M oxidant, 1 M reductant, 1 M H™, and 1 atm la 
Thus, a strong reducing agent (such as NADH) is poised to donate electrons 
and has a negative reduction potential, whereas a strong oxidizing agent (such 
as O,,) is ready to accept electrons and has a positive reduction potential. 

The reduction potentials of many biologically important redox couples 
are known (Table 18.1). Table 18.1 is like those presented in chemistry 


TABLE 18.1 Standard reduction potentials of some reactions 
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Oxidant Reductant n E, (V) 
Succinate + CO, a-Ketoglutarate 2 —0.67 
Acetate Acetaldehyde 2 —0.60 
Ferredoxin (oxidized) Ferredoxin (reduced) 1 —0.43 
2H+ H, 2 —0.42 
NAD* NADH + Ht 2 —0.32 
NADP* NADPH + H* 2 —0.32 
Lipoate (oxidized) Lipoate (reduced) 2 —0.29 
Glutathione (oxidized) Glutathione (reduced) 2 —0.23 
FAD FADH, 2 —0.22 
Acetaldehyde Ethanol 2 =0.20 
Pyruvate Lactate 2 —0.19 
2H+ H, 2 0.00! 
Fumarate Succinate 2 +0.03 
Cytochrome b (+3) Cytochrome b (+2) 1 +0.07 
Dehydroascorbate Ascorbate 2 +0.08 
Ubiquinone (oxidized) Ubiquinone (reduced) 2 +0.10 
Cytochrome c (+3) Cytochrome c (+2) 1 +0.22 
Fe (+3) Fe (+2) 1 +0.77 
%O,+2H* H,O 2 +0.82 


Note: Ep is the standard oxidation—reduction potential (pH 7, 25°C) and n is the number of electrons 
transferred. E} refers to the partial reaction written as Oxidant + e~— reductant 


'Standard oxidation — reduction potential at pH = 0. 


18.2 Electron Transfer 
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textbooks, except that a hydrogen ion concentration of 107’ M (pH 7) 
instead of 1 M (pH 0) is the standard state adopted by biochemists. This 
difference is denoted by the prime in FE). Recall that the prime in AG®’ 
denotes a standard free-energy change at pH 7. 

The standard free-energy change AG®’ is related to the change in reduc- 
tion potential AE) by 


AG®’ = —nFAE} 


in which n is the number of electrons transferred, F is a proportionality 
constant called the Faraday constant [96.48 kJ mol~! V~! (23.06 kcal mol7! 
V- 4), AE) is in volts, and AG®’ is in kilojoules or kilocalories per mole. 

The free-energy change of an oxidation—reduction reaction can be read- 
ily calculated from the reduction potentials of the reactants. For example, 
consider the reduction of pyruvate by NADH, catalyzed by lactate dehy- 
drogenase. Recall that this reaction maintains redox balance in lactic acid 
fermentation (Figure 16.11). 


Pyruvate + NADH + H* = lactate + NAD* (A) 


The reduction potential of the NAD* : NADH couple, or half-reaction, is 
—0.32 V, whereas that of the pyruvate:lactate couple is —0.19 V. By conven- 
tion, reduction potentials (as in Table 18.1) refer to partial reactions written 
as reductions: oxidant + e~ — reductant. Hence, 


Pyruvate + 2H* +2e — lactate Eo = —0.19 V (B) 
NAD* +H* +2e NADH £E}=—0.32V (C) 


To obtain reaction A from reactions B and C, we need to reverse the direc- 
tion of reaction C so that NADH appears on the left side of the arrow. In 
doing so, the sign of Ej must be changed. 


Pyruvate + 2H* +2e —> lactate Ey = —0.19V (B) 
NADH > NAD* + H* + 2e E = +0.32V (D) 


For reaction B, the free energy can be calculated with n = 2. 


AG 


—2 X 96.48kJ mol} V7! x -0.19V 
= +36.7 kJ mol !(+8.8 kcal mol!) 


Likewise, for reaction D, 


AG?’ = —2 X 96.48kJ mol! V7! x +0.32V 
= —61.8kJ mol '(—14.8 kcal mol‘) 


Thus, the free energy for reaction A is given by 


AG®°’ = AG°' (for reaction B) + AG®' (for reaction D) 
+36.7kJ mol | — 61.8 kJ mol | 
= —25.1kJ mol !(—6.0 kcal mol!) 


A 1.14-volt potential difference between NADH and molecular 

oxygen drives electron transport through the chain and favors the 
formation of a proton gradient 

The driving force of oxidative phosphorylation is the electron-transfer 
potential of NADH or FADH, relative to that of O,. How much energy is 
released by the reduction of O, with NADH? Let us calculate AG®’ for this 
reaction. The pertinent half-reactions are 


,0,+2H*++2e—>H,O  E)= +0.82V (A) 
NAD*+ + H+ +2e-—»NADH ~ E} = -0.32V (B) 


The combination of the two half-reactions, as it proceeds in the electron- 
transport chain, yields 


1/,0, + NADH + H* — H,O + NADt (C) 


The standard free energy for this reaction is then given by 


AG°*' 


(—2 X 96.48kJ mol"! V~! x +0.82 V) 
+ (—2 X 96.48 kJ mol”! V7! x 0.32 V) 
= —158.2kJ mol”! + (—61.9) kJ mol”?! 
= —220.1 kJ mol! (—52.6 kcal mol!) 


This release of free energy is substantial. Recall that AG®’ for the 
synthesis of ATP is 30.5 kJ mol~! (7.3 kcal mol~!). The released energy 
is initially used to generate a proton gradient that is then used for the 
synthesis of ATP and the transport of metabolites across the mitochon- 
drial membrane. 

How can the energy associated with a proton gradient be quantified? 
Recall that the free-energy change for a species moving from one side of a 
membrane where it is at concentration c, to the other side where it is at a 
concentration c, 1s given by 


AG = RT In(c9/c;) + ZFAV 


in which Z is the electrical charge of the transported species and AV is the 
potential in volts across the membrane (Section 13.1). Under typical condi- 
tions for the inner mitochondrial membrane, the pH outside is 1.4 units 
lower than inside [corresponding to In (c,/c,) of 3.2] and the membrane 
potential is 0.14 V, the outside being positive. Because Z = +1 for protons, 
the free-energy change is (8.32 X 1073 kJ mol! K~! X 310 K X 3.2) + 
(+1 X 96.48 kJ mol7! V~! X 0.14 V) = 21.8 kJ mol7! (5.2 kcal mol7!). 
Thus, each proton that is transported out of the matrix to the cytoplasmic 
side corresponds to 21.8 kJ mol~! of free energy. 


18.3 The Respiratory Chain Consists of Four Complexes: 
Three Proton Pumps and a Physical Link to the 
Citric Acid Cycle 


Electrons are transferred from NADH to O, through a chain of three large 
protein complexes called NADH-Q oxidoreductase, Q-cytochrome c oxidore- 
ductase, and cytochrome c oxidase (Figure 18.6 and Table 18.2). Electron flow 
within these transmembrane complexes is highly exergonic and powers the 
transport of protons across the inner mitochondrial membrane. A fourth large 
protein complex, called succinate-Q reductase, contains the succinate dehy- 
drogenase that generates FADH, in the citric acid cycle. Electrons from this 
FADH, enter the electron-transport chain at Q-cytochrome c oxidoreduc- 
tase. Succinate-Q reductase, in contrast with the other complexes, does not 
pump protons. NADH-Q oxidoreductase, succinate-Q reductase, 
Q-cytochrome c oxidoreductase, and cytochrome c oxidase are also called 
Complex I, I, III, and IV, respectively. Complexes I, III, and IV appear to 
be associated in a supramolecular complex termed the respirasome. As we 
have seen before, such supramolecular complexes facilitate the rapid trans- 
fer of substrate and prevent the release of reaction intermediates. 
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FIGURE 18.6 Components of the electron- 
transport chain. Electrons flow down an 
energy gradient from NADH to O,. The flow 
is catalyzed by four protein complexes, and 200 0 & 
the energy released is used to generate a O H50 
proton gradient. [Information from D. Sadava : : 
et al., Life, 8th ed. (Sinauer, 2008), p. 150.] Reaction progress 


Two special electron carriers ferry the electrons from one complex to 
the next. The first is coenzyme Q (Q), also known as ubiquinone because 
it is a ubiquitous quinone in biological systems. Ubiquinone is a hydro- 
phobic quinone that diffuses rapidly within the inner mitochondrial 
membrane. Electrons are carried from NADH-Q oxidoreductase to 
Q-cytochrome c oxidoreductase, the third complex of the chain, by the 
reduced form of Q. Electrons from the FADH, generated by the citric 
acid cycle are transferred first to ubiquinone and then to the Q-cytochrome 
c oxidoreductase complex. 

Coenzyme Q is a quinone derivative with a long tail consisting of five- 
carbon isoprene units that account for its hydrophobic nature. The number 
of isoprene units in the tail depends on the species. The most common 
mammalian form contains 10 isoprene units (coenzyme Q,). For simplicity, 
the subscript will be omitted from this abbreviation because all varieties 


TABLE 18.2 Components of the mitochondrial electron-transport chain 


Oxidant or reductant 


Enzyme complex Mass (kDa) Subunits Prosthetic group Matrix side Membrane core Cytoplasmic side 


NADH-Q oxidoreductase >900 46 FMN NADH Q 
Fe-S 
Succinate-Q reductase 140 4 FAD Succinate Q 
Fe-S 
Q-cytochrome c oxidoreductase 250 11 Heme by; Q Cytochrome c 
Heme by 
Heme c, 
Fe-S 
Cytochrome c oxidase 160 13 Heme a Cytochrome c 


Heme a, 
Cu, and Cug 


Information from: J. W. DePierre and L. Ernster, Annu. Rev. Biochem. 46:215, 1977; Y. Hatefi, Annu Rev. Biochem. 54:1015, 1985; and J. E. Walker, 
Q. Rev. Biophys. 25:253, 1992. 
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FIGURE 18.7 Oxidation states of 

; : . ; : 2 ; : quinones. The reduction of ubiquinone 
function in an identical manner. Quinones can exist in several oxidation (Q) to ubiquinol (QH,) proceeds through a 
states. In the fully oxidized state (Q), coenzyme Q has two keto groups semiquinone intermediate (QH*). 


(Figure 18.7). The addition of one electron and one proton results in the 
semiquinone form (QH-:). The semiquinone can lose a proton to form a 
semiquinone radical anion (Q:~). The addition of a second electron and 
proton to the semiquinone generates ubiquinol (QH,), the fully reduced 
form of coenzyme Q, which holds its protons more tightly. Thus, for qui- 
nones, electron-transfer reactions are coupled to proton binding and release, a 
property that is key to transmembrane proton transport. Because ubiqui- 
none is soluble in the membrane, a pool of Q and QH,—the Q pool—is 
thought to exist in the inner mitochondrial membrane, although recent 
research suggests that the Q pool is confined to the respirasome. 

In contrast with Q, the second special electron carrier is a protein. 
Cytochromec, asmall soluble protein, shuttles electrons from Q-cytochrome 
c oxidoreductase to cytochrome c oxidase, the final component in the chain 
and the one that catalyzes the reduction of O,. 


lron-sulfur clusters are common components of the 
electron transport chain 


Tron-sulfur clusters in iron-sulfur proteins (also called nonheme iron pvoteins) 
play a critical role in a wide range of reduction reactions in biological systems. 
Several types of Fe-S clusters are known (Figure 18.8). In the simplest kind, a 
single iron ion is tetrahedrally coordinated to the sulfhydryl groups of four 
cysteine residues of the protein (Figure 18.8A). A second kind, denoted by 
2Fe-25, contains two iron ions, two inorganic sulfides, and usually four cyste- 
ine residues (Figure 18.8B). A third type, designated 4Fe-48, contains four 
iron ions, four inorganic sulfides, and four cysteine residues (Figure 18.8C). 
NADH-Q oxidoreductase contains both 2Fe-25 and 4Fe-45 clusters. Iron 
ions in these Fe-S complexes cycle between Fe?* (reduced) and Fe?* (oxi- 
dized) states. Unlike quinones and flavins, iron-sulfur clusters generally 
undergo oxidation—reduction reactions without releasing or binding protons. 


e The importance of Fe-S clusters is illustrated by the loss of function 

of the protein frataxin. Frataxin is a small (14.2-kDa) mitochondrial 
protein that is crucial for the synthesis of Fe-S clusters. Mutations in 
frataxin causes Friedreich’s ataxia, a disease of varying severity that affects 
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FIGURE 18.9 Oxidation states of flavins. 
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FIGURE 18.8 lron-sulfur clusters. (A) A single iron ion bound by four cysteine residues. 
(B) 2Fe-2S cluster with iron ions bridged by sulfide ions. (C) 4Fe-4S cluster. Each of these 
clusters can undergo oxidation-reduction reactions. 


the central and peripheral nervous as well as the heart and skeletal system. 
Severe cases lead to death in young adult life. The most common mutation 
is trinucleotide expansion (Section 28.5) in the gene for frataxin. 


The high-potential electrons of NADH enter the respiratory chain 
at NADH-Q oxidoreductase 


The electrons of NADH enter the chain at NADH-Q oxidoreductase 
(also called Complex I and NADH dehydrogenase), an enormous enzyme 
(>900 kDa) consisting of approximately 46 polypeptide chains. This pro- 
ton pump, like that of the other two in the respiratory chain, is encoded by 
genes residing in both the mitochondria and the nucleus. NADH-Q oxido- 
reductase is L-shaped, with a horizontal arm lying in the membrane and a 
vertical arm that projects into the matrix. 
The reaction catalyzed by this enzyme appears to be 


NADH + Q + 5 Ainatrix > NAD* + QH> + 4 Hojoplaam 


The initial step is the binding of NADH and the transfer of its two high- 
potential electrons to the flavin mononucleotide (FMN) prosthetic group, 
yielding the reduced form, FMNH, (Figure 18.9). The electron acceptor of 
FMN, the isoalloxazine ring, is identical with that of FAD. Electrons are 
then transferred from FMNH, toa series of iron-sulfur clusters, the second 
type of prosthetic group in NADH-Q oxidoreductase. 

Recent structural studies have suggested how Complex I acts as a proton 
pump. What are the structural elements required for proton-pumping? The 
membrane-embedded part of the complex has four proton half-channels 
consisting, in part, of vertical helices. One set of half-channels is exposed to 
the matrix and the other to the inner membrane space (Figure 18.10). The 
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vertical helices are linked on the matrix side by 
a long horizontal helix (HL) that connects the 
matrix half-channels, while the cytoplasmic 
half-channels are joined by a series of B-hair- 
pin-helix connecting elements (RH). An 
enclosed Q chamber, the site where Q accepts 
electrons from NADH, exists near the junc- 
tion of the hydrophilic portion and the mem- 
brane-embedded portion. Finally, a hydro- 
philic funnel connects the Q chamber to a 
water-lined channel (into which the half chan- 
nels open) that extends the entire length of the 
membrane-embedded portion. 

How do these structural elements cooperate 
to pump protons out of the matrix? When Q 
accepts two electrons from NADH, generating 
Q?-, the negative charges on Q?7 interact elec- 
trostatically with negatively-charged amino 
acid residues in the membrane-embedded arm, 
causing conformational changes in the long 
horizontal helix and the BH elements. These 
changes in turn alter the structures of the con- 
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FIGURE 18.10 Coupled electron-proton transfer reactions through NADH-Q 
oxidoreductase. Electrons flow in Complex | from NADH through FMN and a series 
of iron-sulfur clusters to ubiquinone (Q), forming Q2-. The charges on Q2> are 
electrostatically transmitted to hydrophilic amino acid residues (shown as red 
(glutamate) and blue (lysine or histidine) balls) that power the movement of HL and 
BH components. This movement changes the conformation of the transmembrane 
helices and results in the transport of four protons out of the mitochondrial matrix. 
(Information from R. Baradaran et al., Nature 494:443-448 |] 


nected vertical helices that change the pK, of 

amino acids, allowing protons from the matrix to first bind to the amino acids, 
then dissociate into the water-lined channel and finally enter the intermem- 
brane space. Thus, the flow of two electrons from NADH to coenzyme Q through 
NADH-Q oxidoreductase leads to the pumping of four hydrogen ions out of the 
matrix of the mitochondrion. Q?~ subsequently takes up two protons from the 
matrix as it is reduced to QH,. The removal of these protons from the matrix 
contributes to the formation of the proton motive force. The QH, subsequently 
leaves the enzyme for the Q pool, allowing another reaction cycle to occur. 

It is important to note that the citric acid cycle is not the only source of 
mitochondrial NADH. As we will see in Chapter 22, fatty acid degradation, 
which also takes place in mitochondria, is another crucial source of NADH 
for the electron-transport chain. Moreover, electrons from cytoplasmically 
generated NADH can be transported into mitochondria for use by the 
electron-transport chain (Section 18.5). 


Ubiquinol is the entry point for electrons from FADH, of flavoproteins 


FADH, enters the electron-transport chain at the second protein complex of 
the chain. Recall that FADH, is formed in the citric acid cycle, in the oxidation 
of succinate to fumarate by succinate dehydrogenase (Section 17.2). Succinate 
dehydrogenase is part of the succinate-Q reductase complex (Complex II), an 
integral membrane protein of the inner mitochondrial membrane. FADH, 
does not leave the complex. Rather, its electrons are transferred to Fe-5 centers 
and then finally to Q to form QH,, which then is ready to transfer electrons 
further down the electron-transport chain. The succinate-Q reductase com- 
plex, in contrast with NADH-Q oxidoreductase, does not pump protons from 
one side of the membrane to the other. Consequently, less ATP is formed from 
the oxidation of FADH, than from NADH. 


Electrons flow from ubiquinol to cytochrome c through 
Q-cytochrome c oxidoreductase 


What is the fate of ubiquinol generated by Complexes I and II? The electrons 
from QH, are passed on to cytochrome c (Cyt c), a water-soluble protein, by 
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kK FIGURE 18.11 Structure of 
Q-cytochrome c oxidoreductase. 
This enzyme is a homodimer with each 
monomer consisting of 11 distinct 
polypeptide chains. Some of the more 
prominent components in one monomer 
are colored while the other monomer is 
white. Although each monomer contains 
the same components, some are identified 
in one monomer or the other for ease of 
viewing. Notice that the major prosthetic 
groups, three hemes and a 2Fe-2S cluster, 
are located either near the cytoplasmic 
edge of the complex bordering the 
intermembrane space (top) or in the region 
embedded in the membrane (a helices 
represented by tubes). They are well 
positioned to mediate the electron-transfer 
reactions between quinones in the membrane 
and cytochrome c in the intermembrane 
space. [Drawn from 1BCC.pdb.] 


Rieske iron-sulfur center 


Cys His 


the second of the three proton pumps in the respiratory chain, Q-cytochrome 
c oxidoreductase (also known as Complex III and as cytochrome c reductase). 
The flow of a pair of electrons through this complex leads to the effective net 
transport of 2 H* to the cytoplasmic side, half the yield obtained with 
NADH-Q reductase because of a smaller thermodynamic driving force. 


QH, 2 Cyt Cox + 2 [; haar =F Q aes Cyt Cred + 4 [; Ree 


Q-cytochrome c oxidoreductase itself contains two types of cytochromes, 
named b and c, (Figure 18.11). A cytochrome is an electron-transferring protein 
that contains a heme prosthetic group. The heme prosthetic group in cyto- 
chromes b, c,, and c is iron-protoporphyrin IX, the same heme present in 
myoglobin and hemoglobin (Section 7.1). The iron ion of a cytochrome, in 
contrast to hemoglobin and myoglobin, alternates between a reduced ferrous 
(+2) state and an oxidized ferric (+3) state during electron transport. The 
two cytochrome subunits of Q-cytochrome c oxidoreductase contain a total 
of three hemes: two hemes within cytochrome b, termed heme b, (L for low 
affinity) and heme b,, (H for high affinity), and one heme within cytochrome 
c,. These identical hemes have different electron affinities because they are in 
different polypeptide environments. For example, heme b, , which is located 
in a cluster of helices near the cytoplasmic face of the membrane, has lower 
affinity for an electron than does heme b,;, which is near the matrix side. 

In addition to the hemes, the enzyme contains an iron-sulfur protein with 
a 2Fe-2S center. This center, termed the Rieske center, is unusual in that one 
of the iron ions is coordinated by two histidine residues rather than two cyste- 
ine residues. This coordination stabilizes the center in its reduced form, rais- 
ing its reduction potential so that it can readily accept electrons from QH,. 


The Q cycle funnels electrons from a two-electron carrier to a 
one-electron carrier and pumps protons 


QH, passes two electrons to Q-cytochrome c oxidoreductase, but the accep- 
tor of electrons in this complex, cytochrome c, can accept only one electron. 
How does the switch from the two-electron carrier ubiquinol to the one- 
electron carrier cytochrome c take place? The mechanism for the coupling of 
electron transfer from Q to cytochrome c to transmembrane proton trans- 
port is known as the Q cycle (Figure 18.12). Two QH, molecules bind to the 
complex consecutively, each giving up two electrons and two H*. These 
protons are released to the cytoplasmic side of the membrane. The first QH, to 
exit the Q pool binds to the first Q binding site (Q.,), and its two electrons 
travel through the complex to different destinations. One electron flows, 
first, to the Rieske 2Fe-25 cluster; then, to cytochrome c,; and, finally, to a 
molecule of oxidized cytochrome c, converting it into its reduced form. The 
reduced cytochrome c molecule is free to diffuse away from the enzyme to 
continue down the respiratory chain. 

The second electron passes through two heme groups of cytochrome b to 
an oxidized ubiquinone in a second Q binding site (Q..). The Qin the second 
binding site is reduced to a semiquinone radical anion (Q:~ ) by the electron 
from the first QH,. The now fully oxidized Q leaves the first Q site, free to 
reenter the Q pool. 

A second molecule of QH, binds First half of Q cycle 
to the Q,, site of Q-cytochrome c ox1- 
doreductase and reacts in the same 
way as the first. One of the electrons 
is transferred to cytochrome c and the 
second electron is transferred to the 
partly reduced ubiquinone bound in 
the Q, binding site. On the addition 
of the electron from the second QH, 
molecule, this quinone radical anion 
takes up two protons from the matrix 
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Second half of Q cycle 


side to form QH,. The removal of 
these two protons from the matrix con- 
tributes to the formation of the proton 
gradient. In sum, four protons are 
released on the cytoplasmic side, and 
two protons are removed from the 
mitochondrial matrix. 


2 H* 


FIGURE 18.12 Q cycle. The Q cycle takes place in Complex Ill, which is represented in outline 
form. In the first half of the cycle, two electrons of a bound QH, are transferred, one to cytochrome 
c and the other to a bound Q in a second binding site to form the semiquinone radical anion 

Q: . The newly formed Q dissociates and enters the Q pool. In the second half of the cycle, a 
second QH, also gives up its electrons to Complex Ill, one to a second molecule of cytochrome c 
and the other to reduce Q:” to QH,. This second electron transfer results in the uptake of two 
protons from the matrix. The path of electron transfer is shown in red. 


2 QH, + Q + 2Cytey, + 2 Hinatrix 
2Q + QH + 2 Cytaeg + 4 He oeies 


In one Q cycle, two QH, molecules are oxidized to form two Q mole- 
cules, and then one Q molecule is reduced to QH,. The problem of how to 
efficiently funnel electrons from a two-electron carrier (QH,) to a one- 
electron carrier (cytochrome c) is solved by the Q cycle. The cytochrome b 
component of the reductase is in essence a recycling device that enables both 
electrons of QH, to be used effectively. 


Cytochrome c oxidase catalyzes the reduction of 
molecular oxygen to water 


The last of the three proton-pumping assemblies of the respiratory chain is 
cytochrome c oxidase (Complex IV). Cytochrome c oxidase catalyzes the 
transfer of electrons from the reduced form of cytochrome c¢ to molecular 
oxygen, the final acceptor. 
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HO Tyr 


Histidine covalently bound to tyrosine 


Kk FIGURE 18.13 Structure of cytochrome c oxidase. This enzyme consists of 13 
polypeptide chains. Notice that most of the complex, as well as two major prosthetic groups 
(heme a and heme a,-Cug) are embedded in the membrane (a helices represented by vertical 
tubes). Heme a,—Cug is the site of the reduction of oxygen to water. The Cu,/Cu, prosthetic 
group is positioned near the intermembrane space to better accept electrons from 
cytochrome c. CO(bb) is a carbonyl group of the peptide backbone. [Drawn from 20CC.pdb.] 


4 Cyt Greg + 8 Hihatrix + Oo > 4 Cyt cox + 2H2O + 4 Hotoplasm 


The requirement of oxygen for this reaction is what makes “‘aerobic” organisms 
aerobic. To obtain oxygen for this reaction is the reason that human beings must 
breathe. Four electrons are funneled to O,, to completely reduce it to HO, and, 
concomitantly, protons are pumped from the matrix to the cytoplasmic side of 
the inner mitochondrial membrane. This reaction is quite thermodynamically 
favorable. From the reduction potentials in Table 18.1, the standard free-energy 
change for this reaction is calculated to be AG°’ = —231.8kJ mol~!(—55.4 kcal 
mol~!). As much of this free energy as possible must be captured in the form of 
a proton gradient for subsequent use in ATP synthesis. 

Bovine cytochrome c oxidase is reasonably well understood at the struc- 
tural level (Figure 18.13). It consists of 13 subunits, 3 of which are encoded by 
the mitochondrion’s own genome. Cytochrome c oxidase contains two heme 
A groups and three copper ions, arranged as two copper centers, designated 
A and B. One center, Cu,/Cu,, contains two copper ions linked by two 
bridging cysteine residues. This center initially accepts electrons from reduced 
cytochrome c. The remaining copper ion, Cup, is bonded to three histidine 
residues, one of which is modified by covalent linkage to a tyrosine residue. 
The copper centers alternate between the reduced Cu* (cuprous) form and 
the oxidized Cu?* (cupric) form as they accept and donate electrons. 

There are two heme A molecules, called heme a and heme a, in cyto- 
chrome c oxidase. Heme A differs from the heme in cytochrome c and ¢, in 
three ways: (1) a formyl group replaces a methyl group, (2) a C,, hydrocar- 
bon chain replaces one of the vinyl groups, and (3) the heme is not cova- 
lently attached to the protein. 
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Heme A 


Heme a and heme a; have distinct redox potentials because they are located 
in different environments within cytochrome c oxidase. An electron flows 
from cytochrome c to Cu af Cu,, to heme a, to heme a, to Cug, and finally 
to O,. Heme a, and Cug are directly adjacent. Together, heme a, and Cug 
form the active center at which O, is reduced to H,O. 

Four molecules of cytochrome c bind consecutively to the enzyme and 
transfer an electron to reduce one molecule of O, to H,O (Figure 18.14): 


1. Two molecules of cytochrome c sequentially transfer 2. Reduced Cug and Fe in 
electrons to reduce Cug and heme az. heme a; bind O, which 


aGpochemee forms a peroxide bridge. 


Heme a, 


2,0 2 Cytochrome c 


NY & 


& 
© HO. 


FIGURE 18.14 Cytochrome c oxidase 
mechanism. The cycle begins and ends 
with all prosthetic groups in their oxidized 
forms (shown in blue). Reduced forms are 
in red. Four cytochrome c molecules 
donate four electrons, which, in allowing 
the binding and cleavage of an O, 


molecule, also makes possible the import 


4. The addition of two more 3. The addition of two more is 
protons leads to the electrons and two more of four H™ from the matrix to form two 
release of water. protons cleaves the molecules of HO, which are released from 


peroxide bridge. the enzyme to regenerate the initial state. 
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FIGURE 18.15 Peroxide bridge. The 
oxygen bound to heme az is reduced to 
peroxide by the presence of Cuz. 


Cyt Creduced Cyt Coxidized 


4Ht 4Ht 
Pumped = Chemical 
protons _ protons 


FIGURE 18.16 Proton transport by 
cytochrome c oxidase. Four protons are 
taken up from the matrix side to reduce 
one molecule of O, to two molecules of 
H,O. These protons are called “chemical 
protons” because they participate in a 
clearly defined reaction with O,. Four 
additional “pumped” protons are 
transported out of the matrix and released 
on the cytoplasmic side in the course of 
the reaction. The pumped protons double 
the efficiency of free-energy storage in the 
form of a proton gradient for this final step 
in the electron-transport chain. 


1. Electrons from two molecules of reduced cytochrome c flow down an 
electron-transfer pathway within cytochrome c oxidase, one stopping at 
Cup and the other at heme a,. With both centers in the reduced state, they 
together can now bind an oxygen molecule. 


2. As molecular oxygen binds, it abstracts an electron from each of the 
nearby ions in the active center to form a peroxide (OF) bridge between 
them (Figure 18.15). 


3. Two more molecules of cytochrome c bind and release electrons that 
travel to the active center. The addition of an electron as well as H* to each 
oxygen atom reduces the two ion-oxygen groups to Cu,**—OH and 
Fe3+—OH. 


4. Reaction with two more H* ions allows the release of two molecules of 
H,O and resets the enzyme to its initial, fully oxidized form. 


4 Cyt Cred + 4 Hinatrix + O2 > 4 Cyt co, + 2H2O 


The four protons in this reaction come exclusively from the matrix. Thus, the 
consumption of these four protons contributes directly to the proton gradi- 
ent. Recall that each proton contributes 21.8 kJ mol~! (5.2 kcal mol~!) to 
the free energy associated with the proton gradient; so these four protons 
contribute 87.2 kJ mol~! (20.8 kcal mol~!), an amount substantially less 
than the free energy available from the reduction of oxygen to water. What 
is the fate of this missing energy? Remarkably, cytochrome c oxidase uses this 
energy to pump four additional protons from the matrix to the cytoplasmic side 
of the membrane in the course of each reaction cycle for a total of eight protons 
removed from the matrix (Figure 18.16). The details of how these protons are 
transported through the protein are still under study. However, two effects 
contribute to the mechanism. First, charge neutrality tends to be main- 
tained in the interior of proteins. Thus, the addition of an electron to a site 
inside a protein tends to favor the binding of H* to a nearby site. Second, 
conformational changes take place, particularly around the heme a,;—Cug 
center, in the course of the reaction cycle. Presumably, in one conformation, 
protons may enter the protein exclusively from the matrix side, whereas, in 
another, they may exit exclusively to the cytoplasmic side. Thus, the overall 
process catalyzed by cytochrome c oxidase is 


4 Cyt Crea + 8 Hihatrix + O2 > 4 Cyt cox + 2H2O + 4 Hetopiasm 


Figure 18.17 summarizes the flow of electrons from NADH and 
FADH, through the respiratory chain. This series of exergonic reactions is 
coupled to the pumping of protons from the matrix. As we will see shortly, 
the energy inherent in the proton gradient will be used to synthesize ATP. 


Toxic derivatives of molecular oxygen such as superoxide radicals 
are scavenged by protective enzymes 


As discussed earlier, molecular oxygen is an ideal terminal electron accep- 
tor, because its high affinity for electrons provides a large thermodynamic 
driving force. However, danger lurks in the reduction of O,. The transfer of 
four electrons leads to safe products (two molecules of H,O), but partial 
reduction generates hazardous compounds. In particular, the transfer of a 
single electron to O, forms superoxide ion, whereas the transfer of two electrons 
yields peroxide. 
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FIGURE 18.17 The electron-transport 
chain. High-energy electrons in the form 
of NADH and FADH, are generated by 
the citric acid cycle. These electrons 
flow through the respiratory chain, which 
powers proton pumping and results in 
Acetyl CoA the reduction of Os. 


Both compounds are potentially destructive. The strategy for the safe 
reduction of O, is clear: the catalyst does not release partly reduced intermedi- 
ates. Cytochrome c oxidase meets this crucial criterion by holding O, tightly 
between Fe and Cu ions. 


e Although cytochrome c oxidase and other proteins that reduce O, are 

remarkably successful in not releasing intermediates, small amounts of 
superoxide anion and hydrogen peroxide are unavoidably formed. Superoxide, 
hydrogen peroxide, and species that can be generated from them such as the 
hydroxyl radical (OH-) are collectively referred to as reactive oxygen species or 
ROS. Oxidative damage caused by ROS has been implicated in the aging 
process as well as in a growing list of diseases (‘Table 18.3). 

What are the cellular defense strategies against oxidative damage by 

ROS? Chief among them is the enzyme superoxide dismutase. This enzyme Dismutation 
scavenges superoxide radicals by catalyzing the conversion of two of these Peleac torn iy Wier et oingie Teaceant is 
radicals into hydrogen peroxide and molecular oxygen. SN 


Superoxide 
dismutase 


0; +2H* O; +O, 


Eukaryotes contain two forms of this enzyme, a manganese-containing 
version located in mitochondria and a copper-and-zinc-dependent cyto- 
plasmic form. These enzymes perform the dismutation reaction by a similar 


TABLE 18.3 Pathological conditions that may entail free-radical injury 


Atherogenesis 

Emphysema; bronchitis 
Parkinson disease 

Duchenne muscular dystrophy 
Cervical cancer 

Alcoholic liver disease 
Diabetes 

Acute renal failure 

Down syndrome 

Retrolental fibroplasia (conversion of the retina into a fibrous mass in premature infants) 
Cerebrovascular disorders 
Ischemia; reperfusion injury 


Information from Michael Lieberman and Allan D. Marks, Basic Medical Biochemistry: A Clinical Approach, 
4th ed. (Lippincott, Williams & Wilkins, 2012), p. 437. 
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FIGURE 18.18 Superoxide dismutase 
mechanism. The oxidized form of 
superoxide dismutase (M,,) reacts with 
one superoxide ion to form O, and 
generate the reduced form of the enzyme 
(M,eq): The reduced form then reacts with 
a second superoxide and two protons to 
form hydrogen peroxide and regenerate 
the oxidized form of the enzyme. 


mechanism (Figure 18.18). The oxidized form of the enzyme is reduced by 
superoxide to form oxygen. The reduced form of the enzyme, formed in this 
reaction, then reacts with a second superoxide ion to form peroxide, which 
takes up two protons along the reaction path to yield hydrogen peroxide. 
The hydrogen peroxide formed by superoxide dismutase and by other 
processes is scavenged by catalase, a ubiquitous heme protein that catalyzes 
the dismutation of hydrogen peroxide into water and molecular oxygen. 


Catalase 


2H,0, === 0, + 2H,O 


Superoxide dismutase and catalase are remarkably efficient, performing 
their reactions at or near the diffusion-limited rate (Section 8.4). Glutathione 
peroxidase also plays a role in scavenging H,O, (Section 20.5). Other cel- 
lular defenses against oxidative damage include the antioxidant vitamins, 
vitamins E and C. Because it is lipophilic, vitamin E is especially useful in 


protecting membranes from lipid peroxidation. 

e A long-term benefit of exercise may be to increase the amount of 
superoxide dismutase in the cell. The elevated aerobic metabolism 

during exercise causes more ROS to be generated. In response, the cell syn- 

thesizes more protective enzymes. The net effect is one of protection, 

because the increase in superoxide dismutase more effectively protects the 

cell during periods of rest (Problem 48). 

Despite the fact that reactive oxygen species are known hazards, recent 
evidence suggests that, under certain circumstances, the controlled genera- 
tion of thesemolecules may beanimportant component of signal-transduction 
pathways. For instance, growth factors have been shown to increase ROS 
levels as part of their signaling pathway, and ROS regulate channels and 
transcription factors. ROS have been implicated in the control of cell 
differentiation, the immune response, autophagy as well as other metabolic 
activities. The dual role of ROS is an excellent example of the wondrous 
complexity of biochemistry of living systems: even potentially harmful sub- 
stances can be harnessed to play useful roles. 


Electrons can be transferred between groups that 
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factor of 10 every 1.7 A (Figure 18.19). For groups in 
contact, electron-transfer reactions can be quite fast, with 


FIGURE 18.19 Distance dependence of electron-transfer rate. 
The rate of electron transfer decreases as the electron donor and 
the electron acceptor move apart. In a vacuum, the rate decreases 
by a factor of 10 for every increase of 0.8 A. In proteins, the rate 
decreases more gradually, by a factor of 10 for every increase of 
1.7 A. This rate is only approximate because variations in the 
structure of the intervening protein medium can affect the rate. 
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rates of approximately 10!% s~!. Within proteins in the 
electron-transport chain, electron-carrying groups are 
typically separated by 15 A beyond their van der Waals 
contact distance. For such separations, we expect elec- 
tron-transfer rates of approximately 104 s~! (1.e., electron 
transfer in less than 1 ms), assuming that all other factors 


are optimal. Without the mediation of the protein, an electron transfer over 
this distance would take approximately 1 day. 

The case is more complicated when electrons must be transferred 
between two distinct proteins, such as when cytochrome c accepts electrons 
from Complex III or passes them on to Complex IV. A series of hydro- 
phobic interactions bring the heme groups of cytochrome c and c, to within 
4.5 A of each other, with the iron atoms separated by 17.4 A. This distance 
could allow cytochrome c reduction at a rate of 8.3 X 10°s~!. 


The conformation of cytochrome c has remained essentially 
constant for more than a billion years 


ba Cytochrome c is present in all organisms having mitochondrial respi- 

ratory chains: plants, animals, and eukaryotic microorganisms. This 
electron carrier evolved more than 1.5 billion years ago, before the diver- 
gence of plants and animals. Its function has been conserved throughout this 
period, as evidenced by the fact that the cytochrome c of any eukaryotic species 
reacts in vitro with the cytochrome c oxidase of any other species tested thus far. 
For example, wheat-germ cytochrome c reacts with human cytochrome c 
oxidase. Additionally, some prokaryotic cytochromes, such as cytochrome c, 
from the photosynthetic bactertum Rhodospirillum rubrum and cytochrome 
Css, from the denitrifying bacterium Paracoccus denitrificans, closely resem- 
ble cytochrome c from tuna-heart mitochondria (Figure 18.20). This evi- 
dence attests to an efficient evolutionary solution to electron transfer 
bestowed by the structural and functional characteristics of cytochrome c. 


Rhodospirillum rubrum Paracoccus denitrificans 


The resemblance among cytochrome c molecules extends to the level of 
amino acid sequence. Because of the molecule’s small size and ubiquity, the 
amino acid sequences of cytochrome c from more than 80 widely ranging 
eukaryotic species have been determined by direct protein sequencing. The 
striking finding is that 21 of 104 residues have been invariant for more than 
one and a half billion years of evolution. A phylogenetic tree, constructed 
from the amino acid sequences of cytochrome c, reveals the evolutionary 
relationships between many animal species (Figure 18.21). 


18.4 A Proton Gradient Powers the Synthesis of ATP 


Thus far, we have considered the flow of electrons from NADH to O,, an 
exergonic process. 
NADH + 14,0, + H* = —H,0 + NADt 

AG°’ = —220.1 kJ mol~! (—52.6 kcal mol~!) 
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hk FIGURE 18.20 Conservation of 
the three-dimensional structure of 
cytochrome c. Notice the overall structural 
similarity of the different molecules from 
different sources. The side chains are 
shown for the 21 conserved amino acids 
as well as the centrally located planar 
heme. [Drawn from 3CYT.pdb, 3C2C.pdb, 
and 155C.pdb.] 
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FIGURE 18.21 Evolutionary tree 
constructed from sequences of 
cytochrome c. Branch lengths are 
proportional to the number of amino acid 
changes that are believed to have 
occurred. [Information from Walter M. Fitch 
and Emanuel Margoliash.] 
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Some have argued that, along with the 
elucidation of the structure of DNA, the 
discovery that ATP synthesis is powered by 
a proton gradient is one of the two major 
advances in biology in the twentieth 
century. Mitchell’s initial postulation of the 
chemiosmotic theory was not warmly 
received by all. Efraim Racker, one of the 
early investigators of ATP synthase, recalls 
that some thought of Mitchell as a court 
jester, whose work was of no consequence. 
Peter Mitchell was awarded the Nobel Prize 
in chemistry in 1978 for his contributions to 
understanding oxidative phosphorylation. 


FIGURE 18.22 Chemiosmotic hypothesis. 


Electron transfer through the respiratory 
chain leads to the pumping of protons from 
the matrix to the cytoplasmic side of the 
inner mitochondrial membrane. The pH 
gradient and membrane potential constitute 
a proton-motive force that is used to drive 
ATP synthesis. 


Next, we consider how this process is coupled to the synthesis of ATP, an 
endergonic process. 


ADP + P; + H* == ATP + H,O 
AG*®’ = +30.5 kJ mol7!(+7.3 kcal mol~') 


A molecular assembly in the inner mitochondrial membrane carries out the 
synthesis of ATP. This enzyme complex was originally called the mitochondrial 
ATPase or FF, ATPase because it was discovered through its catalysis of the 
reverse reaction, the hydrolysis of ATP. ATP synthase, its preferred name, 
emphasizes its actual role in the mitochondrion. It is also called Complex V. 

How is the oxidation of NADH coupled to the phosphorylation of 
ADP? Electron transfer was first suggested to lead to the formation of a 
covalent high-energy intermediate that serves as a compound having a high 
phosphoryl-transfer potential, analogous to the generation of ATP by the 
formation of 1,3-bisphosphoglycerate in glycolysis (Section 16.1). An alter- 
native proposal was that electron transfer aids the formation of an activated 
protein conformation, which then drives ATP synthesis. The search for 
such intermediates for several decades proved fruitless. 

In 1961, Peter Mitchell suggested a radically different mechanism, the 
chemiosmotic hypothesis. He proposed that electron transport and ATP syn- 
thesis are coupled by a proton gradient across the inner mitochondrial mem- 
brane. In his model, the transfer of electrons through the respiratory chain 
leads to the pumping of protons from the matrix to the cytoplasmic side of 
the inner mitochondrial membrane. The Ht concentration becomes lower 
in the matrix, and an electric field with the matrix side negative is generated 
(Figure 18.22). Protons then flow back into the matrix to equalize the dis- 
tribution. Mitchell’s idea was that this flow of protons drives the synthesis 
of ATP by ATP synthase. The energy-rich unequal distribution of protons 
is called the proton-motive force. The proton-motive force is composed of 
two components: a chemical gradient and a charge gradient. The chemical 
gradient for protons can be represented as a pH gradient. The charge gradi- 
ent is created by the positive charge on the unequally distributed protons 
forming the chemical gradient. Mitchell proposed that both components 
power the synthesis of ATP. 


Proton-motive force(Ap) = 
chemical gradient(ApH) + charge gradient( Ay) 


Mitchell’s highly innovative hypothesis that oxidation and phosphoryla- 
tion are coupled by a proton gradient is now supported by a wealth of evi- 
dence. Indeed, electron transport does generate a proton gradient across the 
inner mitochondrial membrane. The pH outside is 1.4 units lower than 
inside, and the membrane potential is 0.14 V, the outside being positive. As 
calculated on page 529, this membrane potential corresponds to a free 
energy of 21.8 kJ (5.2 kcal) per mole of protons. 
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FIGURE 18.23 Testing the chemiosmotic 


ATP ADP + Pj hypothesis. ATP is synthesized when 
synthase reconstituted membrane vesicles containing 
ATP bacteriorhodopsin (a light-driven proton 


pump) and ATP synthase are illuminated. 


An artificial system was created to elegantly demonstrate the basic 
principle of the chemiosmotic hypothesis. The role of the respiratory 
chain was played by bacteriorhodopsin, a membrane protein from 
halobacteria that pumps protons when illuminated. Synthetic vesicles 
containing bacteriorhodopsin and mitochondrial ATP synthase puri- 
fied from beef heart were created (Figure 18.23). When the vesicles 
were exposed to light, ATP was formed. This key experiment clearly 
showed that the respiratory chain and ATP synthase are biochemically 
separate systems, linked only by a proton-motive force. 


ATP synthase is composed of a proton-conducting unit 
and a catalytic unit 


Two parts of the puzzle of how NADH oxidation is coupled to ATP 
synthesis are now evident: (1) electron transport generates a proton- 
motive force, and (2) ATP synthesis by ATP synthase can be pow- 
ered by a proton-motive force. How is the proton-motive force 
converted into the high phosphoryl-transfer potential of ATP? 

Biochemical, electron microscopic, and crystallographic studies 
of ATP synthase have revealed many details of its structure 
(Figure 18.24). It is a large, complex enzyme resembling a ball on a 
stick. Much of the “stick” part, called the Fp subunit, is embedded in 
the inner mitochondrial membrane. The 85-A-diameter ball, called 
the F, subunit, protrudes into the mitochondrial matrix. The F, 
subunit contains the catalytic activity of the synthase. In fact, iso- 
lated F, subunits display ATPase activity. 

The F,, subunit consists of five types of polypeptide chains (a,, 
B3, y, 5, and €) with the indicated stoichiometry. The a and B sub- 
units, which make up the bulk of the F,,, are arranged alternately in a 
hexameric ring; they are homologous to one another and are mem- 
bers of the P-loop NTPase family (Section 9.4). Both bind nucleo- 
tides but only the B subunits are catalytically active. Just below the a 
and B subunits is a central stalk consisting of the y and e proteins. 
The y subunit includes a long helical coiled coil that extends into the 
center of the a8, hexamer. The y subunit breaks the symmetry of the K FIGURE 18.24 Structure of ATP synthase. A 
O58. hexamer: each of the B subunits is distinct by virtue of its interac- schematic structure is shown along with representations 
tion with a different face of y. Distinguishing the three B subunits is _ of the components for which structures have been 
crucial for understanding the mechanism of ATP synthesis. determined to high resolution. The P-loop NTPase 

The F) subunit is a hydrophobic segment that spans the inner Gomaine oh the esane| > subUniis areindieateh by Bue 

: : : shading. Notice that part of the enzyme complex is 
mitochondrial membrane. Fy contains the proton channel of the com- — orrhedded in the inner mitochondrial ease, 
plex. This channel consists of a ring comprising from 8 to 14 c sub- whereas the remainder resides in the matrix. [Drawn from 
units that are embedded in the membrane. A single a subunit binds —_1£79.pdb and 1COV.pdb]] 


Inner mitochondrial 
membrane 


FIGURE 18.25 ATPase assists in the 
formation of cristae. Oligomers of ATP ® 
synthase dimers facilitate cristae formation, ® 
creating an area where the protons (red 
balls) are concentrated and have ready 
access to the F, portion of the ATP CS) 
synthase. The electron transport chain is 
represented by the yellow cylinders 
embedded in the inner mitochondrial e® e@ 
membrane. [Information from K. M. Davies 
et al., Proc. Natl. Acad. Sci. U.S.A. 108: 


14121-14126, 2011.] 
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to the outside of the ring. The F, and F, subunits are connected in two 
ways: by the central ye stalk and by an exterior column. The exterior col- 
umn consists of one a subunit, two b subunits, and the 6 subunit. 

ATP synthases associate with one another to form dimers, which then 
associate to form large oligomers of dimers. This association stabilizes the 
individual enzymes to the rotational forces required for catalysis and facili- 
tates the curvature of the inner mitochondrial membrane. The formation of 
the cristae allows the proton pumps of the electron transport chain to local- 
ize the proton gradient in the vicinity of the synthases, which are located at 
the tips of the cristae, thereby enhancing the efficiency of ATP synthesis 
(Figure 18.25). 


Proton flow through ATP synthase leads to the release of tightly 
bound ATP: The binding-change mechanism 


ATP. synthase catalyzes the formation of ATP from ADP and 
orthophosphate. 


ADP?~ + HPO,” + H* =— = ATP*” +H,O 


The actual substrates are ADP and ATP complexed with Mg?*, as in 
all known phosphoryl-transfer reactions with these nucleotides. A terminal 
oxygen atom of ADP attacks the phosphorus atom of P. to form a pentacova- 
lent intermediate, which then dissociates into ATP and H,O (Figure 18.26). 

How does the flow of protons drive the synthesis of ATP? Isotopic- 
exchange experiments unexpectedly revealed that enzyme-bound ATP forms 
readily in the absence of a proton-motive force. When ADP and P. were added 
to ATP synthase in HO), '8O became incorporated into P; through the 
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FIGURE 18.26 ATP-synthesis mechanism. One of the oxygen atoms of ADP attacks the 
phosphorus atom of P; to form a pentacovalent intermediate, which then forms ATP and 
releases a molecule of H,O. 
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FIGURE 18.27 ATP forms without a proton-motive force but is not released. The results 
of isotopic-exchange experiments indicate that enzyme-bound ATP is formed from ADP and 
P; in the absence of a proton-motive force. 


synthesis of ATP and its subsequent hydrolysis (Figure 18.27). The rate of 
incorporation of !8O into P. showed that about equal amounts of bound 
ATP and ADP are in equilibrium at the catalytic site, even in the absence of 
a proton gradient. However, ATP does not leave the catalytic site unless 
protons flow through the enzyme. Thus, the role of the proton gradient is not 
to form ATP but to release it from the synthase. 

The fact that three B subunits are components of the F, moiety of the 
ATPase means that there are three active sites on the enzyme, each perform- 
ing one of three different functions at any instant. The proton-motive force 
causes the three active sites to sequentially change functions as protons flow 
through the membrane-embedded component of the enzyme. Indeed, we 
can think of the enzyme as consisting of a moving part and a stationary part: 
(1) the moving unit, or rotor, consists of the c ring and the ye stalk and (2) the 
stationary unit, or stator, is composed of the remainder of the molecule. 


How do the three active sites of ATP synthase respond to the flow of T 

protons? A number of experimental observations suggested a binding- FIGURE 18.28 ATP synthase nucleotide- 
change mechanism for proton-driven ATP synthesis. This proposal states binding sites are not equivalent. The + 
that a B subunit can perform each of three sequential steps in the synthesis subunit passes through the center of the 
of ATP by changing conformation. These steps are (1) ADP and P. binding, GP Hexarer anid meee the nels vtae- 

‘ . . . binding sites in the B subunits distinct from 
(2) ATP synthesis, and (3) ATP release. As already noted, interactions with one another Ths p subunits ara coloredite 
the y subunit make the three B subunits structurally distinct (Figure 18.28). distinguish them from one another. 


At any given moment, one B subunit will be in the L, or loose, conforma- 
tion. This conformation binds ADP and P.. A second subunit will be in the 
T, or tight, conformation. This conformation binds ATP with great avidity, 
so much so that it will convert bound ADP and P. into ATP. Both the T and 
L conformations are sufficiently constrained that they cannot release bound 
nucleotides. The final subunit will be in the O, or open, form. This form has Progressive alteration of the forms of the 
a more open conformation and can bind or release adenine nucleotides. Sipe armee site of ATE Syiumee 
The rotation of the y subunit drives the interconversion of these three CN ee eee 


; as Subunit2 O>L>T SO SLT... 
forms (Figure 18.29). ADP and P. bound in the subunit in the T form are Sik 3 T - ne ee i ce 


transiently combining to form ATP. Suppose that the y subunit is rotated by 
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FIGURE 18.29 Binding-change mechanism for ATP synthase. The rotation of the y 
subunit interconverts the three B subunits. The subunit in the T (tight) form interconverts ADP 
and P; and ATP but does not allow ATP to be released. When the y subunit is rotated by 
120 degrees in a counterclockwise (CCW) direction, the T-form subunit is converted into the 
O form, allowing ATP release. ADP and P; can then bind to the O-form subunit. An additional 
120-degree rotation (not shown) traps these substrates in an L-form subunit. 
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FIGURE 18.30 Direct observation of 
ATP-driven rotation in ATP 

synthase. The a8, hexamer of ATP 
synthase is fixed to a surface, with the y 
subunit projecting upward and linked to a 
fluorescently labeled actin filament. The 
addition and subsequent hydrolysis of ATP 
result in the counterclockwise rotation of 
the y subunit, which can be directly seen 
under a fluorescence microscope. 
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FIGURE 18.31 Components of the 
proton-conducting unit of ATP 
synthase. The c subunit consists of two a 
helices that span the membrane. In E. coli, 
an aspartic acid residue in one of the 


helices lies on the center of the membrane. 


The structure of the a subunit has not yet 
been directly observed, but it appears to 
include two half-channels that allow 
protons to enter and pass partway but not 
completely through the membrane. 
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Actin filament 


120 degrees in a counterclockwise direction (as viewed from the top). This 
rotation converts the T-form site into an O-form site with the nucleotide 
bound as ATP. Concomitantly, the L-form site is converted into a T-form 
site, enabling the transformation of an additional ADP and P. into ATP. The 
ATP in the O-form site can now depart from the enzyme to be replaced by 
ADP and P.. An additional 120-degree rotation converts this O-form site into 
an L-form site, trapping these substrates. Each subunit progresses from the T 
to the O to the L form with no two subunits ever present in the same confor- 
mational form. This mechanism suggests that ATP can be synthesized and 
released by driving the rotation of the y subunit in the appropriate direction. 


Rotational catalysis is the world’s smallest molecular motor 


Is it possible to observe the proposed rotation directly? Elegant experiments, 
using single-molecule techniques (Section 8.6), have demonstrated the rota- 
tion through the use of a simple experimental system consisting solely of 
cloned a8,y subunits (Figure 18.30). The ® subunits were engineered to 
contain amino-terminal polyhistidine tags, which have a high affinity for 
nickel ions (Section 3.1). This property of the tags allowed the a8, assembly 
to be immobilized on a glass surface that had been coated with nickel ions. 
The y subunit was linked to a fluorescently labeled actin filament to provide 
a long segment that could be observed under a fluorescence microscope. 
Remarkably, the addition of ATP caused the actin filament to rotate unidi- 
rectionally in a counterclockwise direction. The y subunit was rotating, driven 
by the hydrolysis of ATP. Thus, the catalytic activity of an individual mole- 
cule could be observed. The counterclockwise rotation is consistent with the 
predicted mechanism for hydrolysis because the molecule was viewed from 
below relative to the view shown in Figure 18.30. 

More-detailed analysis in the presence of lower concentrations of ATP 
revealed that the y subunit rotates in 120-degree increments. Each incre- 
ment corresponds to the hydrolysis of a single ATP molecule. In addition, 
from the results obtained by varying the length of the actin filament and 
measuring the rate of rotation, the enzyme appears to operate near 100% 
efficiency; that is, essentially all of the energy released by ATP hydrolysis is 
converted into rotational motion. 


Proton flow around the c ring powers ATP synthesis 


The direct observation of rotary motion of the y subunit is strong evidence 
for the rotational mechanism for ATP synthesis. The last remaining ques- 
tion 1s: How does proton flow through Fy drive the rotation of the y subunit? 
The mechanism depends on the structures of the a and c subunits of Fp 
(Figure 18.31). The stationary a subunit directly abuts the membrane- 
spanning ring formed by 8 to 14 ¢ subunits. The a subunit includes two 
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FIGURE 18.32 Proton motion across the membrane drives rotation of the c ring. A proton 
enters from the intermembrane space into the cytoplasmic half-channel to neutralize the charge 
on an aspartate residue in a c subunit. With this charge neutralized, the ¢ ring can rotate 
clockwise by one c subunit, moving an aspartic acid residue out of the membrane into the 
matrix half-channel. This proton can move into the matrix, resetting the system to its initial state. 


hydrophilic half-channels that do not span the membrane (Figure 18.31). 
Thus, protons can pass into either of these channels, but they cannot move 
completely across the membrane. The a subunit is positioned such that each 
half-channel directly interacts with one c subunit. 

The structure of the c subunit was determined both by NMR methods 
and by x-ray crystallography. Each polypeptide chain forms a pair of a helices 
that span the membrane. A glutamic acid (or aspartic acid) residue is found in 
the middle of one of the helices. If the glutamate is charged (unprotonated), 
the c subunit will not move into the membrane. The key to proton movement 
across the membrane is that, in a proton-rich environment, such as the cyto- 
plasmic side of the inner mitochondrial membrane, a proton will enter a chan- 
nel and bind the glutamate residue, while the glutamic acid in a proton-poor 
environment of the other half-channel will release a proton (Figure 18.32). 
The c subunit with the bound proton then moves into the membrane as the 
ring rotates by one c subunit. This rotation brings the newly deprotonated c 
subunit from the matrix half-channel to the proton rich cytoplasmic half- 
channel, where it can bind a proton. The movement of protons through the half- 
channels from the high proton concentration of the cytoplasm to the low proton 
concentration of the matrix powers the rotation of the c ring. The a unit remains 
stationary as the c ring rotates. Each proton that enters the cytoplasmic half- 
channel of the a unit moves through the membrane by riding around on the 
rotating c ring to exit through the matrix half-channel into the proton-poor 
environment of the matrix (Figure 18.33). 

How does the rotation of the c ring lead to the synthesis of ATP? The 
c ring is tightly linked to the y and « subunits. Thus, as the c ring turns, the 
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FIGURE 18.33 Proton path through the 
membrane. Each proton enters the 
cytoplasmic half-channel, follows a complete 
rotation of the ¢ ring, and exits through the 
other half-channel into the matrix. 
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CHAPTER 18 Oxidative Phosphorylation 


A little goes a long way 

Despite the various molecular machinations 
and the vast numbers of ATPs synthesized 
and protons pumped, a resting human being 
requires surprisingly little power. Approximately 
116 watts, the energy output of a typical light 
bulb, provides enough energy to sustain a 
resting person. 


FIGURE 18.34 Overview of oxidative 
phosphorylation. The electron-transport 


chain generates a proton gradient, which is 


used to synthesize ATP. 


y and ¢ subunits are turned inside the a8, hexamer unit of F,. The rotation 
of the y subunit in turn promotes the synthesis of ATP through the binding- 
change mechanism. The exterior column formed by the two b chains and 
the 8 subunit prevents the a8, hexamer from rotating in sympathy with the 
c ring. Recall that the number of c subunits in the c ring appears to range 
between 8 and 14. This number is significant because it determines the 
number of protons that must be transported to generate a molecule of ATP. 
Each 360-degree rotation of the yy subunit leads to the synthesis and release 
of three molecules of ATP. Thus, if there are 10 c subunits in the ring (as 
was observed in a crystal structure of yeast mitochondrial ATP synthase), 
each ATP generated requires the transport of 10/3 = 3.33 protons. Recent 
evidence shows that the c rings of all vertebrates are composed of 8 sub- 
units, making vertebrate ATP synthase the most-efficient ATP synthase 
known, with the transport of only 2.7 protons required for ATP synthesis. 
For simplicity, we will assume that three protons must flow into the matrix 
for each ATP formed, but we must keep in mind that the true value may 
differ. As we will see, the electrons from NADH pump enough protons to 
generate 2.5 molecules of ATP, whereas those from FADH, yield 1.5 mol- 
ecules of ATP. 

Let us return for a moment to the example with which we began this 
chapter. If a resting human being requires 85 kg of ATP per day for bodily 
functions, then 3.3 X 102° protons must flow through the ATP synthase per 
day, or 3.3 X 102! protons per second. Figure 18.34 summarizes the process 
of oxidative phosphorylation. 


ATP synthase and G proteins have several common features 


Wa The a and B subunits of ATP synthase are members of the P-loop 
NTPase family of proteins. In Chapter 14, we learned that the signal- 
ing properties of other members of this family, the G proteins, depend on 
their ability to bind nucleoside triphosphates and diphosphates with great 
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tenacity. They do not exchange nucleotides unless they are stimulated to do so 
by interaction with other proteins. The binding-change mechanism of ATP 
synthase is a variation on this theme. The P-loop regions of the B subunits 
will bind either ADP or ATP (or release ATP), depending on which of three 
different faces of the y subunit they interact with. The conformational 
changes take place in an orderly way, driven by the rotation of the yy subunit. 


18.5 Many Shuttles Allow Movement Across 
Mitochondrial Membranes 


The inner mitochondrial membrane must be impermeable to most mole- 
cules, yet much exchange has to take place between the cytoplasm and the 
mitochondria. This exchange is mediated by an array of membrane-spanning 
transporter proteins (Section 13.4). 


Electrons from cytoplasmic NADH enter mitochondria by shuttles 


One function of the respiratory chain is to regenerate NAD* for use in gly- 
colysis. NADH generated by the citric acid cycle and fatty acid oxidation is 
already in the mitochondrial matrix, but how is cytoplasmic NADH reoxi- 
dized to NAD* under aerobic conditions? NADH cannot simply pass into 
mitochondria for oxidation by the respiratory chain, because the inner mito- 
chondrial membrane is impermeable to NADH and NAD*. The solution is 
that electrons from NADH, rather than NADH itself, are carried across the 
mitochondrial membrane. One of several means of introducing electrons 
from NADH into the electron-transport chain is the glycerol 3-phosphate 
shuttle (Figure 18.35). The first step in this shuttle is the transfer of a pair of 
electrons from NADH to dihydroxyacetone phosphate, a glycolytic inter- 
mediate, to form glycerol 3-phosphate. This reaction is catalyzed by a glyc- 
erol 3-phosphate dehydrogenase in the cytoplasm. Glycerol 3-phosphate is 
reoxidized to dihydroxyacetone phosphate on the outer surface of the inner 
mitochondrial membrane by a membrane-bound isozyme of glycerol 
3-phosphate dehydrogenase. An electron pair from glycerol 3-phosphate is 
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FIGURE 18.35 Glycerol 3-phosphate shuttle. Electrons from NADH can enter the 
mitochondrial electron-transport chain by being used to reduce dihydroxyacetone phosphate 
to glycerol 3-phosphate. Glycerol 3-phosphate is reoxidized by electron transfer to an FAD 
prosthetic group in a membrane-bound glycerol 3-phosphate dehydrogenase. Subsequent 
electron transfer to Q to form QH, allows these electrons to enter the electron-transport chain. 
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transferred to an FAD prosthetic group in this enzyme to form FADH,. 
This reaction also regenerates dihydroxyacetone phosphate. 

The reduced flavin transfers its electrons to the electron carrier Q, which 
then enters the respiratory chain as QH,,. When cytoplasmic NADH trans- 
ported by the glycerol 3-phosphate shuttle is oxidized by the respiratory chain, 
1.5 rather than 2.5 molecules of ATP are formed. The yield is lower because 
FAD rather than NAD* is the electron acceptor in mitochondrial glycerol 
3-phosphate dehydrogenase. The use of FAD enables electrons from cyto- 
plasmic NADH to be transported into mitochondria against an NADH 
concentration gradient. The price of this transport is one molecule of ATP 
per two electrons. This glycerol 3-phosphate shuttle is especially prominent 
in muscle and enables it to sustain a very high rate of oxidative phosphoryla- 
tion. Indeed, some insects lack lactate dehydrogenase and are completely 
dependent on the glycerol 3-phosphate shuttle for the regeneration of cyto- 
plasmic NAD*. 

In the heart and liver, electrons from cytoplasmic NADH are brought 
into mitochondria by the malate—aspartate shuttle, which is mediated by 
two membrane carriers and four enzymes (Figure 18.36). Electrons are 
transferred from NADH in the cytoplasm to oxaloacetate, forming malate, 
which traverses the inner mitochondrial membrane in exchange for 
a-ketoglutarate and is then reoxidized by NAD* in the matrix to form 
NADH in a reaction catalyzed by the citric acid cycle enzyme malate dehy- 
drogenase. The resulting oxaloacetate does not readily cross the inner 
mitochondrial membrane and so a transamination reaction (Section 23.3) 
is needed to form aspartate, which can be transported to the cytoplasmic 
side in exchange for glutamate. Glutamate donates an amino group to oxa- 
loacetate, forming aspartate and a-ketoglutarate. In the cytoplasm, aspar- 
tate is then deaminated to form oxaloacetate and the cycle is restarted. 


The entry of ADP into mitochondria is coupled to the exit of 

ATP by ATP-ADP translocase 

The major function of oxidative phosphorylation is to generate ATP from 
ADP. ATP and ADP do not diffuse freely across the inner mitochondrial 
membrane. How are these highly charged molecules moved across the inner 
membrane into the cytoplasm? A specific transport protein, ATP-ADP 
translocase, enables these molecules to transverse this permeability barrier. 
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FIGURE 18.36 Malate-aspartate shuttle. 
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FIGURE 18.37 Mechanism of mitochondrial ATP-ADP translocase. The translocase 
catalyzes the coupled entry of ADP into the matrix and the exit of ATP from it. The binding 
of ADP (1) from the cytoplasm favors eversion of the transporter (2) to release ADP into the 
matrix (3). Subsequent binding of ATP from the matrix to the everted form (4) favors eversion 
back to the original conformation (5), releasing ATP into the cytoplasm (6). 


Most important, the flows of ATP and ADP are coupled. ADP enters the 
mitochondrial matrix only if ATP exits, and vice versa. This process is carried 
out by the translocase, an antiporter: 


3- 4— 3- 4- 
ADP cytoplasm + ATP matrix —> ALP matrix + ATP cytoplasm 


ATP-ADP translocase is highly abundant, constituting about 15% of the 
protein in the inner mitochondrial membrane. The abundance is a manifes- 
tation of the fact that human beings exchange the equivalent of their weight 
in ATP each day. Despite the fact that mitochondria are the sites of ATP 
synthesis, both the cytoplasm and the nucleus have more ATP than the 
mitochondria, a testament to the efficiency of transport. 

The 30-kDa translocase contains a single nucleotide-binding site that 
alternately faces the matrix and the cytoplasmic sides of the membrane 
(Figure 18.37). ATP and ADP bind to the translocase without Mg?*, and 
ATP has one more negative charge than that of ADP. Thus, in an actively 
respiring mitochondrion with a positive membrane potential, ATP trans- 
port out of the mitochondrial matrix and ADP transport into the matrix are 
favored. This ATP-ADP exchange is energetically expensive; about a quar- 
ter of the proton-motive force generated by the respiratory chain is con- 
sumed by this exchange process. The inhibition of the translocase leads to 
the subsequent inhibition of cellular respiration as well (p. 557). 


Mitochondrial transporters for metabolites have a 
common tripartite structure 


Examination of the amino acid sequence of the ATP-ADP translocase 
revealed that this protein consists of three tandem repeats of a 100-amino- 
acid module, each of which appears to have two transmembrane segments. 
This tripartite structure has recently been confirmed by the determination 
of the three-dimensional structure of this transporter (Figure 18.38). The 
transmembrane helices form a tepee-like structure with the nucleotide- 
binding site (marked by a bound inhibitor) lying in the center. Each of the 
three repeats adopts a similar structure. 
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hk FIGURE 18.38 Structure of 
mitochondrial transporters. The structure 
of the ATP-ADP translocase is shown. 
Notice that this structure comprises three 
similar units (shown in red, blue, and 
yellow) that come together to form a 
binding site, here occupied by 
atractyloside, an inhibitor of this transporter. 
Other members of the mitochondrial 
transporter family adopt similar tripartite 
structures. [Drawn from 1OKC.pdb.] 
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FIGURE 18.39 Mitochondrial 
transporters. Transporters (also called 
carriers) are transmembrane proteins that 
carry specific ions and charged metabolites 
across the inner mitochondrial membrane. 
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ATP-ADP translocase is but one of many mitochondrial transporters 
for ions and charged metabolites (Figure 18.39). The phosphate carrier, 
which works in concert with ATP-ADP translocase, mediates the electro- 
neutral exchange of H,PO, for OH . The combined action of these two 
transporters leads to the exchange of cytoplasmic ADP and P, for matrix 
ATP at the cost of the influx of one H* (owing to the transport of one OH 
out of the matrix). These two transporters, which provide ATP synthase 
with its substrates, are associated with the synthase to form a large complex 
called the ATP synthasome. 

Other homologous carriers also are present in the inner mitochondrial 
membrane. The dicarboxylate carrier enables malate, succinate, and fuma- 
rate to be exported from the mitochondrial matrix in exchange for P.. The 
tricarboxylate carrier exchanges citrate and H* for malate. In all, more than 
40 such carriers are encoded in the human genome. A different type of car- 
rier is responsible for pyruvate transport into the mitochondria. Pyruvate in 
the cytoplasm enters the mitochondrial membrane by way of a heterodimer 
composed of two small transmembrane proteins. 


18.6 The Regulation of Cellular Respiration Is Governed 
Primarily by the Need for ATP 


Because ATP is the end product of cellular respiration, the ATP needs of 
the cell are the ultimate determinant of the rate of respiratory pathways and 
their components. 


The complete oxidation of glucose yields about 30 molecules of ATP 


We can now estimate how many molecules of ATP are formed when glu- 
cose 1s completely oxidized to CO,. The number of ATP molecules formed 
in glycolysis and the citric acid cycle is unequivocally known because it is 
determined by the stoichiometries of chemical reactions. In contrast, the 
ATP yield of oxidative phosphorylation is less certain because the stoichi- 
ometries of proton pumping, ATP synthesis, and metabolite-transport 
processes need not be an integer or even have fixed values. As stated ear- 
lier, the best current estimates for the number of protons pumped out of 
the matrix by NADH-Q oxidoreductase, Q-cytochrome c oxidoreductase, 
and cytochrome c oxidase per electron pair are four, two, and four, respec- 
tively. The synthesis of a molecule of ATP is driven by the flow of about 
three protons through ATP synthase. An additional proton is consumed in 
transporting ATP from the matrix to the cytoplasm. Hence, about 2.5 
molecules of cytoplasmic ATP are generated as a result of the flow of a pair 
of electrons from NADH to O,,. For electrons that enter at the level of 
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ATP yield per i 
Reaction sequence see ee Enosphemiauen 
Glycolysis: Conversion of glucose into pyruvate (in the cytoplasm) 
Phosphorylation of glucose = 
Phosphorylation of fructose 6-phosphate 1 
Dephosphorylation of 2 molecules of 1,3-BPG +2 
Dephosphorylation of 2 molecules of phosphoenolpyruvate +2 


2 molecules of NADH are formed in the oxidation 
of 2 molecules of glyceraldehyde 3-phosphate 


Conversion of pyruvate into acetyl CoA (inside mitochondria) 
2 molecules of NADH are formed 


Citric acid cycle (inside mitochondria) 
2 molecules of adenosine triphosphate are formed 
from 2 molecules of succinyl CoA +2 
6 molecules of NADH are formed in the oxidation of 2 molecules 
each of isocitrate, a-ketoglutarate, and malate 
2 molecules of FADH, are formed in the oxidation of 2 molecules 
of succinate 


Oxidative phosphorylation (inside mitochondria) 
2 molecules of NADH formed in glycolysis; each 
yields 1.5 molecules of ATP (assuming transport of NADH by 


the glycerol 3-phosphate shuttle) a3) 
2 molecules of NADH formed in the oxidative decarboxylation 
of pyruvate; each yields 2.5 molecules of ATP tP5) 
2 molecules of FADH, formed in the citric acid cycle; each 
yields 1.5 molecules of ATP +3 
6 molecules of NADH formed in the citric acid cycle; each 
yields 2.5 molecules of ATP +15 
Net Yield per Molecule of Glucose +30 


Information on the ATP yield of oxidative phosphorylation is from values given in P. C. Hinkle, M. A. 
Kumar, A. Resetar, and D. L. Harris, Biochemistry 30:3576, 1991. 


Note: The current value of 30 molecules of ATP per molecule of glucose supersedes the earlier value of 36 
molecules of ATP. The stoichiometries of proton pumping, ATP synthesis, and metabolite transport should 
be regarded as estimates. About 2 more molecules of ATP are formed per molecule of glucose oxidized when 
the malate—aspartate shuttle rather than the glycerol 3-phosphate shuttle is used. 


Q-cytochrome c oxidoreductase, such as those from the oxidation of suc- 
cinate or cytoplasmic NADH transferred by the glycerol-phosphate shut- 
tle, the yield is about 1.5 molecules of ATP per electron pair. Hence, as 
tallied in Table 18.4, about 30 molecules of ATP ave formed when glucose is 
completely oxidized to CO,. Most of the ATP, 26 of 30 molecules formed, 
is generated by oxidative phosphorylation. Recall that the anaerobic 
metabolism of glucose yields only 2 molecules of ATP. One of the effects 
of endurance exercise, a practice that calls for much ATP for an extended 
period of time, is to increase the number of mitochondria and blood vessels 
in muscle and thus increase the extent of ATP generation by oxidative 
phosphorylation. 


The rate of oxidative phosphorylation is determined 
by the need for ATP 


How is the rate of the electron-transport chain controlled? Under most physi- 
ological conditions, electron transport is tightly coupled to phosphorylation. 
Electrons do not usually flow through the electron-transport chain to O, unless 
ADP is simultaneously phosphorylated to ATP. When ADP concentration 
rises, as would be the case in active muscle, the rate of oxidative phosphoryla- 
tion increases to meet the ATP needs of the muscle. The regulation of the rate 
of oxidative phosphorylation by the ADP level is called respiratory control or 
acceptor control. Experiments on isolated mitochondria demonstrate the 


* Supply of ADP 


ADP added nearly exhausted 


\ 


O, consumed 


Time 


FIGURE 18.40 Respiratory control. 
Electrons are transferred to O, only if ADP 
is concomitantly phosphorylated to ATP. 


FIGURE 18.41 Energy charge regulates 
the use of fuels. The synthesis of ATP 
from ADP and P; controls the flow of 
electrons from NADH and FADH, to 
oxygen. The availability of NAD+ and FAD 
in turn control the rate of the citric acid 
cycle (CAC). 
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importance of ADP level (Figure 18.40). The rate of oxygen consumption by 
mitochondria increases markedly when ADP is added and then returns to its 
initial value when the added ADP has been converted into ATP. 

The level of ADP likewise affects the rate of the citric acid cycle. At low 
concentrations of ADP, as in a resting muscle, NADH and FADH, are not 
consumed by the electron-transport chain. The citric acid cycle slows because 
there is less NAD* and FAD to feed the cycle. As the ADP level rises and 
oxidative phosphorylation speeds up, NADH and FADH, are oxidized, and 
the citric acid cycle becomes more active. Electrons do not flow from fuel mol- 
ecules to O, unless ATP needs to be synthesized. We see here another example 
of the regulatory significance of the energy charge (Figure 18.41). 


ATP 


P; + ADP 


Intermembrane 
space 

FADH, g 

NADH gradient ATP 
Matrix 


(Be) NAD*,FAD O, 4,0 


(oe) 7 


Acetyl A 


ATP synthase can be regulated 


Mitochondria contain an evolutionarily conserved protein, inhibitory factor 1 
(IF 1), that specifically inhibits the potential hydrolytic activity of the F)F, ATP 
synthase. What is the function of IF1? Consider a circumstance where tissues 
may be deprived of oxygen (ischemia). Without oxygen as the electron accep- 
tor, the electron transport chain will be unable to generate the proton-motive 
force. The ATP in the mitochondria would be hydrolyzed by the synthase, 
working in reverse (Problem 18.45). The role of IF'1 is to prevent the wasteful 
hydrolysis of ATP by inhibiting the hydrolytic activity of the synthase. 


ee IF 1 is overexpressed in many types of cancer. This over-expression 

plays a role in the induction of the Warburg effect, the switch from 
oxidative phosphorylation to aerobic glycolysis as the principle means for 
ATP synthesis (Section 16.2). 


Regulated uncoupling leads to the generation of heat 


Some organisms possess the ability to uncouple oxidative phosphorylation from 
ATP synthesis to generate heat. Such uncoupling is a means to maintain body 
temperature in hibernating animals, in some newborn animals (including 
human beings), and in many adult mammals, especially those adapted to cold. 
The skunk cabbage uses an analogous mechanism to heat its floral spikes in 
early spring, increasing the evaporation of odoriferous molecules that attract 
insects to fertilize its flowers. In animals, the uncoupling is in brown adipose 
tissue (BAT), which is specialized tissue for the process of nonshivering 


thermogenesis. In contrast, white adipose 
tissue (WAT), which constitutes the bulk 
of adipose tissue, plays no role in thermo- 
genesis but serves as an energy source and 
an endocrine gland (Chapters 26 and 27). 
Brown adipose tissue is very rich in 
mitochondria, often called brown fat mito- 
chondria. The tissue appears brown from 
the combination of the greenish-colored 
cytochromes in the numerous mitochon- 
dria and the red hemoglobin present in the 
extensive blood supply, which helps to 
carry the heat through the body. The inner 
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H™  H* ADP + P; > ATP 
FIGURE 18.42 Action of an uncoupling protein. Uncoupling protein 
(UCP-1) generates heat by permitting the influx of protons into the 
mitochondria without the synthesis of ATP. 


mitochondrial membrane of these mito- 

chondria contains a large amount of uncoupling protein (UCP-1), or thermo- 
genin, a dimer of 33-kDa subunits that resembles ATP-ADP translocase. 
UCP-1 transports protons from the cytoplasm to the matrix with the assis- 
tance of fatty acids. In essence, UCP-1 generates heat by short-circuiting the 
mitochondrial proton battery. The energy of the proton gradient, normally 
captured as ATP, is released as heat as the protons flow through UCP-1 to the 
mitochondrial matrix. This dissipative proton pathway is activated when the 
core body temperature begins to fall. In response to a temperature drop, 
a-adrenergic hormones stimulate the release of free fatty acids from triacylg- 
lycerols stored in cytoplasmic lipid granules (Section 22.2) (Figure 18.42). 
Long chain fatty acids bind to the cytoplasmic face of UCP-1, and the car- 
boxyl group binds a proton. This causes a structural change in UCP-1 so that 
the protonated carboxyl] now faces the proton-poor environment of the matrix, 
and the proton is released. Proton release resets UCP-1 to the initial state. 


e Until recently, adult humans were believed to lack brown fat tissue. 
However, new studies have established that adults, especially females, 

have brown adipose tissue in the neck and upper chest regions that is activated 

by cold (Figure 18.43). Obesity leads to a decrease in brown adipose tissue. 


(A) (B) 


FIGURE 18.43 Brown adipose tissue is revealed on exposure to cold. The results of 
PET-CT scanning show the uptake and distribution of '8F-fluorodeoxyglucose ('8F-FDG) in 
adipose tissue. The patterns of '8F-FDG uptake in the same subject are dramatically different 
under thermoneutral conditions (A) and after exposure to cold (B). [Republished with permission 
of American Society for Clinical Investigation from J Clin Invest. 2013;123(8):3395-3403. 
doi:10.1172/JCI68993. Permission conveyed through Copyright Clearance Center, Inc.] 
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FIGURE 18.44 Sites of action of some 
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We can witness the effects of a lack of nonshivering thermogenesis by 

examining pig behavior. Pigs are unusual mammals in that they have 
large litters and are the only ungulates (hoofed animals) that build nests for 
birth. These behavioral characteristics appear to be the result of a biochemical 
deficiency. Pigs lack UCP-1 and, hence, brown fat. Piglets must rely on other 
means of thermogenesis, such as nesting, large litter size, and shivering. 


a In addition to UCP-1, two other uncoupling proteins have been identi- 

fied. UCP-2, which is 56% identical in sequence with UCP-1, is found 
in a wide variety of tissues. UCP-3 (57% identical with UCP-1 and 73% iden- 
tical with UCP-2) is localized to skeletal muscle and brown fat. This family of 
uncoupling proteins, especially UCP-2 and UCP-3, may play a role in energy 
homeostasis. In fact, the genes for UCP-2 and UCP-3 map to regions of the 
human and mouse chromosomes that have been linked to obesity, supporting 
the notion that they function as a means of regulating body weight. 


Oxidative phosphorylation can be inhibited at many stages 


Many potent and lethal poisons exert their effect by inhibiting oxidative 
phosphorylation at one of a number of different locations (Figure 18.44): 


1. Inhibition of the electron-transport chain. Rotenone, which is used as a fish 
and insect poison, and amytal, a barbiturate sedative, block electron trans- 
fer in NADH-Q oxidoreductase and thereby prevent the utilization of 
NADH as a substrate. Rotenone, as an electron-transport-chain inhibitor, 
may play a role, along with genetic susceptibility, in the development of 
Parkinson disease. In the presence of rotenone and amytal, electron flow 
resulting from the oxidation of succinate is unimpaired, because these elec- 
trons enter through QH,, beyond the block. Antimycin A interferes with 
electron flow from cytochrome b,,; in Q-cytochrome c oxidoreductase. 
Furthermore, electron flow in cytochrome c oxidase can be blocked by cya- 
nide (CN7), azide (N3_), and carbon monoxide (CO). Cyanide and azide 
react with the ferric form of heme a3, whereas carbon monoxide inhibits the 
ferrous form. Inhibition of the electron-transport chain also inhibits ATP 
synthesis because the proton-motive force can no longer be generated. 


2. Inhibition of ATP synthase. Oligomycin, an antibiotic used as an antifun- 
gal agent, and dicyclohexylcarbodiimide (DCC) prevent the influx of pro- 
tons through ATP synthase by binding to the carboxylate group of the c 
subunits required for proton binding. Modification of only one c subunit by 
DCC is sufficient to inhibit the rotation of the entire c ring and hence ATP 
synthesis. If actively respiring mitochondria are exposed to an inhibitor of 
ATP synthase, the electron-transport chain ceases to operate. This obser- 
vation clearly illustrates that electron transport and ATP synthesis are 
normally tightly coupled. 


3. Uncoupling electron transport from ATP synthesis. The tight coupling of 
electron transport and phosphorylation in mitochondria can be uncoupled 
by 2,4-dinitrophenol (DNP) and certain other acidic aromatic compounds. 
These substances carry protons across the inner mitochondrial membrane, 
down their concentration gradient. In the presence of these uncouplers, 
electron transport from NADH to O, proceeds in a normal fashion, but 
ATP is not formed by mitochondrial ATP synthase, because the proton- 
motive force across the inner mitochondrial membrane is continuously dis- 
sipated. This loss of respiratory control leads to increased oxygen consump- 
tion and oxidation of NADH. Indeed, in the accidental ingestion of uncou- 
plers, large amounts of metabolic fuels are consumed, but no energy is 
captured as ATP. Rather, energy is released as heat. DNP is the active 


ingredient in some herbicides and fungicides. Remarkably, some people 
consume DNP as a weight-loss drug, despite the fact that the FDA banned 
its use in 1938. There are also reports that Soviet soldiers were given DNP 
to keep them warm during the long Russian winters. Chemical uncouplers 
are nonphysiological, unregulated counterparts of uncoupling proteins. 


e Drugs are being sought that would function as mild uncouplers, 
uncouplers not as potentially lethal as DNP, for use in treatment of 
obesity and related pathologies. Xanthohumol, a prenylated chalcone found 
in hops and beer, shows promise in this regard. Xanthohumol also scav- 
enges free radicals and is used for treatment of certain types of cancers. 


4. Inhibition of ATP export. ATP-ADP translocase is specifically inhibited 
by very low concentrations of atractyloside (a plant glycoside) or bongkrekic 
acid (an antibiotic from a mold). Atractyloside binds to the translocase 
when its nucleotide site faces the cytoplasm, whereas bongkrekic acid binds 
when this site faces the mitochondrial matrix. Oxidative phosphorylation 
stops soon after either inhibitor is added, showing that ATP-ADP translo- 
case is essential for maintaining adequate amounts of ADP to accept the 
energy associated with the proton-motive force. 


Mitochondrial diseases are being discovered 


The number of diseases that can be attributed to mitochondrial muta- 

tions is steadily growing in step with our growing understanding of 
the biochemistry and genetics of mitochondria. The prevalence of mito- 
chondrial diseases is estimated to be from 10 to 15 per 100,000 people, 
roughly equivalent to the prevalence of the muscular dystrophies. The first 
mitochondrial disease to be understood was Leber hereditary optic neu- 
ropathy (LHON), a form of blindness that strikes in midlife as a result of 
mutations in Complex I. Some of these mutations impair NADH utiliza- 
tion, whereas others block electron transfer to Q. Mutations in Complex I 
are the most frequent cause of mitochondrial diseases. The accumulation of 
mutations in mitochondrial genes in a span of several decades may contrib- 
ute to aging, degenerative disorders, and cancer. 

A human egg harbors several hundred thousand molecules of mitochon- 
drial DNA, whereas a sperm contributes only a few hundred and thus has 
little effect on the mitochondrial genotype. Because the maternally inher- 
ited mitochondria are present in large numbers and not all of the mitochon- 
dria may be affected, the pathologies of mitochondrial mutants can be quite 
complex. Even within a single family carrying an identical mutation, chance 
fluctuations in the percentage of mitochondria with the mutation lead to 
large variations in the nature and severity of the symptoms of the pathologi- 
cal condition as well as the time of onset. As the percentage of defective 
mitochondria increases, energy-generating capacity diminishes until, at 
some threshold, the cell can no longer function properly. Defects in cellular 
respiration are doubly dangerous. Not only does energy transduction 
decrease, but also the likelihood that reactive oxygen species will be gener- 
ated increases. Organs that are highly dependent on oxidative phosphoryla- 
tion, such as the nervous system, the retina, and the heart, are most vulner- 
able to mutations in mitochondrial DNA. 


Mitochondria play a key role in apoptosis 


In the course of development or in cases of significant cell damage, indi- 
vidual cells within multicellular organisms undergo programmed cell death, 
or apoptosis. Mitochondria act as control centers regulating this process. 
Although the details have not yet been established, the outer membrane of 


557 


18.6 Regulation of Oxidative 
Phosphorylation 


558 
CHAPTER 18 Oxidative Phosphorylation 


"Electron potential 
AE 


' PROTON GRADIENT 
Ap 


‘ 


FIGURE 18.45 The proton gradient is an 
interconvertible form of free energy. 


damaged mitochondria becomes highly permeable, a process referred to as 
mitochondrial outer membrane permeabilization (MOMP). This permeabili- 
zation is instigated by a family of proteins (Bcl family) that were initially 
discovered because of their role in cancer. One of the most potent activators 
of apoptosis, cytochrome c, exits the mitochondria and interacts with apop- 
totic peptidase-activating factor 1 (APAF-1), which leads to the formation of 
the apoptosome. The apoptosome recruits and activates a proteolytic enzyme 
called caspase 9, a member of the cysteine protease family (Section 9.1) that 
in turn activates a cascade of other caspases. Each caspase type destroys a 
particular target, such as the proteins that maintain cell structure. Another 
target is a protein that inhibits an enzyme that destroys DNA (an enzyme 
called caspase-activated DNAse or CAD), freeing CAD to cleave the genetic 
material. This cascade of proteolytic enzymes has been called “death by a 
thousand tiny cuts.” 


Power transmission by proton gradients is a central 
motif of bioenergetics 


The main concept presented in this chapter is that mito- 
chondrial electron transfer and ATP synthesis are linked 
by a transmembrane proton gradient. ATP synthesis in 
bacteria and chloroplasts also is driven by proton gradi- 
ents. In fact, proton gradients power a variety of energy- 
requiring processes such as the active transport of calcium 
ions by mitochondria, the entry of some amino acids and 
sugars into bacteria, the rotation of bacterial flagella, and 


a the transfer of electrons from NADP*t to NADPH. 


Proton gradients can also be used to generate heat, as in 
nonshivering thermogenesis. It is evident that proton gra- 
dients ave a central interconvertible currency of free energy in 
biological systems (Figure 18.45). Mitchell noted that the 
proton-motive force is a marvelously simple and effective 
store of free energy because it requires only a thin, closed 
lipid membrane between two aqueous phases. 


SUMMARY 


18.1 Eukaryotic Oxidative Phosphorylation Takes Place in Mitochondria 
Mitochondria generate most of the ATP required by aerobic cells 
through a joint endeavor of the reactions of the citric acid cycle, which 
take place in the mitochondrial matrix, and oxidative phosphoryla- 
tion, which takes place in the inner mitochondrial membrane. 
Mitochondria are descendants of a free-living bacterium that estab- 
lished a symbiotic relation with another cell. 


18.2 Oxidative Phosphorylation Depends on Electron Transfer 

In oxidative phosphorylation, the synthesis of ATP is coupled to the 
flow of electrons from NADH or FADH, to O, by a proton gradient 
across the inner mitochondrial membrane. Electron flow through three 
asymmetrically oriented transmembrane complexes results in the 
pumping of protons out of the mitochondrial matrix and the generation 
of a membrane potential. ATP is synthesized when protons flow back 
to the matrix through a channel in an ATP-synthesizing complex, 
called ATP synthase (also known as F)F',;-ATPase). Oxidative phos- 
phorylation exemplifies a fundamental theme of bioenergetics: the 
transmission of free energy by proton gradients. 


18.3 


18.4 


18.5 


18.6 


The Respiratory Chain Consists of Four Complexes: Three Proton 
Pumps and a Physical Link to the Citric Acid Cycle 

The electron carriers in the respiratory assembly of the inner mitochon- 
drial membrane are quinones, flavins, iron-sulfur complexes, heme 
groups of cytochromes, and copper ions. Electrons from NADH are 
transferred to the FMN prosthetic group of NADH-Q oxidoreductase 
(Complex I), the first of four complexes. This oxidoreductase also con- 
tains Fe-S centers. The electrons emerge in QH,, the reduced form of 
ubiquinone (Q). The citric acid cycle enzyme succinate dehydrogenase is 
a component of the succinate-Q reductase complex (Complex IT), which 
donates electrons from FADH, to Q to form QH,. This hydrophobic 
carrier transfers its electrons to Q-cytochrome c oxidoreductase 
(Complex IIT), a complex that contains cytochromes b and c, and an Fe-S 
center. This complex reduces cytochrome c, a water-soluble peripheral 
membrane protein. Cytochrome c transfers electrons to cytochrome 
c oxidase (Complex IV). This complex contains cytochromes a and a, 
and three copper ions. A heme iron ion and a copper ion in this oxidase 
transfer electrons to O,, the ultimate acceptor, to form H,O. 


A Proton Gradient Powers the Synthesis of ATP 

The flow of electrons through Complexes I, III, and IV leads to the 
transfer of protons from the matrix side to the cytoplasmic side of the 
inner mitochondrial membrane. A proton-motive force consisting of a 
pH gradient (matrix side basic) and a membrane potential (matrix side 
negative) is generated. The flow of protons back to the matrix side 
through ATP synthase drives ATP synthesis. The enzyme complex is 
a molecular motor made of two operational units: a rotating compo- 
nent and a stationary component. The rotation of the y subunit 
induces structural changes in the B subunit that result in the synthesis 
and release of ATP from the enzyme. Proton influx provides the force 
for the rotation. 

The flow of two electrons through NADH-Q oxidoreductase, 
Q-cytochrome c oxidoreductase, and cytochrome c oxidase generates 
a gradient sufficient to synthesize 1, 0.5, and 1 molecule of ATP, 
respectively. Hence, 2.5 molecules of ATP are formed per molecule of 
NADH oxidized in the mitochondrial matrix, whereas only 1.5 mol- 
ecules of ATP are made per molecule of FADH, oxidized, because its 
electrons enter the chain at QH,, after the first proton-pumping site. 


Many Shuttles Allow Movement Across Mitochondrial Membranes 
Mitochondria employ a host of transporters, or carriers, to move mol- 
ecules across the inner mitochondrial membrane. The electrons of 
cytoplasmic NADH are transferred into mitochondria by the glycerol 
phosphate shuttle to form FADH, from FAD or by the malate—aspar- 
tate shuttle to form mitochondrial NADH. The entry of ADP into the 
mitochondrial matrix is coupled to the exit of ATP by ATP-ADP 
translocase, a transporter driven by membrane potential. 


The Regulation of Cellular Respiration ls Governed 

Primarily by the Need for ATP 

About 30 molecules of ATP are generated when a molecule of glucose 
is completely oxidized to CO, and H,O. Electron transport is nor- 
mally tightly coupled to phosphorylation. NADH and FADH, are 
oxidized only if ADP is simultaneously phosphorylated to ATP, a 
form of regulation called acceptor or respiratory control. Proteins have 
been identified that uncouple electron transport and ATP synthesis 
for the generation of heat. 
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KEY TERMS 


oxidative phosphorylation (p. 523) 

electron-transport chain (p. 523) (p. 533) 

proton-motive force (p. 523) 

cellular respiration (p. 524) 

reduction (redox, oxidation—reduction, E5) 
potential (p. 526) 

coenzyme Q (Q, ubiquinone) (p. 530) 

Q pool (p. 531) 

iron-sulfur (nonheme iron) protein 


(p. 531) 
NADH-Q oxidoreductase (Complex I) 
(p. 532) 


flavin mononucleotide (FMN) (p. 532) 


Q cycle (p. 535) 


(p. 535) 


catalase (p. 540) 


| PROBLEMS 


succinate-Q reductase (Complex IT) 
cytochrome c (Cyt c) (p. 533) 


Q-cytochrome c oxidoreductase 
(Complex II) (p. 534) 


Rieske center (p. 534) 
cytochrome c oxidase (Complex IV) 
superoxide dismutase (p. 539) 


ATP synthase (Complex V, F,F, 
ATPase) (p. 542) 


glycerol 3-phosphate shuttle (p. 549) 
malate—aspartate shuttle (p. 550) 


ATP-ADP translocase (adenine nucleo- 
tide translocase, ANT) (p. 550) 


respiratory (acceptor) control (p. 553) 
uncoupling protein (UCP) (p. 555) 


programmed cell death (apoptosis) 
(p. 557) 


mitochondrial outer membrane 


permeabilization (MOMP) (p. 558) 
apoptosome (p. 558) 
caspase (p. 558) 


1. Breathe or ferment? Compare fermentation and respira- 
tion with respect to electron donors and electron acceptors. 


2. Reference states. The standard oxidation—reduction 
potential for the reduction of O, to H,O is given as 0.82 V 
in Table 18.1. However, the value given in textbooks of 
chemistry is 1.23 V. Account for this difference. 


3. Less energetic electrons. Why are electrons carried by 
FADH, not as energy rich as those carried by NADH? 
What is the consequence of this difference? 


4. Now prove it. Calculate the energy released by the reduc- 
tion of O, with FADH),. 


5. Thermodynamic constraint. Compare the AG®’ values for 
the oxidation of succinate by NAD* and by FAD. Use the 
data given in Table 18.1 to find the Ej of the NAD* - 
NADH and fumarate-succinate couples, and assume that 
Eo for the FAD—FADH, redox couple is nearly 0.05 V. 
Why is FAD rather than NAD* the electron acceptor in 
the reaction catalyzed by succinate dehydrogenase? 


6. Give and accept. Distinguish between an oxidizing agent 
and a reducing agent. 


7. The benefactor and beneficiary. Identify the oxidant and 
the reductant in the following reaction. 


Pyruvate + NADH + H* == lactate + NAD* 


8. Six of one, half dozen of the other. How is the redox 
potential (AE}) related to the free-energy change of a reac- 
tion (AG®°’)? 


9. Location, location, location. Iron is a component of many 
of the electron carriers of the electron-transport chain. How 
can it participate in a series of coupled redox reactions if the 
E6 value is +0.77 V, as seen in Table 18.1? 


10. Line up. Place the following components of the 
electron-transport chain in their proper order: 

(a) cytochrome c 

(b) Q-cytochrome c oxidoreductase 

(c) NADH-Q reductase 

(d) cytochrome c oxidase 
(e) ubiquinone 


11. Like mac and cheese. Match each term with its description. 


(a) Respiration 1. Converts reactive oxygen 
(b) Redox potential species into hydrogen 
(c) Electron-transport chain peroxide 
2. Electron flow from 
(d) Flavin mononucleotide NADH and FADH, to O, 
(FMN) 3. Facilitates electron flow 
(e) Iron—sulfur protein from FMN to coenzyme 
Q in Complex I 
f) Coenzyme Q 4. An ATP-generating 


process in which an inor- 
ganic compound serves as 
the final electron acceptor 

5. Measure of the tendency to 
accept or donate electrons 

6. Converts hydrogen perox- 
ide into oxygen and water 

7. Funnels electrons from 
a two-electron carrier to a 
one-electron carrier 

8. Lipid-soluble electron 
carrier 

9. Donates electrons to 
Complex IV 

10. Accepts electrons from 

NADH in Complex I 


g) Cytochrome c 


( 
( 
(h) Q cycle 
( 


i) Superoxide dismutase 


(j) Catalase 


12. Match ’em. 

(a) Complex I___ 
(b) Complex IT___ 
(c) Complex HI ___ 
(d) Complex IV ___ 
(e) Ubiquinone ___ 


Q-cytochrome c oxidoreductase 
Coenzyme Q 
Succinate-Q reductase 


NADH-Q oxidoreductase 


Cytochrome c oxidase 


“a Rw MN 


13. Structural considerations. Explain why coenzyme Q 
is an effective mobile electron carrier in the electron- 
transport chain. 


14. Inhibitors. Rotenone inhibits electron flow through 
NADH-Q oxidoreductase. Antimycin A blocks electron 
flow between cytochromes b and c,. Cyanide blocks elec- 
tron flow through cytochrome oxidase to O,. Predict the 
relative oxidation—reduction state of each of the following 
respiratory-chain components in mitochondria that are 
treated with each of the inhibitors: 


(a) NADt 
(b) NADH-Q oxidoreductase 


(d) cytochrome c, 
(e) cytochrome c 


(c) coenzyme Q (f) cytochrome a 


15. Rumored to be a favorite of Elvis. Amytal is a barbitu- 
rate sedative that inhibits electron flow through Complex I. 
How would the addition of amytal to actively respiring 
mitochondria affect the relative oxidation—reduction states 
of the components of the electron-transport chain and the 
citric acid cycle? 


16. Efficiency. What is the advantage of having Complexes 
I, III, and IV associated with one another in the form of a 
respirasome? 


17. Linked In. What citric acid cycle enzyme is also a com- 
ponent of the electron-transport chain? 


18. ROS, not ROUS. What are the reactive oxygen species 
and why are they especially dangerous to cells? 


19. Reclaim resources. Humans have only about 250 g 
of ATP, but even a couch potato needs about 83 kg of 
ATP to open the bag of chips and use the remote. How is 
this discrepancy between requirements and resources 
reconciled? 


20. Energy harvest. What is the yield of ATP when each of 
the following substrates is completely oxidized to CO, by a 
mammalian cell homogenate? Assume that glycolysis, the 
citric acid cycle, and oxidative phosphorylation are fully 
active. 


(a) Pyruvate (d) Phosphoenolpyruvate 


(b) Lactate (e) Galactose 


(c) Fructose 1,6-bisphosphate (f) Dihydroxyacetone 
phosphate 
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21. Potent poisons. What is the effect of each of the follow- 
ing inhibitors on electron transport and ATP formation by 
the respiratory chain? 


(a) Azide 
(b) Atractyloside 


(c) Rotenone 


(d) DNP 
(e) Carbon monoxide 
(f) Antimycin A 


22. A question of coupling. What is the mechanistic basis for 
the observation that the inhibitors of ATP synthase also 
lead to an inhibition of the electron-transport chain? 


23. A Brownian ratchet wrench. What causes the c subunits 
of ATP synthase to rotate? What determines the direction 
of rotation? 


24. An essential residue. The conduction of protons by the 
F, unit of ATP synthase is blocked by dicyclohexylcarbodi- 
imide, which reacts readily with carboxyl groups. What are 
the most likely targets of action of this reagent? How might 
you use site-specific mutagenesis to determine whether this 
residue is essential for proton conduction? 


25. Alternative routes. The most common metabolic sign of 
mitochondrial disorders is lactic acidosis. Why? 


26. Connections. How does the inhibition of ATP-ADP 
translocase affect the citric acid cycle? Glycolysis? 


27. O, consumption. Oxidative phosphorylation in mito- 
chondria is often monitored by measuring oxygen con- 
sumption. When oxidative phosphorylation is proceeding 
rapidly, the mitochondria will rapidly consume oxygen. If 
there is little oxidative phosphorylation, only small amounts 
of oxygen will be used. You are given a suspension of iso- 
lated mitochondria and directed to add the following com- 
pounds in the order from a to h. With the addition of each 
compound, all of the previously added compounds remain 
present. Predict the effect of each addition on oxygen con- 
sumption by the isolated mitochondria. 


(a) Glucose 
(b) ADP + P. 
(c) Citrate 


(e) Succinate 
(f) Dinitrophenol 
(g) Rotenone 


(d) Oligomycin (h) Cyanide 


28. P : O ratios. The number of molecules of inorganic 
phosphate incorporated into organic form per atom of oxy- 
gen consumed, termed the P : O ratio, was frequently used 
as an index of oxidative phosphorylation. 


(a) What is the relation of the P : O ratio to the ratio of the 
number of protons translocated per electron pair (Ht /2 e7) 
and the ratio of the number of protons needed to synthesize 
ATP and transport it to the cytoplasm (P/H*)? 


(b) What are the P : O ratios for electrons donated by 
matrix NADH and by succinate? 
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29. Cyanide antidote. The immediate administration of 
nitrite is a highly effective treatment for cyanide poisoning. 
What is the basis for the action of this antidote? (Hint: 
Nitrite oxidizes ferrohemoglobin to ferrihemoglobin.) 


30. Runaway mitochondria 1. Suppose that the mitochon- 
dria of a patient oxidize NADH irrespective of whether 
ADP is present. The P : O ratio for oxidative phosphoryla- 
tion by these mitochondria is less than normal. Predict the 
likely symptoms of this disorder. 


31. Recycling device. The cytochrome b component of 
Q-cytochrome c oxidoreductase enables both electrons of 
QH, to be effectively utilized in generating a proton-motive 
force. Cite another recycling device in metabolism that 
brings a potentially dead end reaction product back into 
the mainstream. 


32. Crossover point. The precise site of action of a 
respiratory-chain inhibitor can be revealed by the crossover 
technique. Britton Chance devised elegant spectroscopic 
methods for determining the proportions of the oxidized and 
reduced forms of each carrier. This determination is feasible 
because the forms have distinctive absorption spectra, as 
illustrated in the adjoining graph for cytochrome c. You are 
given a new inhibitor and find that its addition to respiring 
mitochondria causes the carriers between NADH and QH, 
to become more reduced and those between cytochrome c and 
O,, to become more oxidized. Where does your inhibitor act? 


Reduced 


Absorbance coefficient (M-! cm=! x 1075) 


| 
400 500 600 


Wavelength (nm) 


33. Runaway mitochondria 2. Years ago, uncouplers were 
suggested to make wonderful diet drugs. Explain why this 
idea was proposed and why it was rejected. Why might the 
producers of antiperspirants be supportive of the idea? 


34. Everything is connected. If actively respiring mitochon- 
dria are exposed to an inhibitor of ATP-ADP translocase, 
the electron-transport chain ceases to operate. Why? 


35. Identifying the inhibition. You are asked to determine 
whether a chemical is an electron-transport-chain inhibitor 
or an inhibitor of ATP synthase. Design an experiment to 
make this determination. 


36. To each according to its needs. It has been noted that the 
mitochondria of muscle cells often have more cristae than 
the mitochondria of liver cells. Provide an explanation for 
this observation. 


37. Opposites attract. An arginine residue (Arg 210) in the 
a subunit of the E. coli ATP synthase is near the aspartate 
residue (Asp 61) in the matrix-side proton channel. How 
might Arg 210 assist proton flow? 


38. Variable c subunits. Recall that the number of c subunits 
in the c ring appears to range between 8 and 14. This number 
is significant because it determines the number of protons 
that must be transported to generate a molecule of ATP. Each 
360-degree rotation of the y subunit leads to the synthesis and 
release of three molecules of ATP. Thus, if there are 10 c sub- 
units in the ring (as was observed in a crystal structure of yeast 
mitochondrial ATP synthase), each ATP generated requires 
the transport of 10/3 = 3.33 protons. How many protons are 
required to form ATP if the ring has 12 c subunits? 14? 


39. Counterintuitive. Under some conditions, mitochon- 
drial ATP synthase has been observed to actually run in 
reverse. How would that situation affect the proton- 
motive force? 


40. Etiology? What does that mean? What does the fact that 
rotenone appears to increase the susceptibility to Parkinson 
disease indicate about the etiology of Parkinson disease? 


41. Exaggerating the difference. Why must ATP-ADP 
translocase (also called adenine nucleotide translocase or 
ANT) use Mg? -free forms of ATP and ADP? 


42. Respiratory control. The rate of oxygen consumption by 
mitochondria increases markedly when ADP is added and 
then returns to its initial value when the added ADP has 
been converted into ATP (Figure 18.39). Why does the rate 
decrease? 


43. Same, but different. Why is the electroneutral exchange 
of H,PO,4 for OH indistinguishable from the electro- 
neutral symport of H,PO, and Ht? 


44, Multiple uses. Give an example of the use of the proton- 
motive force in ways other than for the synthesis of ATP. 


Chapter Integration Problems 


45. Just obeying the laws. Why do isolated F, subunits of 
ATP synthase catalyze ATP hydrolysis? 


46. The right location. Some cytoplasmic kinases, enzymes 
that phosphorylate substrates at the expense of ATP, bind 
to voltage-dependent anion channels. What might the 
advantage of this binding be? 


47. No exchange. Mice that completely lack ATP-ADP trans- 
locase (ANT~ /ANT7) can be made by using the knockout 
technique. Remarkably, these mice are viable but have the 
following pathological conditions: (1) high serum levels of 


lactate, alanine, and succinate; (2) little electron transport; and 
(3) a six- to eightfold increase in the level of mitochondrial 
H,O, compared with that in normal mice. Provide a possible 
biochemical explanation for each of these conditions. 


48. Maybe you shouldn’t take your vitamins. Exercise is 
known to increase insulin sensitivity and to ameliorate type 
2 diabetes (Chapter 27). Recent research suggests that 
taking antioxidant vitamins might mitigate the beneficial 
effects of exercise with respect to ROS protection. 


(a) What are the antioxidant vitamins? 
(b) How does exercise protect against ROS? 


(c) Explain why vitamins might counteract the effects of 
exercise. 


Chapter Integration and Data Interpretation Problem 


49. Monitoring energy sources. XF technology (Seahorse 
Bioscience) now allows the measurement of the rate of aero- 
bic respiration and lactic acid fermentation simultaneously in 
real time in cultured cells. The extent of aerobic respiration is 
determined by measuring the oxygen consumption rate 
(OCR, measured in picomoles of oxygen consumed per min- 
ute) while the rate of glycolysis correlates with the extracel- 
lular acidification rate [ECAR-milli pH per minute (the 
changes in pH that occur over time)]. The graph below shows 
the results of an experiment using the new technology. 
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Dinitrophenol (DNP), the glycolysis inhibitor 2-deoxyglu- 
cose (DG), and rotenone were added sequentially to cell 
cultures. 


(a) What is the effect on OCR and ECAR of adding DNP 


to the cell culture? Explain these results. 

(b) Explain the effect of the addition of 2-deoxyglucose. 
(c) Explain how 2-deoxyglucose acts as an inhibitor of 
glycolysis? 

(d) Explain the effect of the addition of rotenone. 


Data Interpretation Problem 


50. Mitochondrial disease. A mutation in a mitochondrial 
gene encoding a component of ATP synthase has been 


ECAR (mpH min") @ 
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identified. People who have this mutation suffer from 
muscle weakness, ataxia, and retinitis pigmentosa. A tissue 
biopsy was performed on each of three patients having this 
mutation, and submitochondrial particles were isolated that 
were capable of succinate-sustained ATP synthesis. First, 
the activity of the ATP synthase was measured on the addi- 
tion of succinate and the following results were obtained. 


ATP synthase activity 
(nmol of ATP formed min~! mg~') 


Controls 3.0 

Patient 1 0.25 
Patient 2 0.11 
Patient 3 0.17 


(a) What was the purpose of the addition of succinate? 


(b) What is the effect of the mutation on succinate-coupled 
ATP synthesis? 


Next, the ATPase activity of the enzyme was measured by 
incubating the submitochondrial particles with ATP in the 
absence of succinate. 


ATP hydrolysis 
(nmol of ATP hydrolyzed min~! mg~!) 
Controls 33 
Patient 1 30 
Patient 2 25 
Patient 3 31 


(c) Why was succinate omitted from the reaction? 
(d) What is the effect of the mutation on ATP hydrolysis? 


(e) What do these results, in conjunction with those 
obtained in the first experiment, tell you about the nature of 
the mutation? 


Mechanism Problem 


51. Chiral clue. ATPyS, a slowly hydrolyzed analog of 
ATP, can be used to probe the mechanism of phosphoryl- 
transfer reactions. Chiral ATPyS has been synthesized 
containing !8O in a specific y position and ordinary !°O 
elsewhere in the molecule. The hydrolysis of this chiral 
molecule by ATP synthase in '’O-enriched water yields 
inorganic [!°O,!’O,!8O]thiophosphate having the follow- 
ing absolute configuration. In contrast, the hydrolysis of 
this chiral ATPyS by a calctum-pumping ATPase from 
muscle gives thiophosphate of the opposite configuration. 
What is the simplest interpretation of these data? 


+ Ht 
H2'70 ADP 


Re : Sw ON 7 : 


/ ie 


18¢- ‘oO 18-1 de 
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The Light Reactions of 
Photosynthesis 


Free energy 


Chloroplasts (left) convert light energy into chemical energy. High-energy 
electrons in chloroplasts are transported through two photosystems 
(right). In this transit, which culminates in the generation of reducing 
power, ATP is synthesized in a manner analogous to mitochondrial ATP 
synthesis. In contrast with mitochondrial electron transport, however, 
electrons in chloroplasts are energized by light. [(Left) John Durham/ 
Science Source.] 


-E very year, Earth is bathed in photons with a total energy content of 
approximately 107+ kJ. By comparison, a major hurricane has 10~° that 
amount of energy. The source of the energy is, of course, the electromagnetic 
radiation from the sun. On our planet, there are organisms capable of collect- 
ing a fraction, approximately 1%, of this solar energy and converting it into 
chemical energy. Green plants are the most obvious of these organisms, 
although 60% of this conversion is carried out by algae and bacteria. Although 
only a small amount of the incident energy is captured, it is enough to power 
all life on Earth. This transformation is perhaps the most important of all of 
the energy transformations that we will see in our study of biochemistry; 
without it, life as we know it on our planet simply could not exist. 

The process of converting electromagnetic radiation into chemical 
energy is called photosynthesis, which uses light energy to convert carbon 
dioxide and water into carbohydrates and oxygen. 

Light 
CO, + H,O—>(CH,0) + 0, 
In this equation, CH,O represents carbohydrate, primarily sucrose and 
starch. Photosynthetic carbohydrate synthesis is the most common meta- 
bolic pathway on Earth. These carbohydrates provide not only the energy to 
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CHAPTER 19 The Light Reactions of 
Photosynthesis 


Photosynthetic yield 
“If a year’s yield of photosynthesis were 
amassed in the form of sugar cane, it would 
form a heap over two miles high and with a 
base 43 square miles.” 

—G. E. Fogge 
If all of this sugar cane were converted into 
sugar cubes (0.5 inch, or 1.27 cm, on a side) 
and stacked end to end, the sugar cubes 
would extend 1.6 x 101° miles (2.6 x 
101° kilometers) or to the dwarf planet Pluto. 


run the biological world, but also the carbon molecules to make a wide array 
of biomolecules. Photosynthetic organisms are called autotrophs (literally, 
“self-feeders”) because they can synthesize chemical fuels such as glucose 
from carbon dioxide and water by using sunlight as an energy source and 
then recover some of this energy from the synthesized glucose through the 
glycolytic pathway and aerobic metabolism. Organisms that obtain energy 
from chemical fuels only are called heterotrophs, which ultimately depend 
on autotrophs for their fuel. 

Photosynthesis is composed of two parts: the light reactions and the dark 
reactions. In the light reactions, light energy is transformed into two forms 
of biochemical energy with which we are already familiar: reducing power 
and ATP. The products of the light reactions are then used in the dark reac- 
tions to drive the reduction of CO, and its conversion into glucose and other 
sugars. The dark reactions are also called the Calvin cycle or light-independent 
reactions and will be discussed in Chapter 20. 


Photosynthesis converts light energy into chemical energy 


The light reactions of photosynthesis closely resemble the events of oxida- 
tive phosphorylation. In Chapters 17 and 18, we learned that cellular respi- 
ration 1s the oxidation of glucose to CO, with the reduction of O, to water, 
a process that generates ATP. In photosynthesis, this process must be 
reversed—reducing CO, and oxidizing H,O to synthesize glucose. 

Photosynthesis 


Energy + 6H,O + 6CO, 


Cellular 
respiration 


C,H,,0, + 60, 


Ce6H120¢ + 6 Or 6 O, + 6 H2O + energy 

Although the processes of respiration and photosynthesis are chemically 
opposite each other, the biochemical principles governing the two processes are 
nearly identical. The key to both processes is the generation of high-energy 
electrons. The citric acid cycle oxidizes carbon fuels to CO, to generate high- 
energy electrons. The flow of these high-energy electrons down an electron- 
transport chain generates a proton-motive force. This proton-motive force 1s 
then transduced by ATP synthase to form ATP. To synthesize glucose from 
CO,, high-energy electrons are required for two purposes: (1) to provide reduc- 
ing power in the form of NADPH to reduce CO, and (2) to generate ATP to 
power the reduction. How can high-energy electrons be generated without 
using a chemical fuel? Photosynthesis uses energy from light to boost electrons from 
a low-energy state to a high-energy state. In the high-energy, unstable state, 
nearby molecules can abscond with the excited electrons. These electrons are 
used to produce reducing power, and to generate a proton-motive force across 
a membrane, which subsequently drives the synthesis of ATP. The reactions 
that are powered by sunlight are called the light reactions (Figure 19.1). 


ADP | ATP 


Light NADP* NADPH. 


FIGURE 19.1 The light 
reactions of photosynthesis. 
Light is absorbed and the 
energy is used to drive 
electrons from water to 
generate NADPH and to drive 
protons across a membrane. 
These protons return through 
ATP synthase (purple 
structure) to make ATP. 


Ht 


Photosynthesis in green plants is mediated by two kinds of light reac- 
tions. Photosystem I generates reducing power in the form of NADPH, but, 
in the process, becomes electron deficient. Photosystem II oxidizes water 
and transfers the electrons to replenish the electrons lost by photosystem I. 
A side product of these reactions 1s O,. Electron flow from photosystem II 
to photosystem I generates the transmembrane proton gradient, augmented 
by the protons released by the oxidation of water, that drives the synthesis 
of ATP. In keeping with the similarity of their principles of operation, both 
processes take place in double-membrane organelles: mitochondria for cell- 
ular respiration and chloroplasts for photosynthesis. 


19.1 Photosynthesis Takes Place in Chloroplasts 


Photosynthesis, the means of converting light into chemical energy, takes 
place in organelles called chloroplasts, typically 5 wm long. Like a mitochon- 
drion, a chloroplast has an outer membrane and an inner membrane, with 
an intervening intermembrane space (Figure 19.2). The inner membrane 
surrounds a space called the stroma, which is the site of the dark reactions of 
photosynthesis (Section 20.1). In the stroma are membranous structures 
called thylakoids, which are flattened sacs, or discs. The thylakoid sacs are 
stacked to form a granum. Different grana are linked by regions of thylakoid 
membrane called stroma lamellae (Figure 19.3). The thylakoid membranes 
separate the thylakoid space from the stroma space. Thus, chloroplasts have 
three different membranes (outer, inner, and thylakoid membranes) and three 
separate spaces (intermembrane, stroma, and thylakoid spaces). In developing 
chloroplasts, thylakoids arise from budding of the inner membrane, and so 
they are analogous to the mitochondrial cristae. Like the mitochondrial 
cristae, they are the site of coupled oxidation—reduction reactions of the 
light reactions that generate the proton-motive force. 
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FIGURE 19.2 Diagram of a chloroplast. 


The primary events of photosynthesis take place 
in thylakoid membranes 


The thylakoid membranes contain the energy-transforming machinery: 
light-harvesting proteins, reaction centers, electron-transport chains, and 
ATP synthase. These membranes contain nearly equal amounts of lipids 
and proteins. The lipid composition is highly distinctive: about 75% of the 
total lipids are galactolipids and 10% are sulfolipids, whereas only 10% are 
phospholipids. The thylakoid membrane and the inner membrane, like the 


Photosynthetic catastrophe 

If photosynthesis were to cease, all higher 
forms of life would be extinct in about 25 
years. A milder version of such a catastrophe 
ended the Cretaceous period 65.1 million 
years ago when a large asteroid struck the 
Yucatan Peninsula of Mexico. Enough dust 
was sent into the atmosphere that 
photosynthetic capacity was greatly 
diminished, which apparently led to the 
disappearance of the dinosaurs and allowed 


the mammals to rise to prominence. 


FIGURE 19.3 Electron micrograph of a 
chloroplast from a spinach leaf. The 
thylakoid membranes pack together to 
form grana. [Courtesy of Dr. Kenneth Miller.] 
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Photosynthesis 


highly permeable to small molecules and ions. The stroma contains the sol- 
uble enzymes that utilize the NADPH and ATP synthesized by the thyla- 
koids to convert CO, into sugar. Plant-leaf cells contain between 1 and 100 
chloroplasts, depending on the species, cell type, and growth conditions. 


Chloroplasts arose from an endosymbiotic event 


aa Chloroplasts contain their own DNA and the machinery for replicat- 
ing and expressing it. However, chloroplasts are not autonomous: 
they also contain many proteins encoded by nuclear DNA. How did the 
intriguing relation between the cell and its chloroplasts develop? We now 
believe that, in a manner analogous to the evolution of mitochondria 
(Section 18.1), chloroplasts are the result of endosymbiotic events in which 
a photosynthetic microorganism, most likely an ancestor of a cyanobacte- 
rium (Figure 19.4), was engulfed by a eukaryotic host. Evidence suggests 
that chloroplasts in higher plants and green algae are derived from a single 
endosymbiotic event, whereas those in red and brown algae are derived 
from at least one additional event. 

The chloroplast genome is smaller than that of a cyanobacterium, but 
the two genomes have key features in common. Both are circular and have a 
FIGURE 19.4 Cyanobacteria. A colony single start site for DNA replication, and the genes of both are arranged in 
of the photosynthetic filamentous operons—sequences of functionally related genes under common control 
Eyanebe Uni ebabre Nil at (Chapter 31). In the course of evolution, many of the genes of the chloro- 
magnification. Ancestors of these bacteria . ; 

plast ancestor were transferred to the plant cell’s nucleus or, in some cases, 


are thought to have evolved into present-day : Oe. : 
chloroplasts. [Michael Abbey/Science Source. ] lost entirely, thus establishing a fully dependent relation. 


19.2 Light Absorption by Chlorophyll Induces 
Electron Transfer 


The trapping of light energy is the key to photosynthesis. The first event is 
the absorption of light by a photoreceptor molecule. The principal photore- 
ceptor in the chloroplasts of most green plants is the pigment molecule chlo- 
vophyll a, a substituted tetrapyrrole (Figure 19.5). The four nitrogen atoms of 


FIGURE 19.5 Chlorophyll. Like 
heme, chlorophyll a is a cyclic 
tetrapyrrole. One of the pyrrole 
rings (shown in red) is reduced, 
and an additional five-carbon 
ring (Shown in blue) is fused to 
another pyrrole ring. A phytol 
chain (shown in green) is 
connected by an ester linkage. 
Magnesium ion binds at the 
center of the structure. 


the pyrroles are coordinated to a magnesium ion. Unlike a porphyrin such as 
heme, chlorophyll has a reduced pyrrole ring and an additional 5-carbon ring 
fused to one of the pyrrole rings. Another distinctive feature of chlorophyll 
is the presence of phytol, a highly hydrophobic 20-carbon alcohol, esterified 
to an acid side chain. 

Chlorophylls are very effective photoreceptors because they contain 
networks of conjugated double bonds—alternating single and double 
bonds. Such compounds are called conjugated polyenes. In polyenes, the 
electrons are not localized to a particular atomic nucleus. Upon the 
absorption of light energy, the electron transitions from a low energy 
molecular orbital to a higher energy orbital. Chlorophylls have very strong 
absorption bands in the visible region of the spectrum, where the solar 
output reaching Earth is maximal (Figure 19.6). Chlorophyll a’s peak 
molar extinction coefficient (€), a measure of a compound’s ability to 
absorb light, is greater than 10° M~! cm7!, among the highest observed 
for organic compounds. 

What happens when light is absorbed by a pigment molecule? The 
energy from the light excites an electron from its ground energy level to an 
excited energy level (Figure 19.7). This high-energy electron can have one 
of two fates. For most compounds that absorb light, the electron simply 
returns to the ground state and the absorbed energy is converted into heat. 
However, if a suitable electron acceptor is nearby, as is the case for chloro- 
phyll in photosynthetic systems, the excited electron can move from the 
initial molecule to the acceptor (Figure 19.8). A positive charge forms on the 
initial molecule, owing to the loss of an electron, and a negative charge 
forms on the acceptor, owing to the gain of an electron. Hence, this process 
is referred to as photoinduced charge separation. 

In chloroplasts, the site at which the charge separation takes place within 
each photosystem is called the reaction center. The photosynthetic apparatus 
is arranged to maximize photoinduced charge separation and minimize an 
unproductive return of the electron to its ground state. The electron, 
extracted from its initial site by the absorption of light, now has reducing 
power: it can reduce other molecules to store the energy originally obtained 
from light in chemical forms. 
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FIGURE 19.8 Photoinduced charge separation. If a suitable electron acceptor is nearby, an 
electron that has been moved to a high energy level by light absorption can move from the 
excited molecule to the acceptor. 


A special pair of chlorophylls initiate charge separation 


Photosynthetic bacteria such as Rhodopseudomonas viridis contain a photo- 
synthetic reaction center that has been revealed at atomic resolution. 
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FIGURE 19.6 Light absorption by 
chlorophyll a. Chlorophyll a absorbs visible 
light efficiently, as judged by the extinction 
coefficient near 10° M~! cm~1. 
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FIGURE 19.7 Light absorption. The 
absorption of light leads to the excitation 
of an electron from its ground state to a 
higher energy level. 
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Nonheme iron ; yy Quinone 


Bacteriopheophytin 


Bacteriochlorophyll 


Special pair 


hk FIGURE 19.9 Bacterial photosynthetic reaction center. The core of the reaction center 
from Rhodopseudomonas viridis consists of two similar chains: L (red) and M (blue). An H chain 
(white) and a cytochrome subunit (yellow) complete the structure. Notice that the L and M 
subunits are composed largely of a helices that soan the membrane. A/so notice that a chain 
of electron-carrying prosthetic groups, beginning with a special pair of bacteriochlorophylls 
and ending at a bound quinone, runs through the structure from bottom to top in this view. 
[Drawn from 1PRC.pdb.] 


The bacterial reaction center consists of four polypeptides: L (31 kDa, red), 
M (36 kDa, blue), and H (28 kDa, white) subunits and C, a c-type cyto- 
chrome with four c-type hemes (yellow) (Figure 19.9). Sequence comparisons 
: and low-resolution structural studies have revealed that the bacterial reaction 
Bacteriochlorophyll b : 
(BChI-b) center is homologous to the more complex plant systems. Thus, many of our 
observations of the bacterial system apply to plant systems as well. 

The L and M subunits form the structural and functional core of the 
bacterial photosynthetic reaction center (Figure 19.9). Each of these homol- 
ogous subunits contains five transmembrane helices, in contrast with the 
H subunit, which has just one. The H subunit lies on the cytoplasmic side 
of the cell membrane, and the cytochrome subunit lies on the exterior 
face of the cell membrane, called the periplasmic side because it faces the 
periplasm, the space between the cell membrane and the cell wall. Four 
bacteriochlorophyll b (BChI-b) molecules, two bacteriopheophytin b (BPh) 
molecules, two quinones (Q, and Q,), and a ferrous ion are associated with 
the L and M subunits. 

Bacteriochlorophylls are photoreceptors similar to chlorophylls, except 
for the reduction of an additional pyrrole ring and other minor differences 
that shift their absorption maxima to the near infrared, to wavelengths as 
long as 1000 nm. Bacteriopheophytin is the term for a bacteriochlorophy]l 
that has two protons instead of a magnesium ion at its center. 

The reaction begins with light absorption by a pair of BChI-b mole- 
cules that lie near the periplasmic side of the membrane in the L-M 


Bacteriopheophytin 5 : : 
(BPh) dimer. The pair of BChI-b molecules is called the special pair because of 


Charge 


Absorption separation 


Quinone 
pool 


its fundamental role in photosynthesis. The special pair absorbs light 
maximally at 960 nm, and, for this reason, is often called P960 (P stands 
for pigment). After absorbing light, the excited special pair ejects an elec- 
tron, which is transferred through another BChlI-b to a bacteriopheophy- 
tin (Figure 19.10, steps 1 and 2). This initial charge separation yields a 
positive charge on the special pair (P960*) and a negative charge on BPh 
(BPh7). The electron ejection and transfer take place in less than 10 pico- 
seconds (107!!s). 

A nearby electron acceptor, a tightly bound quinone (Q,), quickly grabs 
the electron away from BPh™ before the electron has a chance to fall back 
to the P960 special pair. From Q,, the electron moves to a more loosely 
associated quinone, Qx. The absorption of a second photon and the move- 
ment of a second electron from the special pair through the bacteriopheo- 
phytin to the quinones completes the two-electron reduction of Qz from 
Q to QH,. Because the Q,-binding site lies near the cytoplasmic side of 
the membrane, two protons are taken up from the cytoplasm, contributing to 
the development of a proton gradient across the cell membrane (Figure 19.10, 
steps 5, 6, and 7). 

In their high-energy states, P960* and BPh~ could undergo charge 
recombination; that is, the electron on BPh~ could move back to neutralize 
the positive charge on the special pair. Its return to the special pair would 
waste a valuable high-energy electron and simply convert the absorbed light 
energy into heat. How is charge recombination prevented? Two factors in 
the structure of the reaction center work together to suppress charge recom- 
bination nearly completely (Figure 19.10, steps 3 and 4). First, the next 
electron acceptor (Q,) is less than 10 A away from BPh_, and so the elec- 
tron is rapidly transferred farther away from the special pair. Second, one of 


Cytochrome Cy oxidized 


FIGURE 19.10 Electron chain in the 
photosynthetic bacterial reaction 
center. The absorption of light by the 
special pair (P960) results in the rapid 
transfer of an electron from this site to a 
bacteriopheophytin (BPh), creating a 
photoinduced charge separation (steps 1 
and 2). (The asterisk on P960 stands for 
excited state.) The possible return of the 
electron from the pheophytin to the 
oxidized special pair is suppressed by the 
“hole” in the special pair being refilled with 
an electron from the cytochrome subunit 
and the electron from the pheophytin being 
transferred to a quinone (Q,) that is farther 
away from the special pair (steps 3 and 4). 
Q, passes the electron to Q,. The 
reduction of a quinone (Qz) on the 
cytoplasmic side of the membrane results 
in the uptake of two protons from the 
cytoplasm (steps 5 and 6). The reduced 
quinone can move into the quinone pool 
in the membrane (step 7). 
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FIGURE 19.11 Cyclic electron flow in the 
bacterial reaction center. Excited 
electrons from the P960 reaction center 
flow through bacteriopheophytin (BPh), a 
pair of quinone molecules (Q, and Qz,), 
cytochrome bc, complex, and finally 
through cytochrome c, to the reaction 
center. The cytochrome bc, complex 
pumps protons as a result of electron flow, 
which powers the formation of ATP. 
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FIGURE 19.12 Two photosystems. The absorption of photons by 


5 Cytochrome 


the hemes of the cytochrome subunit is less than 10 A away from the special 
pair, and so the positive charge on P960 is neutralized by the transfer of an 
electron from the reduced cytochrome. 


Cyclic electron flow reduces the cytochrome of the reaction center 


The cytochrome subunit of the reaction center must regain an electron to 
complete the cycle. It does so by taking back two electrons from reduced 
quinone (QH,). QH, first enters the Q pool in the membrane where it is 
reoxidized to Q by complex bc,, which is homologous to complex LI of the 
respiratory electron-transport chain. Complex bc, transfers the electrons 
from QH, to cytochrome c,, a water-soluble protein in the periplasm, and 
in the process pumps protons into the periplasmic space. The electrons now 
on cytochrome c, flow to the cytochrome subunit of the reaction center. The 
flow of electrons is thus cyclic (Figure 19.11). The proton gradient gener- 
ated in the course of this cycle drives the synthesis of ATP through the 
action of ATP synthase. 


19.3 Two Photosystems Generate a Proton Gradient and 
NADPH in Oxygenic Photosynthesis 


Photosynthesis is more complicated in green plants than in photosynthetic 
bacteria. In green plants, photosynthesis depends on the interplay of two 
kinds of membrane-bound, light-sensitive complexes—photosystem I (PS I) 
and photosystem IT (PS II), as shown in Figure 19.12. There are similarities 
in photosynthesis between green plants and photosynthetic bacteria. Both 
require light to energize reaction centers consisting of special pairs, called 
P700 for photosystem I and P680 for photosystem II, and both transfer 
electrons by using electron-transport chains. However, in plants, electron 
flow is not cyclic but progresses from photosystem IT to 
photosystem I under most circumstances. 


NADP* NADPH Photosystem I, which responds to light with wave- 


lengths shorter than 700 nm, uses light-derived high- 
energy electrons to create biosynthetic reducing power in 
the form of NADPH, a versatile reagent for driving bio- 
synthetic processes. The electrons for creating one mol- 
ecule of NADPH are taken from two molecules of water 
by photosystem II, which responds to wavelengths 
shorter than 680 nm. A molecule of O, is generated as a 
side product of the actions of photosystem II. The elec- 
trons travel from photosystem II to photosystem I 
through cytochrome bf, a membrane-bound complex 
homologous to Complex III in oxidative phosphoryla- 
tion. Cytochrome bf generates a proton gradient across 
the thylakoid membrane that drives the formation of 


two distinct photosystems (PS | and PS Il) is required for complete ATP. Thus, the two photosystems cooperate to produce 


electron flow from water to NADP*. 
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NADPH and ATP. 


Photosystem II transfers electrons from water to plastoquinone 
and generates a proton gradient 


Photosystem II, an enormous transmembrane assembly of more than 
20 subunits, catalyzes the light-driven transfer of electrons from water to 
plastoquinone. This electron acceptor closely resembles ubiquinone, acom- 
ponent of the mitochondrial electron-transport chain. Plastoquinone cycles 


between an oxidized form (Q) anda reduced form (QH,, plastoquinol). The 
overall reaction catalyzed by photosystem II is 


Light 


2Q+2H,O—>0, + 2QH, 


The electrons in QH, are at a higher redox potential than those in H,O. 
Recall that, in oxidative phosphorylation, electrons flow from ubiquinol to 
an acceptor, O,, which is at a lower potential. Photosystem II drives the 
reaction in a thermodynamically uphill direction by using the free energy 
of light. 

This reaction is similar to one catalyzed by the bacterial system in that a 
quinone is converted from its oxidized into its reduced form. Photosystem 
II is reasonably similar to the bacterial reaction center (Figure 19.13). The 
core of the photosystem is formed by D1 and D2, a pair of similar 32-kDa 
subunits that span the thylakoid membrane. These subunits are homolo- 
gous to the L and M chains of the bacterial reaction center. Unlike the bac- 
terial system, photosystem IJ contains a large number of additional subunits 
that bind more than 30 chlorophyll molecules altogether and increase the 
efficiency with which light energy is absorbed and transferred to the reac- 
tion center (Section 19.5). 


Stroma 


complex 


Special pair 


The photochemistry of photosystem II begins with 
excitation of a special pair of chlorophyll molecules that 
are bound by the D1 and D2 subunits (Figure 19.14). 
Because the chlorophyll a molecules of the special pair 
absorb light at 680 nm, the special pair is often called 
P680. On excitation, P680 rapidly transfers an electron to 
a nearby pheophytin. From there, the electron is trans- 
ferred first to a tightly bound plastoquinone at site Q,, and 
then to a mobile plastoquinone at site Q,. This electron 
flow is entirely analogous to that in the bacterial system. 
With the arrival of a second electron and the uptake of two 
protons, the mobile plastoquinone is reduced to QH,. At 
this point, the energy of two photons has been safely and 
efficiently stored in the reducing potential of QH,. 

The major difference between the bacterial system and 
photosystem II is the source of the electrons that are used to 
neutralize the positive charge formed on the special pair. 
P680*, a very strong oxidant, extracts electrons from water 
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hk FIGURE 19.13 The structure of 
photosystem Il. The D, (red) and D, 
(blue) subunits and the numerous bound 
chlorophyll molecules (green). Notice that 


the special pair and the water-oxidizing 


complex lay toward the thylakoid-lumen 
side of the membrane. [Drawn from 
1S5L.pdb.] 


2 H,0 O, 

FIGURE 19.14 Electron flow through photosystem II. Light 
absorption induces electron transfer from P680 down an electron- 
transfer pathway to an exchangeable plastoquinone. The positive 
charge on P680 is neutralized by electron flow from water 
molecules bound at the manganese center. 
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FIGURE 19.15 The core of the water-oxidizing complex. (A) The deduced core structure 
of the water-oxidizing complex (WOC), including four manganese ions and one calcium ion, 

is shown. The valence states of the individual manganese ions are not indicated because of 
uncertainty about the charge on the individual ions. The center is oxidized, one electron at a 
time, until two H,O molecules are oxidized to form a molecule of O,, which is then released 

from the complex. (B) The absorption of photons by the reaction center generates a tyrosine 
radical (red arrows), which then extracts electrons from the manganese ions. The structures 
are designated S, to S, to indicate the number of electrons that have been removed. 


molecules bound at the water-oxidizing complex (WOC), also called the manga- 
nese center. The core of this complex includes a calcium ion, four manganese 
ions, and four water molecules (Figure 19.15A). Manganese was apparently 
evolutionarily selected for this role because of its ability to exist in multiple 
oxidation states and to form strong bonds with oxygen-containing species. In 
its reduced form, the WOC oxidizes two molecules of water to form a single 
molecule of oxygen. Each time the absorbance of a photon powers the removal 
of an electron from P680, the positively charged special pair extracts an elec- 
tron from a tyrosine residue (often designated Z) of subunit D1 of the WOC, 
forming a tyrosine radical (Figure 19.15B). The tyrosine radical then removes 
an electron from a manganese ion. This process occurs four times, with the 
result that H,O is oxidized to generate O, and H*. Four photons must be 
absorbed to extract four electrons from a water molecule (Figure 19.16). The 
four electrons harvested from water are used to reduce two molecules of Q to 
QH,. All oxygenic phototrophs, the most common type of photosynthetic 
organism, use the same inorganic core and protein components for capturing 
the energy of sunlight. A single solution to the biochemical problem of extract- 
ing electrons from water evolved billions of years ago, and has been conserved 
Evolution of oxygen is evident by the generation for use under a wide variety of phylogenetic and ecological circumstances. 

of bubbles in the aquatic plant Elodea. [Colin Photosystem II spans the thylakoid membrane such that the site of qui- 
Milkins/Oxford Scientific Films/Photolibrary.] none reduction is on the side of the stroma, whereas WOC lies in the 


FIGURE 19.16 Four photons are 
required to generate one oxygen 
molecule. When dark-adapted chloroplasts 
are exposed to a brief flash of light, one 
electron passes through photosystem Il. 
Monitoring the O, released after each flash 
reveals that four flashes are required to 
generate each O, molecule. The peaks in 
O, release are after the 3rd, 7th, and 11th 
flashes because the dark-adapted Lt} | | | | | | fd 
chloroplasts start in the S, state—that is, 123 45 6 7 8 9 10 11 12 
the one-electron reduced state. Flash number 


O, evolved per flash 


thylakoid lumen. Thus, the two protons that are taken up with the reduction 
of Q to QH, come from the stroma, and the four protons that are liberated 
in the course of water oxidation are released into the lumen. This distribu- 
tion of protons generates a proton gradient across the thylakoid membrane 
characterized by an excess of protons in the thylakoid lumen compared with 
the stroma (Figure 19.17). 


Cytochrome bf links photosystem Il to photosystem | 


Electrons flow from photosystem II to photosystem I through the cyto- 
chrome bf complex. This complex catalyzes the transfer of electrons from 
plastoquinol (QH,) to plastocyanin (Pc), a small, soluble copper protein in 
the thylakoid lumen. 


QH> + 2 Pe(Cu?*) -> Q ae Pe(Cu™) a Hihylakoid lumen 


The two protons from plastoquinol are released into the thylakoid lumen. 
This reaction is reminiscent of that catalyzed by Complex III in oxidative 
phosphorylation, and most components of the cytochrome bf complex are 
homologous to those of Complex III. The cytochrome bf complex includes 
four subunits: a 23-kDa cytochrome with two b-type hemes, a 20-kDa 
Rieske-type Fe-S protein (p. 534), a 33-kDa cytochrome f with a c-type 
cytochrome, and a 17-kDa chain. 

This complex catalyzes the reaction by proceeding through the Q cycle 
(Figure 18.12). In the first half of the Q cycle, plastoquinol (QH,) 1s oxi- 
dized to plastoquinone (Q), one electron at a time. The electrons from plas- 
toquinol flow through the Fe-S protein to convert oxidized plastocyanin 
(Pc) into its reduced form. 

In the second half of the Q cycle, cytochrome bf reduces a molecule of 
plastoquinone from the Q pool to plastoquinol, taking up two protons from 
one side of the membrane, and then reoxidizes plastoquinol to release these 
protons on the other side. The enzyme is oriented so that protons are released 
into the thylakoid lumen and taken up from the stroma, contributing further 
to the proton gradient across the thylakoid membrane (Figure 19.18). 


Photosystem I uses light energy to generate reduced 
ferredoxin, a powerful reductant 


The final stage of the light reactions is catalyzed by photosystem I, a trans- 
membrane complex consisting of about 15 polypeptide chains and multiple 
associated proteins and cofactors (Figure 19.19). The core of this system is a 
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FIGURE 19.17 Proton-gradient 
direction. Photosystem II releases protons 
into the thylakoid lumen and takes them up 
from the stroma. The result is a pH gradient 
across the thylakoid membrane with an 
excess of protons (low pH) inside. 


2 Ht 


FIGURE 19.18 Cytochrome bf contribution 
to proton gradient. The cytochrome bf 
complex oxidizes QH, to Q through the Q 
cycle. Four protons are released into the 
thylakoid lumen in each cycle. 


k FIGURE 19.19 The structure of 
photosystem |. The psaA and psaB 
subunits are shown in red and blue, 
respectively. Notice the numerous bound 
chlorophyll molecules, shown in green, 
including the special pair, as well as the iron— 
sulfur clusters that facilitate electron transfer 
from the stroma. [Drawn from 1JBO.pdb.] 
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FIGURE 19.20 Electron flow through 
photosystem | to ferredoxin. Light 
absorption induces electron transfer from 
P700 down an electron-transfer pathway 
that includes a chlorophyll molecule, a 
quinone molecule, and three 4Fe-4S 
clusters to reach ferredoxin. The positive 
charge left on P700 is neutralized 

by electron transfer from reduced 
plastocyanin. 


576 


pair of similar subunits, psaA (83 kDa, red) and psaB (82 kDa, blue), which 
bind 80 chlorophyll molecules as well as other redox factors. These subunits 
are quite a bit larger than the core subunits of photosystem II and the bacte- 
rial reaction center. Nonetheless, they appear to be homologous; the termi- 
nal 40% of each subunit is similar to a corresponding subunit of photosystem 
II. A special pair of chlorophyll a molecules lie at the center of the structure 
and absorb light maximally at 700 nm. This center, called P700, initiates 
photoinduced charge separation (Figure 19.20). The electron travels from 
P700 down a pathway through chlorophyll at site Ay and quinone at site A, 
to a set of 4Fe—45 clusters. The next step is the transfer of the electron to 
ferredoxin (Fd), a soluble protein containing a 2Fe—25 cluster coordinated 
to four cysteine residues (Figure 19.21). Ferredoxin transfers electrons to 
NADP*. Meanwhile, P700* captures an electron from reduced plastocya- 
nin provided by photosystem II to return to P700 so that P700 can be 
excited again. Thus, the overall reaction catalyzed by photosystem I is a 
simple one-electron oxidation—reduction reaction. 
Light 


Pe(Gu*) 4: Fd ——> Pe(Cut* + Fdg 


Given that the reduction potentials for plastocyanin and ferredoxin are 
+0.37 V and —0.45 V, respectively, the standard free energy for this reac- 
tion is +79.1 kJ mol~! (+18.9 kcal mol~!). This uphill reaction is driven by 
the absorption of a 700-nm photon, which has an energy of 171 kJ mol~! 
(40.9 kcal mol~!). 


hk FIGURE 19.21 Structure of 
ferredoxin. In plants, ferredoxin contains a 
2Fe-2S cluster. This protein accepts 
electrons from photosystem | and carries 
them to ferredoxin-NADP* reductase. 
[Drawn from 1FXA.pdb.] 


Ferredoxin-NADP* reductase converts NADP* into NADPH 


The reduced ferredoxin generated by photosystem I is a strong reductant 
that is used as an electron source for a variety of reactions, most notably 
the fixation of N, into NH, (Section 24.1). However, ferredoxin is not 
useful for driving many reactions, in part because ferredoxin carries only 
one available electron. In contrast, NADPH, a two-electron reductant, is 
a widely used electron donor in biosynthetic processes, including the reac- 
tions of the Calvin cycle (Chapter 20). How is reduced ferredoxin used to 
drive the reduction of NADP* to NADPH? This reaction is catalyzed 
by ferredoxin-NADP* reductase, a flavoprotein with an FAD prosthetic 
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rk FIGURE 19.22 Structure and function of ferredoxin-NADP* reductase. (A) Structure 
of ferredoxin-NADP* reductase. This enzyme accepts electrons, one at a time, from ferredoxin 
(shown in orange). (B) Ferredoxin-NADP* reductase first accepts two electrons, generated by 
photosystem |, and two protons from the lumen to form two molecules of reduced ferredoxin 
(Fd). Reduced ferredoxin is then used to form FADH,, which then transfers two electrons and 
a proton to NADP? to form NADPH in the stroma. [Drawn from 1EWY.pdb.] 
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group (Figure 19.22A). The bound FAD moiety accepts two electrons and 
two protons from two molecules of reduced ferredoxin to form FADH 
(Figure 19.22B). The enzyme then transfers a hydride ion (H7~) to 
NADP* to form NADPH. This reaction takes place on the stromal 
side of the membrane. Hence, the uptake of a proton in the reduction of 
NADP? further contributes to the generation of the proton gradient across 
the thylakoid membrane. 

The cooperation between photosystem I and photosystem II creates a 
flow of electrons from H,O to NADP*. The pathway of electron flow is 
called the Z scheme of photosynthesis because the redox diagram from P680 
to P700* looks like the letter Z (Figure 19.23). 
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FIGURE 19.23 Pathway of electron flow from H,O to NADP* in photosynthesis. 

This endergonic reaction is made possible by the absorption of light by photosystem II (P680) 
and photosystem | (P700). Abbreviations: Ph, pheophytin; Q, and Qz, plastoquinone-binding 
proteins; Pc, plastocyanin; Ay and A,, acceptors of electrons from P700*; Fd, ferredoxin; 
WOC, water-oxidizing complex. 
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FIGURE 19.24 Jagendorf’s demonstration. 
Chloroplasts synthesize ATP after the 
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19.4 A Proton Gradient across the Thylakoid 
Membrane Drives ATP Synthesis 


In 1966, André Jagendorf showed that chloroplasts synthesize ATP in 
the dark when an artificial pH gradient is imposed across the thylakoid 
membrane. To create this transient pH gradient, he soaked chloroplasts 
in a pH 4 buffer for several hours and then rapidly mixed them with a pH 
8 buffer containing ADP and P.. The pH of the stroma suddenly increased 
to 8, whereas the pH of the thylakoid space remained at 4. A burst of 
ATP synthesis then accompanied the disappearance of the pH gradient 
across the thylakoid membrane (Figure 19.24). This incisive experiment 
was one of the first to unequivocally support the hypothesis put forth by 
Peter Mitchell that ATP synthesis is driven by proton-motive force 
(Section 18.4). 

The principles of ATP synthesis in chloroplasts are nearly identical with 
those in mitochondria. ATP formation is driven by a proton-motive force in 
both photophosphorylation and oxidative phosphorylation. We have seen how 
light induces electron transfer through photosystems II and I and the cyto- 
chrome bf complex. At various stages in this process, protons are released 
into the thylakoid lumen or taken up from the stroma, generating a proton 
gradient. The gradient is maintained because the thylakoid membrane is 
essentially impermeable to protons. The thylakoid space becomes markedly 
acidic, with the pH approaching 4. The light-induced transmembrane proton 
gradient is about 3.5 pH units. As discussed in Section 18.4, energy inherent 
in the proton gradient, called the proton-motive force (Ap), is described as the 
sum of two components: a charge gradient and a chemical gradient. In chlo- 
roplasts, nearly all of Ap arises from the pH gradient, whereas in mitochon- 
dria, the contribution from the membrane potential is larger. The reason for 
this difference is that the thylakoid membrane is quite permeable to Cl~ and 
M¢g?". The light-induced transfer of H* into the thylakoid space is accom- 
panied by the transfer of either Cl~ in the same direction or Mg?* (1 Mg?t 
per 2 H*) in the opposite direction. Consequently, electrical neutrality is 
maintained and no membrane potential is generated. The influx of Mg?* 
into the stroma plays a role in the regulation of the Calvin Cycle 
(Section 20.2). A pH gradient of 3.5 units across the thylakoid membrane 
corresponds to a proton-motive force of 0.20 V or a AG of —20.0 kJ mol~! 
(—4.8 kcal mol7!). 


The ATP synthase of chloroplasts closely resembles those of 
mitochondria and prokaryotes 


The proton-motive force generated by the light reactions is converted into 
ATP by the ATP synthase of chloroplasts, also called the CF,—CF,, complex 
(C stands for chloroplast; F’ for factor). CF,-CF) ATP synthase closely 
resembles the FF) complex of mitochondria (Section 18.4). CF) conducts 
protons across the thylakoid membrane, whereas CF, catalyzes the forma- 
tion of ATP from ADP and P.. 

CF, is embedded in the thylakoid membrane. It consists of four differ- 
ent polypeptide chains known as I (17 kDa), II (16.5 kDa), III (8 kDa), and 
IV (27 kDa) having an estimated stoichiometry of 1: 2: 10-14: 1. Subunits 
I and II have sequence similarity to subunit b of the mitochondrial Fp 
subunit, III corresponds to subunit c of the mitochondrial complex, and sub- 
unit I'V is similar in sequence to subunit a. CF, the site of ATP synthesis, 
has a subunit composition a,8,y5e. The B subunits contain the catalytic 
sites, similarly to the F, subunit of mitochondrial ATP synthase. 
Remarkably, the 8 subunits of ATP synthase in corn chloroplasts are more 
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FIGURE 19.25 Comparison of photosynthesis and oxidative phosphorylation. The light- 
induced electron transfer in photosynthesis drives protons into the thylakoid lumen. The 
excess protons flow out of the lumen through ATP synthase to generate ATP in the stroma. In 
oxidative phosphorylation, electron flow down the electron-transport chain pumps protons out 
of the mitochondrial matrix. Excess protons from the intermembrane space flow into the 
matrix through ATP synthase to generate ATP in the matrix. 


than 60% identical in amino acid sequence with those of human ATP 
synthase, despite the passage of approximately 1 billion years since the 
separation of the plant and animal kingdoms. 

Note that the membrane orientation of CF ,—CF, is reversed compared 
with that of the mitochondrial ATP synthase (Figure 19.25). However, the 
functional orientation of the two synthases is identical: protons flow 
from the lumen through the enzyme to the stroma or matrix where ATP is 
synthesized. Because CF, is on the stromal surface of the thylakoid mem- 
brane, the newly synthesized ATP is released directly into the stromal 
space. Likewise, NADPH formed by photosystem I is released into the 
stromal space. Thus, ATP and NADPH, the products of the light reactions of 
photosynthesis, are appropriately positioned for the subsequent dark reactions, 
in which CO, is converted into carbohydrate. 


The activity of chloroplast ATP synthase is regulated 


The activity of the ATP synthase is sensitive to the redox conditions in the 
chloroplast. For maximal activity, a specific disulfide bond in the y subunit 
must be reduced to two cysteines. The reductant is reduced thioredoxin, 


580 which is formed from ferredoxin generated in photosystem I, by ferredoxin— 


CHAPTER 19 The Light Reactions of thioredoxin reductase, an iron-sulfur containing enzyme. 
Photosynthesis 


Ferredoxin-thioredoxin 
reductase 


2 Reduced ferredoxin + thioredoxin disulfide 
2 oxidized ferredoxin + reduced thioredoxin + 2H* 


Conformational changes in the ¢ subunit also contribute to synthase 
regulation. The ¢ subunit appears to exist in two conformations. One con- 
formation inhibits ATP hydrolysis by the synthase, while the other, which 
is generated by an increase in the proton-motive force, allows ATP synthe- 
sis and facilitates the reduction of the disulfide bond in the y subunit. Thus, 
synthase activity is maximal when biosynthetic reducing power and a pro- 
ton gradient are available. We will see in Chapter 20 that redox regulation is 
also important in photosynthetic carbon metabolism. 


Cyclic electron flow through photosystem | leads to the 
production of ATP instead of NADPH 


On occasion, when the ratio of NADPH to NADP? is very high, as might be 
the case if there was another source of electrons to form NADPH (Section 20.3), 
NADP* may be unavailable to accept electrons from reduced ferredoxin. In 
these circumstances, specific large protein complexes allow cyclic electron flow 
that powers ATP synthesis. Electrons arising from P700, the reaction center of 
photosystem I, generate reduced ferredoxin. The electron in reduced ferredoxin 
is transferred to the cytochrome bf complex rather than to NADP*. This elec- 
tron then flows back through the cytochrome bf complex to reduce plastocyanin, 
which can then be reoxidized by P700* to complete a cycle. The net outcome of 
this cyclic flow of electrons is the pumping of protons by the cytochrome bf 
complex. The resulting proton gradient then drives the synthesis of ATP. In 
this process, called cyclic photophosphorylation, ATP is generated without the 
concomitant formation of NADPH (Figure 19.26). Photosystem IT does not par- 
ticipate in cyclic photophosphorylation, and so O, is not formed from HO. 
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FIGURE 19.26 Cyclic photophosphorylation. (A) In this pathway, electrons from reduced 
ferredoxin are transferred to cytochrome bf rather than to ferredoxin-NADP* reductase. The 
flow of electrons through cytochrome bf pumps protons into the thylakoid lumen. These 
protons flow through ATP synthase to generate ATP. Neither NADPH nor O, is generated by 
this pathway. (B) A scheme showing the energetic basis for cyclic photophosphorylation. 
Abbreviations: Fd, ferredoxin; Pc, plastocyanin. 


The absorption of eight photons yields one O,, two NADPH, 581 
and three ATP molecules 19.5 Accessory Pigments 


We can now estimate the overall stoichiometry for the light reactions. The 
absorption of four photons by photosystem II generates one molecule of O, 
and releases four protons into the thylakoid lumen. The two molecules of 
plastoquinol are oxidized by the Q cycle of the cytochrome bf complex to 
release eight protons into the lumen. Finally, the electrons from four mole- 
cules of reduced plastocyanin are driven to ferredoxin by the absorption of 
four additional photons. The four molecules of reduced ferredoxin generate 
two molecules of NADPH. Thus, the overall reaction is 


2H,O + 2NADP* + 10 Hgjoma > O2 + 2NADPH + 12 Hitmen 


The 12 protons released in the lumen can then flow through ATP synthase. 
Let us assume that there are 12 subunit II] components in CF. We expect 
that 12 protons must pass through CF, to complete one full rotation of CF ,. 
A single rotation generates three molecules of ATP. Given the ratio of 
3 ATP for 12 protons, the overall reaction is 


2H,O + 2NADP* + 10 Hgioma ——> O2 + 2 NADPH + 12 Hitmen 
3 ADP3~ + 3P?27 + 3H + 12 Hibmen —> 3ATP*” + 3H2O + 12 Heb oma 
2NADP* + 3ADP?~ + 3P?” + H* —>0O, + 2NADPH + 3ATP* + H,O 


Thus, eight photons are required to yield three molecules of ATP (2.7 pho- 
tons/ATP). 

Cyclic photophosphorylation is a somewhat more productive way to 
synthesize ATP than noncyclic photophosphorylation (the Z scheme). 
The absorption of four photons by photosystem I leads to the release 
of eight protons into the lumen by the cytochrome bf system. These pro- 
tons flow through ATP synthase to yield two molecules of ATP. Thus, 
each two absorbed photons yield one molecule of ATP. No NADPH 
is produced. 


19.5 Accessory Pigments Funnel Energy into 
Reaction Centers 


A light-harvesting system that relied only on the chlorophyll a molecules 
of the special pair would be rather inefficient for two reasons. First, chlo- 
rophyll a molecules absorb light only at specific wavelengths (Figure 19.6). 
A large gap is present in the middle of the visible region between approxi- 
mately 450 and 650 nm. This gap falls right at the peak of the solar spec- 
trum, and so failure to collect this light would constitute a considerable lost 
opportunity. Second, even on a cloudless day, many photons that can be 
absorbed by chlorophyll a pass through the chloroplast without being 
absorbed, because the density of chlorophyll a molecules in a reaction cen- 
ter is not very great. Accessory pigments, both additional chlorophylls and 
other classes of molecules, are closely associated with reaction centers. 
These pigments absorb light and funnel the energy to the reaction center for 
conversion into chemical forms. Accessory pigments prevent the reaction 
center from sitting idle. 

Chlorophyll b and carotenoids are important accessory pigments that fun- 
nel energy to the reaction center. Chlorophyll b differs from chlorophyll a in OCHs 
having a formyl group in place of a methyl group. This small difference R 
shifts its two major absorption peaks toward the center of the visible region. Chlorophyll b 
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FIGURE 19.27 Absorption spectra of 
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In particular, chlorophyll b efficiently absorbs light with wavelengths 
between 450 and 500 nm (Figure 19.27). 

Carotenoids are extended polyenes that absorb light between 400 and 
500 nm. The carotenoids are responsible for most of the yellow and red 
colors of fruits and flowers, and they provide the brilliance of fall, when 
the chlorophyll molecules degrade, revealing the carotenoids. Tomatoes 
are especially rich in lycopene, which accounts for their red color, whereas 
carrots and pumpkins have abundant B-carotene, accounting for their 
orange color. 
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In addition to their role in transferring energy to reaction centers, the 
carotenoids serve a safeguarding function. Carotenoids suppress damaging 
photochemical reactions, particularly those including oxygen, that can be 
induced by bright sunlight. This protection may be especially important in 
the fall, when the primary pigment chlorophyll is being degraded and thus 
not able to absorb light energy. Plants lacking carotenoids are quickly killed 
on exposure to light and oxygen. 


Resonance energy transfer allows energy to move from the site of 
initial absorbance to the reaction center 


How is energy funneled from accessory pigments to a reaction center? The 
absorption of a photon does not always lead to electron excitation and trans- 
fer. More commonly, excitation energy is transferred from one molecule to 
a nearby molecule through electromagnetic interactions through space 
(Figure 19.28). The rate of this process, called resonance energy transfer, 
depends strongly on the distance between the energy-donor and the energy- 
acceptor molecules; an increase in the distance between the donor and the 
acceptor by a factor of 2 typically results in a decrease in the energy-transfer 
rate by a factor of 2° = 64. For reasons of conservation of energy, energy 


FIGURE 19.28 Resonance energy transfer. (1) An electron can accept energy from 
electromagnetic radiation of appropriate wavelength and jump to a higher energy state. 
(2) When the excited electron falls back to its lower energy state, the absorbed energy is 
released. (3) The released energy can be absorbed by an electron in a nearby molecule, 
and this electron jumps to a high energy state. 


transfer must be from a donor in the excited state to an accep- 
tor of equal or lower energy. The excited state of the special 
pair of chlorophyll molecules is lower in energy than that of 
single chlorophyll molecules, allowing reaction centers to trap 
the energy transferred from other molecules. 

What is the structural relationship between the acces- 
sory pigment and the reaction center? The accessory pig- 
ments are arranged in numerous light-harvesting complexes 
that completely surround the reaction center (Figure 19.29). 
The 26-kDa subunit of light-harvesting complex II 
(LHC-II) is the most abundant membrane protein in chlo- 
roplasts. This subunit binds seven chlorophyll a molecules, 
six chlorophyll b molecules, and two carotenoid molecules. 
Similar light-harvesting assemblies exist in photosynthetic 
bacteria (Figure 19.30). 


The components of photosynthesis are highly organized 


The intricacy of photosynthesis, seen already in the elaborate 
interplay of complex components, extends even to the place- 
ment of thecomponents in the thylakoid membranes. Thylakoid 
membranes of most plants are differentiated into stacked 
(appressed) and unstacked (nonappressed) regions (Figures 19.2 
and 19.3). Stacking increases the amount of thylakoid mem- 
brane in a given chloroplast volume. Both regions surround a 
common internal thylakoid space, but only unstacked regions 
make direct contact with the chloroplast stroma. Stacked and 
unstacked regions differ in the nature of their photosynthetic 
assemblies (Figure 19.31). Photosystem I and ATP synthase 
are located almost exclusively in unstacked regions, whereas 
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Photochemical reaction here converts the energy of a 
photon into a separation of charge, initiating electron flow. 


FIGURE 19.29 Energy transfer from accessory pigments to 
reaction centers. Light energy absorbed by accessory 
chlorophyll molecules or other pigments can be transferred to 
reaction centers by resonance energy transfer (yellow arrows), 
where it drives photoinduced charge separation. [Source: 

D. L. Nelson and M. M. Cox, Lehninger Principles of Biochemistry, 
5th ed. (W. H. Freeman and Company, 2008), Fig. 19.52.] 


photosystem II is present mostly in stacked regions. The cytochrome bf com- 
plex is found in both regions. Plastoquinone and plastocyanin are the mobile 
carriers of electrons between assemblies located in different regions of the 
thylakoid membrane. A common internal thylakoid space enables protons 
liberated by photosystem II in stacked membranes to be utilized by ATP 
synthase molecules that are located far away in unstacked membranes. 


hk FIGURE 19.30 Structure of a 
bacterial light-harvesting complex. 
Eight polypeptides, each of which binds 
three chlorophyll molecules (green) and 
a carotenoid molecule (red), surround a 
central cavity that contains the reaction 
center (not shown). Notice the high 
concentration of accessory pigments that 
surround the reaction center. [Drawn from 
1LGH.pdb.] 
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FIGURE 19.31 Location of photosynthesis components. Photosynthetic assemblies are 
differentially distributed in the stacked (appressed) and unstacked (nonappressed) regions of 
thylakoid membranes. [Information from Dr. Jan M. Anderson and Dr. Bertil Andersson.] 


What is the functional significance of this lateral differentiation of the 
thylakoid membrane system? The positioning of photosystem I in the 
unstacked membranes also gives it direct access to the stroma for the reduc- 
tion of NADP*. ATP synthase is also located in the unstacked region to 
provide space for its large CF, globule and to give access to ADP. In con- 
trast, the tight quarters of the appressed region pose no problem for photo- 
system II, which interacts with a small polar electron donor (H,O) and a 
highly lipid soluble electron carrier (plastoquinone). 


Many herbicides inhibit the light reactions of photosynthesis 


Many commercial herbicides kill weeds by interfering with the action of 
photosystem II or photosystem I. Inhibitors of photosystem II block elec- 
tron flow, whereas inhibitors of photosystem I divert electrons from the 
terminal part of this photosystem. Photosystem II inhibitors include urea 
derivatives such as diuron and triazine derivatives such as atrazine. These 
chemicals bind to the Qs site of the D1 subunit of photosystem IT and block 
the formation of plastoquinol (QH,). 

Paraquat (1,1’-dimethyl-4-4’-bipyridinium) is an inhibitor of photosys- 
tem J. Paraquat, a dication, can accept electrons from photosystem I to 
become a radical. This radical reacts with O, to produce reactive oxygen 
species such as superoxide (O,,”) and hydroxyl radical (OH»). Such reactive 
oxygen species react with many biomolecules, including double bonds in 
membrane lipids, damaging the membrane. 


19.6 The Ability to Convert Light into Chemical 
Energy Is Ancient 

Ne The ability to convert light energy into chemical energy is a tremen- 

dous evolutionary advantage. Geological evidence suggests that oxy- 
genic photosynthesis became important approximately 2 billion years ago. 
Anoxygenic photosynthetic systems arose much earlier in the 3.5-billion- 
year history of life on Earth (Table 19.1). The photosynthetic system of the 
nonsulfur purple bacterium Rhodopseudomonas viridis has most features 
common to oxygenic photosynthetic systems and clearly predates them. 
Green sulfur bacteria such as Chlorobium thiosulfatophilum carry out a reac- 


tion that also seems to have appeared before oxygenic photosynthesis and is 
even more similar to oxygenic photosynthesis than the photosystem of 


TABLE 19.1 Major groups of photosynthetic prokaryotes 


Bacteria Photosynthetic electron donor O, use 
Green sulfur H,, HS, i) Anoxygenic 
Green nonsulfur Variety of amino acids and organic acids Anoxygenic 
Purple sulfur H,, H,5,5 Anoxygenic 
Purple nonsulfur Usually organic molecules Anoxygenic 
Cyanobacteria H,O Oxygenic 


R. viridis. Reduced sulfur species such as H,5 are electron donors in the 
overall photosynthetic reaction 


CO, + 2H,8 —> (CH,O) + 28 + H,O 


Nonetheless, photosynthesis did not evolve immediately at the origin of 
life. No photosynthetic organisms have been discovered in the domain of 
Archaea, implying that photosynthesis evolved in the domain of Bacteria 
after Archaea and Bacteria diverged from a common ancestor. All domains 
of life do have electron-transport chains in common, however. As we have 
seen, components such as the ubiquinone-cytochrome c oxidoreductase 
and cytochrome bf family are present in both respiratory and photosyn- 
thetic electron-transport chains. These components were the foundations 
on which light-energy-capturing systems evolved. 


Artificial photosynthetic systems may provide clean, renewable energy 


As we have learned, photosynthetic organisms use sunlight to oxidize HO, 
producing O, and protons used to power ATP synthesis and generate 
NADPH. Research is currently underway to try to mimic this process in 
order to provide clean energy. Photovoltaic cells can use light energy to ox1- 
dize water, producing O, as well as H,. Hydrogen gas is a fuel that, upon 
reaction with oxygen, generates energy and only water as a waste product. 
Major difficulties in creating efficient photovoltaic cells are that the materi- 
als required are not durable and often not readily available. Recent work 
suggests that semiconductors composed of organic—inorganic material show 
great promise. One such material, perovskite (CH,NH,PbI,), is remark- 
ably efficient at capturing sunlight. It is humbling to realize that, although 
photosynthetic organisms have been oxidizing water for billions of years, 
researchers today struggle to replicate the process. 


SUMMARY 


19.1 Photosynthesis Takes Place in Chloroplasts 
The proteins that participate in the light reactions of photosynthesis 
are located in the thylakoid membranes of chloroplasts. The light reac- 
tions result in (1) the creation of reducing power for the production of 
NADPH, (2) the generation of a transmembrane proton gradient for 
the formation of ATP, and (3) the production of O,. 


19.2 Light Absorption by Chlorophyll Induces Electron Transfer 
Chlorophyll molecules absorb light quite efficiently because they are 
polyenes. An electron excited to a high-energy state by the absorption of 
a photon can move to nearby electron acceptors. In photosynthesis, an 
excited electron leaves a pair of associated chlorophyll molecules known 
as the special pair. The functional core of photosynthesis, a reaction 
center from a photosynthetic bacterium, has been studied in great detail. 
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CHAPTER 19 The Light Reactions of 
Photosynthesis 


KEY TERMS 


19.4 


19.5 


19.6 


In this system, the electron moves from the special pair (containing bac- 
teriochlorophyll) to a bacteriopheophytin (a bacteriochlorophyll lacking 
the central magnesium ion) to quinones. The reduction of quinones 
leads to the generation of a proton gradient, which drives ATP synthesis 
in a manner analogous to that of oxidative phosphorylation. 


Two Photosystems Generate a Proton Gradient and NADPH in 
Oxygenic Photosynthesis 

Photosynthesis in green plants is mediated by two linked photosystems. 
In photosystem II, the excitation of a special pair of chlorophyll molecules 
called P680 leads to electron transfer to plastoquinone in a manner analo- 
gous to that of the bacterial reaction center. The electrons are replenished 
by the extraction of electrons from a water molecule at the water-oxidizing 
complex, which contains four manganese ions. One molecule of O, is 
generated at this center for each four electrons transferred. The plasto- 
quinol produced at photosystem II is reoxidized by the cytochrome bf 
complex, which transfers the electrons to plastocyanin, a soluble copper 
protein. From plastocyanin, the electrons enter photosystem I. In photo- 
system I, the excitation of special pair P700 releases electrons that flow to 
ferredoxin, a powerful reductant. Ferredoxin-NADP* reductase, a flavo- 
protein, then catalyzes the formation of NADPH. A proton gradient is 
generated as electrons pass through photosystem II, through the cyto- 
chrome bf complex, and through ferredoxin-NADP* reductase. 


A Proton Gradient across the Thylakoid Membrane Drives ATP Synthesis 
The proton gradient across the thylakoid membrane creates a proton- 
motive force, used by ATP synthase to form ATP. The ATP synthase 
of chloroplasts (also called the CF,;—CF,, complex) closely resembles the 
ATP-synthesizing assemblies of bacteria and mitochondria. If the 
NADPH:NADP* ratio is high, electrons transferred to ferredoxin by 
photosystem I can reenter the cytochrome bf complex. This process, called 
cyclic photophosphorylation, leads to the generation of a proton gradient 
by the cytochrome bf complex without the formation of NADPH or O,. 


Accessory Pigments Funnel Energy into Reaction Centers 
Light-harvesting complexes that surround the reaction centers con- 
tain additional molecules of chlorophyll a, as well as carotenoids and 
chlorophyll b molecules, which absorb light in the center of the visible 
spectrum. These accessory pigments increase the efficiency of light 
capture by absorbing light and transferring the energy to reaction cen- 
ters through resonance energy transfer. 


The Ability to Convert Light into Chemical Energy Is Ancient 

The photosystems have structural features in common that suggest a 
common evolutionary origin. Similarities in organization and molecular 
structure to those of oxidative phosphorylation suggest that the photo- 
synthetic apparatus evolved from an early energy-transduction system. 
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Problems 


1. A crucial prereq. Human beings do not produce 
energy by photosynthesis, yet this process is critical to 
our survival. Explain. 


2. The accounting. What is the overall reaction for the light 
reactions of photosynthesis? 


3. Like a fife and drum. Match each term with its description. 


Uses resonance energy 
transfer to reach the 
reaction center 

2. Powers the formation of 


(a) Light reactions 1. 
(b) Chloroplasts 


(c) Reaction center 


(d) Chlorophyll NADPH 
(e) Light-harvesting 3. Pumps protons 
complex 4. Site of photoinduced 
(f) Photosystem I charge separation 
(g) Photosystem II 5. Cellular location of 
(h) Cytochrome bf photosynthesis 
complex 6. CF,—CF, complex 
(1) Water-oxidizing 7. Generates ATP, 
complex NADPH, and O, 
(j) ATP synthase 8. Site of oxygen generation 
9. Transfers electrons from 
H,O to P680 
10. Primary photosynthetic 
pigment 


4. A single wavelength. Photosynthesis can be measured 
by measuring the rate of oxygen production. When 
plants are exposed to light of wavelength 680 nm, more 
oxygen is evolved than if the plants are exposed to light of 
700 nm. Explain. 


5. Combining wavelengths. If plants described in problem 4 
are illuminated by a combination of light of 680 nm and 700 
nm, the oxygen production exceeds that of either wave- 
length alone. Explain. 


6. Complementary powers. Photosystem I produces a pow- 
erful reductant, whereas photosystem IT produces a power- 
ful oxidant. Identify the reductant and oxidant and describe 
their roles. 


7. If a little is good. What is the advantage of having an 
extensive set of thylakoid membranes in the chloroplasts? 


8. Cooperation. Explain how light-harvesting complexes 
enhance the efficiency of photosynthesis. 


9. One thing leads to another. What is the ultimate elec- 
tron acceptor in photosynthesis? The ultimate electron 
donor? What powers the electron flow between the donor 
and the acceptor? 


10. Neutralization compensation. In chloroplasts, a greater 
pH gradient across the thylakoid membrane is required to 


power the synthesis of ATP than is required across the 
mitochondrial inner membrane. Explain this difference. 


11. Environmentally appropriate. Chlorophyll is a hydro- 
phobic molecule. Why is this property crucial for the func- 
tion of chlorophyll? 


12. Proton origins. What are the various sources of protons 
that contribute to the generation of a proton gradient in 
chloroplasts? 


13. Efficiency matters. What fraction of the energy of 700-nm 
light absorbed by photosystem I is trapped as high- 
energy electrons? 


14. That’s not right. Explain the defect or defects in the 
hypothetical scheme for the light reactions of photosynthesis 
depicted here. 
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15. Electron transfer. Calculate the AE) and AG®’ for the 
reduction of NADP* by ferredoxin. Use data given in 
Table 18.1. 


16. To boldly go. (a) It can be argued that, if life were 
to exist elsewhere in the universe, it would require some 
process like photosynthesis. Why is this argument reason- 
able? (b) If the starship Enterprise were to land on a 
distant planet and find no measurable oxygen in the atmo- 
sphere, could the crew conclude that photosynthesis is not 
taking place? 


17. Weed killer 1. Dichlorophenyldimethylurea(DCMU), 
a herbicide, interferes with photophosphorylation and 
O, evolution. However, it does not block O, evolution 
in the presence of an artificial electron acceptor such 
as ferricyanide. Propose a site for the inhibitory action 


of DCMU. 


18. Weed killer 2. Predict the effect of the herbicide dichlo- 
rophenyldimethylurea (DCMU) on a plant’s ability to per- 
form cyclic photophosphorylation. 


19. Infrared harvest. Consider the relation between the 
energy of a photon and its wavelength. 
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(a) Some bacteria are able to harvest 1000-nm light. What 
is the energy (in kilojoules or kilocalories) of a mole (also 
called an einstein) of 1000-nm photons? 


(b) What is the maximum increase in redox potential that 
can be induced by a 1000-nm photon? 


(c) What is the minimum number of 1000-nm photons 
needed to form ATP from ADP and P.? Assume a AG of 
50 kJ mol~! (12 kcal mol~!) for the phosphorylation reaction. 


20. Missing acceptors. Suppose that a bacterial reaction 
center containing only the special pair and the quinones 
has been prepared. Given the separation of 22 A between 
the special pair and the closest quinone, estimate the rate 
of electron transfer between the excited special pair and 
this quinone. 


21. Close approach. Suppose that energy transfer between 
two chlorophyll a molecules separated by 10 A takes place 
in 10 picoseconds. Suppose that this distance is increased to 
20 A with all other factors remaining the same. How long 
would energy transfer take? 


Chapter Integration Problems 


22. Functional equivalents. What structural feature of mito- 
chondria corresponds to the thylakoid membranes? 


23. Compare and contrast. Compare and contrast oxidative 
phosphorylation and photosynthesis. 


24. Energy accounts. On page 597, the balance sheet for 
the cost of the photosynthetically powered synthesis of 
glucose is presented. Eighteen molecules of ATP are 
required. Yet, when glucose undergoes combustion in 
cellular respiration, 30 molecules of ATP are produced. 
Account for the difference. 


25. Looking for a place to rest. Albert Szent-Gyérgyi, 
Nobel Prize-winning biochemist, once said, “Life is noth- 
ing but an electron looking for a place to rest.” Explain how 
this pithy statement applies to photosynthesis and cellular 
respiration. 


Mechanism Problem 


26. Hill reaction. In 1939, Robert Hill discovered that chlo- 
roplasts evolve O, when they are illuminated in the pres- 
ence of an artificial electron acceptor such as ferricyanide 


[Fe?*(CN),]°~. Ferricyanide is reduced to ferrocyanide 
[Fe?*(CN),]*~ in this process. No NADPH or reduced 


plastocyanin is produced. Propose a mechanism for the 
Hill reaction. 


Data Interpretation and Chapter Integration Problems 


27. The same, but different. The «38,7 complex of mito- 
chondrial or chloroplast ATP synthase functions as an 
ATPase in vitro. The chloroplast enzyme (both synthase 
and ATPase activity) is sensitive to redox control, whereas 
the mitochondrial enzyme is not. To determine where the 
enzymes differ, a segment of the mitochondrial y subunit 
was removed and replaced with the equivalent segment 
from the chloroplast -y subunit. The ATPase activity of the 
modified enzyme was then measured as a function of redox 
conditions. Dithiothreitol (DTT) is a reducing agent. 


(a) What is the redox regulator of the chloroplast ATP 
synthase in vivo? The adjoining graph shows the ATPase 
activity of modified and control enzymes under various 
redox conditions. 


Modified enzyme and thioredoxin 
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[Data from O. Bald et al. J. Biol, Chem. 275:12757-12762, 2000.] 


(b) What is the effect of increasing the reducing power of 
the reaction mixture for the control and the modified 
enzymes? 

(c) What is the effect of the addition of thioredoxin? How 
do these results differ from those in the presence of DTT 
alone? Suggest a possible explanation for the difference. 

(d) Did the researchers succeed in identifying the region of 
the y subunit responsible for redox regulation? 

(e) What is the biological rationale of regulation by high 
concentrations of reducing agents? 

(f£) What amino acids in the y subunit are most likely 
affected by the reducing conditions? 

(g) What experiments might confirm your answer to 
part e? 


The Calvin Cycle and the Pentose 
Phosphate Pathway 
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Atmospheric carbon dioxide measurements at Mauna Loa, Hawaii. These 
measurements show the increase in carbon dioxide since 1960. The saw- 
tooth appearance of the curve is the result of annual cycles resulting from 
seasonal variation in CO, fixation by the Calvin cycle in terrestrial plants (see 


insert). Much of this fixation takes place in rain forests, which account for 
approximately 50% of terrestrial fixation. [(Left) stillfx/ age fotostock. (Right) Data 
from http://www.esrl.noaa.gov/gmd/ccgg/trends] 


hotosynthesis proceeds in two parts: the light reactions and the dark 

reactions. The light reactions, discussed in Chapter 19, transform light 
energy into ATP and biosynthetic reducing power, NADPH. The dark 
reactions use the ATP and NADPH produced by the light reactions to 
reduce carbon atoms from their fully oxidized state as carbon dioxide to a 
more reduced state as a hexose. Carbon dioxide is thereby trapped in a form 
that is useful for many processes and most especially as a fuel. Together, the 
light reactions and dark reactions of photosynthesis cooperate to transform light 
energy into carbon fuel. The dark reactions are called either the Calvin— 
Benson cycle, after Melvin Calvin and Andrew Benson, the biochemists who 
elucidated the pathway, or simply the Calvin cycle. The components of the 
Calvin cycle are called the dark reactions because, in contrast with the light 
reactions, these reactions do not directly depend on the presence of light. 

The second half of this chapter examines a pathway common to all 
organisms, known variously as the pentose phosphate pathway, the hexose 
monophosphate pathway, the phosphogluconate pathway, or the pentose shunt. 
The pathway provides a means by which glucose can be oxidized to generate 
NADPH, the currency of readily available reducing power in cells. The phos- 
phory]l group on the 2'-hydroxyl group of one of the ribose units of NADPH 
distinguishes NADPH from NADH. There is a fundamental distinction 
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CHAPTER 20 The Calvin Cycle and the 
Pentose Phosphate Pathway 


Hexose 
sugars 


FIGURE 20.1 Calvin cycle. 

The Calvin cycle consists of three 
stages. Stage 1 is the fixation of 
carbon by the carboxylation of 
ribulose 1,5-bisphosphate. Stage 2 is 
the reduction of the fixed carbon to 
begin the synthesis of hexose. Stage 
3 is the regeneration of the starting 
compound ribulose 1,5-bisphosphate. 


between NADPH and NADH in biochemistry: NADH is oxidized by the respi- 
ratory chain to generate ATP, whereas NADPH serves as a reductant in bio- 
synthetic processes. The pentose phosphate pathway can also be used for the 
catabolism of pentose sugars from the diet, the synthesis of pentose sugars 
for nucleotide biosynthesis, and the catabolism and synthesis of less com- 
mon four- and seven-carbon sugars. The pentose phosphate pathway and 
the Calvin cycle have in common several enzymes and intermediates that 
attest to an evolutionary kinship. Like glycolysis and gluconeogenesis, these 
pathways are mirror images of each other: the Calvin cycle uses NADPH to 
reduce carbon dioxide to generate hexoses, whereas the pentose phosphate 
pathway oxidizes glucose to carbon dioxide to generate NADPH. 


20.1 The Calvin Cycle Synthesizes Hexoses from 
Carbon Dioxide and Water 


As stated in Chapter 16, glucose can be formed from noncarbohydrate pre- 
cursors, such as lactate and amino acids, by gluconeogenesis. The energy 
powering gluconeogenesis ultimately comes from previous catabolism of 
carbon fuels. In contrast, photosynthetic organisms can use the Calvin cycle 
to synthesize glucose from carbon dioxide gas and water, by using sunlight 
as an energy source. The Calvin cycle introduces into life all the carbon 
atoms that will be used as fuel and as the carbon backbones of biomolecules. 
Photosynthetic organisms are called autotrophs (literally, self-feeders) 
because they can convert sunlight into chemical energy, which they subse- 
quently use to power their biosynthetic processes. Organisms that obtain 
energy from chemical fuels only are called heterotrophs, and such organisms 
ultimately depend on autotrophs for their fuel. 
The Calvin cycle comprises three stages (Figure 20.1): 
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1. The fixation of CO, by ribulose 1,5-bisphosphate to form two mole- 
cules of 3-phosphoglycerate; 


2. The reduction of 3-phosphoglycerate to form hexose sugars; and 


3. The regeneration of ribulose 1,5-bisphosphate so that more CO, can 
be fixed. 


This set of reactions takes place in the stroma of chloroplasts, the photo- 
synthetic organelles. 


Carbon dioxide reacts with ribulose 1,5-bisphosphate to form 
two molecules of 3-phosphoglycerate 


The first step in the Calvin cycle is the fixation of CO,. This begins with the 
conversion of ribulose 1,5-bisphosphate into a highly reactive enediolate 
intermediate. The CO, molecule condenses with the enediol intermediate 
to form an unstable six-carbon compound, which is rapidly hydrolyzed to 
two molecules of 3-phosphoglycerate. 


CH>0PO32 
CH,0P032- | CH0P032- 
C—o- 
— H+ | CO2 ae bei H,0 
C——OH 
H—C—OH 
H—C—OH | H—C—OH 
CH 0PO32 
CH20PO37— CH,0PO032- 
Ribulose Enediolate Unstable 
1,5-bisphosphate intermediate intermediate 


This highly exergonic reaction [AG” = —51.9kJ mol~! (—12.4 kcal mol7!)] 
is catalyzed by ribulose 1,5-bisphosphate carboxylase/oxygenase (usually 
called rubisco), an enzyme located on the stromal surface of the thylakoid 
membranes of chloroplasts. This important reaction is the rate-limiting step 
in hexose synthesis. In plants and green algae, rubisco consists of eight large 
(L, 55-kDa) subunits and eight small (S, 15-kDa) ones (Figure 20.2). Each 
L chain contains a catalytic site and a regulatory site. The 5 chains enhance 
the catalytic activity of the L chains. This enzyme is abundant in chloro- 
plasts, accounting for approximately 30% of the total leaf protein in some 
plants. In fact, rubisco is the most abundant enzyme and probably the most 
abundant protein in the biosphere. Large amounts are present because 
rubisco is a slow enzyme; its maximal catalytic rate is only 3 s~!. 


Small 
subunit 


k FIGURE 20.2 Structure of rubisco. The 
enzyme ribulose 1,5-bisphosphate carboxylase/ 
oxygenase (rubisco) comprises eight large 
subunits (one shown in red and the others in 
yellow) and eight small subunits (one shown 
in blue and the others in white). The active sites 
lie in the large subunits. [Drawn from 1RXO.pdb.] 
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FIGURE 20.3 Role of the magnesium 
ion in the rubisco mechanism. Ribulose 
1,5-bisphosphate binds to a magnesium 
ion that is linked to rubisco through a 
glutamate residue, an aspartate residue, 
and the lysine carbamate. The coordinated 
ribulose 1,5-bisphosphate gives up a 
proton to form a reactive enediolate 
species that reacts with CO, to form 

a new carbon-carbon bond. 


FIGURE 20.4 Formation of 
3-phosphoglycerate. The overall 
pathway for the conversion of ribulose 1,5 
bisphosphate and CO, into two molecules 
of 3-phosphoglycerate. Although the free 
species are shown, these steps take place 
on the magnesium ion. 


Rubisco activity depends on magnesium and carbamate 


Rubisco requires a bound divalent metal ion for activity, usually magnesium 
ion. Like the zinc ion in the active site of carbonic anhydrase (Section 9.2), 
this metal ion serves to activate a bound substrate molecule by stabilizing 
a negative charge. Interestingly, a CO, molecule other than the substrate 
is required to complete the assembly of the Mg?* binding site in rubisco. 
This CO, molecule adds to the uncharged e-amino group of lysine as to 
forma carbamate: This negatively charged adduct then binds the Mg”* 


Glu 


Ribulose 1,5-bisphosphate Enediolate intermediate 


CO, 


The metal center plays a key role in binding ribulose 1,5-bisphosphate 
and activating it so that it reacts with CO, (Figure 20.3). Ribulose 1,5- 
bisphosphate binds to Mg?* through its ketog group and an adjacent feaeosyl 
group. This complex is readily deprotonated to form an enediolate interme- 
diate. This reactive species, analogous to the zinc—hydroxide species in car- 
bonic anhydrase, couples with CO, forming the new carbon-carbon bond. 
The resulting 2-carboxy-3-keto-D-arabinitol 1,5-bisphosphate is coordi- 
nated to the Mg?* ion through three groups, including the newly formed 
carboxylate. A molecule of H,O is then added to this B-ketoacid to form an 
intermediate that cleaves to form two molecules of 3-phosphoglycerate 
(Figure 20.4). 
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intermediate 


p-arabinitol intermediate 


1,5-bisphosphate 


Rubisco activase is essential for rubisco activity 


In the absence of CO,, rubisco binds to its substrate, ribulose 1,5-bisphos- 
phate, so tightly that its enzyme activity is inhibited. The enzyme rubisco 
activase uses ATP to induce structural changes in rubisco that allow 
release of the bound substrate and formation of the required carbamate. 
The ATP requirement of rubisco activase coordinates rubisco activity 
with the light reactions. 

The activase contains a P-loop domain (Section 9.4) and assembles into 
a hexameric ring—like structure. The enzyme binds to the C-terminal end of 
the polypeptide chain that forms that large subunit. Using the energy of 
ATP hydrolysis, the C-terminal is transiently pulled in to the central pore 
of the activase. This releases the bound ribulose 1,5-bisphosphate and acti- 
vates rubisco. Activase is a member of the large AAA family of ATPases 
that usually assemble into ring-shaped oligomeric structures and use the 
ATP hydrolysis to power conformational changes in their substrates. AAA 
ATPases play a variety of roles in biological systems. They are crucial for 
protein degradation (Chapter 23) and DNA replication (Chapter 28), and 
also function as molecular motors (Chapter 35). 


Rubisco also catalyzes a wasteful oxygenase reaction: 
Catalytic imperfection 


The reactive enediolate intermediate generated on the Mg?* ion sometimes 
reacts with O, instead of CO,; thus, rubisco also catalyzes a deleterious 
oxygenase reaction. The products of this reaction are phosphoglycolate and 
3-phosphoglycerate (Figure 20.5). The rate of the carboxylase reaction is four 
times that of the oxygenase reaction under normal atmospheric conditions 
at 25°C; the stromal concentration of CO, is then 10 pM and that of O, 1s 
250 pM. The oxygenase reaction, like the carboxylase reaction, requires 
that lysine 201 be in the carbamate form. Because this carbamate forms only 
in the presence of CO,, rubisco 1s prevented from catalyzing the oxygenase 
reaction exclusively when CO, is absent. 

Phosphoglycolate is not a versatile metabolite. A salvage pathway recovers 
part of its carbon skeleton (Figure 20.6). A specific phosphatase converts phos- 
phoglycolate into glycolate, which enters peroxisomes (also called microbodies; 
Figure 20.7). Glycolate is then oxidized to glyoxylate by glycolate oxidase, an 
enzyme with a flavin mononucleotide prosthetic group. The HO, produced in 
this reaction is cleaved by catalase to H,O and O,,. Transamination of glyoxyl- 
ate then yields glycine, which then enters the mitochondria. Two glycine mol- 
ecules can unite to form serine with the release of CO, and ammonium ion 
(NH,*). The ammonium ion, used in the synthesis of nitrogen-containing 
compounds, is salvaged by a glutamine synthetase reaction (Figure 20.6 and 
Section 23.3). Serine reenters the peroxisome,where it donates its ammonia to 
glyoxylate and is subsequently converted back into 3-phosphoglycerate. 

This salvage pathway serves to recycle three of the four carbon atoms of 
two molecules of glycolate. However, one carbon atom is lost as CO,. This 
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FIGURE 20.5 A wasteful side 
reaction. The reactive enediolate 
intermediate on rubisco also reacts 
with molecular oxygen to form a 
hydroperoxide intermediate, which 
then proceeds to form one molecule 
of 3-phosphoglycerate and one 
molecule of phosphoglycolate. 
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CHAPTER 20 The Calvin Cycle and the 
Pentose Phosphate Pathway 


FIGURE 20.7 Electron micrograph of 
three peroxisomes. The peroxisomes are 
surrounded by rough endoplasmic reticulum. 
[Don W. Fawcett/Science Source.] 


CHLOROPLAST 
-——— Ribulose 1,5-bisphosphate 
Glutamine 
CO2 Di O2 
3-Phosphoglycerate Glutamine 7 ADP 
K Ho synthetase ATP 
ane ae 
Glutamate 
Phosphoglycolate 
H20 
- 
PEROXISOME MITOCHONDRION 
ae 
He 
Glycerate -00C~ ~o~ 
Glycolate 
O2 
H202 
H NHa4t* 
( 
Hydroxypyruvate -9QQ9C~ “oO CO 
t Glyoxylate 2 Glycine 
oxy! a Vh 2 
NH3 Serine 


FIGURE 20.6 Photorespiratory reactions. Phosphoglycolate is formed as a product of 

the oxygenase reaction in chloroplasts. After dephosphorylation, glycolate is transported 

into peroxisomes where it is converted into glyoxylate and then glycine. In mitochondria, two 
glycines are converted into serine, after losing a carbon as CO, and ammonium ion. The serine 
is converted back into 3-phosphoglycerate and the ammonium ion is salvaged in chloroplasts. 


process is called photorespiration because O, is consumed and CO, is released. 
Photorespiration is wasteful because organic carbon is converted into CO 
without the production of ATP, NADPH, or another energy-rich metabolite. 
Although evolutionary processes have presumably enhanced the preference 
of rubisco for carboxylation—for instance, the rubisco of higher plants is 
eightfold as specific for carboxylation as that of photosynthetic bacteria— 
photorespiration still accounts for the loss of up to 25% of the carbon fixed. 

Much research has been undertaken to generate recombinant forms of 
rubisco that display reduced oxygenase activity, but all such attempts have 
failed. This raises the following question: What is the biochemical basis of 
this inefficiency? Structural studies show that when the reactive enediolate 
intermediate is formed, loops close over the active site to protect the enedio- 
late. A channel to the environment is maintained to allow access by CO). 
However, like CO,, O, is a linear molecule that also fits the channel. In 
essence, the problem lies not with the enzyme but in the unremarkable struc- 
ture of CO,. CO, lacks any chemical features that would allow discrimina- 
tion between it and other gases such as O,,, and thus the oxygenase activity is 
an inevitable failing of the enzyme. Another possibility exists, however. The 
oxygenase activity may not be an imperfection of the enzyme, but rather an 
imperfection in our understanding. Perhaps the oxygenase activity performs 
a biochemically important role that we have not yet discovered. 


Hexose phosphates are made from phosphoglycerate, 
and ribulose 1,5-bisphosphate is regenerated 


The 3-phosphoglycerate product of rubisco is next converted into fructose 
6-phosphate, which readily isomerizes to glucose 1-phosphate and glucose 
6-phosphate. The mixture of the three phosphorylated hexoses is called the 


hexose monophosphate pool. The steps in this conversion (Figure 20.8) are 
like those of the gluconeogenic pathway (Figure 16.24), except that glycer- 
aldehyde 3-phosphate dehydrogenase in chloroplasts, which generates 
glyceraldehyde 3-phosphate (GAP), is specific for NADPH rather than 
NADH. These reactions and that catalyzed by rubisco bring CO, to the 
level of a hexose, converting CO, into a chemical fuel at the expense of 
NADPH and ATP generated from the light reactions. 

The third phase of the Calvin cycle is the regeneration of ribulose 
1,5-bisphosphate, the acceptor of CO, in the first step. The problem is to 
construct a five-carbon sugar from six-carbon and three-carbon sugars. A 
transketolase and an aldolase play the major role in the rearrangement of the 
carbon atoms. The transketolase, which we will see again in the pentose 
phosphate pathway, requires the coenzyme thiamine pyrophosphate (TPP) 
to transfer a two-carbon unit (CO—CH,OH) from a ketose to an aldose. 
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Aldolase, which we have already encountered in glycolysis (Section 16.1), 
catalyzes an aldol condensation between dihydroxyacetone phosphate 
(DHAP) and an aldehyde. This enzyme is highly specific for dihydroxyac- 
etone phosphate, but it accepts a wide variety of aldehydes. 
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With these enzymes, the construction of the five-carbon sugar proceeds 
as shown in Figure 20.9. 

Finally, ribose 5-phosphate is converted into ribulose 5-phosphate by 
phosphopentose isomerase, whereas xylulose 5-phosphate is converted into 
ribulose 5-phosphate by phosphopentose epimerase. Ribulose 5-phosphate is 
converted into ribulose 1,5-bisphosphate through the action of phosphoribu- 
lokinase (Figure 20.10). The sum of these reactions is 


Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate 
+ dihydroxyacetone phosphate + 3 ATP —~> 
3 ribulose 1,5-bisphosphate + 3 ADP 


Figure 20.11 presents the required reactions with the proper stoichiometry 
to convert three molecules of CO, into one molecule of DHAP. However, 
two molecules of DHAP are required for the synthesis of a member of the 
hexose monophosphate pool. Consequently, the cycle as presented must 
take place twice to yield a hexose monophosphate. The outcome of the 
Calvin cycle is the generation of a hexose and the regeneration of the start- 
ing compound, ribulose 1,5-bisphosphate. In essence, ribulose 1,5-bisphos- 
phate acts catalytically, similarly to oxaloacetate in the citric acid cycle. 


Hexose monophosphate 
pool 


_—, 


¢ Glucose 1-phosphate ) 


a 


| Glucose 6-phosphate | 


| 
\ Fructose 6-phosphate ) 


— ee ee ee 


1 Fructose 1,6-bisphosphate 


1 Glyceraldehyde 1 Dihydroxyacetone 
3-phosphate ~ phosphate 


NADP* 

NADPH 
2 1,3-Bisphosphoglycerate 

ADP 


AP 


2 3-Phosphoglycerate 
FIGURE 20.8 Hexose phosphate 
formation. 3-Phosphoglycerate is 


converted into fructose 6-phosphate in a 
pathway parallel to that of gluconeogenesis. 
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FIGURE 20.10 Regeneration of ribulose 
1,5-bisphosphate. Both ribose 
5-phosphate and xylulose 5-phosphate are 
converted into ribulose 5-phosphate, which 
is then phosphorylated to complete the 
regeneration of ribulose 1,5-bisphosphate. 
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FIGURE 20.9 Formation of five-carbon sugars. First, transketolase converts a six-carbon 
sugar and a three-carbon sugar into a four-carbon sugar and a five-carbon sugar. Then, 
aldolase combines the four-carbon product and a three-carbon sugar to form a seven-carbon 
sugar. Finally, this seven-carbon sugar reacts with another three-carbon sugar to form two 


additional five-carbon sugars. 
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Three ATP and two NADPH molecules are used to bring carbon 
dioxide to the level of a hexose 


What is the energy expenditure for synthesizing a hexose? Six rounds of the 
Calvin cycle are required, because one carbon atom is reduced in each 
round. Twelve molecules of ATP are expended in phosphorylating 12 mol- 
ecules of 3-phosphoglycerate to 1,3-bisphosphoglycerate, and 12 molecules 
of NADPH are consumed in reducing 12 molecules of 1,3-bisphosphoglyc- 
erate to glyceraldehyde 3-phosphate. An additional six molecules of ATP 
are spent in regenerating ribulose 1,5-bisphosphate. 

We can now write a balanced equation for the net reaction of the Calvin 
cycle: 


6CO, + 18 ATP + 12 NADPH + 12 H,O—> 
C.H,,0, + 18 ADP + 18 P. + 12 NADP* + 6H” 


Thus, three molecules of ATP and two molecules of NADPH are con- 
sumed in incorporating a single CO, molecule into a hexose such as glucose 
or fructose. 


Starch and sucrose are the major carbohydrate stores in plants 


What are the fates of the members of the hexose monophosphate pool? 
These molecules are used in a variety of ways, but there are two primary 
roles. Plants contain two major storage forms of sugar: starch and sucrose. 
Starch, like its animal counterpart glycogen, is a polymer of glucose resi- 
dues, but it is less branched than glycogen because it contains a smaller 
proportion of a-1,6-glycosidic linkages (Section 11.2). Another difference 
is that ADP-glucose, not UDP-glucose, is the activated precursor. Starch is 
synthesized and stored in chloroplasts. 
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FIGURE 20.11 Calvin 
cycle. The diagram shows 
the reactions necessary with 
the correct stoichiometry to 
convert three molecules of 
CO, into one molecule of 
dihydroxyacetone phosphate 
(DHAP). The cycle is not as 
simple as presented in Figure 
20.1; rather, it entails many 
reactions that lead ultimately 
to the synthesis of glucose 
and the regeneration of 
ribulose 1,5-bisphosphate. 
{Information from J. R. Bowyer 
and R. C. Leegood. 
“Photosynthesis,” in Plant 
Biochemistry, P. M. Dey and 

J. B. Harborne, Eds. (Academic 
Press, 1997), p. 85.] 


Triose phosphates 
(from chloroplasts) 


FIGURE 20.12 Synthesis of sucrose. 
Sucrose 6-phosphate is formed by the 
reaction between fructose 6-phosphate 
and the activated intermediate uridine 
diphosphate glucose (UDP-glucose). 
Sucrose phosphatase subsequently 
generates free sucrose (not shown). 
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FIGURE 20.13 Light regulation of the 
Calvin cycle. The light reactions of 
photosynthesis transfer electrons out of the 
thylakoid lumen into the stroma and transfer 
protons from the stroma into the thylakoid 
lumen. As a consequence of these 
processes, the concentrations of NADPH, 
reduced ferredoxin (Fd), and Mg?* in the 
stroma are higher in the light than in the 
dark. The stromal pH is also increased (the 
concentration of Ht is lowered) in the light 
as a result of proton pumping from the 
stroma to the thylakoid lumen. Each of these 
concentration changes helps couple the 
Calvin cycle reactions to the light reactions. 
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In contrast, sucrose (common table sugar), a disaccharide, is synthesized 
in the cytoplasm. Plants lack the ability to transport hexose phosphates 
across the chloroplast membrane, but they are able to transport triose phos- 
phates from chloroplasts to the cytoplasm. Triose phosphate intermediates 
such as glyceraldehyde 3-phosphate cross into the cytoplasm in exchange for 
phosphate through the action of an abundant triose phosphate-phosphate 
antiporter. Fructose 6-phosphate formed from triose phosphates joins the 
glucose unit of UDP-glucose to form sucrose 6-phosphate (Figure 20.12). 
The hydrolysis of the phosphate ester by sucrose phosphatase yields sucrose, 
a readily transportable and mobilizable sugar that is stored in many plant 
cells, such as those in sugar beets and sugar cane. 


20.2 The Activity of the Calvin Cycle Depends on 
Environmental Conditions 


How do the light reactions communicate with the dark reactions to regulate 
this crucial process of fixing CO, into biomolecules? The principal means of 
regulation is alteration of the stromal environment by the light reactions. The 
light reactions lead to an increase in stromal pH (a decrease in the Ht 
concentration), as well as an increase in the stromal concentrations of Mg’, 
NADPH, and reduced ferredoxin—changes that contribute to the activa- 
tion of certain Calvin-cycle enzymes, located in the stroma (Figure 20.13). 


Rubisco is activated by light-driven changes in proton and 
magnesium ion concentrations 


As stated earlier, the rate-limiting step in the Calvin cycle is the carboxylation 
of ribulose 1,5-bisphosphate to form two molecules of 3-phosphoglycerate. 
The activity of rubisco increases markedly on illumination because light facili- 
tates the carbamate formation necessary for enzyme activity (p. 593). In the 
stroma, the pH increases from 7 to 8, and the level of Mg?*rises. Both 
effects are consequences of the light-driven pumping of protons into the 
thylakoid space. Mg?* ions from the thylakoid space are released into the 


TABLE 20.1 Enzymes regulated by thioredoxin 


Enzyme 


Rubisco 
Fructose 1,6-bisphosphatase 


Pathway 


Carbon fixation in the Calvin cycle 
Gluconeogenesis 

Glyceraldehyde 3-phosphate dehydrogenase Calvin cycle, gluconeogenesis, glycolysis 
Sedoheptulose 1,7-bisphosphatase Calvin cycle 
Glucose 6-phosphate dehydrogenase Pentose phosphate pathway 


Phenylalanine ammonia lyase Lignin synthesis 


Phosphoribulokinase Calvin cycle 
NADP*-malate dehydrogenase C, pathway 
CF,—-CF, ATP synthase The light reactions 


stroma to compensate for the influx of protons. Carbamate formation is 
favored at alkaline pH. CO, adds to a deprotonated form of lysine 201 of 
rubisco, and Mg?* ion binds to the carbamate to generate the active form of 


the enzyme. Thus, light leads to the generation of regulatory signals as well 
as ATP and NADPH. 


Thioredoxin plays a key role in regulating the Calvin cycle 


Light-driven reactions lead to electron transfer from water to ferredoxin 
and, eventually, to NADPH. The presence of reduced ferredoxin and 
NADPH are good signals that conditions are right for biosynthesis. One 
way in which this information is conveyed to biosynthetic enzymes is by 
thioredoxin, a 12-kDa protein containing neighboring cysteine residues 
that cycle between a reduced sulfhydryl and an oxidized disulfide form 
(Figure 20.14). The reduced form of thioredoxin activates many biosyn- 
thetic enzymes, including the chloroplast ATP synthase (p. 579), by 
reducing disulfide bridges that control their activity. Reduced thioredoxin 
also inhibits several degradative enzymes by the same means (Table 20.1). 
In chloroplasts, oxidized thioredoxin is reduced by ferredoxin in a reac- 
tion catalyzed by ferredoxin—thioredoxin reductase (p. 580). Thus, the 
activities of the light and dark reactions of photosynthesis are coordinated 
through electron transfer from reduced ferredoxin to thioredoxin and then to 
component enzymes containing regulatory disulfide bonds (Figure 20.15). We 
shall return to thioredoxin when we consider the reduction of ribonucleo- 
tides (Section 25.3). 

NADPH is a signal molecule that activates two biosynthetic enzymes, 
phosphoribulokinase and glyceraldehyde 3-phosphate dehydrogenase. In 
the dark, these enzymes are inhibited by association with an 8.5-kDa pro- 
tein called CP12, an intrinsically disordered protein (Section 2.6). NADPH 
disrupts this association by promoting the formation of two disulfide bonds 
in CP12, leading to the release of the active enzymes. 


The C, pathway of tropical plants accelerates photosynthesis 
by concentrating carbon dioxide 


The oxygenase activity of rubisco presents a biochemical challenge to 
tropical plants because the oxygenase activity increases more rapidly with 
temperature than does the carboxylase activity. How, then, do plants that 
grow in hot climates,such as sugar cane, prevent very high rates of wasteful 
photorespiration? Their solution to this problem is to achieve a high local 
concentration of CO, at the site of the Calvin cycle in their photosynthetic 
cells. The essence of this process, which was elucidated by Marshall 
Davidson Hatch and C. Roger Slack, is that four-carbon (C,) compounds 
such as oxaloacetate and malate carry CO, from mesophyll cells, which are in 


kh FIGURE 20.14 Thioredoxin. The 
oxidized form of thioredoxin contains a 
disulfide bond. When thioredoxin is 
reduced by reduced ferredoxin, the 
disulfide bond is converted into two free 
sulfhydryl groups. Reduced thioredoxin 
can cleave disulfide bonds in enzymes, 
activating certain Calvin cycle enzymes 
and inactivating some degradative 
enzymes. [Drawn from 1F9M.pdb.] 
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FIGURE 20.15 Enzyme activation by 


thioredoxin. Reduced thioredoxin activates 


certain Calvin cycle enzymes by cleaving 
regulatory disulfide bonds. 
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FIGURE 20.16 C, pathway. Carbon 
dioxide is concentrated in bundle-sheath 
cells by the expenditure of ATP in 
mesophyll cells. 


Air Mesophyll cell Bundle-sheath cell 


Oxaloacetate —==> Malate — ==> Malate 
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CO, ==> CO, | 
ai" S05" 
i ys 
Phosphoenol- x Pyruvate <===— Pyruvate 
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contact with air, to bundle-sheath cells, which are the major sites of photosyn- 
thesis (Figure 20.16). The decarboxylation of the four-carbon compound in 
a bundle-sheath cell maintains a high concentration of CO, at the site of 
the Calvin cycle. The three-carbon product returns to the mesophyll cell 
for another round of carboxylation. 

The C, pathway for the transport of CO, starts in a mesophyll cell with 
the condensation of CO, and phosphoenolpyruvate to form oxaloacetate in a 
reaction catalyzed by phosphoenolpyruvate carboxylase. In some species, oxa- 
loacetate is converted into malate by an NADP*-linked malate dehydroge- 
nase. Malate enters the bundle-sheath cell and is oxidatively decarboxylated 
within the chloroplasts by an NADP*—linked malate dehydrogenase. The 
released CO, enters the Calvin cycle in the usual way by condensing with 
ribulose 1,5-bisphosphate. Pyruvate formed in this decarboxylation reaction 
returns to the mesophyll cell. Finally, phosphoenolpyruvate is formed from 
pyruvate by pyruvate-P. dikinase. 

The net reaction of this C, pathway is 


CO, (in mesophyll cell) + ATP + 2H,O—> 
CO, (in bundle-sheath cell) + AMP + 2 P.+ 2H* 


Thus, the energetic equivalent of two ATP molecules is consumed in trans- 
porting CO, to the chloroplasts of the bundle-sheath cells. In essence, this 
process is active transport: the pumping of CO, into the bundle-sheath cell 
is driven by the hydrolysis of one molecule of ATP to one molecule of 
AMP and two molecules of orthophosphate. ‘The CO, concentration can be 
20-fold as great in the bundle-sheath cells as in the mesophyll cells. 

When the C, pathway and the Calvin cycle operate together, the net 
reaction 1s 


6CO, + 30 ATP + 12 NADPH + 24H,O —> 
C0, + 30ADP +302 +12 NADP* + 18 H* 


Note that 30 molecules of ATP are consumed per hexose molecule formed 
when the C, pathway delivers CO, to the Calvin cycle, in contrast with 18 
molecules of ATP per hexose molecule in the absence of the C, pathway. 
The high concentration of CO, in the bundle-sheath cells of C, plants, 
which is the result of the expenditure of the additional 12 molecules of ATP, 
is critical for their rapid photosynthetic rate, because CO, 1s limiting when 
light is abundant. A high CO, concentration also minimizes the energy loss 
caused by photorespiration. 

Tropical plants with a C, pathway do little photorespiration because the 
high concentration of CO, in their bundle-sheath cells accelerates the carbox- 
ylase reaction relative to the oxygenase reaction. This effect is especially 
important at higher temperatures. The geographic distribution of plants 


having this pathway (C, plants) and those lacking it (C plants) can now be 
understood in molecular terms. C, plants have the advantage in a hot envi- 
ronment and under high illumination, which accounts for their prevalence 
in the tropics. C; plants, which consume only 18 molecules of ATP per 
hexose molecule formed in the absence of photorespiration (compared with 
30 molecules of ATP for C 4 plants), are more efficient at temperatures lower 
than about 28°C, and so they predominate in temperate environments. 


av Rubisco is present in bacteria, eukaryotes, and even archaea, although 

other photosynthetic components have not been found in archaea. 
Thus, rubisco emerged early in evolution, when the atmosphere was rich in 
CO, and almost devoid of O,. The enzyme was not originally selected to 
operate in an environment like the present one, which is almost devoid of 
CO, and rich in O,,. Photorespiration became significant about 600 million 
years ago, when the CO, concentration fell to present levels. The C, path- 
way is thought to have evolved in response no more than 30 million years 
ago and possibly as recently as 7 million years ago. It is interesting that none 
of the enzymes are unique to C, plants, suggesting that this pathway made 
use of already existing enzymes. 


Crassulacean acid metabolism permits growth in arid ecosystems 


Many plants, including some that grow in hot, dry climates, keep the stomata 
of their leaves closed in the heat of the day to prevent water loss (Figure 20.17). 
As a consequence, CO, cannot be absorbed during the daylight hours, when 
it is needed for glucose synthesis. Rather, CO, enters the leaf when the sto- 
mata open at the cooler temperatures of night. To store the CO, until it can be 
used during the day, such plants make use of an adaptation called crassulacean 
acid metabolism (CAM), named after the genus Crassulacea (the succulents). 
Carbon dioxide is fixed by the C, pathway into malate, which is stored in 
vacuoles. During the day, malate is decarboxylated and the CO, becomes 
available to the Calvin cycle. In contrast with C, plants, CAM plants separate 
CO, accumulation from CO, utilization temporally rather than spatially. 

Although CAM plants do prevent water loss, the use of malate as the 
sole source of CO, comes at a metabolic cost. Because the malate storage is 
limited, CAM plants cannot generate CO, as rapidly as it can be imported 
by C, and C, plants. Consequently, the growth rate of CAM plants is 
slower than that of C, and C, plants. The saguaro cactus, which can live up 
to 200 years and reach a height of 60 feet, can take 15 years to grow to only 
1 foot in height. 


20.3 The Pentose Phosphate Pathway Generates NADPH 
and Synthesizes Five-Carbon Sugars 


Photosynthetic organisms can use the light reactions for generation of some 
NADPH. Another pathway, present in all organisms, meets the NADPH 
needs of nonphotosynthetic organisms and of the nonphotosynthetic tissues 
in plants. The pentose phosphate pathway, which occurs in the cytoplasm, is 
a crucial source of NADPH for use in reductive biosynthesis (Table 20.2) as 
well as for protection against oxidative stress. This pathway consists of two 
phases: (1) the oxidative generation of NADPH and (2) the nonoxidative 
interconversion of sugars (Figure 20.18). In the oxidative phase, NADPH is 
generated when glucose 6-phosphate is oxidized to ribulose 5-phosphate. 


Glucose 6-phosphate + 2 NADP* + H,QO —> 
ribulose 5-phosphate + 2 NADPH + 2H*+ CO, 
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FIGURE 20.17 Scanning electron 
micrograph of an open stoma and a 
closed stoma. [Power and Syred/Science 
Photo Library] 


TABLE 20.2 Pathways requiring NADPH 


Synthesis 

Fatty acid biosynthesis 
Cholesterol biosynthesis 
Neurotransmitter biosynthesis 
Nucleotide biosynthesis 


Detoxification 
Reduction of oxidized glutathione 
Cytochrome P450 monooxygenases 
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FIGURE 20.18 Pentose phosphate pathway. The 
pathway consists of (1) an oxidative phase that generates 
NADPH and (2) a nonoxidative phase that interconverts 
phosphorylated sugars. 


In the nonoxidative phase, the pathway catalyzes 
the interconversion of three-, four-, five-, six-, and 
seven-carbon sugars in a series of nonoxidative reac- 
tions. Excess five-carbon sugars may be converted 
into intermediates of the glycolytic pathway. All 
these reactions take place in the cytoplasm. These 
interconversions rely on the same reactions that lead 
to the regeneration of ribulose 1,5-bisphosphate in 
the Calvin cycle. 


Two molecules of NADPH are generated in 
the conversion of glucose 6-phosphate into 
ribulose 5-phosphate 


The oxidative phase of the pentose phosphate path- 
way starts with the dehydrogenation of glucose 
6-phosphate at carbon 1, a reaction catalyzed by glu- 
cose 6-phosphate dehydrogenase (Figure 20.19). This 
enzyme is highly specific for NADP*; the K,, for 
NAD* is about a thousand times as great as that for 
NADP? . The product is 6-phosphoglucono-8-lactone, 
which is an intramolecular ester between the C-1 car- 
boxyl group and the C-5 hydroxyl group. The next 
step is the hydrolysis of 6-phosphoglucono-6-lactone 
by a specific lactonase to give 6-phosphogluconate. 
This six-carbon sugar acid is then oxidatively decar- 
boxylated by 6-phosphogluconate dehydrogenase to 
yield ribulose 5-phosphate. NADP *is again the elec- 
tron acceptor. 


The pentose phosphate pathway and glycolysis 
are linked by transketolase and transaldolase 


The preceding reactions yield two molecules of 
NADPH and one molecule of ribulose 5-phosphate 
for each molecule of glucose 6-phosphate oxidized. 
The ribulose 5-phosphate is subsequently isomer- 
ized to ribose 5-phosphate by phosphopentose 
isomerase. 
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FIGURE 20.19 Oxidative phase of the pentose phosphate pathway. Glucose 6-phosphate 
is oxidized to 6-phosphoglucono-8-lactone to generate one molecule of NADPH. The lactone 
product is hydrolyzed to 6-phosphogluconate, which is oxidatively decarboxylated to ribulose 
5-phosphate with the generation of a second molecule of NADPH. 


Ribose 5-phosphate and its derivatives are components of RNA and DNA, as 
wellas of ATP, NADH, FAD, and coenzyme A. Although ribose 5-phosphate 
is a precursor to many biomolecules, many cells need NADPH for reductive 
biosyntheses much more than they need ribose 5-phosphate for incorporation 
into nucleotides. For instance, adipose tissue, the liver, and mammary glands 
require large amounts of NADPH for fatty acid synthesis (Chapter 22). In 
these cases, ribose 5-phosphate is converted into the glycolytic intermediates 
glyceraldehyde 3-phosphate and fructose 6-phosphate by transketolase and 
transaldolase. These enzymes create a reversible link between the pentose phos- 
phate pathway and glycolysis by catalyzing these three successive reactions. 


Transketolase 


C,+C, C,+G, 


Transaldolase 
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C,+C, 


Transketolase 
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The net result of these reactions is the formation of two hexoses and one 
triose from three pentoses: 


ces 


26 2, 


The first of the three reactions linking the pentose phosphate pathway 
and glycolysis is the formation of glyceraldehyde 3-phosphate and sedoheptu- 
lose 7-phosphate from two pentoses. 
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The donor of the two-carbon unit in this reaction is xylulose 5-phosphate, 
an epimer of ribulose 5-phosphate. A ketose is a substrate of transketolase 
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only if its hydroxyl group at C-3 has the configuration of xylulose rather 
than ribulose. Ribulose 5-phosphate is converted into the appropriate epi- 
mer for the transketolase reaction by phosphopentose epimerase in the reverse 
reaction of that which takes place in the Calvin cycle. 


CH20H CH2OH 
c=o0 _ 

oa io ay a 

H—C—OH ne pokee 
CH,0P032- bores 


Ribulose 5-phosphate Xylulose 5-phosphate 


Glyceraldehyde 3-phosphate and sedoheptulose 7-phosphate generated 
by the transketolase then react to form fructose 6-phosphate and erythrose 
4-phosphate. 
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This synthesis of a four-carbon sugar and a six-carbon sugar is catalyzed 
by transaldolase. 

In the third reaction, transketolase catalyzes the synthesis of fructose 
6-phosphate and glyceraldehyde 3-phosphate from erythrose 4-phosphate and 
xylulose 5-phosphate. 
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The sum of these reactions is 


2 Xylulose 5-phosphate + ribose 5-phosphate == 

2 fructose 6-phosphate + glyceraldehyde 3-phosphate 
Xylulose 5-phosphate can be formed from ribose 5-phosphate by the 
sequential action of phosphopentose isomerase and phosphopentose epim- 
erase, and so the net reaction starting from ribose 5-phosphate is 
3 Ribose 5-phosphate === 

2 fructose 6-phosphate + glyceraldehyde 3-phosphate 


Thus, excess ribose 5-phosphate formed by the pentose phosphate pathway can 
be completely converted into glycolytic intermediates. Moreover, any ribose 


TABLE 20.3 Pentose phosphate pathway 


Reaction 


Enzyme 


Oxidative phase 

Glucose 6-phosphate + NADP* —> 6-phosphoglucono-8-lactone + NADPH + Ht 
6-Phosphoglucono-8-lactone + H,O0 —> 6-phosphogluconate + Ht 
6-Phosphogluconate + NADP* —> ribulose 5-phosphate + CO, + NADPH + H* 


Nonoxidative Phase 
Ribulose 5-phosphate == ribose 5-phosphate 
Ribulose 5-phosphate == xylulose 5-phosphate 


Xylulose 5-phosphate + ribose 5-phosphate == 
sedoheptulose 7-phosphate + glyceraldehyde 3-phosphate 


Sedoheptulose 7-phosphate + glyceraldehyde 3-phosphate == 
fructose 6-phosphate + erythrose 4-phosphate 


Xylulose 5-phosphate + erythrose 4-phosphate == 
fructose 6-phosphate + glyceraldehyde 3-phosphate 


ingested in the diet can be processed into glycolytic intermediates by this 
pathway. It is evident that the carbon skeletons of sugars can be extensively 
rearranged to meet physiological needs (‘Table 20.3). 


Mechanism: Transketolase and transaldolase stabilize carbanionic 
intermediates by different mechanisms 


The reactions catalyzed by transketolase and transaldolase are distinct, yet simi- 
lar in many ways. One difference is that transketolase transfers a two-carbon 
unit, whereas transaldolase transfers a three-carbon unit. Each of these units is 
transiently attached to the enzyme in the course of the reaction and thus these 
enzymes are examples of double displacement reactions (Section 8.4). 


Transketolase reaction. Transketolase contains a tightly bound thiamine 
pyrophosphate as its prosthetic group. The enzyme transfers a two-carbon 
glycoaldehyde from a ketose donor to an aldose acceptor. The site of the 
addition of the two-carbon unit is the thiazole ring of TPP. Transketolase is 
homologous to the E, subunit of the pyruvate dehydrogenase complex 
(Section 17.1) and the reaction mechanism is similar (Figure 20.20). 
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FIGURE 20.20 Transketolase 


mechanism. (1) Thiamine pyrophosphate 
(TPP) ionizes to form a carbanion. (2) The 


carbanion of TPP attacks the ketose 


substrate. (8) Cleavage of a carbon-carbon 
bond frees the aldose product and leaves a 
two-carbon fragment joined to TPP. (4) This 


activated glycoaldehyde intermediate 
attacks the aldose substrate to form a 


new carbon-carbon bond. (5) The ketose 


product is released, freeing the TPP for 
the next reaction cycle. 
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FIGURE 20.21 Transaldolase mechanism. 
1) The reaction begins with the formation 
of a Schiff base between a lysine residue in 
transaldolase and the ketose substrate. 

2) Protonation of the Schiff base occurs. 

3) Deprotonation leads to the release of 
the aldose product, leaving a three-carbon 
fragment attached to the lysine residue. 

4) This intermediate adds to the aldose 
substrate. (6) Protonation occurs, forming a 
new carbon-carbon bond. (6) Subsequent 
deprotonation and (7) hydrolysis of the 
Schiff base releases the ketose product 
from the lysine side chain, completing the 
reaction cycle. 


The reaction proceeds as follows: 


1. The C-2 carbon atom of bound TPP readily ionizes to give a carbanion. 


2. The negatively charged carbon atom of this reactive intermediate attacks 
the carbonyl group of the ketose substrate. 


3. The resulting addition compound releases the aldose product to yield an 
activated glycoaldehyde unit. 


4. The positively charged nitrogen atom in the thiazole ring acts as an elec- 
tron sink in the development of this activated intermediate. The carbonyl 
group of a suitable aldose acceptor then condenses with the activated glyco- 
aldehyde unit to form a new ketose. 


5. The ketose is then released from the enzyme. 


Transaldolase reaction. Transaldolase transfers a three-carbon dihy- 
droxyacetone unit from a ketose donor to an aldose acceptor. Transaldolase, 
in contrast to transketolase, does not contain a prosthetic group. Rather, a 
Schiff base is formed between the carbonyl group of the ketose substrate and the 
€-amino group of a lysine residue at the active site of the enzyme (Figure 20.21). 
This kind of covalent enzyme-substrate intermediate is like that formed 
in fructose 1,6-bisphosphate aldolase in the glycolytic pathway (Section 16.1) 
and, indeed, the enzymes are homologous. (1) The reaction begins with the 
formation of the Schiff base. (2) The Schiff base becomes protonated and the 
bond between C-3 and C-4 is split. (3) Upon reprotonation, the aldose 
product is released leaving a three-carbon fragment bound to the enzyme. 
The negative charge on the Schiff-base carbanion moiety is stabilized by 
resonance (Figure 20.22). The positively charged nitrogen atom of the 
protonated Schiff base acts as an electron sink. (4) The Schiff-base 
adduct is stable until a suitable aldose becomes bound. (5) The dihy- 
droxyacetone moiety then reacts with the carbonyl] group of the aldose. 
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FIGURE 20.22 Carbanion intermediates. For transketolase and transaldolase, a carbanion 
intermediate is stabilized by resonance. In transketolase, TPP stabilizes this intermediate; 
in transaldolase, a protonated Schiff base plays this role. 


Protonation allows the formation of a new carbon-carbon bond. 
(6) Subsequent deprotonation followed by (7) hydrolysis of the Schiff 
base releases the ketose product. The nitrogen atom of the protonated 
Schiff base plays the same role in transaldolase as the thiazole-ring nitro- 
gen atom does in transketolase. In each enzyme, a group within an inter- 
mediate reacts like a carbanion in attacking a carbonyl group to form a 
new carbon-carbon bond. In each case, the charge on the carbanion is 
stabilized by resonance (Figure 20.22). 


20.4 The Metabolism of Glucose 6-Phosphate by 
the Pentose Phosphate Pathway Is Coordinated 
with Glycolysis 


Glucose 6-phosphate is metabolized by both the glycolytic pathway 
(Chapter 16) and the pentose phosphate pathway. How is the processing of 
this important metabolite partitioned between these two metabolic routes? 
The cytoplasmic concentration of NADP* plays a key role in determining 
the fate of glucose 6-phosphate. 


The rate of the pentose phosphate pathway is 
controlled by the level of NADP* 


The first reaction in the oxidative branch of the pentose phosphate path- 
way, the dehydrogenation of glucose 6-phosphate, is essentially irrevers- 
ible. In fact, this reaction is rate limiting under physiological conditions 
and serves as the control site for the oxidative branch of the pathway. 
The most important regulatory factor is the level of NADP*. Low levels 
of NADP* reduce the dehydrogenation of glucose 6-phosphate because 
it is needed as the electron acceptor. The effect of low levels of NADP* 
is intensified by the fact that NADPH competes with NADP* in bind- 
ing to the enzyme. The ratio of NADP* to NADPH in the cytoplasm of 
a liver cell from a well-fed rat is about 0.014. The marked effect of 
the NADP* level on the rate of the oxidative phase ensures that 
NADPH is not generated unless the supply needed for reductive biosyn- 
theses or protection against oxidative stress is low. The nonoxidative 
phase of the pentose phosphate pathway is controlled primarily by the 
availability of substrates. 
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20.4 Coordination of the Pentose 
Phosphate Pathway 
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The flow of glucose 6-phosphate depends on the need for NADPH, 
ribose 5-phosphate, and ATP 


We can grasp the intricate interplay between glycolysis and the pentose 
phosphate pathway by examining the metabolism of glucose 6-phosphate in 
four different metabolic situations (Figure 20.23). 


Mode 1. Much move ribose 5-phosphate than NADPH is required. For exam- 
ple, rapidly dividing cells need ribose 5-phosphate for the synthesis of 
nucleotide precursors of DNA. Most of the glucose 6-phosphate is converted 
into fructose 6-phosphate and glyceraldehyde 3-phosphate by the glycolytic 
pathway. Transaldolase and transketolase then convert two molecules of 
fructose 6-phosphate and one molecule of glyceraldehyde 3-phosphate into 
three molecules of ribose 5-phosphate by a reversal of the reactions described 
earlier. The stoichiometry of mode 1 is 


5 Glucose 6-phosphate + ATP —> 6 ribose 5-phosphate + ADP + 2H* 


Mode2. The needs for NADPH and for ribose 5-phosphate are balanced. Under 

these conditions, glucose 6-phosphate is processed to one molecule of ribu- 

lose 5-phosphate while generating two molecules of NADPH. Ribulose 
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TABLE 20.4 Tissues with active pentose phosphate pathways 


Tissue Function 

Adrenal gland Steroid synthesis 

Liver Fatty acid and cholesterol synthesis 
Testes Steroid synthesis 

Adipose tissue Fatty acid synthesis 

Ovary Steroid synthesis 

Mammary gland Fatty acid synthesis 

Red blood cells Maintenance of reduced glutathione 


5-phosphate is then converted into ribose 5-phosphate. The stoichiometry 
of mode 2 is 


Glucose 6-phosphate + 2 NADP* + H,O—> 
ribose 5-phosphate + 2 NADPH + 2 H* + CO, 


Mode 3. Much more NADPH than ribose 5-phosphate is required. For exam- 
ple, adipose tissue requires a high level of NADPH for the synthesis of fatty 
acids (‘Table 20.4). In this case, glucose 6-phosphate is completely oxidized to 
CO,,. Three groups of reactions are active in this situation. First, the oxidative 
phase of the pentose phosphate pathway forms two molecules of NADPH 
and one molecule of ribulose 5-phosphate. Then, ribulose 5-phosphate is 
converted into fructose 6-phosphate and glyceraldehyde 3-phosphate by 
transketolase and transaldolase. Finally, glucose 6-phosphate is resynthesized 
from fructose 6-phosphate and glyceraldehyde 3-phosphate by the gluconeo- 
genic pathway. The stoichiometries of these three sets of reactions are 


6 Glucose 6-phosphate + 12 NADP* + 6 H,O —> 
6 Ribose 5-phosphate + 12 NADPH + 12 H* + 6CO, 
6 Ribose 5-phosphate ——> 
4 Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate 
4 Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate + H,Q ——> 
5-Glucose 6-phosphate + P; 
The sum of the mode 3 reactions is 


Glucose 6-phosphate + 12 NADP* + 7H,O—~> 
6 CO, + 12 NADPH + 12 H* + P, 


Thus, the equivalent of glucose 6-phosphate can be completely oxidized to CO, 
with the concomitant generation of NADPH. In essence, ribose 5-phosphate 
produced by the pentose phosphate pathway is recycled into glucose 
6-phosphate by transketolase, transaldolase, and some of the enzymes of the 
gluconeogenic pathway. 


Mode 4. Both NADPH and ATP are required. Alternatively, ribulose 
5-phosphate formed from glucose 6-phosphate can be converted into pyru- 
vate. Fructose 6-phosphate and glyceraldehyde 3-phosphate derived from 
ribose 5-phosphate enter the glycolytic pathway rather than reverting to 
glucose 6-phosphate. In this mode, ATP and NADPH are concomitantly gen- 
erated, and five of the six carbons of glucose 6-phosphate emerge in pyruvate. 


3 Glucose 6-phosphate + 6 NADP* + 5 NAD* + 5 P. + 8 ADP —> 
5 pyruvate + 3CO, + 6 NADPH + 5 NADH 
+ 8 ATP + 20,0 +8 H* 


Pyruvate formed by these reactions can be oxidized to generate more ATP 
or it can be used as a building block in a variety of biosyntheses. 
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20.4 Coordination of the Pentose 
Phosphate Pathway 
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y-Glutamate 


Cysteine 


Glycine 


Glutathione (reduced) 
(y-Glutamylcysteinylglycine) 


The pentose phosphate pathway is required for rapid cell growth 


Rapidly dividing cells, such as cancer cells, require ribose 5-phosphate for 
nucleic acid synthesis and NADPH for fatty acid synthesis, which in turn is 
required to form membrane lipids (Section 26.1). Recall that rapidly dividing 
cells switch to aerobic glycolysis to meet their ATP needs (Section 16.2). 
Glucose 6-phosphate and glycolytic intermediates are then used to generate 
NADPH and ribose 5-phosphate using the pentose phosphate pathway as 
described in Modes 1 and 3. The diversion of glycolytic intermediates into the 
nonoxidative phase is facilitated by the expression of the gene for a pyruvate 
kinase isozyme, PKM. PKM has a low catalytic activity, creating a bottle- 
neck in the glycolytic pathway. Glycolytic intermediates accumulate and are 
then used by the pentose phosphate pathway to synthesize NADPH and ribose 
5-phosphate. The shunting of phosphorylated intermediates into the nonoxida- 
tive phase of the pentose phosphate pathway is further enabled by the inhibition 
of triose phosphate isomerase by phosphoenolpyruvate, the substrate of PKM. 


Through the looking-glass: The Calvin cycle and the pentose 
phosphate pathway are mirror images 


aa The complexities of the Calvin cycle and the pentose phosphate path- 

way are easier to comprehend if we consider them as functional mirror 
images of each other. The Calvin cycle begins with the fixation of CO, and 
proceeds to use NADPH in the synthesis of glucose. The pentose phosphate 
pathway begins with the oxidation of a glucose-derived carbon atom to CO, 
and concomitantly generates NADPH. The regeneration phase of the Calvin 
cycle converts C, and C, molecules back into the starting material—the C, 
molecule ribulose 1,5-bisphosphate. The pentose phosphate pathway con- 
verts a C, molecule, ribulose 5-phosphate, into C, and C; intermediates of 
the glycolytic pathway. Not surprisingly, in photosynthetic organisms, many 
enzymes are common to the two pathways. We see the economy of evolution: 
the use of identical enzymes for similar reactions with different ends. 


20.5 Glucose 6-Phosphate Dehydrogenase Plays a Key 
Role in Protection Against Reactive Oxygen Species 


The NADPH generated by the pentose phosphate pathway plays a vital role in 
protecting the cells from reactive oxygen species (ROS). Reactive oxygen spe- 
cies generated in oxidative metabolism inflict damage on all classes of macro- 
molecules and can ultimately lead to cell death. Indeed, ROS are implicated in 
a number of human diseases (Table 18.3). Reduced glutathione (GSH), a tripep- 
tide with a free sulfhydryl group, combats oxidative stress by reducing ROS to 
harmless forms. Its task accomplished, the glutathione is now in the oxidized 
form (GSSG) and must be reduced to regenerate GSH. The reducing power is 
supplied by the NADPH generated by glucose 6-phosphate dehydrogenase in 
the pentose phosphate pathway. Indeed, cells with reduced levels of glucose 
6-phosphate dehydrogenase are especially sensitive to oxidative stress. 


Glucose 6-phosphate dehydrogenase deficiency causes a drug- 
induced hemolytic anemia 


e The importance of the pentose phosphate pathway is highlighted by 
some people’s anomalous responses to certain drugs. For instance, 
pamaquine, the first synthetic antimalarial drug introduced in 1926, was 
associated with the appearance of severe and mysterious ailments. Most 
patients tolerated the drug well, but a few developed severe symptoms 
within a few days after therapy was started. Their urine turned black, 


jaundice developed, and the hemoglobin content of 
the blood dropped sharply. In some cases, massive 
destruction of red blood cells caused death. 

This drug-induced hemolytic anemia was shown 
30 years later to be caused by a deficiency of glucose 
6-phosphate dehydrogenase, the enzyme catalyzing 
the first step in the oxidative branch of the pentose 
phosphate pathway. The result is a dearth of NADPH 
in all cells, but this deficiency is most acute in red 
blood cell because they lack mitochondria and have 
no alternative means of generating reducing 
power. This defect, which is inherited on the X chro- 
mosome, is the most common disease that results 
from an enzyme malfunction, affecting hundreds of 
millions of people. 

Pamaquine sensitivity is not simply a historical 
oddity about malaria treatment many decades ago. 
Primaquine, an antimalarial closely related to pama- 
quine, is widely used in malaria-infested regions of 
the world. Vicine, a purine glycoside of fava beans 


Vicine 


Vicia faba. The Mediterranean plant Vicia faba is a source of fava 


(which are consumed in countries surrounding the beans that contain the purine glycoside vicine. [FLPA/RichardBecker/ 


Mediterranean), also induces hemolysis. People defi- _age footstock|] 

cient in glucose 6-phosphate dehydrogenase suffer 

hemolysis from eating fava beans or inhaling the pollen of the fava flowers, 
a response called favism. How can we explain hemolysis caused by pama- 
quine, primaquine, and vicine biochemically? These chemicals are oxida- 
tive agents that generate peroxides, reactive oxygen species that can damage 
membranes as well as other biomolecules. Peroxides are usually eliminated 
by the enzyme glutathione peroxidase, which uses reduced glutathione as a 
reducing agent. 


Glutathione peroxidase 


2 GSH + ROOH GSSG + H,O + ROH 


The major role of NADPH in red cells is to reduce the disulfide form of 
glutathione to the sulfhydryl form. The enzyme that catalyzes the regenera- 
tion of reduced glutathione is glutathione reductase. 


y-Glu——Cys — Gly 
| Glutathione 


S reductase 2 y-Glu Cys — Gly 
| + NADPH + Ht —————= | + NADPt 
Ss SH 
y-Glu——Cys — Gly 
Oxidized glutathione (GSSG) Reduced glutathione (GSH) 


Red blood cells with a lowered level of reduced glutathione are more 
susceptible to hemolysis. In the absence of glucose 6-phosphate dehydroge- 
nase, peroxides continue to damage membranes because no NADPH is 
being produced to restore reduced glutathione.Thus, the answer to our 
question is that glucose 6-phosphate dehydrogenase is required to maintain 
reduced glutathione levels to protect against oxidative stress. In the absence of 
oxidative stress, however, the deficiency is quite benign. The sensitivity to 
oxidative agents of people having this dehydrogenase deficiency also clearly 
demonstrates that atypical reactions to drugs may have a genetic basis. 

Reduced glutathione is also essential for maintaining the normal 
structure of red blood cells by maintaining the structure of hemoglobin. 
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FIGURE 20.24 Red blood cells with 
Heinz bodies. The light micrograph shows 
red blood cells obtained from a person 
deficient in glucose 6-phosphate 
dehydrogenase. The dark particles, called 
Heinz bodies, inside the cells are clumps of 
denatured hemoglobin that adhere to the 
plasma membrane and stain with basic 
dyes. Red blood cells in such people are 
highly susceptible to oxidative 

damage. [Courtesy of Dr. Stanley Schrier.] 


The reduced form of glutathione serves as a sulfhydryl buffer that keeps 
the residues of hemoglobin in the reduced sulfhydryl form. Without 
adequate levels of reduced glutathione, the hemoglobin sulfhydryl 
groups can no longer be maintained in the reduced form. Hemoglobin 
molecules then cross-link with one another to form aggregates called 
Heinz bodies on cell membranes (Figure 20.24). Membranes damaged by 
Heinz bodies and reactive oxygen species become deformed, and the cell 
is likely to undergo lysis. 


A deficiency of glucose 6-phosphate dehydrogenase confers an 
evolutionary advantage in some circumstances 


e The incidence of the most common form of glucose 6-phosphate 
dehydrogenase deficiency, characterized by a 10-fold reduction 
in enzymatic activity in red blood cells, is 11% among Americans of 
African heritage. This high frequency suggests that the deficiency may 
be advantageous under certain environmental conditions. Indeed, glucose 
6-phosphate dehydrogenase deficiency protects against falciparum malaria. 
The parasites causing this disease require NADPH for growth. Moreover, 
infection by the parasites induces oxidative stress in the infected cell. 
Because the pentose phosphate pathway is compromised, the cell and 
parasite die from oxidative damage. Thus, glucose 6-phosphate dehydro- 
genase deficiency is a mechanism of protection against malaria, which 
accounts for its high frequency in malaria-infested regions of the world. 
We see here once again the interplay of heredity and environment in the 
production of disease. 

The ability of glucose 6-phosphate dehydrogenase deficiency to protect 
against malaria does, however, create a public health conundrum. Primaquine 
is a commonly used and highly effective antimalarial drug. However, indis- 
criminate use of primaquine causes hemolysis in individuals deficient in glu- 
cose 6-phosphate dehydrogenase. A solution to this problem may be in the 
offing, as recent work shows that an anti-malaria vaccine may be within reach. 


SUMMARY 


20.1 The Calvin Cycle Synthesizes Hexoses from Carbon Dioxide and Water 
ATP and NADPH formed in the light reactions of photosynthesis are 
used to convert CO, into hexoses and other organic compounds. The 
dark phase of photosynthesis, called the Calvin cycle, starts with the 
reaction of CO, and ribulose 1,5-bisphosphate to form two molecules 
of 3-phosphoglycerate. This reaction is catalyzed by rubisco (ribulose 
1,5-bisphosphate carboxylase/oxygenase). The steps in the conver- 
sion of 3-phosphoglycerate into fructose 6-phosphate and glucose 
6-phosphate are like those of gluconeogenesis, except that glyceralde- 
hyde 3-phosphate dehydrogenase in chloroplasts is specific for 
NADPH rather than NADH. Ribulose 1,5-bisphosphate is regener- 
ated from fructose 6-phosphate, glyceraldehyde 3-phosphate, and 
dihydroxyacetone phosphate by a complex series of reactions. Several 
of the steps in the regeneration of ribulose 1,5-bisphosphate are like 
those of the pentose phosphate pathway. Three molecules of ATP and 
two molecules of NADPH are consumed for each molecule of CO, 
converted into a hexose. Starch in chloroplasts and sucrose in the cyto- 
plasm are the major carbohydrate stores in plants. 

Rubisco also catalyzes a competing oxygenase reaction, which pro- 
duces phosphoglycolate and 3-phosphoglycerate. The recycling of 
phosphoglycolate leads to the release of CO, and further consumption 
of O, ina process called photorespiration. 


20.2 


20.3 


20.4 


20.5 


The Activity of the Calvin Cycle Depends on Environmental Conditions 
Reduced thioredoxin formed by the light-driven transfer of electrons 
from ferredoxin activates enzymes of the Calvin cycle by reducing 
disulfide bridges. The light-induced increase in pH and Mg?* level of 
the stroma is important in stimulating the carboxylation of ribulose 
1,5-bisphosphate by rubisco. Photorespiration is minimized in tropical 
plants, which have an accessory pathway—the C, pathway—for con- 
centrating CO, at the site of the Calvin cycle. This pathway enables 
tropical plants to take advantage of high levels of light and minimize 
the oxygenation of ribulose 1,5-bisphosphate. Other plants use crassu- 
lacean acid metabolism (CAM) to prevent dehydration. In CAM 
plants, the C, pathway is active during the night, when the plant 
exchanges gases with the air. During the day, gas exchange is elimi- 
nated and CO, is generated from malate stored in vacuoles. 


The Pentose Phosphate Pathway Generates NADPH and Synthesizes 
Five-Carbon Sugars 

Whereas the Calvin cycle is present only in photosynthetic organisms, 
the pentose phosphate pathway is present in all organisms. The pen- 
tose phosphate pathway consists of two phases: an oxidative phase and 
a nonoxidative phase. The oxidative phase generates NADPH and 
ribulose 5-phosphate in the cytoplasm. NADPH is used in reductive 
biosyntheses. The oxidative phase starts with the dehydrogenation of 
glucose 6-phosphate to form a lactone, which is hydrolyzed to give 
6-phosphogluconate and then oxidatively decarboxylated to yield 
ribulose 5-phosphate. NADP? is the electron acceptor in both of these 
oxidations. The nonoxidative phase begins with the isomerization of 
ribulose 5-phosphate (a ketose) to ribose 5-phosphate (an aldose). 
Ribose 5-phosphate is converted into glyceraldehyde 3-phosphate and 
fructose 6-phosphate by transketolase and transaldolase. These two 
enzymes create a reversible link between the pentose phosphate path- 
way and glycolysis. Xylulose 5-phosphate, sedoheptulose 7-phosphate, 
and erythrose 4-phosphate are intermediates in these interconversions. 
When NADPH needs are paramount, fructose 6-phosphate and glyc- 
eraldehyde 3-phosphate from the nonoxidative phase are used to syn- 
thesize glucose 6-phosphate for metabolism in the oxidative phase. In 
this way, 12 molecules of NADPH can be generated for each molecule 
of glucose 6-phosphate that is completely oxidized to CO,. 


The Metabolism of Glucose 6-Phosphate by the Pentose Phosphate 
Pathway Is Coordinated with Glycolysis 

Only the nonoxidative branch of the pathway is significantly active 
when much more ribose 5-phosphate than NADPH must be synthe- 
sized. Under these conditions, fructose 6-phosphate and glyceralde- 
hyde 3-phosphate (formed by the glycolytic pathway) are converted 
into ribose 5-phosphate without the formation of NADPH. 
Alternatively, ribose 5-phosphate formed by the oxidative branch can 
be converted into pyruvate through fructose 6-phosphate and glyceral- 
dehyde 3-phosphate. In this mode, ATP and NADPH are generated, 
and five of the six carbons of glucose 6-phosphate emerge in pyruvate. 
The interplay of the glycolytic and pentose phosphate pathways enables 
the levels of NADPH, ATP, and building blocks such as ribose 5-phos- 
phate and pyruvate to be continuously adjusted to meet cellular needs. 


Glucose 6-Phosphate Dehydrogenase Plays a Key Role in Protection 
Against Reactive Oxygen Species 

NADPH generated by glucose 6-phosphate dehydrogenase maintains 
the appropriate levels of reduced glutathione required to combat 
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KEY TERMS 


cell. Cells with diminished glucose 6-phosphate dehydrogenase activ- 
ity are especially sensitive to oxidative stress. 


Calvin cycle (dark reactions) (p. 589) 

autotroph (p. 590) 

heterotroph (p. 590) 

rubisco (ribulose 1,5-bisphosphate 
carboxylase/oxygenase) (p. 591) 


aldolase (p. 595) 
starch (p. 597) 
sucrose (p. 597) 


peroxisome (microbody) (p. 593) 
photorespiration (p. 594) C, plant (p. 601) 


hexose monophosphate pool (p. 595) C, plant (p. 601) 


PROBLEMS 


transketolase (p. 595) 


thioredoxin (p. 599) 
C, pathway (p. 599) 


crassulacean acid metabolism 


(CAM) (p. 601) 
pentose phosphate pathway (p. 601) 


glucose 6-phosphate dehydrogenase 
(p. 602) 
glutathione (p. 610) 


1. A vital cycle. Why is the Calvin cycle crucial to the func- 
tioning of all life forms? 


2. Be nice to plants. Differentiate between autotrophs and 
heterotrophs. 


3. Cabalistic reactions? Why are the reactions of the Calvin 
cycle sometimes referred to as the dark reactions? Do they 
take place only at night or are they grim, secret reactions? 


4. Compare and contrast. Identify the similarities and dif- 
ferences between the Krebs cycle and the Calvin cycle. 


5. Labeling experiments. When Melvin Calvin performed 
his initial experiments on carbon fixation, he exposed 
algae to radioactive carbon dioxide. After 5 seconds, only a 
single organic compound contained radioactivity but, after 
60 seconds, many compounds had incorporated radioac- 
tivity. (a) What compound initially contained the radioac- 
tivity? (b) What compounds contained radioactivity after 
60 seconds? 


6. Three-part harmony. The Calvin cycle can be thought of 
as occurring in three parts or stages. Describe the stages. 


7. Not always to the swiftest. Suggest a reason why rubisco 
might be the most abundant enzyme in the world. 


8. A requirement. In an atmosphere devoid of CO, but rich 
in O,, the oxygenase activity of rubisco disappears. Why? 


9. Reduce locally. Glyceraldehyde 3-phosphate dehy- 
drogenase in chloroplasts uses NADPH to participate in 
the synthesis of glucose. In gluconeogenesis in the cyto- 
plasm, the isozyme of the dehydrogenase uses NADH. 
Why is it advantageous for the chloroplast enzyme to use 


NADPH? 


10. Total eclipse. An illuminated suspension of the green 
alga Chlorella is actively carrying out photosynthesis. 


Suppose that the light is suddenly switched off. How would 
the levels of 3-phosphoglycerate and ribulose 1,5-bisphos- 
phate change in the next minute? 


11. CO, deprivation. An illuminated suspension of 
Chlorella is actively carrying out photosynthesis in the 
presence of 1% CO,. The concentration of CO, is abruptly 
reduced to 0.003%. What effect would this reduction 
have on the levels of 3-phosphoglycerate and ribulose 1,5- 
bisphosphate in the next minute? 


12. Salvage operation. Write a balanced equation for the 
transamination of glyoxylate to yield glycine. 


13. Dog days of August. Before the days of pampered lawns, 
most homeowners practiced horticultural Darwinism. A 
result was that the lush lawns of early summer would often 
convert into robust cultures of crabgrass in the dog days of 
August. Provide a possible biochemical explanation for this 
transition. 


14. Is it hot in here, or is it just me? Why is the C, pathway 
valuable for tropical plants? 


15. No free lunch. Explain why maintaining a high concen- 
tration of CO, in the bundle-sheath cells of C, plants is an 
example of active transport. How much ATP is required 
per CO, to maintain a high concentration of CO, in the 
bundle-sheath cells of C, plants? 


16. Breathing pictures? What is photorespiration, what is its 
cause, and why is it believed to be wasteful? 


17. Global warming. C, plants are most common in higher 
latitudes and become less common at latitudes near the 
equator. The reverse is true of C, plants. How might global 
warming affect this distribution? 


18. Communication. What are the light-dependent changes 
in the stroma that regulate the Calvin cycle? 


19. Match ‘em. Match each term with its description. 


a) Calvincycle____—_ 1. Catalyzes CO, fixation 

b) Rubisco ____ 2. Storage form of carbohydrates 
c) Carbamate___—— 3. a-1,4 linkages only 

d) Starch 4. 3-Phosphoglycerate is formed 
e) Sucrose after carbon fixation 

f) Amylose The dark reactions 


. Includes a-1,6 linkages 
. Required for rubisco activity 
. Carbon fixation results in 
oxaloacetate formation 
9. Allow exchange of gases 
10. Transport form of 
carbohydrates 


g) Amylopectin____ 
h) C, plants____ 

i) C,plants___ 

(j) Stomata_ 


20. Linked in. Describe how the pentose phosphate 
pathway and glycolysis are linked by transaldolase and 
transketolase. 


21. Biochemical taxonomy. 

a) Identify 6-phosphoglucono-6-lactone._ 
b) Which reactions produce NADPH? 
c) Identify ribulose 5-phosphate._ 

d) What reaction generates CO,?__ 

e) Identify 6-phosphogluconate._ 


f) Which reaction is catalyzed by phosphopentose 
isomerase? 


g) Identify ribose 5-phosphate 
h) Which reaction is catalyzed by lactonase? 
i) Identify glucose 6-phosphate. 


(j) Which reaction is catalyzed by 6-phosphogluconate 
dehydrogenase?_ 


k) Which reaction is catalyzed by glucose 6-phosphate 
dehydrogenase? 
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23. Recurring decarboxylations. Which reaction in the citric 
acid cycle is most analogous to the oxidative decarboxylation 
of 6-phosphogluconate to ribulose 5-phosphate? What kind 
of enzyme-bound intermediate is formed in both reactions? 


24. Synthetic stoichiometries. What is the stoichiometry of 
the synthesis of (a) ribose 5-phosphate from glucose 6- 
phosphate without the concomitant generation of NADPH? 
(b) NADPH from glucose 6-phosphate without the con- 
comitant formation of pentose sugars? 


25. Offal or awful? Liver and other organ meats contain 
large quantities of nucleic acids. In the course of digestion, 
RNA is hydrolyzed to ribose, among other chemicals. 
Explain how ribose can be used as a fuel. 


26. A required ATP. The metabolism of glucose 6-phosphate 
into ribose 5-phosphate by the joint efforts of the pentose 
phosphate pathway and glycolysis can be summarized by 
the following equation. 


5 Glucose 6-phosphate + ATP —~> 
6 ribose 5-phosphate + ADP 


Which reaction requires the ATP? 


27. No respiration. Glucose is normally completely oxi- 
dized to CO, in the mitochondria. In what circumstance can 
glucose be completely oxidized to CO, in the cytoplasm? 


28. Watch your diet, doctor. The noted psychiatrist Hannibal 
Lecter once remarked to FBI Agent Clarice Starling that he 
enjoyed liver with some fava beans and a nice Chianti. Why 
might this diet be dangerous for some people? 


29. No redundancy. Why do deficiencies in glucose 
6-phosphate dehydrogenase frequently present as anemia? 


30. Damage control. What is the role of glutathione in pro- 
tection against damage by reactive peroxides? Why is the 
pentose phosphate pathway crucial to this protection? 


fe) H 
SC Ox So 
CH,OPO37- CH,OPO37- CH20H i 
fe) 2 | 
H—C—OH SH H—C—OH 
fe) re) 
H H H HO—-C—H | SP 
H B H ‘ D F H—C—OH H H—C—OH 
——————}> ———™4™ 
ian ‘cam ae 2a = 
HO OH HO | | 
iar aa H-—C-—OH CHOPO;32- CH,0P032- 
CH»OPO32- 
A Cc E G I 


22. Tracing glucose. Glucose labeled with !*C at C-6 is 
added to a solution containing the enzymes and cofactors of 
the oxidative phase of the pentose phosphate pathway. 
What is the fate of the radioactive label? 


31. Reductive power. What ratio of NADPH to NADP? is 
required to sustain [GSH] = 10 mM and [GSSG] = 1 mM? 
Use the redox potentials given in Table 18.1. 
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Mechanism Problems 


32. An alternative approach. The mechanisms of some aldol- 
ases do not include Schiff-base intermediates. Instead, these 
enzymes require bound metal ions. Propose such a mecha- 
nism for the conversion of dihydroxyacetone phosphate and 
glyceraldehyde 3-phosphate into fructose 1,6-bisphosphate. 


33. A recurring intermediate. Phosphopentose isomerase 
interconverts the aldose ribose 5-phosphate and the ketose 
ribulose 5-phosphate. Propose a mechanism. 


Chapter Integration Problems 


34. Catching carbons. Radioactive-labeling experiments 
can yield estimates of how much glucose 6-phosphate is 
metabolized by the pentose phosphate pathway and how 
much is metabolized by the combined action of glycolysis 
and the citric acid cycle. Suppose that you have samples of 
two different tissues as well as two radioactively labeled 
glucose samples, one with glucose labeled with '4C at C-1 
and the other with glucose labeled with '4C at C-6. Design 
an experiment that would enable you to determine the rela- 
tive activity of the aerobic metabolism of glucose compared 
with metabolism by the pentose phosphate pathway. 


35. You do what you can do. Red blood cells lack mitochon- 
dria. These cells process glucose to lactate, but they also gen- 
erate CO,. What is the purpose of producing lactate? How 
can red blood cells generate CO, if they lack mitochondria? 


36. Better to focus on one. Rubisco catalyzes both a carbox- 
ylation reaction and a wasteful oxygenase reaction. Below 
are the kinetic parameters for the two reactions. 


Ky0°2 Ky,°2 hoe? Rest? 
(uM) (uM) os (s71) 
10 500 3 2 


(a) Determine the values of ee oe and k, EOey 
Rages as s~'M7!, 

(b) In light of the k,,,/ Ky, values for the two reactions, why 
does the oxygenation reaction occur? 


37. Photosynthetic efficiency. Use the following information 
to estimate the efficiency of photosynthesis. 


The AG®’ for the reduction of CO, to the level of hexose is 
+477 kJ mol! (+114 kcal mol~!). 

A mole of 600-nm photons has an energy content of 199 kJ 
(47.6 kcal). 


Assume that the proton gradient generated in producing 
the required NADPH is sufficient to drive the synthesis of 
the required ATP. 


38. A violation of the First Law? The complete combustion 
of glucose to CO, and H,O yields 30 ATP, as shown in 


Table 18.4. However, the synthesis of glucose requires only 
18 ATP. How is it possible that glucose synthesis from CO, 
and H,O requires only 18 ATP, but combustion to CO, 
and HO yields 30 ATP? Is it a violation of the First Law of 
Thermodynamics, or perhaps a miracle? 


Data Interpretation Problem 


39. Deciding between 3 and 4. Graph A shows the photo- 
synthetic activity of two species of plant, one a C, plant and 
the other a C, plant, as a function of leaf temperature. 


foe} 
o 


(A) [- 


a 
oO 


Photosynthetic activity 
N 
oO 
] 


(micromoles of CO, assimilated per 
square meter of leaf area per second) 
aS 
Oo 
] 


| | | | | 
10 20 30 40 50 


Leaf temperature (°C) 


oO 


(a) Which data were most likely generated by the C, plant 
and which by the C, plant? Explain. 


(b) Suggest some possible explanations for why the photo- 
synthetic activity falls at higher temperatures. 


Graph B illustrates how the photosynthetic activity of C, 
and C, plants varies with CO, concentration when tem- 
perature (30°C) and light intensity (high) are constant. 
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(c) Why can C, plants thrive at CO, concentrations that do 
not support the growth of C, plants? 


(d) Suggest a plausible explanation why C, plants continue 
to increase photosynthetic activity at higher CO, concen- 
trations, whereas C, plants reach a plateau. 


CHAPTER 


Glycogen Metabolism 


Epinephrine 


Glycogen _ Glucose for energy 


Signaling cascades lead to the mobilization of 
glycogen to produce glucose, an energy source for 
cyclists. [(Left) Jonathan Devich/Epic Images. ] 


: . ; OUTLINE 
lucose is an important fuel and, as we will see, a key precursor for the 
biosynthesis of many molecules. However, glucose cannot be stored, 21.1 Glycogen Breakdown Reauires 
because high concentrations of glucose disrupt the osmotic balance of the the Interplay of Several Enzymes 
cell, which would cause cell damage or death. How can adequate stores of Ho Pieschenies G Requned by 
glucose be maintained without damaging the cell? The solution to this prob- iksateries lneetiona aed 
lem is to store glucose as a nonosmotically active polymer called glycogen. Reversible Phosphorylation 


Glycogen is a readily mobilized storage form of glucose. It is a very large, 
branched polymer of glucose residues that can be broken down to yield glucose 
molecules when energy is needed (Figure 21.1). A glycogen molecule has 


21.3 Epinephrine and Glucagon Signal 
the Need for Glycogen 


approximately 12 layers of glucose molecules and can be as large as 40 nm, ida 

containing approximately 55,000 glucose residues. Most of the glucose resi- 21.4 Glycogen Is Synthesized and 
dues in glycogen are linked by a-1,4-glycosidic bonds (Figure 21.2). Branches Degraded by Different Pathways 
at about every 12 residues are created by a-1,6-glycosidic bonds. Recall that 21.5 Glycogen Breakdown and 
a-glycosidic linkages form open helical polymers, whereas B linkages produce Synthesis Are Reciprocally 
nearly straight strands that form structural fibrils, as in cellulose (Figure 11.14). Regulated 


Glycogen is not as reduced as fatty acids are and consequently are not as 
energy rich. Why isn’t all excess fuel stored as fatty acids rather than as glyco- 
gen? The controlled release of glucose from glycogen maintains blood-glucose 
concentration between meals. The circulating blood keeps the brain supplied 
with glucose, which is virtually the only fuel used by the brain, except during 
prolonged starvation (Chapter 27). Moreover, the readily mobilized glucose 
from glycogen is a good source of energy for sudden, strenuous activity. 
Unlike fatty acids, the released glucose can be metabolized in the absence of 
oxygen and can thus supply energy for anaerobic activity (Section 16.1). 
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FIGURE 21.1 Glycogen. At the core of the 
glycogen molecule is the protein glycogenin 
(yellow). Each line represents glucose 
molecules joined by an a-1,4-glycosidic 
linkage. The nonreducing ends of the 
glycogen molecule form the surface of the 
ycogen granule. Degradation takes place 
this surface. [Information from R. Melendez 
al. Biophys. J. 77:13827-1332, 1999.] 


ov @aQ 


FIGURE 21.3 Electron micrograph 

of a liver cell. The dark particles in the 
cytoplasm are glycogen granules. [Courtesy 
of Dr. George Palade.] 


618 


CH20H CH20H CH20OH 


O 0 ie) 
22k, (OH OH OH 1 ye linkage 
Nonreducing 
mnds eis . - O. a-1,4 linkage 


CH2OH CHJOH OH CH>OH OH CH2OH OH © CHa CH,0H 
re) re) re) O O O 
OH OH OH OH OH} 4 OH 
HO fe) re) O re) re) R 
OH OH OH OH OH OH 


FIGURE 21.2 Glycogen structure. In this structure of two outer branches of a glycogen 
molecule, the residues at the nonreducing ends are shown in red and the residue that starts a 
branch is shown in green. The rest of the glycogen molecule is represented by R. 


Glycogen is present in bacteria, archae, and eukaryotes. Recall that plants 
store glucose as starch, a similar chemical. Thus, storing energy as glucose poly- 
mers is common to all forms of life. In humans, most tissues have some glyco- 
gen, although the two major sites of glycogen storage are the liver and skeletal 
muscle. The concentration of glycogen is higher in the liver than in muscle (10% 
versus 2% by weight), but more glycogen is stored in skeletal muscle overall 
because of muscle’s much greater mass. Glycogen is present in the cytoplasm, 
with the molecule appearing as granules (Figure 21.3). In the liver, glycogen 
synthesis and degradation are regulated to maintain blood-glucose levels as 
required to meet the needs of the organism as a whole. In contrast, in muscle, 
these processes are regulated to meet the energy needs of the muscle itself. 


Glycogen metabolism is the regulated release and storage of glucose 


Glycogen degradation and synthesis are simple biochemical processes. 
Glycogen degradation consists of three steps: (1) the release of glucose 
1-phosphate from glycogen, (2) the remodeling of the glycogen substrate to 
permit further degradation, and (3) the conversion of glucose 1-phosphate 
into glucose 6-phosphate for further metabolism. The glucose 6-phosphate 
derived from the breakdown of glycogen has three possible fates (Figure 21.4): 
(1) It can be metabolized by glycolysis, (2) it can be converted into free 
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Glucose 1-phosphate 
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released into the blood; 

(3) be processed by the 4 

pentose phosphate pathway 

to generate NADPH or Blood for use by 

ribose in a variety of tissues. other tissues 


glucose for release into the bloodstream, and (3) it can be processed by the 
pentose phosphate pathway to yield NADPH and ribose derivatives. The 
conversion of glycogen into free glucose takes place mainly in the liver. 

Glycogen synthesis, which takes place when glucose is abundant, 
requires an activated form of glucose, uridine diphosphate glucose (UDP- 
glucose), formed by the reaction of UTP and glucose 1-phosphate. As is the 
case for glycogen degradation, the glycogen molecule must be remodeled for 
continued synthesis. 

The regulation of glycogen degradation and synthesis is complex and is 
facilitated by the fact that all of the enzymes involved in glycogen metabo- 
lism and its regulation are associated with the glycogen particle. Several 
enzymes taking part in glycogen metabolism allosterically respond to 
metabolites that signal the energy needs of the cell. Through these allosteric 
responses, enzyme activity is adjusted to meet the needs of the cell. In addition, 
hormones may initiate signal cascades that lead to the reversible phosphoryl- 
ation of enzymes, which alters their catalytic rates. Regulation by hormones 
adjusts glycogen metabolism to meet the needs of the entire organism. 


21.1 Glycogen Breakdown Requires the Interplay 
of Several Enzymes 


The efficient breakdown of glycogen to provide glucose 6-phosphate for 
further metabolism requires four enzyme activities: one to degrade glycogen, 
two to remodel glycogen so that it can remain a substrate for degradation, 
and one to convert the product of glycogen breakdown into a form suitable 
for further metabolism. We will examine each of these activities in turn. 


Phosphorylase catalyzes the phosphorolytic cleavage of glycogen to 
release glucose 1-phosphate 


Glycogen phosphorylase, the key enzyme in glycogen breakdown, cleaves its 
substrate by the addition of orthophosphate (P.) to yield glucose 1-phosphate. 
The cleavage of a bond by the addition of orthophosphate is referred to as 
phosphorolysis. 


Glycogen + P. == glucose 1-phosphate + glycogen 
(n residues) (n — 1 residues) 


Phosphorylase catalyzes the sequential removal of glucosyl residues from 
the nonreducing ends of the glycogen molecule (the ends with a free OH 
group on carbon 4). Orthophosphate splits the glycosidic linkage between 
C-1 of the terminal residue and C-4 of the adjacent one. Specifically, it 
cleaves the bond between the C-1 carbon atom and the glycosidic oxygen 
atom, and the a configuration at C-1 is retained. 


CH20H CH,0OH CH 20H CH20OH 


6 0 HPO,7- 0 O 
OH OH ais OH af OH 
HO O OR HO OPO37- HO OR 
OH OH OH OH 


Glycogen Glucose 1-phosphate Glycogen 
(n residues) (n - 1 residues) 


Glucose 1-phosphate released from glycogen can be readily converted into 
glucose 6-phosphate, an important metabolic intermediate, by the enzyme 
phosphoglucomutase. 
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hk FIGURE 21.5 Structure of glycogen 
phosphorylase. This enzyme forms a 
homodimer: one subunit is shown in white 
and the other in yellow. Each catalytic site 
includes a pyridoxal phosphate (PLP) 
group, linked to lysine 680 of the enzyme. 
The binding site for the phosphate (P,) 
substrate is shown. Notice that the catalytic 
site lies between the C-terminal domain 
and the glycogen-binding site. A narrow 
crevice, which binds four or five glucose 
units of glycogen, connects the two sites. 
The separation of the sites allows the 
catalytic site to phosphorolyze several 
glucose units before the enzyme must 
rebind the glycogen substrate. [Drawn 
from 1NOI.pdb.] 


Glycogen- 
binding 
site 


The reaction catalyzed by phosphorylase is readily reversible in vitro. At 
pH 6.8, the equilibrium ratio of orthophosphate to glucose 1-phosphate is 
3.6. The value of AG®’ for this reaction is small because a glycosidic bond is 
replaced by a phosphoryl ester bond that has a nearly equal transfer poten- 
tial. However, phosphorolysis proceeds far in the direction of glycogen 
breakdown in vivo because the [P.]/[glucose 1-phosphate] ratio is usually 
greater than 100, substantially favoring phosphorolysis. We see here an 
example of how the cell can alter the free-energy change to favor a reaction’s 
occurrence by altering the ratio of substrate and product. 

The phosphorolytic cleavage of glycogen is energetically advantageous 
because the released sugar is already phosphorylated. In contrast, a hydrolytic 
cleavage would yield glucose, which would then have to be phosphorylated 
at the expense of a molecule of ATP to enter the glycolytic pathway. An 
additional advantage of phosphorolytic cleavage for muscle cells is that no 
transporters exist for glucose 1-phosphate, which is negatively charged 
under physiological conditions, and so it cannot be transported or diffuse 
out of the cell. 


Mechanism: Pyridoxal phosphate participates in the 
phosphorolytic cleavage of glycogen 


The special challenge faced by phosphorylase is to cleave glycogen phospho- 
rolytically rather than hydrolytically to save the ATP required to phospho- 
rylate free glucose. Thus, water must be excluded from the active site. 
Phosphorylase is a dimer of two identical 97-kDa subunits. Each subunit is 
compactly folded into an amino-terminal domain (480 residues) containing a 
glycogen-binding site and acarboxyl-terminal domain (360 residues; Figure 21.5). 
The catalytic site in each subunit is located in a deep crevice formed by resi- 
dues from both domains. Substrates bind synergistically, causing the crevice 
to narrow, thereby excluding water. What is the mechanistic basis of the 
phosphorolytic cleavage of glycogen? 
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Several clues suggest a mechanism. First, both the glycogen substrate 
and the glucose 1-phosphate product have an a configuration at C-1. 
A direct attack by phosphate on C-1 of a sugar would invert the configura- 
tion at this carbon atom because the reaction would proceed through a 
pentacovalent transition state. The fact that the glucose 1-phosphate 
formed has an a rather than a B configuration suggests that an even number 
of steps (most simply, two) 1s required. A likely explanation for these results 
is that a carbonium ion intermediate is formed from the glucose residue. 

A second clue to the catalytic mechanism of phosphorylase is its require- 
ment for the coenzyme pyridoxal phosphate (PLP), a derivative of pyridox- 
ine (vitamin B,, Section 15.4). The aldehyde group of this coenzyme forms 
a Schiff-base linkage with a specific lysine side chain of the enzyme (Figure 
21.6). Structural studies indicate that the reacting orthophosphate group 
takes a position between the 5’-phosphate group of PLP and the glycogen 
substrate (Figure 21.7). The 5’-phosphate group of PLP acts in tandem with 
orthophosphate by serving as a proton donor and then as a proton acceptor (i.€., 
as a general acid-base catalyst). Orthophosphate (in the HPO,?~ form) 
donates a proton to the oxygen atom attached to carbon 4 of the departing 
glycogen chain and simultaneously acquires a proton from PLP. The carbo- 
nium ion intermediate formed in this step is then attacked by orthophos- 
phate to form a-glucose 1-phosphate, with the concomitant return of a 
proton to pyridoxal phosphate. The special role of pyridoxal phosphate in 
the reaction is necessary because water is excluded from the active site. 

The glycogen-binding site is 30 A away from the catalytic site 
(Figure 21.5), but it is connected to the catalytic site by a narrow crevice able 
to accommodate four or five glucose units. The large separation between the 
binding site and the catalytic site enables the enzyme to phosphorolyze 
many residues without having to dissociate and reassociate after each cata- 
lytic cycle. An enzyme that can catalyze many reactions without having to 
dissociate and reassociate after each catalytic step is said to be processive—a 
property of enzymes that synthesize and degrade large polymers. We will 
see such enzymes again when we consider DNA and RNA synthesis. 


A debranching enzyme also is needed for the breakdown of glycogen 


Glycogen phosphorylase acting alone degrades glycogen to a limited extent. 
The enzyme can break a-1,4-glycosidic bonds on glycogen branches but 
soon encounters an obstacle. The a-1,6-glycosidic bonds at the branch 
points are not susceptible to cleavage by phosphorylase. Indeed, phospho- 
rylase stops cleaving a-1,4 linkages when it reaches a terminal residue four 
residues away from a branch point. Because about 1 in 12 residues is 
branched, cleavage by the phosphorylase alone would come to a halt after 
the release of eight glucose molecules per branch. 
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FIGURE 21.6 PLP-Schiff-base linkage. 
A pyridoxal phosphate (PLP) group (red) 
forms a Schiff base with a lysine residue 
(blue) at the active site of phosphorylase, 
where it functions as a general acid-base 
catalyst. 


A Schiff base, also called an imine, is a 
compound containing a carbon-nitrogen 
double bond with the nitrogen atom bonded to 
an organic compound, not a hydrogen atom. 
A Schiff base is formed by the reaction of a 
primary amine with an aldehyde or ketone. 


FIGURE 21.7 Phosphorylase mechanism. 
A bound HPO,?7 group (red) favors the 
cleavage of the glycosidic bond by donating 
a proton to the C-4 oxygen of the departing 
glycosyl group (black). This reaction results 
in the formation of a carbonium ion and is 
favored by the transfer of a proton from the 
protonated phosphate group of the bound 
pyridoxal phosphate (PLP) group (blue). The 
carbonium ion and the orthophosphate 
combine to form glucose 1-phosphate. 
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FIGURE 21.8 Glycogen remodeling. First, 
a-1,4-glycosidic bonds on each branch 

are cleaved by phosphorylase, leaving 

four residues along each branch. The 
transferase shifts a block of three glucosyl 
residues from one outer branch to the 
other. In this reaction, the a-1,4-glycosidic 
link between the blue and the green 
residues is broken and a new a-1,4 link 
between the blue and the yellow residues is 
formed. The green residue is then removed 
by a-1,6-glucosidase, leaving a linear chain 
with all a-1,4 linkages, suitable for further 
cleavage by phosphorylase. 


How can the remainder of the glycogen molecule be mobilized for use as 
a fuel? Two additional enzymes, a transferase and a-1,6-glucosidase, remodel 
the glycogen for continued degradation by the phosphorylase (Figure 21.8). 
The transferase shifts a block of three glucosyl residues from one outer branch 
to another. This transfer exposes a single glucose residue joined by an 
a-1,6-glycosidic linkage. a-1,6-Glucosidase, also known as the debranch- 
ing enzyme, hydrolyzes the a-1,6-glycosidic bond. 
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A free glucose molecule is released and then phosphorylated by the glyco- 
lytic enzyme hexokinase. Thus, the transferase and a-1,6-glucosidase con- 
vert the branched structure into a linear one, which paves the way for 
further cleavage by phosphorylase. In eukaryotes, the transferase and the 
a-1,6-glucosidase activities are present in a single 160-kDa 
polypeptide chain, providing yet another example of a 
core _ bifunctional enzyme (Figure 16.29). 


Phosphoglucomutase converts glucose 1-phosphate 
into glucose 6-phosphate 


Glucose 1-phosphate formed in the phosphorolytic cleavage 
of glycogen must be converted into glucose 6-phosphate to 
enter the metabolic mainstream. This shift of a phosphoryl 
group is catalyzed by phosphoglucomutase. Recall that this 
enzyme is also used in galactose metabolism (Section 16.1). 
To effect this shift, the enzyme exchanges a phosphoryl 
group with the substrate (Figure 21.9). The catalytic site of 
an active mutase molecule contains a phosphorylated serine 
residue. The phosphoryl group is transferred from the serine 
residue to the C-6 hydroxyl group of glucose 1-phosphate to 
form glucose 1,6-bisphosphate. The C-1 phosphoryl group 
of this intermediate is then shuttled to the same serine resi- 
due, resulting in the formation of glucose 6-phosphate and 
the regeneration of the phosphoenzyme. 

These reactions are like those of phosphoglycerate mutase, a glycolytic 
enzyme (Section 16.1). The role of glucose 1,6-bisphosphate in the intercon- 
version of the phosphoglucoses is like that of 2,3-bisphosphoglycerate (2,3- 
BPG) in the interconversion of 2-phosphoglycerate and 3-phosphoglycerate 
in glycolysis. A phosphoenzyme intermediate participates in both reactions. 


The liver contains glucose 6-phosphatase, a hydrolytic 
enzyme absent from muscle 


A major function of the liver is to maintain a nearly constant level of glucose 
in the blood. The liver releases glucose into the blood between meals and 
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during muscular activity. The released glucose is taken up primarily by the 
brain, skeletal muscle, and red blood cells. In contrast with unmodified 
glucose, however, the phosphorylated glucose produced by glycogen 
breakdown is not transported out of cells. The liver contains a hydrolytic 
enzyme, glucose 6-phosphatase, that enables glucose to leave that organ. 
This enzyme cleaves the phosphory] group to form free glucose and ortho- 
phosphate. This glucose 6-phosphatase is the same enzyme that releases 
free glucose at the conclusion of gluconeogenesis. It is located on the lume- 
nal side of the smooth endoplasmic reticulum membrane. Recall that glu- 
cose 6-phosphate is transported into the endoplasmic reticulum; glucose 
and orthophosphate formed by hydrolysis are then shuttled back into the 
cytoplasm (Section 16.1). 


Glucose 6-phosphate + HzO —— glucose + P; 


Glucose 6-phosphatase is absent from most other tissues. Muscle tissues 
retain glucose 6-phosphate for the generation of ATP. In contrast, glucose 
is not a major fuel for the liver. 


21.2 Phosphorylase Is Regulated by Allosteric Interactions 
and Reversible Phosphorylation 


Glycogen degradation is precisely controlled by multiple interlocking 
mechanisms. The focus of this control is the enzyme glycogen phosphor- 
ylase. Phosphorylase is regulated by several allosteric effectors that signal the 
energy state of the cell as well as by reversible phosphorylation, which is 
responsive to hormones such as insulin, epinephrine, and glucagon. We will 
examine the differences in the control of two isozymic forms of glycogen 
phosphorylase: one specific to liver and one specific to skeletal muscle. 
These differences are due to the fact that the liver maintains glucose homeo- 
stasis of the organism as a whole, whereas the muscle uses glucose to produce 
energy for itself. 


Liver phosphorylase produces glucose for use by other tissues 


The dimeric phosphorylase exists in two interconvertible forms: a usually 
active phosphorylase a and a usually inactive phosphorylase b (Figure 21.10). 
Each of these two forms exists in equilibrium between an active relaxed (R) 
state and a much less active tense (T) state, but the equilibrium for phos- 
phorylase a favors the active R state, whereas the equilibrium for phos- 
phorylase b favors the less-active T state (Figure 21.11). The role of 
glycogen degradation in the liver is to form glucose for export to other tis- 
sues when the blood-glucose level is low. Consequently, we can think of the 
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FIGURE 21.9 Reaction catalyzed by 
phosphoglucomutase. A phosphoryl 


group is transferred from the enzyme to the 
substrate, and a different phosphoryl group 
is transferred back to restore the enzyme to 


its initial state. 
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Phosphorylase a (in R state) Phosphorylase b (in T state) 


k FIGURE 21.10 Structures of phosphorylase a and phosphorylase b. Phosphorylase 


a is phosphorylated on serine 14 of each subunit. This modification favors the structure of 
the more-active R state. One subunit is shown in white, with helices and loops important for 
regulation shown in blue and red. The other subunit is shown in yellow, with the regulatory 
structures shown in orange and green. Phosphorylase 6 is not phosphorylated and exists 
predominantly in the T state. Notice that the catalytic sites are partly occluded in the T state. 
[Drawn from 1GPA.pdb and 1NOJ.pdb.] 
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FIGURE 21.11 Phosphorylase regulation. Both 
phosphorylase b and phosphorylase a exist as 
equilibria between an active R state and a less- 
active T state. Phosphorylase 6 is usually inactive 
because the equilibrium favors the T state. 
Phosphorylase a is usually active because the 
equilibrium favors the R state. Regulatory 
structures are shown in blue and green. 
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default state of liver phosphorylase as being the a form: glucose is to 
be generated unless the enzyme is signaled otherwise. The liver phos- 
phorylase a form thus exhibits the most responsive R <> T transition 
(Figure 21.12). The binding of glucose to the active site shifts the a 
form from the active R state to the less-active T state. In essence, the 
enzyme reverts to the low-activity T state only when it detects the 
presence of sufficient glucose. If glucose is present in the diet, there 
is no need to degrade glycogen. 
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(liver) 


2 Glucose (@) 


T state R state 


FIGURE 21.12 Allosteric regulation of liver phosphorylase. The binding of glucose 
to phosphorylase a shifts the equilibrium to the T state and inactivates the enzyme. 
Thus, glycogen is not mobilized when glucose is already abundant. 


Muscle phosphorylase is regulated by the intracellular Phosphorylase b 
energy charge (muscle) 


In contrast to the liver isozyme, the default state of muscle 
phosphorylase is the b form, owing to the fact that, for 
muscle, phosphorylase must be active primarily during _Nucleotide- 


muscle contraction. Muscle phosphorylase bis activated by _Pinding sites ene 
the presence of high concentrations of AMP, which binds 2 ATP 
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to a nucleotide-binding site and stabilizes the conforma- 
tion of phosphorylase b in the active R state (Figure 21.13). 
Thus, when a muscle contracts and ATP is converted into 
AMP by the sequential action of myosin (Section 9.4) and 
adenylate kinase (Section 16.2), the phosphorylase is sig- 
naled to degrade glycogen. ATP acts as a negative allosteric FIGURE 21.13 Allosteric regulation of muscle phosphorylase. A 
effector by competing with AMP. Thus, the transition of low energy charge, represented by high concentrations of AMP, 
phosphorylase b between the active R state and the less-active _@V0FS the transition to the R state. ATP and glucose 6-phosphate 
T state is controlled by the energy charge of the muscle cell. pele telly 
Glucose 6-phosphate also binds at the same site as ATP 
and stabilizes the less-active state of phosphorylase b, an example of feed- 
back inhibition. Under most physiological conditions, phosphorylase b is 
inactive because of the inhibitory effects of ATP and glucose 6-phosphate. In 
contrast, phosphorylase a is fully active, regardless of the levels of AMP, ATP, 
and glucose 6-phosphate. In resting muscle, nearly all the enzyme is in the 
inactive b form. 

Unlike the enzyme in muscle, the liver phosphorylase is insensitive to 
regulation by AMP because the liver does not undergo the dramatic changes 
in energy charge seen in a contracting muscle. We see here a clear example 
of the use of isozymes to establish the tissue-specific biochemical properties 
of the liver and muscle. In human beings, liver phosphorylase and muscle 
phosphorylase are approximately 90% identical in amino acid sequence, yet 
the 10% difference results in subtle but important shifts in the stability of 
various forms of the enzyme. 
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Biochemical characteristics of muscle fiber types differ 


Not only do the biochemical needs of liver and muscle differ with respect 
to glycogen metabolism, but the biochemical needs of different muscle 
fiber types also vary. Skeletal muscle consists of three different fiber types: 
type I, or slow-twitch muscle; type IIb (also called type IIx), or fast-twitch 
fibers; and type Ila fibers that have properties intermediate between the 
other two fiber types. Type I fibers rely predominantly on cellular respira- 
tion to derive energy. These fibers are powered by fatty acid degradation 
and are rich in mitochondria, the site of fatty acid degradation and the citric 
acid cycle. As we will see in Chapter 22, fatty acids are an excellent energy 
storage form, but generating ATP from fatty acids is slower than from 
glycogen. Glycogen is not an important fuel for type I fibers, and conse- 
quently the amount of glycogen phosphorylase is low. Type I fibers power 
endurance activities. Type IIb fibers use glycogen as their main fuel. 
Consequently, the glycogen and glycogen phosphorylase are abundant. 
These fibers are also rich in glycolytic enzymes needed to process glucose 
quickly in the absence of oxygen and poor in mitochondria. Type IIb fibers 
power burst activities such as sprinting and weight lifting. No amount of 
training can interconvert type I fibers and type IIb fibers. However, there 
is some evidence that type IJa fibers are “trainable”; that is, endurance 
training enhances the oxidative capacity of type Ila fibers while burst 
activity training enhances the glycolytic capacity. Table 21.1 shows the 
biochemical profile of the fiber types. 
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TABLE 21.1 Biochemical characteristics of muscle fiber types 


Characteristic Type I Type Ila Type Ub 


Fatigue resistance High Intermediate Low 
Mitochondrial density High Intermediate Low 
Metabolic type Oxidative Oxidative/glycolytic Glycolytic 
Myoglobin content High Intermediate Low 
Glycogen content Low Intermediate High 
Triacylglycerol content High Intermediate Low 
Glycogen phosphorylase activity Low Intermediate High 
Phosphofructokinase activity Low Intermediate High 
Citrate synthase activity High Intermediate Low 


Phosphorylation promotes the conversion of phosphorylase 
b to phosphorylase a 


In both liver and muscle, phosphorylase b is converted into phosphorylase a 
by the phosphorylation of a single serine residue (serine 14) in each subunit. 
This conversion is initiated by hormones. Low blood levels of glucose lead 
to the secretion of the hormone glucagon. Fear or the excitement of exercise 
will cause levels of the hormone epinephrine to increase. The rise in gluca- 
gon and epinephrine levels results in phosphorylation of the enzyme to the 
phosphorylase a form in liver and muscle respectively. The regulatory 
enzyme phosphorylase kinase catalyzes this covalent modification. 

Comparison of the structures of phosphorylase a in the R state and 
phosphorylase b in the T state reveals that subtle structural changes at the 
subunit interfaces are transmitted to the active sites (Figure 21.10). The 
transition from the T state (the prevalent state of phosphorylase b) to the R 
state (the prevalent state of phosphorylase a) entails a 10-degree rotation 
around the twofold axis of the dimer. Most important, this transition is 
associated with structural changes in a helices that move a loop out of the 
active site of each subunit. Thus, the T state is less active because the cata- 
lytic site is partly blocked. In the R state, the catalytic site is more accessible 
and a binding site for orthophosphate is well organized. 


Phosphorylase kinase is activated by phosphorylation 
and calcium ions 


Phosphorylase kinase activates phosphorylase b by attaching a phosphoryl 
group. The subunit composition of phosphorylase kinase in skeletal muscle 
is (aBy5),, and the mass of this very large protein 1s 1300 kDa. The enzyme 
consists of two (aBy5), lobes that are joined by a B, bridge that is the core 
of the enzyme and serves as a scaffold for the remaining subunits. The y 
subunit contains the active site, while all of the remaining subunits (~90% 
by mass) play regulatory roles. The 6 subunit is the calcium binding pro- 
tein calmodulin, a calcium sensor that stimulates many enzymes in eukary- 
otes (Section 14.1). The a and B subunits are targets of protein kinase A. 
The 8 subunit is phosphorylated first, followed by the phosphorylation of 
the a subunit. 

Activation of phosphorylase kinase is initiated when Ca?* binds to the 8 
subunit. This mode of activation of the kinase is especially noteworthy in 
muscle, where contraction is triggered by the release of Ca?* from the sar- 
coplasmic reticulum (Figure 21.14). Maximal activation is achieved with 
the phosphorylation of the 8 and a subunits of the Ca”*-bound kinase. The 
stimulation of phosphorylase kinase is one step in a signal-transduction 
cascade initiated by signal molecules such as glucagon and epinephrine. 
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21.3 Epinephrine and Glucagon Signal the Need 
for Glycogen Breakdown 


Protein kinase A activates phosphorylase kinase, which in turn activates 
glycogen phosphorylase. What activates protein kinase A? What is the sig- 
nal that ultimately triggers an increase in glycogen breakdown? 


G proteins transmit the signal for the initiation of glycogen breakdown 


Several hormones greatly affect glycogen metabolism. Glucagon and epi- 
nephrine trigger the breakdown of glycogen. Muscular activity or its anti- 
cipation leads to the release of epinephrine (adrenaline), a catecholamine 
derived from tyrosine, from the adrenal medulla. Epinephrine markedly 
stimulates glycogen breakdown in muscle and, to a lesser extent, in the liver. 
The liver is more responsive to glucagon, a polypeptide hormone secreted 
by the a cells of the pancreas when the blood-sugar level is low. 
Physiologically, glucagon signifies the starved state (Figure 21.15). 

How do hormones trigger the breakdown of glycogen? They initiate a 
cyclic AMP signal-transduction cascade, already discussed in Section 14.1 
(Figure 21.16). 


1. The signal molecules epinephrine and glucagon bind to specific seven- 
transmembrane (7TM) receptors in the plasma membranes of target cells 
(Section 14.1). Epinephrine binds to the B-adrenergic receptor in muscle, 
whereas glucagon binds to the glucagon receptor in the liver. These binding 
events activate the G, protein. A specific external signal has been transmitted 
into the cell through structural changes, first in the receptor and then in the 
G protein. 
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FIGURE 21.14 Activation of phosphorylase 
kinase. Phosphorylase kinase, an (aB-y8), 
assembly, is partly activated by Ca2t 
binding to the 6 subunit. Activation is 
maximal when the 8 and a subunits are 
phosphorylated in response to hormonal 
signals. When active, the enzyme converts 
phosphorylase b into phosphorylase a. 
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FASTING: Low glucose EXERCISE 


Glucagon from pancreas 


Epinephrine from adrenal medulla 


FIGURE 21.15 PATHWAY INTEGRATION: 


Hormonal control of glycogen 
breakdown. Glucagon stimulates liver- 
glycogen breakdown when blood glucose 
is low. Epinephrine enhances glycogen 
breakdown in muscle and the liver to 
provide fuel for muscle contraction. 


FIGURE 21.16 Regulatory cascade for 
glycogen breakdown. Glycogen 
degradation is stimulated by hormone 
binding to 7TM receptors. Hormone 
binding initiates a G-protein-dependent 
signal-transduction pathway that results in 
the phosphorylation and activation of 
glycogen phosphorylase. 
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2. The GTP-bound subunit of G, activates the transmembrane protein 


adenylate cyclase. This enzyme catalyzes the formation of the second mes- 
senger cyclic AMP (cAMP) from ATP. 


3. The elevated cytoplasmic level of cAMP activates protein kinase A 
(Section 10.3). The binding of cAMP to inhibitory regulatory subunits 


Epinephrine (muscle) or 
glucagon (liver) 
Pa receptor 


POO0OOOY4) 
a pe DODPOOOOS XS 
PL yy OOK 
ma Ge 6 OOO 
Z > 


Adenylate 
cyclase 


Protein “Protein 


kinase A kinase A 


Phosphorylase ~~ —~y Phosphorylase 
kinase kinase 


| 


Phosphorylase ~~ ~~, Phosphorylase 
b a 


triggers their dissociation from the catalytic subunits. The free catalytic 
subunits are now active. 


4. Protein kinase A phosphorylates phosphorylase kinase, first on the 
B subunit and then on the a subunit, which activates glycogen 
phosphorylase. 


The cyclic AMP cascade highly amplifies the effects of hormones. The 
binding of a small number of hormone molecules to cell-surface receptors 
leads to the release of a very large number of sugar units. Indeed, much of 
the stored glycogen would be mobilized within seconds were it not for a 
counterregulatory system. 

The signal-transduction processes in the liver are more complex than 
those in muscle. Epinephrine can also elicit glycogen degradation in the 
liver. However, in addition to binding to the B-adrenergic receptor, it binds 
to the 7T'M a-adrenergic receptor, which then initiates the phosphoinositide 
cascade (Section 14.2) that induces the release of Ca?* from endoplasmic 
reticulum stores. Recall that the 6 subunit of phosphorylase kinase is the 
Ca?* sensor calmodulin. The binding of Ca?* to calmodulin leads to a par- 
tial activation of phosphorylase kinase. Stimulation by both glucagon and 
epinephrine leads to maximal mobilization of liver glycogen. 


Glycogen breakdown must be rapidly turned off when necessary 


It is crucial that the high-gain system of glycogen breakdown be terminated 
quickly to prevent the wasteful depletion of glycogen after energy needs 
have been met. When glucose needs have been satisfied, phosphorylase 
kinase and glycogen phosphorylase are dephosphorylated and inactivated. 
Simultaneously, glycogen synthesis is activated. 

The signal-transduction pathway leading to the activation of glycogen 
phosphorylase is shut down automatically when secretion of the initiating 
hormone ceases. The inherent GTPase activity of the G protein converts the 
bound GTP into inactive GDP, and phosphodiesterases always present in 
the cell convert cyclic AMP into AMP. Protein phosphatase 1 (PP1) removes 
the phosphoryl groups from phosphorylase kinase, thereby inactivating the 
enzyme. Finally, PP1 also removes the phosphoryl group from glycogen 
phosphorylase, converting the enzyme into the usually inactive b form. 


The regulation of glycogen phosphorylase became more 
sophisticated as the enzyme evolved 


aa Analyses of the primary structures of glycogen phosphorylase from 
human beings, rats, Dictyostelium (slime mold), yeast, potatoes, and 
E. coli have enabled inferences to be made about the evolution of this 
important enzyme. The 16 residues that come into contact with glucose at 
the active site are identical in nearly all the enzymes. There is more variation 
but still substantial conservation of the 15 residues at the pyridoxal phos- 
phate-binding site. Likewise, the glycogen-binding site is well conserved in 
all the enzymes. The high degree of similarity among these three sites shows 
that the catalytic mechanism has been maintained throughout evolution. 

Differences arise, however, when we compare the regulatory sites. The 
simplest type of regulation would be feedback inhibition by glucose 
6-phosphate. Indeed, the glucose 6-phosphate regulatory site is highly con- 
served among most of the phosphorylases. The crucial amino acid residues 
that participate in regulation by phosphorylation and nucleotide binding are 
well conserved only in the mammalian enzymes. Thus, this level of regula- 
tion was a later evolutionary acquisition. 
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21.4 Glycogen Is Synthesized and Degraded 
by Different Pathways 


As with glycolysis and gluconeogenesis, biosynthetic and degradative path- 
ways rarely operate by precisely the same reactions in the forward and 
reverse directions. Glycogen metabolism provided the first known example 
of this important principle. Separate pathways afford much greater flexibility, 
both in energetics and in control. 

Glycogen is synthesized by a pathway that utilizes uridine diphosphate 
glucose (UDP-glucose) as the activated glucose donor. 


Synthesis: Glycogen, + UDP-glucose —> glycogen,+;, + UDP 
Degradation: Glycogen, +; + P; —— glycogen, + glucose1-phosphate 


UDP-glucose is an activated form of glucose 


UDP-glucose, the glucose donor in the biosynthesis of glycogen, is an acti- 
vated form of glucose, just as ATP and acetyl CoA are activated forms of 
orthophosphate and acetate, respectively. The C-1 carbon atom of the glu- 
cosyl unit of UDP-glucose is activated because its hydroxyl group is esteri- 
fied to the diphosphate moiety of UDP. 

UDP-glucose is synthesized from glucose 1-phosphate and uridine tri- 
phosphate (UTP) in a reaction catalyzed by UDP-glucose pyrophosphory- 
lase. This reaction liberates the outer two phosphoryl residues of UTP as 
pyrophosphate. 
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UTP UDP-glucose 


This reaction is readily reversible. However, pyrophosphate is rapidly 
hydrolyzed in vivo to orthophosphate by an inorganic pyrophosphatase. 
The essentially irreversible hydrolysis of pyrophosphate drives the synthe- 
sis of UDP-glucose. 


Glucose 1-phosphate + UTP == UDP-glucose + PP. 
PPe +H, ——-2P 


Glucose 1-phosphate + UTP + H,O ——~> UDP-glucose + 2 P; 


The synthesis of UDP-glucose exemplifies another recurring theme in bio- 
chemistry: Many biosynthetic reactions are driven by the hydrolysis of 
pyrophosphate. 


Glycogen synthase catalyzes the transfer of glucose from 
UDP-glucose to a growing chain 


New glucosyl units are added to the nonreducing terminal residues of gly- 
cogen. The activated glucosyl unit of UDP-glucose is transferred to the 
hydroxyl group at C-4 of a terminal residue to form an a-1,4-glycosidic 
linkage. UDP is displaced by the terminal hydroxyl group of the growing 
glycogen molecule. This reaction is catalyzed by glycogen synthase, the key 
regulatory enzyme in glycogen synthesis. Humans have two isozymes of 


glycogen synthase: one is specific to the liver while the other is expressed in 631 


muscle and other tissues. 
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Glycogen synthase, a member of the large glycosyltransferase family 


(Section 11.3), can add glucosyl residues only 


to a polysaccharide chain 


already containing at least four residues. Thus, glycogen synthesis requires a 
primer. This priming function is carried out by glycogenin, a Mn** -requiring 
glycosyltransferase composed of two identical 37-kDa subunits. Each sub- 
unit of glycogenin catalyzes the formation of a-1,4-glucose polymers, 10 to 
20 glucosyl units in length, which are covalently attached to the phenolic 
hydroxyl group of a specific tyrosine residue in each glycogenin subunit. 
UDP-glucose is the donor in this autoglycosylation. At this point, glycogen 


synthase takes over to extend the glycogen mol- 
ecule. Thus, every glycogen molecule has a gly- 
cogenin molecule at its core (Figure 21.1). 


We Despite no detectable sequence similarity, 

structural studies have revealed that glyco- 
gen synthase is homologous to glycogen phos- 
phorylase. The binding site for UDP-glucose in 
glycogen synthase corresponds in position to the 
pyridoxal phosphate in glycogen phosphorylase. 


A branching enzyme forms a-1,6 linkages 


Glycogen synthase catalyzes only the synthesis 
of a-1,4 linkages. Another enzyme is required to 
form the a-1,6 linkages that make glycogen a 
branched polymer. Branching takes place after a 
number of glucosyl residues are joined in a-1,4 
linkages by glycogen synthase (Figure 21.17). A 
branch is created by the breaking of an a-1,4 
link and the formation of an a-1,6 link. A block 
of residues, typically 7 in number, is transferred 
toa more interior site. The branching enzyme that 
catalyzes this reaction requires that the block of 
7 or so residues must include the nonreducing 
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FIGURE 21.17 Branching reaction. The branching enzyme removes an 
oligosaccharide of approximately seven residues from the nonreducing end and 
creates an internal a-1,6 linkage. 


FIGURE 21.18 Cross section of a 
glycogen molecule. The component 
labeled G is glycogenin. 
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terminus, and must come from a chain at least 11 residues long. In addition, 
the new branch point must be at least 4 residues away from a preexisting one. 
Branching is important because it increases the solubility of glycogen. 
Furthermore, branching creates a large number of terminal residues, the 
sites of action of glycogen phosphorylase and synthase (Figure 21.18). 
Thus, branching increases the rate of glycogen synthesis and degradation. 


Glycogen synthase is the key regulatory enzyme in glycogen synthesis 


Glycogen synthase, like phosphorylase, exists in two forms: an active non- 
phosphorylated a form and a usually inactive phosphorylated b form. 
Again, like the phosphorylase, the interconversion of the two forms is regu- 
lated by covalent modification. However, the key means of regulating gly- 
cogen synthase is by allosteric regulation of the phosphorylated form of the 
enzyme, glycogen synthase b. Glucose 6-phosphate is a powerful activator 
of the enzyme, stabilizing the R state of the enzyme relative to the T state. 

The covalent modification of glycogen synthase appears to play more of 
a fine-tuning role. The synthase is phosphorylated at multiple sites by sev- 
eral protein kinases—notably, glycogen synthase kinase (GSK), which is 
under the control of insulin (p. 635), and protein kinase A. The function of 
the multiple phosphorylation sites is still under investigation. Note that 
phosphorylation has opposite effects on the enzymatic activities of glycogen 
synthase and glycogen phosphorylase. 


Glycogen is an efficient storage form of glucose 


What is the cost of converting glucose 6-phosphate into glycogen and back 
into glucose 6-phosphate? The pertinent reactions have already been 
described, except for reaction 5, which is the regeneration of UTP. ATP 
phosphorylates UDP in a reaction catalyzed by nucleoside diphosphokinase. 


Glucose 6-phosphate ——> glucose 1-phosphate (1 
Glucose 1-phosphate + UTP ——> UDP-glucose + PP; ( 
PP; + H,O —> 2P; ( 
UDP-glucose + glycogen, ——> glycogen,+; + UDP (4 
UDP + ATP —+ UTP + ADP ( 


Sum: Glucose 6-phosphate + ATP + glycogen, + H,O —~ 
glycogen,,, + ADP +2 P. 


Thus, 1 molecule of ATP is hydrolyzed to incorporate glucose 6-phosphate 
into glycogen. The energy yield from the breakdown of glycogen is highly 
efficient. About 90% of the residues are phosphorolytically cleaved to glu- 
cose 1-phosphate, which is converted at no cost into glucose 6-phosphate. 
The other residues are branch residues, which are hydrolytically cleaved. 
One molecule of ATP is then used to phosphorylate each of these glucose 
molecules to glucose 6-phosphate. The complete oxidation of glucose 
6-phosphate yields about 31 molecules of ATP, and storage consumes 
slightly more than 1 molecule of ATP per molecule of glucose 6-phosphate; 
so the overall efficiency of storage is nearly 97%. 


21.5 Glycogen Breakdown and Synthesis Are 
Reciprocally Regulated 


An important control mechanism prevents glycogen from being synthe- 
sized at the same time as it is being broken down. The same glucagon- and 
epinephrine-triggered cAMP cascades that initiate glycogen breakdown in the 
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liver and muscle, respectively, also shut off glycogen synthesis. Glucagon and 
epinephrine control both glycogen breakdown and glycogen synthesis through 
protein kinase A (Figure 21.19). Recall that protein kinase A adds a phos- 
phoryl group to phosphorylase kinase, activating that enzyme and initiating 
glycogen breakdown. Glycogen synthase kinase and protein kinase A add 
phosphoryl groups to glycogen synthase, but this phosphorylation leads to 
a decrease in enzymatic activity. In this way, glycogen breakdown and syn- 
thesis are reciprocally regulated. How is the enzymatic activity reversed so 
that glycogen breakdown halts and glycogen synthesis begins? 


Protein phosphatase 1 reverses the regulatory effects of 
kinases on glycogen metabolism 


After a bout of exercise, muscle must shift from a glycogen-degrading mode 
to one of glycogen replenishment. A first step in this metabolic task is to shut 
down the phosphorylated proteins that stimulate glycogen breakdown. This 
task is accomplished by protein phosphatases that catalyze the hydrolysis of 
phosphorylated serine and threonine residues in proteins. Protein phosphatase 
1 plays key voles in regulating glycogen metabolism (Figure 21.20). PP1 inacti- 
vates phosphorylase a and phosphorylase kinase by dephosphorylating them. 
PP1 decreases the rate of glycogen breakdown; it reverses the effects of the 
phosphorylation cascade. Moreover, PP1 also removes phosphoryl groups from 
glycogen synthase b to convert it into the more active glycogen synthase a form. 
Here, PP1 also accelerates glycogen synthesis. PP1 is yet another molecular 
device for coordinating carbohydrate storage. Interestingly, recent research 
suggests that glycogen phosphorylase is also regulated by acetylation (Chapter 
10.3). Acetylation not only inhbits the enzyme, but also enhances dephos- 
phorylation by promoting the interaction with PP1. Acetylation is stimulated 
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FIGURE 21.20 Regulation of glycogen 
synthesis by protein phosphatase 1. 
Protein phosphatase 1 stimulates 

glycogen synthesis while inhibiting glycogen 
breakdown. Active enzymes are shown in 
green and inactive enzymes in red. 


FIGURE 21.21 Regulation of protein 
phosphatase 1 (PP1) in muscle takes 
place in two steps. Phosphorylation of Gy, 
by protein kinase A dissociates the catalytic 
subunit from its substrates in the glycogen 
particle. Phosphorylation of the inhibitor 
subunit by protein kinase A inactivates the 
catalytic unit of PP1. 
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by glucose and insulin, but inhibited by glucagon. It is fascinating that gly- 
cogen phosphorylase, one of the first allosteric enzymes identified and one of 
the most studied enzymes, is still revealing secrets. 

The catalytic subunit of PP1 is a 37-kDa single-domain protein. This 
subunit is usually bound to one of a family of regulatory subunits with masses 
of approximately 120 kDa. In skeletal muscle and heart, the most prevalent 
regulatory subunit 1s called G,,, whereas in the liver, the most prevalent sub- 
unit is G, . These regulatory subunits have modular structures with domains 
that participate in interactions with glycogen, with the catalytic subunit, and 
with target enzymes. Thus, these regulatory subunits act as scaffolds, bringing 
together the phosphatase and its substrates on the glycogen particle. 

What prevents the phosphatase 
activity of PP1 from always inhibit- 
ing glycogen degradation? When 
| glycogen degradation is called for, 

epinephrine or glucagon initiate the 
cAMP cascade that activates pro- 
tein kinase A (Figure 21.21). Protein 
kinase A reduces the activity of PP1 
by two mechanisms. First, in mus- 
cle, G), is phosphorylated in the 
domain responsible for binding the 
catalytic subunit. The catalytic 
subunit is released from glycogen 
and from its substrates and dephos- 
phoryation is greatly reduced. 
Second, almost all tissues contain 
small proteins that, when phos- 
phorylated, bind to the catalytic 


Activated 


2 
nhibitor 


Inactive 


HHA DOQPOMOOONK 
! | | SAAN 0 OOO 
IRS 


IRS-® —, Protein kinases 


Glycogen Glycogen 


synthase synthase 
kinase kinase 


ff 


Glycogen Glycogen 
synthase synthase 


KD 
PPI 


subunit of PP1 and inhibit it. Thus, when glycogen degradation is 
switched on by cAMP, the accompanying phosphorylation of these 
inhibitors keeps phosphorylase in its active a form and glycogen synthase 
in its inactive b form. 


Insulin stimulates glycogen synthesis by inactivating glycogen 
synthase kinase 


After exercise, people often consume carbohydrate-rich foods to restock 
their glycogen stores. How is glycogen synthesis stimulated? When 
blood-glucose levels are high, insulin stimulates the synthesis of glycogen by 
inactivating glycogen synthase kinase, one of the enzymes that maintains 
glycogen synthase in its phosphorylated, inactive state (Figure 21.22). 
The first step in the action of insulin is its binding to its receptor, a tyro- 
sine kinase receptor in the plasma membrane (Section 14.2). The binding 
of insulin stimulates the tyrosine kinase activity of the receptor so that it 
phosphorylates insulin-receptor substrates (IRSs). These phosphorylated 
proteins trigger signal-transduction pathways that eventually lead to the 
activation of protein kinases that phosphorylate and inactivate glycogen 
synthase kinase. The inactive kinase can no longer maintain glycogen 
synthase in its phosphorylated, inactive state. Protein phosphatase 1 
dephosphorylates glycogen synthase, activating it, and restoring glycogen 
reserves. Recall that insulin increases the amount of glucose in the cell by 
increasing the number of glucose transporters in the membrane 
(Section 16.2). The net effect of insulin is thus the replenishment of gly- 
cogen stores. 


Glycogen metabolism in the liver regulates the blood-glucose level 


After a meal rich in carbohydrates, blood-glucose levels rise, and glycogen 
synthesis 1s stepped up in the liver. Although insulin is the primary signal 
for glycogen synthesis, another is the concentration of glucose in the blood, 
which normally ranges from about 4.4—6.7 mM. The liver senses the con- 
centration of glucose in the blood and takes up or releases glucose 


FIGURE 21.22 Insulin inactivates 


glycogen synthase kinase. Insulin triggers 
a cascade that leads to the phosphorylation 


and inactivation of glycogen synthase 


kinase and prevents the phosphorylation of 
glycogen synthase. Protein phosphatase 1 


(PP1) removes the phosphates from 


glycogen synthase, thereby activating the 
enzyme and allowing glycogen synthesis. 


IRS, insulin-receptor substrate. 
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FIGURE 21.23 Blood glucose regulates 
liver-glycogen metabolism. The infusion 
of glucose into the bloodstream leads to 
the inactivation of phosphorylase, followed 
by the activation of glycogen synthase, in 
the liver. [Data from W. Stalmans, H. De Wulf, 
L. Hue, and H.-G. Hers. Eur. J. Biochem. 
41:117-134, 1974.] 


FIGURE 21.24 Glucose regulation of 
liver-glycogen metabolism. Glucose 
binds to and inhibits glycogen 
phosphorylase a in the liver, facilitating the 
formation of the T state of phosphorylase a. 
The T state of phosphorylase a does not 
bind protein phosphate 1 (PP1), leading to 
the dissociation and activation of PP1 from 
glycogen phosphorylase a. The free PP1 
dephosphorylates glycogen phosphorylase 
a and glycogen synthase b, leading to the 
inactivation of glycogen breakdown and the 
activation of glycogen synthesis. 
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accordingly. The amount of liver phosphorylase a decreases rapidly when 
glucose is infused (Figure 21.23). After a lag period, the amount of glycogen 
synthase a increases, which results in glycogen synthesis. Phosphorylase a is 
the glucose sensor in liver cells, facilitating the switch from degradation to 
synthesis. How does phosphorylase perform its sensor function? 

Phosphorylase a and PP1 are localized to the glycogen particle by inter- 
actions with the G, subunit of PP1. The binding of glucose to phosphoryl- 
ase a shifts its allosteric equilibrium from the active R form to the inactive 
T form. This conformational change renders the phosphoryl group on serine 
14 a substrate for protein phosphatase 1. PP1 binds tightly to phosphorylase 
a only when the phosphorylase 1s in the R state but is inactive when bound. 
When glucose induces the transition to the T form, PP1 and the phosphor- 
ylase dissociate from each other and the glycogen particle, and PP1 becomes 
active, converting the phosphorylase a form to the b form. Recall that the 
R< T transition of muscle phosphorylase a is unaffected by glucose and is 
thus unaffected by the rise in blood-glucose levels (Section 21.2). 

How does glucose binding to glycogen phosphorylase stimulate glyco- 
gen synthesis? As mentioned above, the conversion of a into b is accompa- 
nied by the release of PP1, which is then free to activate glycogen synthase 
(Figure 21.24). The removal of the phosphoryl group of inactive glycogen 
synthase b converts it into the active a form. What accounts for the lag 
between termination of glycogen degradation and the beginnings of glyco- 
gen synthesis (Figure 21.23)? There are about 10 phosphorylase a molecules 
per molecule of phosphatase. Consequently, the activity of glycogen synthase 
begins to increase only after most of phosphorylase a is converted into b. The 
lag between the decrease in glycogen degradation and the increase in glyco- 
gen synthesis prevents the two pathways from operating simultaneously. 
This remarkable glucose-sensing system depends on three key elements: 
(1) communication between the allosteric site for glucose and the serine 
phosphate, (2) the use of PP1 to inactivate phosphorylase and activate gly- 
cogen synthase, and (3) the binding of the phosphatase to phosphorylase a 
to prevent the premature activation of glycogen synthase. 


e Efforts are underway to develop drugs that disrupt the interaction of 

liver phosphorylase with the G, subunit as a treatment for type 2 
diabetes (Section 27.2). One experimental drug binds at a unique site on the 
enzyme that, synergistically with glucose, stabilizes the T state of 
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phosphorylase and thereby enhances the transition to glycogen synthesis, as 
described earlier. Type 2 diabetes is characterized by excess blood glucose. 
Hence, disrupting the association of phosphorylase with the G,; would ren- 
der it a substrate for PP1. Glycogen degradation would cease, and glucose 
release into the blood would be inhibited. 


A biochemical understanding of glycogen-storage 
diseases is possible 


e Edgar von Gierke described the first glycogen-storage disease in 1929. 

A patient with this disease has a huge abdomen caused by a massive 
enlargement of the liver. There is a pronounced hypoglycemia between 
meals. Furthermore, the blood-glucose level does not rise on administration 
of epinephrine and glucagon. An infant with this glycogen-storage disease 
may have convulsions because of the low blood-glucose level. 

The enzymatic defect in von Gierke disease was elucidated in 1952 by 
Carl and Gerty Cori. They found that glucose 6-phosphatase is missing from 
the liver of a patient with this disease. This finding was the first demonstra- 
tion of an inherited deficiency of a liver enzyme. The glycogen in the liver is 
normal in structure but is present in abnormally large amounts. The absence 
of glucose 6-phosphatase in the liver causes hypoglycemia because glucose 
cannot be formed from glucose 6-phosphate. This phosphorylated sugar 
does not leave the liver, because it cannot cross the plasma membrane. The 
presence of excess glucose 6-phosphate triggers an increase in glycolysis in 
the liver, leading to a high level of lactate and pyruvate in the blood. Patients 
who have von Gierke disease also have an increased dependence on fat 
metabolism. This disease can also be produced by a mutation in the gene that 
encodes the glucose 6-phosphate transporter. Recall that glucose 6-phosphate 
must be transported into the lumen of the endoplasmic reticulum to be 
hydrolyzed by phosphatase (Section 16.3). Mutations in the other three 
essential proteins of this system can likewise lead to von Gierke disease. 

Seven other glycogen-storage diseases have been characterized 
(Table 21.2). In Pompe disease (type II), lysosomes become engorged with 


TABLE 21.2 Glycogen-storage diseases 


637 


21.5 Regulation of Glycogen Metabolism 


Glycogen 
Type Defective enzyme Organ affected in the affected organ Clinical features 
I Glucose 6-phosphatase Liver and kidney Increased amount; Massive enlargement of the liver; 
Von Gierke or transport system normal structure. failure to thrive; severe 
hypoglycemia, ketosis, 
hyperuricemia, hyperlipemia. 
Il a-1,4-Glucosidase All organs Massive increase in Cardiorespiratory failure 
Pompe (lysosomal) amount; normal structure. causes death, usually before age 2. 
Il a-1,6-Glucosidase Muscle and liver Increased amount; Like type I, but milder course. 
Cori (debranching enzyme) short outer branches. 
IV Branching enzyme Liver and spleen Normal amount; very long Progressive cirrhosis of the liver; 
Andersen (a-1,4 — a-1,6) outer branches. liver failure causes death, 
usually before age 2. 
V Phosphorylase Muscle Moderately increased Limited ability to perform strenuous 
McArdle amount; normal structure. exercise because of painful 
muscle cramps; otherwise patient 
is normal and well developed. 
VI Phosphorylase Liver Increased amount. Like type I, but milder 
Hers course. 
VII Phosphofructokinase Muscle Increased amount; Like type V. 
normal structure. 
VII Phosphorylase kinase Liver Increased amount; Mild liver enlargement; 


normal structure. 


Note: Types I through VII are inherited as autosomal recessives. Type VII] is sex linked. 


mild hypoglycemia. 


FIGURE 21.25 Electron micrograph of 
muscle cell from a patient with Pompe 
disease. Glycogen-engorged lysosomes 
are seen throughout the cell, including the 
myofibrils. As the disease progresses, 
lysosomes may rupture, releasing large 
amounts of glycogen into the cytoplasm. 
Such accumulations of cytoplasmic 
glycogen are called glycogen lakes. 
[Reproduced with permission of the author, 
from Thurberg, B. L. et. al. Characterization of 
pre- and post-treatment pathology after 
enyme replacement for Pompe disease. Lab. 
Invest. 86 (12):1208-1220, 2006.] 
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glycogen because they lack a-1,4-glucosidase, a hydrolytic enzyme con- 
fined to these organelles (Figure 21.25). Carl and Gerty Cori also eluci- 
dated the biochemical defect in another glycogen-storage disease (type 
IIT), which cannot be distinguished from von Gierke disease (type I) by 
physical examination alone. In type III disease, the structure of liver and 
muscle glycogen is abnormal and the amount is markedly increased. Most 
striking, the outer branches of the glycogen are very short. Patients having 
this type lack the debranching enzyme (a-1,6-glucosidase), and so only the 
outermost branches of glycogen can be effectively utilized. Thus, only a 
small fraction of this abnormal glycogen is functionally active as an acces- 
sible store of glucose. 

A defect in glycogen metabolism confined to muscle is found in 
McArdle disease (type V). Muscle phosphorylase activity is absent, and a 
patient’s capacity to perform strenuous exercise is limited because of painful 
muscle cramps. The patient is otherwise normal and well developed. Thus, 
effective utilization of muscle glycogen is not essential for life. 


SUMMARY 


Glycogen, a readily mobilized fuel store, is a branched polymer of 
glucose residues. Most of the glucose units in glycogen are linked by 
a-1,4-glycosidic bonds. At about every twelfth residue, a branch is 
created by an a-1,6-glycosidic bond. Glycogen is present in large 
amounts in muscle cells and in liver cells, where it is stored in the cyto- 
plasm in the form of hydrated granules. 


21.1 Glycogen Breakdown Requires the Interplay of Several Enzymes 

Most of the glycogen molecule is degraded to glucose 1-phosphate 
by the action of glycogen phosphorylase, the key enzyme in glycogen 
breakdown. The glycosidic linkage between C-1 of a terminal resi- 
due and C-4 of the adjacent one is split by orthophosphate to give 
glucose 1-phosphate, which can be reversibly converted into glucose 
6-phosphate. Branch points are degraded by the concerted action of 
an oligosaccharide transferase and an a-1,6-glucosidase. 


21.2 Phosphorylase Is Regulated by Allosteric Interactions and 

Reversible Phosphorylation 

Phosphorylase b, which is usually inactive, is converted into active 
phosphorylase a by the phosphorylation of a single serine residue in 
each subunit. This reaction is catalyzed by phosphorylase kinase. 
The a form in the liver is inhibited by glucose. In liver, phosphoryl- 
ase is activated to liberate glucose for export to other organs, such 
as skeletal muscle and the brain. The b form in muscle can be acti- 
vated by the binding of AMP, an effect counteracted by ATP and 
glucose 6-phosphate. In contrast to liver, muscle phosphorylase is 
activated to generate glucose for use inside the cell as a fuel for con- 
tractile activity. 


21.3 Epinephrine and Glucagon Signal the Need for Glycogen Breakdown 
Epinephrine and glucagon stimulate glycogen breakdown through 
specific 7ITM receptors. Muscle is the primary target of epinephrine, 
whereas the liver is responsive to glucagon. Both signal molecules ini- 
tiate a kinase cascade that leads to the activation of phosphorylase 
kinase, which in turn converts glycogen phosphorylase b to the phos- 
phorylated a form. 


21.4 


Glycogen Is Synthesized and Degraded by Different Pathways 

The pathway for glycogen synthesis differs from that for glycogen 
breakdown. UDP-glucose, the activated intermediate in glycogen 
synthesis, is formed from glucose 1-phosphate and UTP. Glycogen 
synthase catalyzes the transfer of glucose from UDP-glucose to the 
C-4 hydroxyl group of a terminal residue in the growing glycogen 
molecule. Synthesis is primed by glycogenin, an autoglycosylating 
protein that contains a covalently attached oligosaccharide unit on a 
specific tyrosine residue. A branching enzyme converts some of the 
a-1,4 linkages into a-1,6 linkages to increase the number of ends so 
that glycogen can be made and degraded more rapidly. 


Glycogen Breakdown and Synthesis Are Reciprocally Regulated 
Glycogen synthesis and degradation are coordinated by several ampli- 
fying reaction cascades. Epinephrine and glucagon stimulate glycogen 
breakdown and inhibit its synthesis by increasing the cytoplasmic 
level of cyclic AMP, which activates protein kinase A. Protein kinase 
A activates glycogen breakdown by attaching a phosphate to phos- 
phorylase kinase and inhibits glycogen synthesis by phosphorylating 
glycogen synthase. Glycogen synthase kinase also inhibits synthesis 
by phosphorylating the synthase. 

The glycogen-mobilizing actions of protein kinase A are reversed 
by protein phosphatase 1, which is regulated by several hormones. 
Epinephrine inhibits this phosphatase by blocking its attachment to 
glycogen molecules and by turning on an inhibitor. Insulin, in con- 
trast, triggers a cascade that phosphorylates and inactivates glycogen 
synthase kinase. Hence, glycogen synthesis is decreased by epineph- 
rine and increased by insulin. Glycogen synthase and phosphorylase 
are also regulated by noncovalent allosteric interactions. In fact, phos- 
phorylase is a key part of the glucose-sensing system of liver cells. 
Glycogen metabolism exemplifies the power and precision of revers- 
ible phosphorylation in regulating biological processes. 


KEY TERMS 
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Problems 


glycogen phosphorylase (p. 619) 
phosphorolysis (p. 619) 


pyridoxal phosphate (PLP) (p. 620) 


phosphorylase kinase (p. 626) 
calmodulin (p. 626) 


PROBLEMS 


epinephrine (adrenaline) (p. 627) 


glucagon (p. 627) 


protein kinase A (PKA) (p. 628) 


uridine diphosphate glucose 
(UDP-glucose) (p. 630) 


glycogen synthase (p. 630) 


glycogenin (p. 631) 


glycogen synthase kinase (p. 632) 
protein phosphatase 1 (PP1) (p. 633) 
insulin (p. 635) 


1. Step-by-step degradation. What are the three steps in 
glycogen degradation and what enzymes are required? 


2. Match ‘em. Match each term with its description. 


(a) Glycogen phosphorylase 


(b) Phosphorolysis 
(c) Transferase 
(d) a-1,6-Glucosidase 


1. 


2. 


3. 


4, 


Calcium-binding subunit 
of phosphorylase kinase 
Activates glycogen 
phosphorylase 

Removal of a glucose 
residue by the addition 
of phosphate 

Stimulates glycogen 
breakdown in muscle 


(e) Phosphoglucomutase 


(f) Phosphorylase kinase 


(g) Protein kinase A____ 
(h) Calmodulin 
(i) Epinephrine___ 

(j) Glucagon ____ 


5. 


6. 


7 


joe) 


Ne) 


10. 


Liberates a free glucose 
residue 

Shifts the location of 
several glucose residues 


. Stimulates glycogen 


breakdown in the liver 


. Catalyzes phosphorolytic 


cleavage 


. Prepares glucose 


1-phosphate for glycolysis 
Phosphorylates phos- 
phorylase kinase 
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3. Choice is good. Glycogen is not as reduced as fatty acids 
are and consequently not as energy rich. Why do animals 
store any energy as glycogen? Why not convert all excess 
fuel into fatty acids? 


4. If a little is good, a lot is better. a-Amylose is an 
unbranched glucose polymer. Why would this polymer not 
be as effective a storage form of glucose as glycogen? 


5. Telltale products. A sample of glycogen from a patient 
with liver disease is incubated with orthophosphate, phos- 
phorylase, the transferase, and the debranching enzyme 
(a-1,6-glucosidase). The ratio of glucose 1-phosphate to 
glucose formed in this mixture is 100. What is the most 
likely enzymatic deficiency in this patient? 


6. Dare to be different. Compare the allosteric regulation of 
phosphorylase in the liver and in muscle, and explain the 
significance of the difference. 


7. A thumb on the balance. The reaction catalyzed by phos- 
phorylase is readily reversible in vitro. At pH 6.8, the equi- 
librium ratio of orthophosphate to glucose 1-phosphate is 
3.6. The value of AG®’ for this reaction is small because a 
glycosidic bond is replaced by a phosphoryl ester bond that 
has a nearly equal transfer potential. However, phospho- 
rolysis proceeds far in the direction of glycogen breakdown 
in vivo. Suggest one means by which the reaction can be 
made irreversible in vivo. 


8. Excessive storage. Suggest an explanation for the fact that 
the amount of glycogen in type I glycogen-storage disease 
(von Gierke disease) is increased. 


9. Recouping an essential phosphoryl. The phosphoryl group 
on phosphoglucomutase is slowly lost by hydrolysis. 
Propose a mechanism that utilizes a known catalytic inter- 
mediate for restoring this essential phosphoryl group. How 
might this phosphoryl! donor be formed? 


10. Not all absences are equal. Hers disease results from 
an absence of liver glycogen phosphorylase and may 
result in serious illness. In McArdle disease, muscle 
glycogen phosphorylase is absent. Although exercise is 
difficult for patients suffering from McArdle disease, the 
disease is rarely life threatening. Account for the different 
manifestations of the absence of glycogen phosphorylase 
in the two tissues. What does the existence of these two 
different diseases indicate about the genetic nature of the 
phosphorylase? 


11. Hydrophobia. Why is water excluded from the active 
site of phosphorylase? Predict the effect of a mutation that 
allows water molecules to enter. 


12. Removing all traces. In human liver extracts, the 
catalytic activity of glycogenin was detectable only after 
treatment with a-amylase, an enzyme the hydrolyzes 


a-1,4-bonds in starch and glycogen. Why was a-amylase 
necessary to reveal the glycogenin activity? 


13. Two in one. A single polypeptide chain houses the 
transferase and debranching enzyme. Cite a potential 
advantage of this arrangement. 


14. How did they do that? A strain of mice has been devel- 
oped that lack the enzyme phosphorylase kinase. Yet, after 
strenuous exercise, the glycogen stores of a mouse of this 
strain are depleted. Explain how this depletion is possible. 


15. An appropriate inhibitor. What is the rationale for the 
inhibition of muscle glycogen phosphorylase by glucose 
6-phosphate when glucose 1-phosphate is the product of 
the phosphorylase reaction? 


16. Passing along the information. Outline the signal- 
transduction cascade for glycogen degradation in muscle. 


17. Slammin’ on the breaks. There must be a way to shut 
down glycogen breakdown quickly to prevent the wasteful 
depletion of glycogen after energy needs have been met. What 
mechanisms are employed to turn off glycogen breakdown? 


18. Diametrically opposed. Phosphorylation has opposite 
effects on glycogen synthesis and breakdown. What is the 
advantage of its having opposing effects? 


19. Feeling depleted. Glycogen depletion resulting from 
intense, extensive exercise can lead to exhaustion and the 
inability to continue exercising. Some people also experi- 
ence dizziness, an inability to concentrate, and a loss of 
muscle control. Account for these symptoms. 


20. Everyone had a job to do. What accounts for the fact 
that liver phosphorylase is a glucose sensor, whereas muscle 
phosphorylase is not? 


21. Yin and Yang. Match the terms on the left with the 
descriptions on the right. 


(a) UDP-glucose 
(b) UDP-glucose pyrophos- 


1. Glucose 1-phosphate is 
one of its substrates. 


phorylase 2. Potent activator of 
c) Glycogen synthase glycogen synthase b. 
d) Glycogenin. 3. Glucose sensor in 


e) Branching enzyme ____ the liver. 

f) Glucose 6-phosphate 4. Activated substrate for 

glycogen synthesis. 

(g) Glycogen synthase 5. Synthesizes a-1,4 

kinase linkages between 
glucose molecules. 

6. Leads to the inactivation 
of glycogen synthase 
kinase. 

7. Synthesizes a-1,6 
linkages between glucose 
molecules. 


FS pe RE Re 


8. Catalyzes the formation 
of glycogen synthase b. 
G) Insulin 9. Catalyzes the formation 
(j) Glycogen phosphorylase of glycogen synthase a. 
a 10. Provides the primer for 
glycogen synthesis. 


(h) Protein phosphatase 1 


22. Team effort. What enzymes are required for the synthe- 
sis of a glycogen particle starting from glucose 6-phosphate? 


23. Force it forward. The following reaction accounts for 
the synthesis of UDP-glucose. This reaction is readily 
reversible. How is it made irreversible in vivo? 


Glucose 1-phosphate + UTP == 
UDP-glucose + PP; 


24. If you insist. Why does activation of the phosphorylated 
b form of glycogen synthase by high concentrations of 
glucose 6-phosphate make good biochemical sense? 


25. An ATP saved is an ATP earned. The complete oxida- 
tion of glucose 6-phosphate derived from free glucose yields 
30 molecules ATP, whereas the complete oxidation of glu- 
cose 6-phosphate derived from glycogen yields 31 mole- 
cules of ATP. Account for this difference. 


26. Dual roles. Phosphoglucomutase is crucial for glycogen 
breakdown as well as for glycogen synthesis. Explain the 
role of this enzyme in each of the two processes. 


27. Working at cross-purposes. Write a balanced equation 
showing the effect of simultaneous activation of glycogen 
phosphorylase and glycogen synthase. Include the reac- 
tions catalyzed by phosphoglucomutase and UDP-glucose 
pyrophosphorylase. 


28. Achieving immortality. Glycogen synthase requires a 
primer. A primer was formerly thought to be provided 
when the existing glycogen granules are divided between 
the daughter cells produced by cell division. In other words, 
parts of the original glycogen molecule were simply passed 
from generation to generation. Would this strategy have 
been successful in passing glycogen stores from generation 
to generation? How are new glycogen molecules now known 
to be synthesized? 


29. Synthesis signal. How does insulin stimulate glycogen 
synthesis? 


Mechanism Problem 


30. Family resemblance. Enzymes of the a-amylase family 
(problem 12) catalyze a reaction by forming a covalent 
intermediate to a conserved aspartate residue. Propose 
mechanisms for the two enzymes catalyzing steps in glyco- 
gen debranching on the basis of their potential membership 
in the a-amylase family. 


641 
Problems 


Chapter Integration Problems 


31. Double activation. What pathway in addition to the 
cAMP-induced signal transduction pathway is used in the 
liver to maximize glycogen breakdown? 


32. Carbohydrate conversion. Write a balanced equation for 
the formation of glycogen from galactose. 


33. Working together. What enzymes are required for the 
liver to release glucose into the blood when an organism is 
asleep and fasting? 


34. A shattering experience. Crystals of phosphorylase a 
grown in the presence of glucose shatter when a substrate 
such as glucose 1-phosphate is added. Why? 


35. I know I’ve seen that face before. UDP-glucose is the 
activated form of glucose used in glycogen synthesis. 
However, we have previously met other similar activated 


forms of carbohydrate in our consideration of metabolism. 
Where else have we seen UDP-carbohydrate? 


36. Same symptoms, different cause. Suggest another muta- 
tion in glucose metabolism that causes symptoms similar to 
those of von Gierke disease. 


Data Interpretation Problems 


37. An authentic replica. Experiments were performed in 
which serine (5S) 14 of glycogen phosphorylase was replaced 
by glutamate (E). The V,_.,, of the mutant enzyme was then 
compared with the wild type phosphorylase in both the a 
and the b form. 


Vinax Hmoles of glucose 1-PO, released min™! mg™! 


Wild type phosphorylase b 25+ 0.4 
Wild type phosphorylase a 100 + 5 
S to E mutant 60 +3 


(a) Explain the results obtained with the mutant. 


(b) Predict the effect of substituting aspartic acid for 
the serine. 


38. Glycogen isolation 1. The liver is a major storage site for 
glycogen. Purified from two samples of human liver, glyco- 
gen was either treated or not treated with a-amylase and 
subsequently analyzed by SDS-PAGE and western blotting 
with the use of antibodies to glycogenin. The results are 
presented in the adjoining illustration on the next page. 


(a) Why are no proteins visible in the lanes without amy- 
lase treatment? 


(b) What is the effect of treating the samples with a-amylase? 
Explain the results. 


(c) List other proteins that you might expect to be associ- 
ated with glycogen. Why are other proteins not visible? 
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39. Glycogen isolation 2. The gene for glycogenin was 
transfected into a cell line that normally stores only small 
amounts of glycogen. The cells were then manipulated 
according to the following protocol, and glycogen was 
isolated and analyzed by SDS-PAGE and western blotting 
by using an antibody to glycogenin with and without 
a-amylase treatment. The results are presented in the 
adjoining illustration. 
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a-Amylase 


[Courtesy of Dr. Peter J. Roach, Indiana University School of Medicine.] 


The protocol: Cells cultured in growth medium and 
25 mM glucose (lane 1) were switched to medium contain- 
ing no glucose for 24 hours (lane 2). Glucose-starved cells 
were refed with medium containing 25 mM glucose for 
1 hour (lane 3) or 3 hours (lane 4). Samples (12 wg of pro- 
tein) were either treated or not treated with a-amylase, as 
indicated, before being loaded on the gel. 


(a) Why did the western analysis produce a “smear’’—that 
is, the high-molecular-weight staining in lane 1(—)? 

(b) What is the significance of the decrease in high- 
molecular-weight staining in lane 2(—)? 

(c) What is the significance of the difference between lanes 
2(—) and 3(—)? 

(d) Suggest a plausible reason why there is essentially no 
difference between lanes 3(—) and 4(—). 


(e) Why are the bands at 66 kDa the same in the lanes 
treated with amylase, despite the fact that the cells were 


treated differently? 


Fatty Acid Metabolism 


Fats provide efficient means for storing energy for later use. (Right) The processes 
of fatty acid synthesis (preparation for energy storage) and fatty acid degradation 
(preparation for energy use) are, in many ways, the reverse of each other. (Above) 
Studies of mice are revealing the interplay between these pathways and the 
biochemical bases of appetite and weight control. [Photograph Oak Ridge National 
Laboratory/U.S. Department of Energy/Science Photo Library.] 


e turn now from the metabolism of carbohydrates to that of fatty acids. 

A fatty acid contains a long hydrocarbon chain and a terminal carbox- 
ylate group. Fatty acids have four major physiological roles. First, fatty acids 
are fuel molecules. They are stored as triacylglycerols (also called neutral fats 
or triglycerides), which are uncharged esters of fatty acids with glycerol. 
Triacylglycerols are stored in adipose tissue, composed of cells called adipo- 
cytes (Figure 22.1). Fatty acids mobilized from triacylglycerols are oxidized to 
meet the energy needs of a cell or organism. During rest or moderate exercise, 
such as walking, fatty acids are our primary source of energy. Second, fatty 
acids are building blocks of phospholipids and glycolipids. These amphipathic 
molecules are important components of biological membranes, as discussed 
in Chapter 12. Third, many proteins are modified by the covalent attachment 
of fatty acids, which targets the proteins to membrane locations (Section 12.4). 
Fourth, fatty acid derivatives serve as hormones and intracellular messengers. In 
this chapter, we focus on the degradation and synthesis of fatty acids. 
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FIGURE 22.1 Electron micrograph of an 
adipocyte. A small band of cytoplasm 
surrounds the large deposit of triacylglycerols. 
[Biophoto Associates/Photo Researchers.] 
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Fatty acid degradation and synthesis mirror each other in 
their chemical reactions 


Fatty acid degradation and synthesis consist of four steps that are the 
reverse of each other in their basic chemistry. Degradation is an oxidative 
process that converts a fatty acid into a set of activated acetyl units (acetyl 
CoA) that can be processed by the citric acid cycle (Figure 22.2). An acti- 
vated fatty acid is oxidized to introduce a double bond; the double bond is 
hydrated to introduce a hydroxyl group; the alcohol is oxidized to a ketone; 
and, finally, the fatty acid is cleaved by coenzyme A to yield acetyl CoA and 
a fatty acid chain two carbons shorter. If the fatty acid has an even number 
of carbon atoms and is saturated, the process is simply repeated until the 
fatty acid is completely converted into acetyl CoA units. 

Fatty acid synthesis is essentially the reverse of this process. The process 
starts with the individual units to be assembled—1in this case, with an acti- 
vated acyl group (most simply, an acetyl unit) and a malonyl unit 
(Figure 22.2). The malonyl unit condenses with the acetyl unit to form a 
four-carbon fragment. To produce the required hydrocarbon chain, the 
carbonyl group is reduced to a methylene group in three steps: a reduction, 


FATTY ACID DEGRADATION FATTY ACID SYNTHESIS 
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FIGURE 22.2 Steps in fatty acid degradation and synthesis. The two processes are in 
many ways mirror images of each other. 


a dehydration, and another reduction, exactly the opposite of degradation. 
The product of the reduction is butyryl CoA. Another activated malonyl 
group condenses with the butyryl unit, and the process is repeated until a 
C,,, or shorter fatty acid is synthesized. 


22.1 Triacylglycerols Are Highly Concentrated 
Energy Stores 


Triacylglycerols are highly concentrated stores of metabolic energy because 
they are reduced and anhydrous. The yield from the complete oxidation of 
fatty acids is about 38 kJ g~! (9 kcal g~!), in contrast with about 17 kJ g~! 
(4 kcal g~!) for carbohydrates and proteins. The basis of this large differ- 
ence in caloric yield is that fatty acids are much more reduced than carbohy- 
drates or proteins. Furthermore, triacylglycerols are nonpolar, and so they 
are stored in a nearly anhydrous form, whereas much more polar carbohy- 
drates are more highly hydrated. In fact, 1 g of dry glycogen binds about 2 g 
of water. Consequently, a gram of nearly anhydrous fat stores 6.75 times as 
much energy as a gram of hydrated glycogen, which is likely the reason that 
triacylglycerols rather than glycogen were selected in evolution as the major 
energy reservoir. Consider a typical 70-kg man, who has fuel reserves of 
420,000 kJ (100,000 kcal) in triacylglycerols, 100,000 kJ (24,000 kcal) in 
protein (mostly in muscle), 2500 kJ (600 kcal) in glycogen, and 170 kJ 
(40 kcal) in glucose. Triacylglycerols constitute about 11 kg of his total body 
weight. If this amount of energy were stored in glycogen, his total body 
weight would be 64 kg greater. The glycogen and glucose stores provide 
enough energy to sustain physiological function for about 24 hours, whereas 
the triacylglycerol stores allow survival for several weeks. 

In mammals, the major site of triacylglycerol accumulation is the cyto- 
plasm of adipose cells (fat cells). This fuel-rich tissue 1s located throughout 
the body, notably under the skin (subcutaneous fat) and surrounding the 
internal organs (visceral fat). Droplets of triacylglycerol coalesce to form a 
large globule, called a lipid droplet, which may occupy most of the cell vol- 
ume (Figure 22.1). The lipid droplet is surrounded by a monolayer of 
phospholipids and numerous proteins required for triacylglycerol metabo- 
lism. Originally believed to be inert-lipid deposits, lipid droplets are now 
understood to be dynamic organelles essential for the regulation of lipid 
metabolism. Adipose cells are specialized for the synthesis and storage of 
triacylglycerols and for their mobilization into fuel molecules that are trans- 
ported to other tissues by the blood. Muscle also stores triacylglycerols for 
its own energy needs. Indeed, triacylglycerols are evident as the “marbling” 
of expensive cuts of beef. 

The utility of triacylglycerols as an energy source is dramatically illus- 
trated by the abilities of migratory birds, which can fly great distances with- 
out eating after having stored energy as triacylglycerols. Examples are the 
American golden plover and the ruby-throated hummingbird. The golden 
plover flies from Alaska to the southern tip of South America; a large segment 
of the flight (3800 km, or 2400 miles) is over open ocean, where the birds can- 
not feed. The ruby-throated hummingbird flies nonstop across the Gulf of 
Mexico. Fatty acids provide the energy source for both these prodigious feats. 


Dietary lipids are digested by pancreatic lipases 


Most lipids are ingested in the form of triacylglycerols and must be degraded 
to fatty acids for absorption across the intestinal epithelium. Intestinal 
enzymes called lipases, secreted by the pancreas, degrade triacylglycerols to 
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Triacylglycerols fuel the long migration 
flights of the American golden plover 
(Pluvialis dominica). [Source: Jim Zipp/ 
Science Source.] 
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FIGURE 22.4 Glycocholate. Bile acids, 
such as glycocholate, facilitate lipid 
digestion in the intestine. 


FIGURE 22.5 Chylomicron formation. 
Free fatty acids and monoacylglycerols are 
absorbed by intestinal epithelial cells. 
Triacylglycerols are resynthesized and 
packaged with other lipids and apolipoprotein 
B-48 to form chylomicrons, which are then 
released into the lymph system. 
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FIGURE 22.3 Action of pancreatic lipases. Lipases secreted by the pancreas convert 
triacylglycerols into fatty acids and monoacylglycerol for absorption into the intestine. 


free fatty acids and monoacylglycerol (Figure 22.3). Lipids 

present a special problem because, unlike carbohydrates and 

proteins, these molecules are not soluble in water. They exit 

N~ “coo- _ the stomach as an emulsion, particles with a triacylglycerol 

core surrounded by cholesterol and cholesterol esters. How are 

the lipids made accessible to the lipases, which are in aqueous 

solution? First, the particles are coated with bile acids (Figure 22.4), 

amphipathic molecules synthesized from cholesterol in the liver and secreted 

from the gallbladder. The ester bonds of each lipid are oriented toward the 

surface of the bile salt-coated particle, rendering the bond more accessible to 

digestion by lipases in aqueous solution. However, in this form, the particles 

are still not substrates for digestion. The protein colipase (another pancreatic 

secretory product) must bind the lipase to the particle to permit lipid degrada- 

tion. The final digestion products are carried in micelles to the intestinal epi- 

thelium where they are transported across the plasma membrane (Figure 22.5). 

If the production of bile salts is inadequate due to liver disease, large amounts 

of fats (as much as 30 g day~!) are excreted in the feces. This condition is 
referred to as steatorrhea, after stearic acid, a common fatty acid. 


Dietary lipids are transported in chylomicrons 


In the intestinal mucosal cells, the triacylglycerols are resynthesized from 
fatty acids and monoacylglycerols and then packaged into lipoprotein trans- 
port particles called chylomicrons, stable particles approximately 2000 A 
(200 nm) in diameter (Figure 22.5). These particles are composed mainly of 
triacylglycerols, with apoliprotein B-48 (apo B-48) as the main protein com- 
ponent. Protein constituents of lipoprotein particles are called apolipopro- 
teins. Chylomicrons also transport fat-soluble vitamins and cholesterol. 
The chylomicrons are released into the lymph system and then into the 
blood. These particles bind to membrane-bound lipases, primarily at adipose 
tissue and muscle, where the triacylglycerols are once again degraded into 
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free fatty acids and monoacylglycerol for transport into the tissue. The triac- 
ylglycerols are then resynthesized inside the cell and stored. In the muscle, 
they can be oxidized to provide energy. 


22.2 The Use of Fatty Acids as Fuel Requires Three 
Stages of Processing 


Tissues throughout the body gain access to the lipid energy reserves stored 
in adipose tissue through three stages of processing. First, the lipids must be 
mobilized. In this process, triacylglycerols are degraded to fatty acids and 
glycerol, which are released from the adipose tissue and transported to the 
energy-requiring tissues. Second, at these tissues, the fatty acids must be 
activated and transported into mitochondria for degradation. Third, the 
fatty acids are broken down in a step-by-step fashion into acetyl CoA, 
which is then processed in the citric acid cycle. 


Triacylglycerols are hydrolyzed by hormone-stimulated lipases 


Consider someone who has just awakened from a night’s sleep and begins a 
bout of exercise. Glycogen stores will be low, but lipids are readily available. 
How are these lipid stores mobilized? 

Before fats can be used as fuels, the triacylglycerol storage form must be 
hydrolyzed to yield isolated fatty acids. This reaction is catalyzed by a hormon- 
ally controlled lipase. Under the physiological conditions facing an early- 
morning runner, glucagon and epinephrine will be present. In adipose tissue, 
these hormones trigger 7 TM receptors that activate adenylate cyclase 
(Section 14.1). The increased level of cyclic AMP then stimulates protein 
kinase A, which phosphorylates two key proteins: pevilipin, a fat-droplet- 
associated protein, and hormone-sensitive lipase (Figure 22.6). The phospho- 
rylation of perilipin has two crucial effects. First, it restructures the fat droplet 
so that the triacylglycerols are more accessible to the mobilization. Second, the 
phosphorylation of perilipin triggers the release of a coactivator for adipose tri- 
glyceride lipase (ATGL). Once bound to the coactivator, ATGL initiates the 
mobilization of triacylglycerols by releasing a fatty acid from triacylglycerol, 
forming diacylglycerol. Diacylglycerol is converted into a free fatty acid and 
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FIGURE 22.6 Mobilization of 
triacylglycerols. Triacylglycerols in 
adipose tissue are converted into free 
fatty acids in response to hormonal signals. 
1. The hormones activate protein kinase 
A through the cAMP cascade. 2. Protein 
kinase A phosphorylates perilipin, 

resulting in the restructuring of the lipid 
droplet and release of the coactivator of 
ATGL, thereby activating ATGL. 3. ATGL 
converts triacylglycerol into diacylglycerol. 
4. Hormone-sensitive lipase releases a 
fatty acid from diacylglycerol, generating 
monoacylglycerol. 5. Monoacylglycerol 
lipase completes the mobilization process. 
Abbreviations: 7TM, seven transmembrane 
receptor; ATGL, adipose triglyceride 
lipase; CA, coactivator; HS lipase, 
hormone-sensitive lipase; MAG lipase, 
monoacylglycerol lipase; DAG, 
diacylglycerol; TAG, triacylglycerol. 
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ATP 


monoacylglycerol by the hormone-sensitive lipase. Finally, a monoacylglycerol 
lipase completes the mobilization of fatty acids with the production of a free 
fatty acid and glycerol. Thus, epinephrine and glucagon induce lipolysis. Although 
their role in muscle is not as firmly established, these hormones probably also 
regulate the use of triacylglycerol stores in that tissue. 


e If the coactivator required by ATGL is missing or defective, a rare 

condition (incidence unknown) called Chanarin-Dorfman syndrome 
results. Fats cae throughout the body because they cannot be 
released by ATGL. Other symptoms include dry skin (ichthyosis), enlarged 
liver, muscle weakness, and mild cognitive disability. 


Free fatty acids and glycerol are released into the blood 


Fatty acids are not soluble in aqueous solutions. In order to reach tissues 
that require fatty acids, the released fatty acids bind to the blood protein 
albumin, which delivers them to tissues in need of fuel. 

Glycerol formed by lipolysis is absorbed by the liver and phosphory]l- 
ated. It is then oxidized to dihydroxyacetone phosphate, which is isomer- 
ized to glyceraldehyde 3-phosphate. This molecule is an intermediate in 
both the glycolytic and the gluconeogenic pathways. 


ADP NAD* NADH + Ht 


0 
rs 
oo a. /prn0H hia 
> HO—C-H > O=—=C H—C—OH 


CH20OH 
Glycerol 


ia \ 
HO—C—H 


Glycerol Triose 
CH 0PO;2- phosphate CH,OPO037- phosphate CH,OPO;2- 
dehydrogenase isomerase 
L-Glycerol Dihydroxyacetone p-Glyceraldehyde 
3-phosphate phosphate 3-phosphate 


Hence, glycerol can be converted into pyruvate or glucose in the liver, 
which contains the appropriate enzymes (Figure 22.7). The reverse process 
can take place by the reduction of dihydroxyacetone phosphate to glycerol 
3-phosphate. Hydrolysis by a phosphatase then gives glycerol. Thus, glyc- 
erol and glycolytic intermediates are readily interconvertible. 
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FIGURE 22.7 Lipolysis generates fatty g 
acids and glycerol. The fatty acids are 
used as fuel by many tissues. The liver 
processes glycerol by either the glycolytic 
or the gluconeogenic pathway, depending 
on its metabolic circumstances. CO, + HO 
Abbreviation: CAC, citric acid cycle. X e 


Fatty acids are linked to coenzyme A before they are oxidized 


Fatty acids separate from the albumin in the blood stream and diffuse across 
the cell membrane with the assistance of transport proteins. In the cell, fatty 
acids are shuttled about in association with fatty-acid-binding-proteins. 


Fatty acid oxidation occurs in the mitochondrial matrix, but in order to 
enter the mitochondria, the fatty acids are first activated through the forma- 
tion of a thioester linkage to coenzyme A. Adenosine triphosphate (ATP) 
drives the formation of the thioester linkage between the carboxyl group of a 
fatty acid and the sulfhydryl group of coenzyme A. This activation reaction 
takes place on the outer mitochondrial membrane, where it is catalyzed by 
acyl CoA synthetase (also called fatty acid thiokinase). 


ATP = AMP. + _- PP; 
0 


ae 1 
ee + HS—CoA pe CoA 
0 R q 


Acyl CoA synthetase accomplishes the activation of a fatty acid in two 
steps. First, the fatty acid reacts with ATP to form an acyl adenylate. In this 
mixed anhydride, the carboxyl group of a fatty acid is bonded to the phos- 
phoryl group of AMP. The other two phosphoryl] groups of the ATP sub- 
strate are released as pyrophosphate. In the second step, the sulfhydryl 
group of coenzyme A attacks the acyl adenylate, which is tightly bound to 
the enzyme, to form acyl CoA and AMP. 


R 


O O 
Je + ATP Jk + PP; (1) 
R fo) R AMP 
Fatty acid Acyl adenylate 
O O 
Jk + HS—CoA Jk _7OOA + AMP (2) 
R AMP R S 
Acyl CoA 


These partial reactions are freely reversible. In fact, the equilibrium con- 
stant for the sum of these reactions 1s close to 1. One high-transfer-potential 
compound is cleaved (between PP. and AMP) and one high-transfer- 
potential compound is formed (the thioester acyl CoA). How is the overall 
reaction driven forward? The answer is that pyrophosphate is rapidly 
hydrolyzed by a pyrophosphatase. The complete reaction is 


RCOO™ + Cod + ATP + H,0-—> 
RCO-CoA + AMP + 2P.+2H* 


This reaction is quite favorable because the equivalent of two molecules of 
ATP is hydrolyzed, whereas only one high-transfer-potential compound is 
formed. We see here another example of a recurring theme in biochemistry: 
Many biosynthetic reactions are made irreversible by the hydrolysis of inorganic 
pyrophosphate. 


Ne Another motif recurs in this activation reaction. The enzyme-bound 
acyl adenylate intermediate is not unique to the synthesis of acyl 
CoA. Acyl adenylates are frequently formed when carboxyl groups are acti- 
vated in biochemical reactions. Amino acids are activated for protein synthe- 
sis by a similar mechanism (Section 30.2), although the enzymes that 
catalyze this process are not homologous to acyl CoA synthetase. Thus, 
activation by adenylation recurs in part because of convergent evolution. 


Carnitine carries long-chain activated fatty acids into the 
mitochondrial matrix 


Fatty acids are activated on the outer mitochondrial membrane, whereas 
they are oxidized in the mitochondrial matrix. A special transport 
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FIGURE 22.8 Acyl carnitine translocase. 
The entry of acyl carnitine into the 
mitochondrial matrix is mediated by a 
translocase. Carnitine returns to the 
cytoplasmic side of the inner mitochondrial 
membrane in exchange for acyl carnitine. 


O 
R-= § QO 
2 Hp 5 

B oa 


mechanism is needed to carry activated long-chain fatty acids across the 
inner mitochondrial membrane. These fatty acids must be conjugated to 
carnitine, an alcohol with both a positive and a negative charge (a zwitter- 
ion). The acyl group is transferred from the sulfur atom of coenzyme A to 
the hydroxyl group of carnitine to form acyl carnitine. This reaction is cata- 
lyzed by carnitine acyltransferase I, also called carnitine palmitoyl transferase I, 
which is bound to the outer mitochondrial membrane. 


O 
pe CoA + 
R s~ Hz 


H3C 
Acyl CoA Carnitine Acyl carnitine 


Acyl carnitine is then shuttled across the inner mitochondrial membrane by a 
translocase (Figure 22.8). The acyl group is transferred back to coenzyme A on 
the matrix side of the membrane. This reaction, which is catalyzed by carnitine 
acyltransferase II (carnitine palmitoyl transferase II), is simply the reverse of 
the reaction that takes place in the cytoplasm. The reaction is thermodynami- 
cally feasible because of the zwitterionic nature of carnitine. The O-acy] link in 
carnitine has a high group-transfer potential, apparently because, being zwit- 
terions, carnitine and its esters are solvated differently from most other alco- 
holsand their esters. Finally, the translocase returns carnitine to the cytoplasmic 
side in exchange for an incoming acy] carnitine. 


e A number of diseases have been traced to a deficiency of carnitine, 
the transferase, or the translocase. Inability to synthesize carnitine 
may be a contributing factor to the development of autism in males. The 
symptoms of carnitine deficiency range from mild muscle cramping to 
severe weakness and even death. In general, muscle, kidney, and heart are 
the tissues primarily impaired. Muscle weakness during prolonged exercise 
is a symptom of a deficiency of carnitine acyltransferases because muscle 
relies on fatty acids as a long-term source of energy. Medium-chain (C,.— 
C,.) fatty acids are oxidized normally in these patients because these fatty 
acids can enter the mitochondria, to some degree, in the absence of carni- 
tine. These diseases illustrate that the impaired flow of a metabolite from one 
compartment of a cell to another can lead to a pathological condition. 


Acetyl CoA, NADH, and FADH, are generated in each round 
of fatty acid oxidation 


A saturated acyl CoA is degraded by a recurring sequence of four reactions: 
oxidation by flavin adenine dinucleotide (FAD), hydration, oxidation by 
NAD*, and thiolysis by coenzyme A (Figure 22.9). The fatty acid chain is 
shortened by two carbon atoms as a result of these reactions and FADH,, 
NADH, and acetyl CoA are generated. Because oxidation takes place at the 
8 carbon atom, this series of reactions is called the B-oxidation pathway. 

The first reaction in each round of degradation is the oxidation of acyl 
CoA by an acyl CoA dehydrogenase to give an enoyl CoA with a trans double 
bond between C-2 and C-3. 


Acyl CoA + E-FAD —- trans-A?-enoyl CoA + E-FADH, 


As in the dehydrogenation of succinate in the citric acid cycle, FAD rather 
than NAD* is the electron acceptor because the AG for this reaction is 


insufficient to drive the reduction of NAD*. Electrons from the FADH, 
prosthetic group of the reduced acyl CoA dehydrogenase are transferred to 
a second flavoprotein called electron-transferring flavoprotein (ETF). In 
turn, ETF donates electrons to ETF: ubiquinone reductase, an iron-sulfur 
protein. Ubiquinone is thereby reduced to ubiquinol, which delivers its 
high-potential electrons to the second proton-pumping site of the respira- 
tory chain (Section 18.3). Consequently, 1.5 molecules of ATP are gener- 
ated per molecule of FADH, formed in this dehydrogenation step, as in the 
oxidation of succinate to fumarate. 


R—CH,—CH>—R' Y E-FAD pial all ais Gmail (QH>) 
R—CH=CH—R' nee ¢ ETF-FAD Fe-S (reduced) “\ Ubiquinone (Q) 


The next step is the hydration of the double bond between C-2 and C-3 
by enoyl CoA hydratase. 


trans-A?-Enoyl CoA + H,O —-> L-3-hydroxyacyl CoA 


The hydration of enoyl CoA is stereospecific. Only the L isomer of 
3-hydroxyacyl CoA is formed when the trans-A? double bond is hydrated. 
The enzyme also hydrates a cis-A* double bond, but the product then is the 
D isomer. We shall return to this point shortly in considering how unsatu- 
rated fatty acids are oxidized. 

The hydration of enoyl CoA isa prelude to the second oxidation reaction, 
which converts the hydroxyl group at C-3 into a keto group and generates 
NADH. This oxidation is catalyzed by L-3-hydroxyacyl CoA dehydrogenase, 
which is specific for the L isomer of the hydroxyacyl substrate. 


L-3-Hydroxyacyl CoA + NAD* == 3-ketoacyl CoA + NADH + H* 


The preceding reactions have oxidized the methylene group at C-3 toa 
keto group. The final step is the cleavage of 3-ketoacyl CoA by the thiol 
group of a second molecule of coenzyme A, which yields acetyl CoA and an 
acyl CoA shortened by two carbon atoms. This thiolytic cleavage is cata- 
lyzed by B-ketothiolase. 


3-Ketoacyl CoA + HS-CoA = — acetyl CoA + acyl CoA 
(n carbons) (n — 2 carbons) 


Table 22.1 summarizes the reactions in fatty acid oxidation. 
The shortened acyl CoA then undergoes another cycle of oxidation, 
starting with the reaction catalyzed by acyl CoA dehydrogenase 


TABLE 22.1 Principal reactions in fatty acid oxidation 


0) 
— 
Ned ge es 


CoA 


trans-A?-Enoyl CoA 


H,0 
= Hydration 


HH 
L-3-Hydroxyacyl CoA 
NADt 
Oxidation 
H* + NADH 


i | 

R Cc C 
aa ee 
2 


CoA 


s\ 
HH 
3-Ketoacyl CoA 


HS——CoA ecieas 
Thiolysis 


O 
Lc 
FO a C CoA 
C s 0; 
Hi, re? i oi 
Acyl CoA Acetyl CoA 
(shortened by 
two carbon atoms) 


Be. 


FIGURE 22.9 Reaction sequence for the 
degradation of fatty acids. Fatty acids are 
degraded by the repetition of a four- 
reaction sequence consisting of oxidation, 
hydration, oxidation, and thiolysis. 


Step Reaction Enzyme 

1 Fatty acid + CoA + ATP == acyl CoA + AMP + PP; Acyl CoA synthetase (also called fatty 
acid thiokinase and fatty acid:CoA ligase)* 

2 Carnitine + acyl CoA = = acyl carnitine + CoA Carnitine acyltransferase (also called 
carnitine palmitoyl transferase) 

3 Acyl CoA + E-FAD —> trans-A?-enoyl CoA + E-FADH, Acyl CoA dehydrogenases (several isozymes having 
different chain-length specificity) 

4 trans-A?-Enoyl CoA + H,O == L-3-hydroxyacyl CoA Enoyl CoA hydratase (also called crotonase or 
3-hydroxyacyl CoA hydrolyase) 

5 L-3-Hydroxyacyl CoA + NAD* == 3-ketoacyl CoA + NADH + H* L-3-Hydroxyacyl CoA dehydrogenase 

6 3-Ketoacyl CoA + CoA == acetyl CoA + acyl CoA (shortened by C,) 8-Ketothiolase (also called thiolase) 


*An AMP-forming ligase. 
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. . ’ e) (Figure 22.10). Fatty acid chains containing from 12 to 18 carbon 
2 2 2 atoms are oxidized by the long-chain acyl CoA dehydrogenase. The 
a aan < ae a Sage at medium-chain acyl CoA dehydrogenase oxidizes fatty acid chains 
? . ' . having from 14 to 4 carbons, whereas the short-chain acyl CoA dehy- 
i drogenase acts only on 4- and 6-carbon fatty acid chains. In contrast, 
EA __COA 8-ketothiolase, hydroxyacyl dehydrogenase, and enoyl CoA hydra- 
gle 3 tase act on fatty acid molecules of almost any length. 
H H | The complete oxidation of palmitate yields 106 molecules of ATP 
ee Had Oagt gen Ly a We can now calculate the energy yield derived from the oxidation of 
a) rd nla a fatty acid. In each reaction cycle, an acyl CoA is shortened by two 
i carbon atoms, and one molecule each of FADH,, NADH, and acetyl 
AoA CoA are formed. 
H3C S + 
C,-acyl CoA + FAD + NAD* + H,O + CoA —> 
C,,_,-acyl CoA + FADH, + NADH + acetyl CoA + Ht 
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Hp H2 


FIGURE 22.10 First three rounds in the 
degradation of palmitate. Two-carbon 
units are sequentially removed from the 
carboxyl end of the fatty acid. 
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FIGURE 22.11 The degradation of a 
monounsaturated fatty acid. Cis-A°-Enoy! 
CoA isomerase allows continued B-oxidation 
of fatty acids with a single double bond. 
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The degradation of palmitoyl CoA (C,,-acyl CoA) requires seven 
reaction cycles. In the seventh cycle, the C,-ketoacyl CoA is thio- 
lyzed to two molecules of acetyl CoA. Hence, the stoichiometry of the 
oxidation of palmitoyl CoA is 


Palmitoyl CoA + 7 FAD + 7 NAD* + 7 CoA + 7H,O —> 
8 acetyl CoA + 7 FADH, + 7 NADH + 7 Ht 


Approximately 2.5 molecules of ATP are generated when the respi- 
ratory chain oxidizes each of these NADH molecules, whereas 
1.5 molecules of ATP are formed for each FADH, because their 
electrons enter the chain at the level of ubiquinol. Recall that the 
oxidation of acetyl CoA by the citric acid cycle yields 10 molecules of ATP. 
Hence, the number of ATP molecules formed in the oxidation of palmitoyl 
CoA is 10.5 from the seven FADH,, 17.5 from the seven NADH, and 80 
from the eight acetyl CoA molecules, which gives a total of 108. The equiv- 
alent of 2 molecules of ATP is consumed in the activation of palmitate, in 
which ATP is split into AMP and 2 molecules of orthophosphate. Thus, the 
complete oxidation of a molecule of palmitate yields 106 molecules of ATP. 


22.3 Unsaturated and Odd-Chain Fatty Acids Require 
Additional Steps for Degradation 


The B-oxidation pathway accomplishes the complete degradation of satu- 
rated fatty acids having an even number of carbon atoms. Most fatty acids 
have such structures because of their mode of synthesis (to be addressed 
later in this chapter). However, not all fatty acids are so simple. The oxida- 
tion of fatty acids containing double bonds requires additional steps, as does 
the oxidation of fatty acids containing an odd number of carbon atoms. 


An isomerase and a reductase are required for the oxidation of 
unsaturated fatty acids 


The oxidation of unsaturated fatty acids presents some difficulties, yet 
many such fatty acids are available in the diet. Most of the reactions are the 
same as those for saturated fatty acids. In fact, only two additional 
enzymes—an isomerase and a reductase—are needed to degrade a wide 
range of unsaturated fatty acids. 

Consider the oxidation of palmitoleate (Figure 22.11). This C,, unsat- 
urated fatty acid, which has one double bond between C-9 and C-10, 
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is activated and transported across the inner mitochondrial membrane 
in the same way as saturated fatty acids are. Palmitoleoyl] CoA then 
undergoes three cycles of degradation, which are carried out by the same 
enzymes as those in the oxidation of saturated fatty acids. However, the 
cis-A?-enoyl CoA formed in the third round is not a substrate for acyl 
CoA dehydrogenase. The presence of a double bond between C-3 and C-4 
prevents the formation of another double bond between C-2 and C-3. 
This impasse is resolved by a new reaction that shifts the position and 
configuration of the cis-A? double bond. cis-A?-Enoyl CoA isomerase con- 
verts this double bond into a trans-A? double bond. The double bond is now 
between C-2 and C-3. The subsequent reactions are those of the satu- 
rated fatty acid oxidation pathway, in which the trans-A?-enoyl CoA is a 
regular substrate. 

Human beings require polyunsaturated fatty acids, which have multi- 
ple double bonds, as important precursors for signal molecules, but excess 
polyunsaturated fatty acids are degraded by B oxidation. However, 
another problem arises with the oxidation of polyunsaturated fatty acids. 
Consider linoleate, a C,, polyunsaturated fatty acid with cis-A? and cis- 
A? double bonds (Figure 22.12). The cis-A% double bond (between car- 
bons 3 and 4) formed after three rounds of B-oxidation is converted into a 
trans-A? double bond (between carbons 2 and 3) by the aforementioned 
isomerase. The acyl CoA produced by another round of B-oxidation con- 
tains a cis-A* (between carbons 4 and 5) double bond. Dehydrogenation of 
this species by acyl CoA dehydrogenase yields a 2,4-dienoyl intermediate 
(double bond between carbons 2 and 3 and carbons 4 and 5), which is not 
a substrate for the next enzyme in the B-oxidation pathway. This impasse 
is circumvented by 2,4-dienoyl CoA reductase, an enzyme that uses 
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FIGURE 22.12 Oxidation of linoleoyl CoA. 
The complete oxidation of the diunsaturated 
fatty acid linoleate is facilitated by the 
activity of enoyl CoA isomerase and 
2,4-dienoyl CoA reductase. 
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FIGURE 22.13 Conversion of propionyl 
CoA into succinyl CoA. Propionyl CoA, 
generated from fatty acids with an odd 
number of carbons as well as some amino 
acids, is converted into the citric acid cycle 
intermediate succinyl CoA. 


NADPH to reduce the 2,4-dienoyl intermediate to trans-A?-enoyl CoA. 
cis-A-Enoyl CoA isomerase then converts trans-A3-enoyl CoA into the 
trans-A* form, a customary intermediate in the B-oxidation pathway. 
These catalytic strategies are elegant and economical. Only two extra 
enzymes are needed for the oxidation of any polyunsaturated fatty acid. 
Odd-numbered double bonds are handled by the isomerase, and even-numbered 
ones by the reductase and the isomerase. 


Odd-chain fatty acids yield propionyl CoA in the final thiolysis step 


Fatty acids having an odd number of carbon atoms are minor species. They 
are oxidized in the same way as fatty acids having an even number, except 
that propionyl CoA and acetyl CoA, rather than two molecules of acetyl 
CoA, are produced in the final round of degradation. The activated three- 
carbon unit in propionyl CoA enters the citric acid cycle after it has been 
converted into succinyl CoA. 

The pathway from propionyl CoA to succinyl CoA is especially inter- 
esting because it entails a rearrangement that requires vitamin B,, (also 
known as cobalamin). Propionyl CoA is carboxylated at the expense of the 
hydrolysis of a molecule of ATP to yield the D isomer of methylmalonyl 
CoA (Figure 22.13). This carboxylation reaction is catalyzed by propionyl 
CoA carboxylase, a biotin enzyme that has a catalytic mechanism like that of 
the homologous enzyme pyruvate carboxylase (Section 16.3). The D isomer 
of methylmalonyl CoA is racemized to the L isomer, the substrate for a 
mutase that converts it into succinyl CoA by an intramolecular rearrangement. 
The —CO—S—CoA group migrates from C-2 to a methyl group in 
exchange for a hydrogen atom. This very unusual isomerization is catalyzed 
by methylmalonyl CoA mutase, which contains a derivative of cobalamin as 
its coenzyme. 
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Vitamin B,, contains a corrin ring and a cobalt atom 


Cobalamin enzymes, which are present in most organisms, catalyze three 
types of reactions: (1) intramolecular rearrangements; (2) methylations, as in 
the synthesis of methionine; and (3) the reduction of ribonucleotides to deoxy- 
ribonucleotides (Section 25.3). In mammals, only two reactions are known to 
require coenzyme B,,. The conversion of L-methylmalonyl CoA into suc- 
cinyl CoA is one, and the formation of methionine by the methylation of 
homocysteine is the other (Section 24.2). The latter reaction is especially 
important because methionine is required for the generation of coenzymes 
that participate in the synthesis of purines and thymine, which are needed 
for nucleic acid synthesis. 

The core of cobalamin consists of a corrin ring with a central cobalt atom 
(Figure 22.14). The corrin ring, like a porphyrin, has four pyrrole units. Two 
of them are directly bonded to each other, whereas the others are joined by 
methine bridges, as in porphyrins. The corrin ring is more reduced than that 
of porphyrins and the substituents are different. A cobalt atom is bonded to 
the four pyrrole nitrogens. The fifth substituent linked to the cobalt atom is 
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a derivative of dimethylbenzimidazole that contains ribose 3-phosphate and 
aminoisopropanol. One of the nitrogen atoms of dimethylbenzimidazole is 
linked to the cobalt atom. In coenzyme B, > the sixth substituent linked to the 
cobalt atom is a 5’-deoxyadenosyl unit or a methyl group. This position can 
also be occupied by a cyano group. Cyanocobalamin is the form of the coen- 
zyme administered to treat B,, deficiency. In all of these compounds, the 
cobalt is in the +3 oxidation state. 


Mechanism: Methylmalonyl CoA mutase catalyzes a 
rearrangement to form succinyl CoA 


The rearrangement reactions catalyzed by coenzyme B,, are exchanges of 
two groups attached to adjacent carbon atoms of the substrate (Figure 22.15). 
A hydrogen atom migrates from one carbon atom to the next, and an R 
group (such as the —CO—S—CoA group of methylmalonyl CoA) con- 
comitantly moves in the reverse direction. The first step in these intramo- 
lecular rearrangements is the cleavage of the carbon—cobalt bond of 
5'-deoxyadenosylcobalamin to generate the Co** form of the coenzyme 
and a 5’-deoxyadenosyl radical, —CH,: (Figure 22.16). In this homolytic 
cleavage reaction, one electron of the Co—C bond stays with Co (reducing it 
from the +3 to the +2 oxidation state), whereas the other electron stays 
with the carbon atom, generating a free radical. In contrast, nearly all other 
cleavage reactions in biological systems are heterolytic: an electron pair is 
transferred to one of the two atoms that were bonded together. 

What is the role of this very unusual —CH,,: radical? This highly reac- 
tive species abstracts a hydrogen atom from the substrate to form 
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FIGURE 22.15 Rearrangement reaction 
catalyzed by cobalamin enzymes. The R 
group can be an amino group, a hydroxyl 
group, or a substituted carbon. 


FIGURE 22.16 Formation of a 
5'-deoxyadenosyl radical. The 
methylmalonyl CoA mutase reaction begins 
with the homolytic cleavage of the bond 
joining Co%* of coenzyme B,, to a carbon 
atom of the ribose of the adenosine moiety 
of the enzyme. The cleavage generates a 
5'-deoxyadenosyl radical and leads to the 
reduction of Co?+ to Co2*. The letter R 
represents the 5'-deoxyadenosyl component 
of the coenzyme, and the green oval 
represents the remainder of the coenzyme. 


L-Methylmalonyl CoA 


Succinyl CoA 


5'-Deoxyadenosyl 
radical 


FIGURE 22.17 Formation of succinyl 
CoA by a rearrangement reaction. A free 
radical abstracts a hydrogen atom in the 
rearrangement of methylmalonyl CoA to 
succinyl CoA. 
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5’-deoxyadenosine and a substrate radical (Figure 22.17). This substrate 
radical spontaneously rearranges: the carbonyl CoA group migrates to the 
position formerly occupied by H on the neighboring carbon atom to pro- 
duce a different radical. This product radical abstracts a hydrogen atom 
from the methyl group of 5’-deoxyadenosine to complete the rearrangement 
and return the deoxyadenosy] unit to the radical form. The role of coenzyme 
B,, in such intramolecular migrations is to serve as a source of free radicals for 
the abstraction of hydrogen atoms. 

Anessential property of coenzyme B, , is the weakness of its cobalt-carbon 
bond, which is readily cleaved to generate a radical. To facilitate the cleavage 
of this bond, enzymes such as methylmalonyl CoA mutase displace the benz- 
imidazole group from the cobalamin and bind to the cobalt atom through a 
histidine residue (Figure 22.18). The steric crowding around the cobalt— 
carbon bond within the corrin ring system contributes to the bond weakness. 


Fatty acids are also oxidized in peroxisomes 


Although most fatty acid oxidation takes place in mitochondria, oxidation 
of long chain and branched fatty acids takes place in cellular organelles 


kK FIGURE 22.18 Active site of 
methylmalonyl CoA mutase. Notice that a 
histidine residue from the enzyme binds to 
cobalt in place of benzimidazole. This 
arrangement of substrate and coenzyme in 
the active site facilitates the cleavage of the 
cobalt-carbon bond and the subsequent 
abstraction of a hydrogen atom from the 
substrate. [Drawn from 4REQ.pdb.] 
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called peroxisomes (Figure 22.19) (problems 17 and 44). These organelles are 
small membrane-bounded compartments that are present in the cells of 
most eukaryotes. Fatty acid oxidation in these organelles, which halts at 
octanoyl CoA, serves to shorten very long chains (C,,,) to make them better 
substrates of B oxidation in mitochondria. Peroxisomal oxidation differs 
from B oxidation in the initial dehydrogenation reaction (Figure 22.20). In 
peroxisomes, acyl CoA dehydrogenase, a flavoprotein, transfers electrons 
from the substrate to FADH, and then to O, to yield H,O, instead of cap- 
turing high-energy electrons as FADH, for use in the electron-transport 
chain, as in mitochondrial B oxidation. Peroxisomes contain high concen- 
trations of the enzyme catalase to degrade H,O, into water and O,. 
Subsequent steps are identical with those of their mitochondrial counter- 
parts, although they are carried out by different isoforms of the enzymes. 


e Peroxisomes do not function in patients with Zellweger syndrome. 

Liver, kidney, and muscle abnormalities usually lead to death by 
age 6. The syndrome is caused by a defect in the import of enzymes into the 
peroxisomes. Here we see a pathological condition resulting from an inap- 
propriate cellular distribution of enzymes. 


O, H202 H20 + 0 
Catalase 
saa CoA 
S Ss 
Acyl CoA Acyl CoA o 
a dehydrogenase dehydrogenase Further 
CHa (ox, FAD) (red, FADH2) i; 
f / 
HG HEC 
Yen Ye ) 
2)n >)n 
4 / 
BES HC 


Ketone bodies are formed from acetyl CoA when fat 
breakdown predominates 


The acetyl CoA formed in fatty acid oxidation enters the citric acid cycle 
only if fat and carbohydrate degradation are appropriately balanced. Acetyl 
CoA must combine with oxaloacetate to gain entry to the citric acid cycle. 
The availability of oxaloacetate, however, depends on an adequate supply of 
carbohydrate. Recall that oxaloacetate is normally formed from pyruvate, 
the product of glucose degradation in glycolysis, by pyruvate carboxylase 
(Section 16.3). If carbohydrate is unavailable or improperly utilized, the 
concentration of oxaloacetate is lowered and acetyl CoA cannot enter the 
citric acid cycle. This dependency is the molecular basis of the adage that 
fats burn in the flame of carbohydrates. 

In fasting or diabetes, oxaloacetate is consumed to form glucose by the 
gluconeogenic pathway (Section 16.3) and hence is unavailable for conden- 
sation with acetyl CoA. Under these conditions, acetyl CoA is diverted to 
the formation of acetoacetate and D-3-hydroxybutyrate. Acetoacetate, 
D-3-hydroxybutyrate, and acetone are often referred to as ketone bodies. 
Abnormally high levels of ketone bodies are present in the blood of 
untreated diabetics. 

Acetoacetate is formed from acetyl CoA in three steps (Figure 22.21). 
Two molecules of acetyl CoA condense to form acetoacetyl CoA. This 
reaction, which is catalyzed by thiolase, is the reverse of the thiolysis step 
in the oxidation of fatty acids. Acetoacetyl CoA then reacts with acetyl 


FIGURE 22.19 Electron micrograph of 

a peroxisome in a liver cell. A crystal of 
urate oxidase is present inside the organelle, 
which is bounded by a single bilayer 
membrane. The dark granular structures 
outside the peroxisome are glycogen 
particles. [Courtesy of Dr. George Palade.] 


FIGURE 22.20 Initiation of peroxisomal 
fatty acid degradation. The first 
dehydrogenation in the degradation of fatty 
acids in peroxisomes requires a flavoprotein 
dehydrogenase that transfers electrons 
from its FADH, moiety to O, to yield H,O,. 
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FIGURE 22.21 Formation of ketone bodies. The ketone bodies—acetoacetate, D-3- 
hydroxybutyrate, and acetone—are formed from acetyl CoA primarily in the liver. Enzymes 
catalyzing these reactions are (1) 3-ketothiolase, (2) hydroxymethylglutaryl CoA synthase, 

(8) hydroxymethylglutaryl CoA cleavage enzyme, and (4) D-3-hydroxybutyrate dehydrogenase. 
Acetoacetate spontaneously decarboxylates to form acetone. 


CoA and water to give 3-hydroxy-3-methylglutaryl CoA (HMG-CoA) 
and CoA. This condensation resembles the one catalyzed by citrate syn- 
thase (Section 17.2). This reaction, which has a favorable equilibrium 
owing to the hydrolysis of a thioester linkage, compensates for the unfavor- 
able equilibrium in the formation of acetoacetyl CoA. 3-Hydroxy-3- 
methylglutaryl CoA is then cleaved to acetyl CoA and acetoacetate. The 
sum of these reactions is 


2 Acetyl CoA + H»O —> acetoacetate + 2CoA + H* 


D-3-Hydroxybutyrate is formed by the reduction of acetoacetate in the 
mitochondrial matrix by D-3-hydroxybutyrate dehydrogenase. The ratio of 
hydroxybutyrate to acetoacetate depends on the NADH/NAD* ratio 
inside mitochondria. 

Because it is a B-ketoacid, acetoacetate also undergoes a slow, spon- 
taneous decarboxylation to acetone. The odor of acetone may be detected 
in the breath of a person who has a high level of acetoacetate in the 
blood. Under starvation conditions, the acetone may be captured to syn- 
thesize glucose. 


Ketone bodies are a major fuel in some tissues 


The major site of the production of acetoacetate and 3-hydroxybutyrate 
is the liver. These molecules diffuse from the liver mitochondria into the 
blood and are transported to other tissues such as heart and kidney 
(Figure 22.22). Acetoacetate and 3-hydroxybutyrate are normal fuels of 
respiration and are quantitatively important as sources of energy. Indeed, 
heart muscle and the renal cortex use acetoacetate in preference to glu- 
cose. In contrast, glucose is the major fuel for the brain and red blood 
cells in well-nourished people on a balanced diet. However, the brain 
adapts to the utilization of acetoacetate during starvation and diabetes. 
In prolonged starvation, 75% of the fuel needs of the brain are met by 
ketone bodies. 

Acetoacetate is converted into acetyl CoA in two steps. First, acetoacetate 
is activated by the transfer of CoA from succinyl CoA in a reaction catalyzed 
by a specific CoA transferase. Second, acetoacetyl CoA is cleaved by thiolase 
to yield two molecules of acetyl CoA, which can then enter the citric acid 
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INTEGRATION: Liver supplies 
ketone bodies to the peripheral 
tissues. During fasting or in 
untreated diabetics, the liver 
converts fatty acids into ketone 
bodies, which are a fuel source for 
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a number of tissues. Ketone-body 
production is especially important 
during starvation, when ketone 
bodies are the predominant fuel. 
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requires an additional step to yield acetyl CoA. It is first oxidized to produce ae . 
acetoacetate, which is processed as heretofore described, and NADH for use teanieeiase 
in oxidative phosphorylation. Succinate 


p-3-Hydroxybutyrate Acetoacetate 

Ketone bodies can be regarded as a water-soluble, transportable form of 
acetyl units. Fatty acids are released by adipose tissue and converted into 
acetyl units by the liver, which then exports them as acetoacetate. As might 
be expected, acetoacetate also has a regulatory role. High levels of acetoace- 
tate in the blood signify an abundance of acetyl units and lead to a decrease in 


the rate of lipolysis in adipose tissue. 

e High blood levels of ketone bodies, the result of certain pathological 
conditions, can be life threatening. The most common of these condi- 

tions is diabetic ketosis in patients with insulin-dependent diabetes 
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FIGURE 22.23 Utilization of acetoacetate 
as a fuel. Acetoacetate can be converted 
into two molecules of acetyl CoA, which 
then enter the citric acid cycle. 
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FIGURE 22.24 Diabetic ketosis results 
when insulin is absent. In the absence of 
insulin, fats are released from adipose 
tissue, and glucose cannot be absorbed by 
the liver or adipose tissue. The liver 
degrades the fatty acids by B oxidation but 
cannot process the acetyl CoA, because of 
a lack of glucose-derived oxaloacetate 
(OAA). Excess ketone bodies are formed 
and released into the blood. 
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mellitus. These patients are unable to produce insulin. As stated earlier, this 
hormone, normally released after meals, signals tissues to take up glucose. 
In addition, it curtails fatty acid mobilization by adipose tissue. The absence 
of insulin has two major biochemical consequences (Figure 22.24). First, 
the liver cannot absorb glucose and consequently cannot provide oxaloace- 
tate to process fatty acid-derived acetyl CoA. Second, adipose cells continue 
to release fatty acids into the blood stream; these fatty acids are then taken 
up by the liver and converted into ketone bodies. The liver thus produces 
large amounts of ketone bodies, which are moderately strong acids. The 
result is severe acidosis. The decrease in pH impairs tissue function, most 
importantly in the central nervous system. 


e Interestingly, diets that promote ketone-body formation, called keto- 

genic diets, are frequently used as a therapeutic option for children 
with drug-resistant epilepsy. Ketogenic diets are rich in fats and low in 
carbohydrates, with adequate amounts of protein. In essence, the body is 
forced into starvation mode, where fats and ketone bodies become the main 
fuel source (Section 27.5). How such diets reduce the seizures suffered by 
the children is currently unknown. 


Animals cannot convert fatty acids into glucose 


A typical human being has far greater fat stores than glycogen stores. 
However, glycogen is necessary to fuel very active muscle, as well as the 
brain, which normally uses only glucose as a fuel. When glycogen stores are 
low, why can’t the body make use of fat stores and convert fatty acids into 
glucose? Because animals are unable to effect the net synthesis of glucose from 
fatty acids. Specifically, acetyl CoA cannot be converted into pyruvate or 
oxaloacetate in animals. Recall that the reaction that generates acetyl CoA 
from pyruvate is irreversible (Section 17.1). The two carbon atoms of the 
acetyl group of acetyl CoA enter the citric acid cycle, but two carbon atoms 
leave the cycle in the decarboxylations catalyzed by isocitrate dehydroge- 
nase and a-ketoglutarate dehydrogenase. Consequently, oxaloacetate is 
regenerated, but it is not formed de novo when the acetyl unit of acetyl CoA 
is oxidized by the citric acid cycle. In essence, two carbon atoms enter the 
cycle as an acetyl group, but two carbons leave the cycle as CO, before oxa- 
loacetate is generated. As a result, no net synthesis of oxaloacetate is possi- 
ble. In contrast, plants have two additional enzymes enabling them to 
convert the carbon atoms of acetyl CoA into oxaloacetate (Section 17.5). 


Some fatty acids may contribute to the development of 
pathological conditions 


e As we will see shortly (Section 22.5), certain polyunsaturated fatty 

acids are essential for life, serving as precursors to various signal mol- 
ecules. Vegetable oils, used commonly in food preparation, are rich in poly- 
unsaturated fatty acids. However, polyunsaturated fatty acids are unstable 
and are readily oxidized. This tendency to become rancid reduces their shelf 
life and renders them undesirable for cooking. To circumvent this problem, 
polyunsaturated fatty acids are hydrogenated, converting them to saturated 
and trans unsaturated fatty acids (popularly known as “trans fat”), a variety 
of fat that is rare in nature. Epidemiological evidence suggests that con- 
sumption of large amounts of saturated fatty acids and trans fat promotes 
obesity, type 2 diabetes, and atherosclerosis. The mechanism by which 
these fats exert these effects is under active investigation. Some evidence 
suggests that they promote an inflammatory response and may mute the 
action of insulin and other hormones (Section 27.3). 


22.4 Fatty Acids Are Synthesized by Fatty Acid Synthase 


Fatty acids are synthesized by a complex of enzymes that together are called 
fatty acid synthase. Because eating a typical Western diet meets our physio- 
logical needs for fats and lipids, adult human beings have little need for de 
novo fatty acid synthesis. However, many tissues, such as liver and adipose 
tissue, are capable of synthesizing fatty acids, and this synthesis is required 
under certain physiological conditions. For instance, fatty acid synthesis is 
necessary during embryonic development and during lactation in mam- 
mary glands. Inappropriate fatty acid synthesis in the liver of alcoholics 
contributes to liver failure. 

Acetyl CoA, the end product of fatty acid degradation, is the precur- 
sor for virtually all fatty acids. The biochemical challenge is to link the 
two carbon units together and reduce the carbons to produce palmitate, a 
C,,, fatty acid. Palmitate then serves as a precursor for the variety of other 
fatty acids. 


Fatty acids are synthesized and degraded by different pathways 


Although fatty acid synthesis is the reversal of the degradative pathway in 
regard to basic chemical reactions, the synthetic and degradative pathways 
are different mechanistically, again exemplifying the principle that synthetic 
and degradative pathways are almost always distinct. Some important differ- 
ences between the pathways are as follows: 


1. Synthesis takes place in the cytoplasm, in contrast with degradation, 
which takes place primarily in the mitochondrial matrix. 


2. Intermediates in fatty acid synthesis are covalently linked to the sulfhydryl 
groups of an acyl carrier protein (ACP), whereas intermediates in fatty acid 
breakdown are covalently attached to the sulfhydryl group of coenzyme A. 


3. The enzymes of fatty acid synthesis in higher organisms are joined in a 
single polypeptide chain called fatty acid synthase. In contrast, the degrada- 
tive enzymes are not linked covalently. 


4. The growing fatty acid chain is elongated by the sequential addition of two- 
carbon units derived from acetyl CoA. The activated donor of two-carbon 
units in the elongation step is malonyl ACP. The elongation reaction is driven 
by the release of CO,. 
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FIGURE 22.25 Phosphopantetheine. 
Both acyl carrier protein and coenzyme 
A include phosphopantetheine as their 
reactive units. [Drawn from 1ACP.pdb.] 


5. The reductant in fatty acid synthesis is NADPH, whereas the oxidants 
in fatty acid degradation are NAD* and FAD. 


6. The isomeric form of the hydroxyacyl intermediate in degradation is L, 
while the D form is used in synthesis. 


The formation of malonyl CoA is the committed step in 
fatty acid synthesis 


Fatty acid synthesis starts with the carboxylation of acetyl CoA to malonyl 
CoA. This irreversible reaction is the committed step in fatty acid synthesis. 

The synthesis of malonyl CoA is catalyzed by the cytoplasmic enzyme 
acetyl CoA carboxylase 1, which contains a biotin prosthetic group. The 
carboxyl group of biotin is covalently attached to the € amino group of a 
lysine residue, as in pyruvate carboxylase (Figure 16.24) and propionyl 
CoA carboxylase (p. 654). As with these other enzymes, a carboxybiotin 
intermediate is formed at the expense of the hydrolysis of a molecule of 
ATP. The activated CO, group in this intermediate is then transferred to 
acetyl CoA to form malonyl CoA. 


Biotin-enzyme + ATP + HCO,” == 
CO,-biotin-enzyme + ADP + P; 


CO,-biotin-enzyme + acetyl CoA ——> malonyl CoA + biotin-enzyme 


Acetyl CoA carboxylase 2, an isozyme of carboxylase 1 located in the mito- 
chondria, is the essential regulatory enzyme for fatty acid metabolism 
(Section 22.5). 


Intermediates in fatty acid synthesis are attached to an 
acyl carrier protein 


The intermediates in fatty acid synthesis are linked to an acyl carrier pro- 
tein. Specifically, they are linked to the sulfhydryl terminus of a phospho- 
pantetheine group. In the degradation of fatty acids, this unit is present as 
part of coenzyme A, whereas, in their synthesis, it is attached to a serine 
residue of the acyl carrier protein (Figure 22.25). Thus, ACP, a single poly- 
peptide chain of 77 residues, can be regarded as a giant prosthetic group, a 
“macro CoA.” 


Fatty acid synthesis consists of a series of condensation, reduction, 
dehydration, and reduction reactions 

The enzyme system that catalyzes the synthesis of saturated long-chain 
fatty acids from acetyl CoA, malonyl CoA, and NADPH is called fatty acid 
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TABLE 22.2 Principal reactions in fatty acid synthesis in bacteria 


Step Reaction Enzyme 
1 Acetyl CoA + HCO; + ATP —> malonyl CoA + ADP + P, + Ht Acetyl CoA carboxylase 
2 Acetyl CoA + ACP = = acetyl ACP + CoA Acetyl transacylase 
3 Malonyl CoA + ACP = = malonyl ACP + CoA Malony] transacylase 
4 Acetyl ACP + malonyl ACP —> acetoacetyl ACP + ACP + CO, 8-Ketoacyl synthase 
5 Acetoacetyl ACP + NADPH + H* == p-3-hydroxybutyryl ACP + NADPt 8-Ketoacyl reductase 
6 D-3-Hydroxybutyryl ACP ——= crotonyl ACP + H,O 3-Hydroxyacyl dehydratase 
7 Crotonyl ACP + NADPH + Ht —+ butyryl ACP + NADPt Enoyl reductase 


synthase. The synthase is actually a complex of distinct enzymes. The fatty 
acid synthase complex in bacteria is readily dissociated into individual 
enzymes when the cells are broken apart. The availability of these isolated 
enzymes has helped biochemists elucidate the steps in fatty acid synthesis 
(Table 22.2). In fact, the reactions leading to fatty acid synthesis in higher 
organisms are very much like those of bacteria. 

The elongation phase of fatty acid synthesis starts with the formation of 
acetyl ACP and malonyl ACP. Acetyl transacylase and malonyl transacylase 
catalyze these reactions. 


Acetyl CoA + ACP = — acetyl ACP + CoA 
Malonyl CoA + ACP = = malonyl ACP + CoA 


Malonyl transacylase is highly specific, whereas acetyl transacylase can 
transfer acyl groups other than the acetyl unit, though at a much slower rate. 
The synthesis of fatty acids with an odd number of carbon atoms starts 
with propionyl ACP, which is formed from propionyl CoA by acetyl 
transacylase. 

Acetyl ACP and malonyl ACP react to form acetoacetyl ACP 
(Figure 22.26). The B-ketoacyl synthase, also called the condensing enzyme, 
catalyzes this condensation reaction. 


Acetyl ACP + malonyl ACP —— acetoacetyl ACP + ACP + CO, 


In the condensation reaction, a four-carbon unit is formed from a two- 
carbon unit and a three-carbon unit, and CO, is released. Why is the four- 
carbon unit not formed from two 2-carbon units—say, two molecules of 
acetyl ACP? The answer is that the equilibrium for the synthesis of aceto- 
acetyl ACP from two molecules of acetyl ACP is highly unfavorable. In 
contrast, the equilibrium is favorable if malonyl ACP is a reactant because its 
decarboxylation contributes a substantial decrease in free energy. In effect, 
ATP drives the condensation reaction, though ATP does not directly par- 
ticipate in the condensation reaction. Instead, ATP is used to carboxylate 
acetyl CoA to malonyl CoA. The free energy thus stored in malonyl CoA is 
released in the decarboxylation accompanying the formation of acetoacetyl 
ACP. Although HCO3_ is required for fatty acid synthesis, its carbon atom 
does not appear in the product. Rather, all the carbon atoms of fatty acids 
containing an even number of carbon atoms are derived from acetyl CoA. 

The next three steps in fatty acid synthesis reduce the keto group at C-3 to 
a methylene group (Figure 22.26). First, acetoacetyl ACP is reduced to 
D-3-hydroxybutyryl ACP by B-ketoacyl reductase. This reaction differs from 
the corresponding one in fatty acid degradation in two respects: (1) the 
D rather than the L isomer is formed; and (2) NADPH is the reducing agent, 
whereas NAD7™ is the oxidizing agent in B oxidation. This difference exem- 
plifies the general principle that NADPH is consumed in biosynthetic reactions, 
whereas NADH is generated in energy-yielding reactions. Then, D-3- 
hydroxybutyryl ACP is dehydrated to form crotonyl ACP, which is a 
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FIGURE 22.26 The steps of fatty acid 
synthesis. Fatty acid synthesis begins with 
the condensation of malonyl ACP and 
acetyl ACP to form acetoacetyl ACP. 
Acetoacetyl ACP is then reduced, 
dehydrated, and reduced again to form 
butyryl ACP. Another cycle begins with the 
condensation of butyryl ACP and malonyl 
ACP. The sequence of reactions is repeated 
until the final product palmitate is formed. 
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trans-A?-enoyl ACP by 3-hydroxyacyl dehydratase. The final step in the 
cycle reduces crotonyl ACP to butyryl ACP. NADPH is again the reductant, 
whereas FAD is the oxidant in the corresponding reaction in B oxidation. The 
bacterial enzyme that catalyzes this step, enoyl reductase, can be inhibited by 
triclosan, a broad-spectrum antibacterial agent that is added to a variety of 
products such as toothpaste, soaps, and skin creams. These last three reac- 
tions—a reduction, a dehydration, and a second reduction—convert acetoace- 
tyl ACP into butyryl ACP, which completes the first elongation cycle. 

In the second round of fatty acid synthesis, butyryl ACP condenses with 
malonyl ACP to form a C,-8-ketoacyl ACP. This reaction is like the one in 
the first round, in which acetyl ACP condenses with malonyl ACP to form 
a C,-B-ketoacyl ACP. Reduction, dehydration, and a second reduction 
convert the C,-B-ketoacyl ACP into a C,-acyl ACP, which is ready for a 
third round of elongation. The elongation cycles continue until C,,-acyl 
ACP is formed. This intermediate is a good substrate for a thioesterase that 
hydrolyzes C,,-acyl ACP to yield palmitate and ACP. The thioesterase acts 
as a ruler to determine fatty acid chain length. The synthesis of longer-chain 
fatty acids is discussed in Section 22.6. 


Fatty acids are synthesized by a multifunctional enzyme 
complex in animals 


Although the basic biochemical reactions in fatty acid synthesis are very 
similar in E. coli and eukaryotes, the structure of the synthase varies consid- 
erably. The component enzymes of animal fatty acid synthases, in contrast 
with those of E. coli and plants, are linked in a large polypeptide chain. 

The structure of a large part of the mammalian fatty acid synthase has 
recently been determined, with the acyl carrier protein and thioesterase 
remaining to be resolved. The enzyme is a dimer of identical 270-kd sub- 
units. Each chain contains all of the active sites required for activity, as well 
as an acyl carrier protein tethered to the complex (Figure 22.27A). Despite 
the fact that each chain possesses all of the enzymes required for fatty acid 
synthesis, the monomers are not active. A dimer is required. 

The two component chains interact such that the enzyme activities are 
partitioned into two distinct compartments (Figure 22.27B). The select- 
ing and condensing compartment binds the acetyl and malonyl substrates 
and condenses them to form the growing chain. Interestingly, the mam- 
malian fatty acid synthase has one active site, malonyl/acetyl transacyl- 
ase, that adds both acetyl CoA and malonyl CoA. In contrast, most other 
fatty acid synthases have two separate enzyme activities, one for acetyl 
CoA and one for malonyl CoA. The modification compartment is respon- 
sible for the reduction and dehydration activities that result in the satu- 
rated fatty acid product. 

Let us consider one catalytic cycle of the fatty acid synthase complex 
(Figure 22.28). An elongation cycle begins when malonyl/acetyl transacylase 
(MAT) moves an acetyl unit from coenzyme A to the acyl carrier protein 
(ACP). B-Ketosynthase (8-KS) accepts the acetyl unit, which forms a thioes- 
ter with a cysteine residue at the B-KS active site. The vacant ACP is reloaded 
by MAT, this time with a malonyl moiety. Malonyl ACP visits the active site 
of B- KS where the condensation of the two 2-carbon fragments takes place on 
the ACP with the concomitant release of CO,,. The selecting and condensing 
process concludes with the B-ketoacyl product attached to the ACP. 

The loaded ACP then sequentially visits the active sites of the modifica- 
tion compartment of the enzyme, where the B-keto group of the substrate is 
reduced to — OH, dehydrated, and finally reduced to yield the saturated acyl 
product, still attached to the ACP. With the completion of the modification 


(A) FIGURE 22.27 The structure of the 
mammalian fatty acid synthase. (A) The 


KS MAT DH1 DH2 MT ¥KR ER KR ACP TE present in a single polypeptide chain. (B) A 
cartoon of the dimer based on an x-ray 


crystallographic result. The Y-MT and 
(B) W-KR are inactive domains similar to methyl 
transferase and ketoreductase sequences. 
Although there are two domains for DH, 
only one is active. The inactive domains are 
presented in faded colors. Dotted lines 
outline domains for which the structure has 
not yet been determined. Abbreviations: 
KS, ketosynthase; MAT, malonylacety! 
transferase; DH, dehydratase; Y-MT, 
methyl transferase (inactive); ¥-KR, 
ketoreductase (inactive); ER, enoyl 
reductase; KR, ketoreductase; ACP, 
acyl carrier protein; TE, thioesterase. 


Modification 
compartment 


\- 
Selecting and condensing compartment 


process, the reduced product is transferred to the B-KS while the ACP 
accepts another malonyl unit. Condensation takes place and is followed by 
another modification cycle. The process is repeated until the thioesterase 
releases the final C,, palmitic acid product. 


@.@2@:@2@: @2 


FIGURE 22.28 A catalytic cycle of mammalian fatty acid synthase. 
The cycle begins when MAT (not shown) attaches an acetyl unit to ACP. 


(1) ACP delivers the acetyl unit to KS, and MAT then attaches a malonyl 

unit to ACP. (2) ACP visits KS again, which condenses the acetyl and H, H, (0) 

malonyl units to form the B-ketoacyl product, attached to the ACP. 9 2 oC CCH 5 | He C He Ae CH 
(3) ACP delivers the B-ketoacyl product to the KR enzyme, which reduces | Hy . ° 


the keto group to an alcohol. (4) The B-hydroxyl product then visits the meee O) 

DH, which introduces a double bond with the loss of water. (5) The enoyl Ca y~o — (Ce) (1) 
product is delivered to the ER enzyme, where the double bond is ACP ACP 

reduced. (6) ACP hands the reduced product to KS and is recharged with 

malonyl CoA by MAT. (7) KS condenses the two molecules on ACP, which (on) (on) 


is now ready to begin another cycle. See Figure 22.27 for abbreviations. 
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Lyases 

Enzymes catalyzing the cleavage of C—C, 
C—O, or C—N bonds by elimination. A 
double bond is formed in these reactions. 


FIGURE 22.29 Transfer of acetyl CoA to 
the cytoplasm. Acetyl CoA is transferred 
from mitochondria to the cytoplasm, and 
the reducing potential of NADH is 
concomitantly converted into that of 
NADPH by this series of reactions. 


Many eukaryotic multienzyme complexes are multifunctional proteins 
in which different enzymes are linked covalently. Multifunctional enzymes 
such as fatty acid synthase seem likely to have arisen in eukaryotic evolution 
by fusion of the individual genes of evolutionary ancestors. 


The synthesis of palmitate requires 8 molecules of acetyl CoA, 
14 molecules of NADPH, and 7 molecules of ATP 


The stoichiometry of the synthesis of palmitate is 


Acetyl CoA + 7 malonyl CoA + 14 NADPH + 20 Ht ——> 
palmitate + 7CO, + 14 NADP* + 8 CoA + 6H,O 


The equation for the synthesis of the malonyl CoA used in the preceding 
reaction is 


7 Acetyl CoA + 7 CO, + 7 ATP —> 
7 malonyl CoA + 7 ADP +7 P. + 14H* 


Hence, the overall stoichiometry for the synthesis of palmitate is 


8 Acetyl CoA + 7 ATP + 14 NADPH + 6 H* —~> 
palmitate + 14 NADP* + 8CoA + 6H,O +7 ADP +7 P. 


Citrate carries acetyl groups from mitochondria to the cytoplasm 
for fatty acid synthesis 


Fatty acids are synthesized in the cytoplasm, whereas acetyl CoA is formed 
from pyruvate in mitochondria. Hence, acetyl CoA must be transferred 
from mitochondria to the cytoplasm for fatty acid synthesis. Mitochondria, 
however, are not readily permeable to acetyl CoA. Recall that carnitine car- 
ries only long-chain fatty acids. The barrier to acetyl CoA is bypassed by 
citrate, which carries acetyl groups across the inner mitochondrial membrane. 
Citrate is formed in the mitochondrial matrix by the condensation of acetyl 
CoA with oxaloacetate (Figure 22.29). When present at high levels, citrate 
is transported to the cytoplasm, where it is cleaved by ATP-citrate lyase. 


Citrate + ATP + CoA + H,O —> 
acetyl CoA + ADP + P. + oxaloacetate 


This reaction occurs in three steps: (1) The formation of a phospho-enzyme 
with the donation of a phosphoryl group from ATP; (2) binding of citrate 
and CoA followed by the formation of citroyl CoA and release of the phos- 
phate; (3) cleavage of citroyl CoA to yield acetyl CoA and oxaloacetate. As 
we will see shortly (Section 22.6), citrate stimulates acetyl CoA carboxylase, 
the enzyme that regulates fatty acid metabolism. Recall also that the 
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presence of citrate in the cytoplasm inhibits phosphofructokinase, the 
enzyme that controls the glycolytic pathway. 

ATP-citrate lyase is stimulated by insulin, which initiates a signal trans- 
duction pathway that ultimately results in the phosphorylation and activa- 
tion of the lyase by protein kinase B (also called Akt). 


Several sources supply NADPH for fatty acid synthesis 


Oxaloacetate formed in the transfer of acetyl groups to the cytoplasm must 
now be returned to the mitochondria. The inner mitochondrial membrane 
is impermeable to oxaloacetate. Hence, a series of bypass reactions are 
needed. These reactions also generate much of the NADPH needed for fatty 
acid synthesis. First, oxaloacetate is reduced to malate by NADH. This 
reaction is catalyzed by a malate dehydrogenase in the cytoplasm. 


Oxaloacetate + NADH + H* = — malate + NAD* 


Second, malate is oxidatively decarboxylated by an NADP* -linked malate 
enzyme (also called malic enzyme). 


Malate + NADP* —-> pyruvate + CO, + NADPH 


The pyruvate formed in this reaction readily enters mitochondria, where it 
is carboxylated to oxaloacetate by pyruvate carboxylase. 


Pyruvate + CO, + ATP + H,O —— oxaloacetate + ADP + P, + 2 H* 


The sum of these three reactions is 


NADP? ++ NADE + ATP + H,O-—> 
NADPH + NAD* + ADP +P. + Ei 


Thus, one molecule of NADPH is generated for each molecule of acetyl CoA that 
is transferred from mitochondria to the cytoplasm. Hence, eight molecules of 
NADPH are formed when eight molecules of acetyl CoA are transferred to the 
cytoplasm for the synthesis of palmitate. The additional six molecules of NADPH 
required for this process come from the pentose phosphate pathway (Section 20.3). 

The accumulation of the precursors for fatty acid synthesis is a wonder- 
ful example of the coordinated use of multiple pathways. The citric acid 
cycle, transport of oxaloacetate from the mitochondria, and pentose phos- 
phate pathway provide the carbon atoms and reducing power, whereas 
glycolysis and oxidative phosphorylation provide the ATP to meet the 
needs for fatty acid synthesis (Figure 22.30). 


Fatty acid metabolism is altered 
in tumor cells 


e We have previously seen that 
cancer cells alter glucose metab- 


Glucose 
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FIGURE 22.30 PATHWAY INTEGRATION: 
Fatty acid synthesis. Fatty acid synthesis 
requires the cooperation of various 
metabolic pathways located in different 
cellular compartments. 


MITOCHONDRION 


olism to meet the needs of rapid cell 
growth. Cancer cells must also increase 
fatty acid synthesis for use as signal 
molecules as well as for incorporation 
into membrane phospholipids. Many 
of the enzymes of fatty acid synthesis 
are overexpressed in most human can- 
cers, and this expression is correlated 
with tumor malignancy. Recall that 
normal cells do little de novo fatty acid 
synthesis, relying instead on dietary 
intake to meet their fatty needs. 


t Glycolysis 
(Fatty Acid] Pyruvate — -——77——=> Pyruvate 


Glucose 
: NADPH 
Pentose | a 
phosphate ee?” 
nite ae Malate Oxaloacetate ea 
Ribulose I t J 
5-phosphate Acetyl CoA Oxaloacetate 
Citrate <—-————— Citrate 


668 
CHAPTER 22 Fatty Acid Metabolism 


FIGURE 22.31 Electron-transport chain 
in the desaturation of fatty acids. 


The dependence of de novo fatty acid synthesis provides possible thera- 
peutic targets to inhibit cancer cell growth. Inhibition of B-ketoacyl ACP 
synthase, the enzyme that catalyzes the condensation step of fatty acid syn- 
thesis, does indeed inhibit phospholipid synthesis and subsequent cell 
growth in some cancers, apparently by inducing apoptosis. However, 
another startling observation was made: mice treated with inhibitors of the 
B-ketoacyl ACP synthase showed remarkable weight loss because they ate less. 
Thus, fatty acid synthase inhibitors are exciting candidates both as antitu- 
mor and as antiobesity drugs. 

Acetyl CoA carboxylase is also being investigated as a possible target for 
inhibiting cancer cell growth. Inhibition of the carboxylase in prostate and 
breast cancer cell lines induces apoptosis in the cancer cells, and yet is with- 
out effect in normal cells (problem 54). Understanding the alteration of 
fatty acid metabolism in cancer cells is a developing area of research that 
holds promise of generating new cancer therapies. 


22.5 The Elongation and Unsaturation of Fatty Acids Are 
Accomplished by Accessory Enzyme Systems 


The major product of the fatty acid synthase is palmitate. In eukaryotes, 
longer fatty acids are formed by elongation reactions catalyzed by enzymes 
on the cytoplasmic face of the endoplasmic reticulum membrane. These reac- 
tions add two-carbon units sequentially to the carboxyl ends of both satu- 
rated and unsaturated fatty acyl CoA substrates. Malonyl CoA is the 
two-carbon donor in the elongation of fatty acyl CoAs. Again, condensation 
is driven by the decarboxylation of malonyl CoA. 


Membrane-bound enzymes generate unsaturated fatty acids 


Endoplasmic reticulum systems also introduce double bonds into long- 
chain acyl CoAs. For example, in the conversion of stearoyl CoA into oleoyl 
CoA, a cis-A° double bond is inserted by an oxidase that employs molecular 
oxygen and NADH (or NADPH). 


Stearoyl CoA + NADH + H* + 0, —> 
oleoyl CoA + NAD* + 2H,O0 


This reaction is catalyzed by a complex of three membrane-bound proteins: 
NADH -cytochrome b, reductase, cytochrome b., and stearoyl CoA desaturase 
(Figure 22.31). First, electrons are transferred from NADH to the FAD 
moiety of NADH-cytochrome b, reductase. The heme iron atom of cyto- 
chrome b, is then reduced to the Fe2+ state. The nonheme iron atom of the 
desaturase: is subsequently converted into the Fe?* state, which enables it to 
interact with O, and the saturated fatty acyl CoA substrate. A double bond 
is formed and G0 molecules of H,O are released. Two electrons come from 
NADH and two from the single bond of the fatty acyl substrate. 


Oleoyl CoA 
+ 
H*+NADH NAD* StearoylCoA+O, 2H,O 
Cytoplasm 
Endoplasmic aun 
reticulum > E- ae 


Lumen mapieanneindie”  ejulitoias- " eremadle CoA 
b, reductase b, desaturase 


A variety of unsaturated fatty acids can be formed from oleate by a com- 
bination of elongation and desaturation reactions. For example, oleate can 
be elongated to a 20:1 cis-A!! fatty acid. Alternatively, a second double bond 
can be inserted to yield an 18:2 cis-A°, A® fatty acid. Similarly, palmitate 
(16:0) can be oxidized to palmitoleate (16:1 cis-A°), which can then be elon- 
gated to cis-vaccenate (18:1 cis-A!"). 

Unsaturated fatty acids in mammals are derived from either palmitole- 
ate (16:1), oleate (18:1), linoleate (18:2), or linolenate (18:3). The number of 
carbon atoms from the w end of a derived unsaturated fatty acid to the near- 
est double bond identifies its precursor. 

Mammals lack the enzymes to introduce double bonds at carbon atoms 
beyond C-9 in the fatty acid chain. Hence, mammals cannot synthesize 
linoleate (18:2 cis-A°®, A‘) and linolenate (18:3 cis-A°, A!?, A!5). Linoleate 
and linolenate are the two essential fatty acids. The term essential means that 
they must be supplied in the diet because they are required by an organism 
and cannot be synthesized by the organism itself. Linoleate and linolenate 
furnished by the diet are the starting points for the synthesis of a variety of 
other unsaturated fatty acids. 


Eicosanoid hormones are derived from polyunsaturated fatty acids 


Arachidonate, a 20:4 fatty acid derived from linoleate, is the major precursor 
of several classes of signal molecules: prostaglandins, prostacyclins, throm- 
boxanes, and leukotrienes (Figure 22.32). 

A prostaglandin is a 20-carbon fatty acid containing a 5-carbon ring 
(Figure 22.33). This basic compound is modified by reductases and isomer- 
ases to yield nine major classes of prostaglandins, designated PGA through 
PGI; a subscript denotes the number of carbon—carbon double bonds out- 
side the ring. Prostaglandins with two double bonds, such as PGE,, are 
derived from arachidonate; the other two double bonds of this precursor are 
lost in forming a 5-membered ring. Prostacyclin and thromboxanes are 
related compounds that arise from a nascent prostaglandin. They are gener- 
ated by prostacyclin synthase and thromboxane synthase, respectively. 
Alternatively, arachidonate can be converted into leukotrienes by the action 
of lipoxygenase. Leukotrienes, first found in leukocytes, contain three con- 
jugated double bonds—hence the name. Prostaglandins, prostacyclin, 
thromboxanes, and leukotrienes are called eicosanoids (from the Greek 
eikosi, “twenty” ) because they contain 20 carbon atoms. 

Prostaglandins and other eicosanoids are local hormones 
because they are short-lived. They alter the activities both of 
the cells in which they are synthesized and of adjoining cells 
by binding to 7TM receptors. Their effects may vary from 


. . . . . PLA — = — 7 
one cell type to another, in contrast with the more-uniform Phospholipids —,- | Arachidonate 


actions of global hormones such as insulin and glucagon. 
Prostaglandins stimulate inflammation, regulate blood flow 
to particular organs, control ion transport across mem- 
branes, modulate synaptic transmission, and induce sleep. 


Prostacyclin 


cz Recall that aspirin blocks access to the active site of synthase 


the enzyme that converts arachidonate into prosta- 
glandin H, (Section 12.3). Because arachidonate is the 
precursor of other prostaglandins, prostacyclin, and throm- 
boxanes, blocking this step interferes with many signaling 
pathways. Aspirin’s ability to obstruct these pathways 
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22.5 Elongation and Unsaturation 
of Fatty Acids 


Precursor Formula 

Linolenate (w-3) © CH,—(CH,),=CH—R 
Linoleate (w-6) CH;—(CH,),=CH—R 
Palmitoleate (w-7) CH,—(CH,),=CH—R 
Oleate (w-9) CH,—(CH,),=CH—R 


Leukotrienes 


Lipoxygenases 
DG lipase 
<= Diacylglycerols 


Prostaglandin 
synthase 


Prostaglandin H, 


(PGH,) 
. Thromboxane 
+ synthases 
Other Thromboxanes 


prostaglandins 


FIGURE 22.32 Arachidonate is the major precursor of 
eicosanoid hormones. Prostaglandin synthase catalyzes the first 
step in a pathway leading to prostaglandins, prostacyclins, and 


accounts for its wide-ranging effects on inflammation, thromboxanes. Lipoxygenase catalyzes the initial step in a pathway 


fever, pain, and blood clotting. leading to leukotrienes. 
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FIGURE 22.33 Structures of several eicosanoids. 


Variations on a theme: Polyketide and nonribosomal peptide 
synthetases resemble fatty acid synthase 


a The mammalian multifunctional fatty acid synthase is a member of a 

large family of complex enzymes termed megasynthases that partici- 
pate in step-by-step synthetic pathways. Two important classes of com- 
pounds that are synthesized by such enzymes are the polyketides and the 
nonribosomal peptides. These classes of compounds provide a variety of use- 
ful drugs, including antibiotics, immunosuppressants, antifungal agents 
and anticancer drugs. The antibiotic erythromycin is an example of a 
polyketide, whereas penicillin (Section 8.5) is a nonribosomal peptide. 


Cysteine 
H,N* 4 
“0oc O 
o-Adipic acid ? 
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Research is underway to learn how to manipulate the pathways and the 
enzymes of polyketide and nonribosomal peptide synthesis in order to gen- 
erate new therapeutics. 


22.6 Acetyl CoA Carboxylase Plays a Key Role in 
Controlling Fatty Acid Metabolism 


Fatty acid metabolism is stringently controlled so that synthesis and degrada- 
tion are highly responsive to physiological needs. Fatty acid synthesis is maxi- 
mal when carbohydrates and energy are plentiful and when fatty acids are 


scarce. Acetyl CoA carboxylase 1 and 2 play essential roles in regulat- 
ing fatty acid synthesis and degradation. Recall that this enzyme 
catalyzes the committed step in fatty acid synthesis: the production 
of malonyl CoA (the activated two-carbon donor). This important 
enzyme is subject to both local and hormonal regulation. We will 
examine each of these levels of regulation in turn. 


Acetyl CoA carboxylase is regulated by conditions in the cell 


Acetyl CoA carboxylase responds to changes in its immediate 


ATP ADP 


AMP-activated _@ 
>» protein kinase 
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carboxylase 
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environment. Acetyl CoA carboxylase is switched off by phos- FIGURE 22.34 Control of acetyl CoA carboxylase. Acetyl 
phorylation and activated by dephosphorylation (Figure 22.34). | CoA carboxylase is inhibited by phosphorylation. 


AMP. activated protein kinase (AMPK) converts the carboxyl- 
ase into an inactive form by modifying three serine residues. AMPK is 
essentially a fuel gauge; it is activated by AMP and inhibited by ATP. 

The carboxylase is also allosterically stimulated by citrate. The level of 
citrate is high when both acetyl CoA and ATP are abundant, signifying that 
raw materials and energy are available for fatty acid synthesis. Citrate acts 
in an unusual manner on inactive acetyl CoA carboxylase, which exists as 
isolated inactive dimers. Citrate facilitates the polymerization of the inac- 
tive dimers into active filaments (Figure 22.35). However, polymerization 
by citrate alone requires supraphysiological concentrations. In the cell, 
citrate-induced polymerization 1s facilitated by the protein MIG12, which 
greatly reduces the amount of citrate required. Polymerization can partly 
reverse the inhibition produced by phosphorylation (Figure 22.36). The 
stimulatory effect of citrate on the carboxylase is counteracted by palmitoyl 
CoA, which is abundant when there is an excess of fatty acids. Palmitoyl 
CoA causes the filaments to disassemble into the inactive subunits. 
Palmitoyl CoA also inhibits the translocase that transports citrate from 
mitochondria to the cytoplasm, as well as glucose 6-phosphate dehydroge- 
nase, which generates NADPH in the pentose phosphate pathway. 

The isozyme acetyl CoA carboxylase 2, located in the mitochondria, 
plays a role in the regulation of fatty acid degradation. Malonyl CoA, the 
product of the carboxylase reaction, is present at a high level when fuel mol- 
ecules are abundant. Malonyl CoA inhibits carnitine acyltransferase I, pre- 
venting the entry of fatty acyl CoAs into the mitochondrial matrix in times of 
plenty. Malonyl CoA is an especially effective inhibitor of carnitine acyl- 
transferase I in heart and muscle, tissues that have little fatty acid synthesis 
capacity of their own. In these tissues, acetyl CoA carboxylase may be a 
purely regulatory enzyme. 


Acetyl CoA carboxylase is regulated by a variety of hormones 


Acetyl CoA carboxylase is controlled by the hormones glucagon, epineph- 
rine, and insulin, which denote the overall energy status of the organism. 
Insulin stimulates fatty acid synthesis by activating the carboxylase, whereas 
glucagon and epinephrine have the reverse effect. 


Regulation by glucagon and epinephrine. Consider, again, a person who 
has just awakened from a night’s sleep and begins a bout of exercise. As 
mentioned, glycogen stores will be low, but lipids are readily available for 
mobilization. 

As stated earlier, the hormones glucagon and epinephrine, present 
under conditions of fasting and exercise, will stimulate the release of fatty 
acids from triacylglycerols in fat cells, which will be released into the blood, 
and probably from muscle cells, where they will be used immediately as 
fuel. These same hormones will inhibit fatty acid synthesis by inhibiting 
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FIGURE 22.35 Filaments of acetyl CoA 
carboxylase. The electron micrograph 
shows the enzymatically active filamentous 
form of acetyl CoA carboxylase from chicken 
liver. The inactive form is a dimer of 265-kDa 
subunits. [Courtesy of Dr. M. Daniel Lane.] 
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FIGURE 22.36 Dependence of the 
catalytic activity of acetyl CoA 
carboxylase on the concentration of 
citrate. (A) Citrate can partly activate the 
phosphorylated carboxylase. (B) The 
dephosphorylated form of the carboxylase 
is highly active even when citrate is absent. 
Citrate partly overcomes the inhibition 
produced by phosphorylation. [Information 
from G. M. Mabrouk, |. M. Helmy, 

K. G. Thampy, and S. J. Wakil. J. Biol. 
Chem. 265:6330-6338, 1990.] 


672 


acetyl CoA carboxylase. Although the exact mechanism by which these 
hormones exert their effects is not known, the net result is to augment the 
inhibition by the AMP-activated kinase. This result makes sound physi- 
ological sense: when the energy level of the cell is low, as signified by a 
high concentration of AMP, and the energy level of the organism is low, 
as signaled by glucagon, fats should not be synthesized. Epinephrine, 
which signals the need for immediate energy, enhances this effect. 
Hence, these catabolic hormones switch off fatty acid synthesis by keeping 
the carboxylase in the inactive phosphorylated state. 


Regulation by insulin. Now consider the situation after the exercise has 
ended and the runner has had a meal. In this case, the hormone insulin 
inhibits the mobilization of fatty acids and stimulates their accumulation 
as triacylglycerols by muscle and adipose tissue. Insulin also stimulates 
fatty acid synthesis by activating acetyl CoA carboxylase. Insulin acti- 
vates the carboxylase by enhancing the phosphorylation of AMPK by 
protein kinase B, which inhibits AMPK, as well as by stimulating the 
activity of a protein phosphatase that dephosphorylates and activates ace- 
tyl CoA carboxylase. Thus, the signal molecules glucagon, epinephrine, 
and insulin act in concert on triacylglycerol metabolism and acetyl CoA 
carboxylase to carefully regulate the utilization and storage of fatty acids. 


Response to diet. Long-term control is mediated by changes in the rates of 
synthesis and degradation of the enzymes participating in fatty acid synthesis. 
Animals that have fasted and are then fed high-carbohydrate, low-fat diets 
show marked increases in their amounts of acetyl CoA carboxylase and fatty 
acid synthase within a few days. This type of regulation is known as adaptive 
control. This regulation, which is mediated both by insulin and by glucose, 
is at the level of gene transcription. 


| SUMMARY 


22.1 Triacylglycerols Are Highly Concentrated Energy Stores 
Fatty acids are physiologically important as (1) fuel molecules, 
(2) components of phospholipids and glycolipids, (3) hydrophobic 
modifiers of proteins, and (4) hormones and intracellular messengers. 
They are stored in adipose tissue as triacylglycerols (neutral fat). 


22.2 The Use of Fatty Acids as Fuel Requires Three Stages of Processing 
Triacylglycerols can be mobilized by the hydrolytic action of lipases that 
are under hormonal control. Glucagon and epinephrine stimulate triac- 
ylglycerol breakdown by activating the lipase. Insulin, in contrast, 
inhibits lipolysis. Fatty acids are activated to acyl CoAs, transported 
across the inner mitochondrial membrane by carnitine, and degraded in 
the mitochondrial matrix by a recurring sequence of four reactions: oxi- 
dation by FAD, hydration, oxidation by NAD*, and thiolysis by coen- 
zyme A. The FADH, and NADH formed in the oxidation steps trans- 
fer their electrons to O, by means of the respiratory chain, whereas the 
acetyl CoA formed in the thiolysis step normally enters the citric acid 
cycle by condensing with oxaloacetate. 


22.3 Unsaturated and Odd-Chain Fatty Acids Require Additional 
Steps for Degradation 
Fatty acids that contain double bonds or odd numbers of carbon atoms 
require ancillary steps to be degraded. An isomerase and a reductase are 
required for the oxidation of unsaturated fatty acids, whereas propionyl 
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CoA derived from chains with odd numbers of carbon atoms requires a 
vitamin B,,-dependent enzyme to be converted into succinyl CoA. 
Ketone bodies, formed from acetyl CoA, are an important fuel source 
for some tissues. Mammals are unable to convert fatty acids into glu- 
cose, because they lack a pathway for the net production of oxaloace- 
tate, pyruvate, or other gluconeogenic intermediates from acetyl CoA. 


Fatty Acids Are Synthesized by Fatty Acid Synthase 

Fatty acids are synthesized in the cytoplasm by a different pathway from 
that of B oxidation. Fatty acid synthase is the enzyme complex responsible 
for fatty acid synthase. Synthesis starts with the carboxylation of acetyl 
CoA to malonyl CoA, the committed step. This ATP-driven reaction is 
catalyzed by acetyl CoA carboxylase, a biotin enzyme. The intermediates 
in fatty acid synthesis are linked to an acyl carrier protein. Acetyl ACP is 
formed from acetyl CoA, and malonyl ACP is formed from malonyl 
CoA. Acetyl ACP and malonyl ACP condense to form acetoacetyl ACP, 
a reaction driven by the release of CO, from the activated malonyl unit. A 
reduction, a dehydration, and a second reduction follow. NADPH is the 
reductant in these steps. The butyryl ACP formed in this way is ready for 
a second round of elongation, starting with the addition of a two-carbon 
unit from malonyl ACP. Seven rounds of elongation yield palmitoyl ACP, 
which is hydrolyzed to palmitate. In higher organisms, the enzymes cata- 
lyzing fatty acid synthesis are covalently linked in a multifunctional 
enzyme complex. A reaction cycle based on the formation and cleavage of 
citrate carries acetyl groups from mitochondria to the cytoplasm. NADPH 
needed for synthesis is generated in the transfer of reducing equivalents 
from mitochondria by the combined action of cytoplasmic malate dehy- 
drogenase and malic enzyme and by the pentose phosphate pathway. 


The Elongation and Unsaturation of Fatty Acids Are Accomplished 

by Accessory Enzyme Systems 

Fatty acids are elongated and desaturated by enzyme systems in the 
endoplasmic reticulum membrane. Desaturation requires NADH and 
O, and is carried out by a complex consisting of a flavoprotein, a cyto- 
chrome, and a nonheme iron protein. Mammals lack the enzymes to 
introduce double bonds distal to C-9, and so they require linoleate and 
linolenate in their diets. 

Arachidonate, an essential precursor of prostaglandins and other 
signal molecules, is derived from linoleate. This 20:4 polyunsaturated 
fatty acid is the precursor of several classes of signal molecules— 
prostaglandins, prostacyclins, thromboxanes, and leukotrienes—that 
act as messengers and local hormones because of their transience. 
They are called eicosanoids because they contain 20 carbon atoms. 
Aspirin (acetylsalicylate), an anti-inflammatory and antithrombotic 
drug, irreversibly blocks the synthesis of these eicosanoids. 


Acetyl CoA Carboxylase Plays a Key Role in Controlling 

Fatty Acid Metabolism 

Fatty acid synthesis and degradation are reciprocally regulated so that 
both are not simultaneously active. Acetyl CoA carboxylase, the essen- 
tial control site, is phosphorylated and inactivated by AMP-activated 
kinase. The phosphorylation is reversed by a protein phosphatase. 
Citrate, which signals an abundance of building blocks and energy, 
partly reverses the inhibition by phosphorylation. Carboxylase activity 
is stimulated by insulin and inhibited by glucagon and epinephrine. In 
times of plenty, fatty acyl CoAs do not enter the mitochondrial matrix, 
because malonyl CoA inhibits carnitine acyltransferase I. 
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KEY TERMS 


triacylglycerol (neutral fat, triglyceride) 
(p. 643) 


lipid droplet (p. 645) 

acyl adenylate (p. 649) 

carnitine (p. 649) 

B-oxidation pathway (p. 650) 
vitamin B,, (cobalamin) (p. 654) 


peroxisome (p. 656) 

ketone body (p. 657) 

acyl carrier protein (ACP) (p. 661) 
fatty acid synthase (p. 661) 
malonyl CoA (p. 661) 

acetyl CoA carboxylase (p. 662) 
ATP-citrate lyase (p. 666) 


arachidonate (p. 669) 
prostaglandin (p. 669) 
eicosanoid (p. 669) 
polyketide (p. 670) 


AMP-.-activated protein kinase (AMPK) 
(p. 671) 


PROBLEMS 


1. Match’em. Match each term with its description. 


a 


(a) Triacylglycerol . The enzyme that initiates 
(b) Perilipin lipid degradation 
(c) Adipose triglyceride . Activates fatty acids for 


lipase degradation 


i) 


(d) Glucagon 3. Converts a cis-A? double 

(e) Acyl CoA synthetase bond into a trans- A? 
a double bond 

(f) Carnitine 4, Reduces 2,4-dienoyl 


intermediate to trans-A?- 
enoyl CoA 
. Storage form of fats 
6. Required for entry into 
mitochondria 


(g) B-Oxidation pathway 


on 


(h) Enoyl CoA isomerase 


(i) 2,4-Dienoyl CoA reduc- 


tase 7. Requires vitamin B,, 
(j) Methylmalonyl CoA 8. Acetoacetate 

mutase 9. Means by which fatty 
(k) Ketone body acids are degraded 


10. Stimulates lipolysis 
11. Lipid-droplet-associated 
protein 


2. After lipolysis. Write a balanced equation for the conver- 
sion of glycerol into pyruvate. Which enzymes are required 
in addition to those of the glycolytic pathway? 


3. Forms of energy. The partial reactions leading to the syn- 
thesis of acyl CoA (equations 1 and 2, p. 649) are freely 
reversible. The equilibrium constant for the sum of these 
reactions is close to 1, meaning that the energy levels of the 
reactants and products are about equal, even though a mol- 
ecule of ATP has been hydrolyzed. Explain why these reac- 
tions are readily reversible. 


4, Activation fee. The reaction for the activation of fatty 
acids before degradation is 


(e) 


Je + ATP + HS—CoA ———> 
ze) fe) 
Jk 7 
CoA + AMP + 2P; 
R 5 


Acyl CoA 


R 
Fatty acid 


This reaction is quite favorable because the equivalent 
of two molecules of ATP is hydrolyzed. Explain why, from a 
biochemical bookkeeping point of view, the equivalent of 
two molecules of ATP is used despite the fact that the left 
side of the equation has only one molecule of ATP. 


5. Proper sequence. Place the following list of reactions or 
relevant locations in the B oxidation of fatty acids in the 
proper order. 


(a) Reaction with carnitine 

(b) Fatty acid in the cytoplasm 

(c) Activation of fatty acid by joining to CoA 
(d) Hydration 

(e) NAD*-linked oxidation 

(f) Thiolysis 

(g) Acyl CoA in mitochondrion 

(h) FAD-linked oxidation. 


6. Remembrance of reactions past. We have encountered 
reactions similar to the oxidation, hydration, and oxidation 
reactions of fatty acid degradation earlier in our study 
of biochemistry. What other pathway employs this set of 
reactions? 


7. A phantom acetyl CoA? In the equation for fatty acid 
degradation shown here, only seven molecules of CoA are 
required to yield eight molecules of acetyl CoA. How is this 
difference possible? 


Palmitoyl CoA + 7 FAD + 7 NAD* 
+7CoA+7H,O—> 
8 acetyl CoA + 7 FADH, + 7 NADH + 7 H* 


8. Comparing yields. Compare the ATP yields from pal- 
mitic acid and palmitoleic acid. 


9. Counting ATPs 1. What is the ATP yield for the com- 
plete oxidation of C,, (heptadecanoic) fatty acid? Assume 
that the propionyl CoA ultimately yields oxaloacetate in the 
citric acid cycle. 


10. Sweet temptation. Stearic acid is a C,. fatty acid compo- 
nent of chocolate. Suppose you had a depressing day and 
decided to settle matters by gorging on chocolate. How 
much ATP would you derive from the complete oxidation 
of stearic acid to CO,’ 


11. The best storage form. Compare the ATP yield from 
the complete oxidation of glucose, a six-carbon carbohy- 
drate, and hexanoic acid, a six-carbon fatty acid. 
Hexanoic acid is also called caprioic acid and is respon- 
sible for the “aroma” of goats. Why are fats better fuels 
than carbohydrates? 


12. From fatty acid to ketone body. Write a balanced equa- 
tion for the conversion of stearate into acetoacetate. 


13. Generous, but not to a fault. Liver is the primary site of 
ketone-body synthesis. However, ketone bodies are not 
used by the liver but are released for other tissues to use. 
The liver does gain energy in the process of synthesizing 
and releasing ketone bodies. Calculate the number of mol- 
ecules of ATP generated by the liver in the conversion of 
palmitate, a C,, fatty acid, into acetoacetate. 


14. Counting ATPs 2. How much energy is attained with 
the complete oxidation of the ketone body D-3- 
hydroxybutyrate? 


15. Another view. Why might someone argue that the 
answer to problem 14 is wrong? 


16. An accurate adage. An old biochemistry adage is that 
fats burn in the flame of carbohydrates. What is the molecular 
basis of this adage? 


17. Refsum disease. Phytanic acid is a branched-chain fatty 
acid component of chlorophyll and is a significant compo- 
nent of milk. In susceptible people, phytanic acid can accu- 
mulate, leading to neurological problems. This syndrome is 
called Refsum disease or phytanic acid storage disease. 


CH; CH; CH; CH; 


H,C — CH— (CH,);—- CH— (CH); CH— (CH), — CH— CH —COO- 


Phytanic acid 


(a) Why does phytanic acid accumulate? 


(b) What enzyme activity would you invent to prevent its 
accumulation? 


18. A hot diet. Tritium is a radioactive isotope of hydrogen 
and can be readily detected. A fully tritiated, six-carbon 
saturated fatty acid is administered to a rat, and a muscle 
biopsy of the rat is taken by concerned, sensitive, and dis- 
crete technical assistants. These assistants carefully isolate 
all of the acetyl CoA generated from the B oxidation of the 
radioactive fatty acid and remove the CoA to form acetate. 
What will be the overall trittum-to-carbon ratio of the iso- 
lated acetate? 
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19. Finding triacylglycerols in all the wrong places. Insulin- 
dependent diabetes is often accompanied by high levels 
of triacylglycerols in the blood. Suggest a biochemical 
explanation. 


20. Match ’em. Match the terms with the descriptions. 


(a) ATP-citratelyase__—S—— 1. Helps to generate 

(b) Malic enzyme ____ NADPH from NADH 
(c) Malonyl CoA _____ 2. Inactivates acetyl CoA 
(d) Acetyl CoA carboxylase carboxylase 


3. Molecule on which fatty 
acids are synthesized 


(e) Acyl carrier protein 


(f) B-Ketoacyl synthase 4. A precursor of prosta- 
glandins 

(g) Palmitate 5. Activated acetyl CoA 

(h) Eicosanoids___ 6. The end product of fatty 

(i) Arachidonate acid synthase 


) 
(j) AMP-activated protein 7. Fatty acids containing 
kinase_ 20 carbon atoms 
8. Catalyzes the committed 
step in fatty acid synthesis 
9. Catalyzes the reaction of 
acetyl ACP and malonyl 
ACP 
10. Generates cytoplasmic 
acetyl CoA 
21. Counterpoint. Compare and contrast fatty acid oxida- 
tion and synthesis with respect to 


a) site of the process 

b) acyl carrier 

c) reductants and oxidants 

d) stereochemistry of the intermediates 


e) direction of synthesis or degradation 


( 
( 
( 
( 
( 
( 


f) organization of the enzyme system 


22. The fizz in fatty acid synthesis. During the initial in vitro 
studies on fatty acid synthesis, experiments were performed 
to determine which buffer would result in optimal activity. 
Bicarbonate buffer proved far superior to phosphate buffer. 
The researchers could not initially account for this result. 
From your perspective of many decades later, explain why 
the result is not surprising. 


23. A supple synthesis. Myristate, a saturated C,, fatty acid, 
is used as an emollient for cosmetics and topical medicinal 
preparations. Write a balanced equation for the synthesis of 
myristate. 


24. The cost of cleanliness. Lauric acid is a 12-carbon fatty 
acid with no double bonds. The sodium salt of lauric acid 
(sodium laurate) is a common detergent used in a variety of 
products, including laundry detergent, shampoo, and 
toothpaste. How many molecules of ATP and NADPH are 
required to synthesize lauric acid? 
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25. Proper organization. Arrange the following steps in 
fatty acid synthesis in their proper order. 


(a) Dehydration 
(b) Condensation 


(c) Release of a Cj, 
fatty acid 


(d) Reduction of a carbonyl group 
(e) Formation of malonyl ACP 


26. No access to assets. What would be the effect on fatty 
acid synthesis of a mutation in ATP-citrate lyase that 
reduces the enzyme’s activity? Explain. 


27. The truth and nothing but. True or False. If false, explain. 
(a) Biotin is required for fatty acid synthase activity. 


(b) The condensation reaction in fatty acid synthesis is 
powered by the decarboxylation of malonyl CoA. 


(c) Fatty acid synthesis does not depend on ATP. 
(d) Palmitate is the end product of fatty acid synthase. 


(e) All of the enzyme activities required for fatty acid synthe- 
sis in mammals are contained in a single polypeptide chain. 


(f) Fatty acid synthase in mammals is active as a monomer. 


(g) The fatty acid arachidonate is a precursor for signal 
molecules. 


(h) Acetyl CoA carboxylase is inhibited by citrate. 


28. Odd fat out. Suggest how fatty acids with odd numbers 
of carbons are synthesized. 


29. Labels. Suppose that you had an in vitro fatty acid- 
synthesizing system that had all of the enzymes and cofactors 
required for fatty acid synthesis except for acetyl CoA. To 
this system, you added acetyl CoA that contained radioactive 
hydrogen (7H, tritium) and carbon 14 (!4C) as shown here. 


The ratio of 7H/'4C is 3. What would the 93H/C ratio be 
after the synthesis of palmitic acid (C,,) with the use of the 
radioactive acetyl CoA? 


30. A tight embrace. Avidin, a glycoprotein found in eggs, 
has a high affinity for biotin. Avidin can bind biotin and 
prevent its use by the body. How might a diet rich in raw 
eggs affect fatty acid synthesis? What will be the effect on 
fatty acid synthesis of a diet rich in cooked eggs? Explain. 


31. Alpha or omega? Only one acetyl CoA molecule is used 
directly in fatty acid synthesis. Identify the carbon atoms in 
palmitic acid that were donated by acetyl CoA. 


32. Now you see it, now you don’t. Although HCO; is 
required for fatty acid synthesis, its carbon atom does not 
appear in the product. Explain how this omission 1s possible. 


33. It is all about communication. Why is citrate an appro- 
priate inhibitor of phosphofructokinase? 


34. Tracing carbon atoms. Consider a cell extract that actively 
synthesizes palmitate. Suppose that a fatty acid synthase in 
this preparation forms one molecule of palmitate in about 
5 minutes. A large amount of malonyl CoA labeled with !*C 
in each carbon atom of its malonyl unit is suddenly added to 
this system, and fatty acid synthesis is stopped a minute later 
by altering the pH. The fatty acids are analyzed for radioac- 
tivity. Which carbon atom of the palmitate formed by this 
system is more radioactive—C-1 or C-14? 


35. An unaccepting mutant. The serine residues in acetyl 
CoA carboxylase that are the target of the AMP-activated 
protein kinase are mutated to alanine. What is a likely con- 
sequence of this mutation? 


36. Sources. For each of the following unsaturated fatty 
acids, indicate whether the biosynthetic precursor in ani- 
mals is palmitoleate, oleate, linoleate, or linolenate. 


(a) 18:1 cis-A!! (d) 20:3 cis-A5, A8, Al! 
(b) 18:3 cis-A®, A?, Al? (e) 22:1 cis-Al8 


(c) 20:2 cis-Al!, Al (f) 22:6 cis-A*, A’, A? Al3, 
Al6 A19 


37. Driven by decarboxylation. What is the role of decarbox- 
ylation in fatty acid synthesis? Name another key reaction in 
a metabolic pathway that employs this mechanistic motif. 


38. Kinase surfeit. Suppose that a promoter mutation leads to 
the overproduction of protein kinase A in adipose cells. How 
might fatty acid metabolism be altered by this mutation? 


39. Blocked assets. The presence of a fuel molecule in the 
cytoplasm does not ensure that the fuel molecule can be 
effectively used. Give two examples of how impaired trans- 
port of metabolites between compartments leads to disease. 


40. Elegant inversion. Peroxisomes have an alternative path- 
way for oxidizing polyunsaturated fatty acids. They contain 
a hydratase that converts D-3-hydroxyacyl CoA into trans- 
A?-enoyl CoA. How can this enzyme be used to oxidize 
CoAs containing a cis double bond at an even-numbered 
carbon atom (e.g., the cis-A!? double bond of linoleate)? 


41. Arm in arm. Many eukaryotic enzymes, such as fatty 
acid synthase, are multifunctional proteins with multiple 
active sites covalently linked. What is the advantage of this 
arrangement? 


42 Covalent catastrophe. What is a potential disadvantage 
of having many catalytic sites together on one very long 
polypeptide chain? 


43. Missing acyl CoA dehydrogenases. A number of genetic 
deficiencies in acyl CoA dehydrogenases have been described. 
This deficiency presents early in life after a period of fasting. 
Symptoms include vomiting, lethargy, and sometimes coma. 
Not only are blood levels of glucose low (hypoglycemia), but 
starvation-induced ketosis is absent. Provide a biochemical 
explanation for these last two observations. 


44, Effects of clofibrate. High blood levels of triacylglycer- 
ides are associated with heart attacks and strokes. Clofibrate, 
a drug that increases the activity of peroxisomes, is some- 
times used to treat patients with such a condition. What is 
the biochemical basis for this treatment? 


45. A different kind of enzyme. Figure 22.36 shows the 
response of acetyl CoA carboxylase to varying amounts of 
citrate. Explain this effect in light of the allosteric effects 
that citrate has on the enzyme. Predict the effects of increas- 
ing concentrations of palmitoyl CoA. 


Mechanism Problems 


46. Variation on a theme. Thiolase is homologous in structure 
to the condensing enzyme. On the basis of this observation, 
propose a mechanism for the cleavage of 3-ketoacyl CoA 


by CoA. 


47. Two plus three to make four. Propose a reaction mecha- 
nism for the condensation of an acetyl unit with a malonyl 
unit to form an acetoacety! unit in fatty acid synthesis. 


Chapter Integration Problems 


48. Ill-advised diet. Suppose that, for some bizarre reason, 
you decided to exist on a diet of whale and seal blubber, 
exclusively. 


(a) How would lack of carbohydrates affect your ability to 
utilize fats? 


(b) What would your breath smell like? 


(c) One of your best friends, after trying unsuccessfully to 
convince you to abandon this diet, makes you promise to 
consume a healthy dose of odd-chain fatty acids. Does your 
friend have your best interests at heart? Explain. 


49. Fats to glycogen. An animal is fed stearic acid that is 
radioactively labeled with [!*C]carbon. A liver biopsy 
reveals the presence of !*C-labeled glycogen. How is this 
finding possible in light of the fact that animals cannot con- 
vert fats into carbohydrates? 


50. Enzyme deficit. Pyruvate dehydrogenase deficiency, a 
severe condition with a poor prognosis, results in neurological 
dysfunction and unremitting lactic acidosis. One treatment is 
placing the patients on a low carbohydrate/high fat diet. 


(a) Explain why the deficiency would be characterized by 
neurological dysfunction. 


(b) Why does lactic acidosis result? 


(c) Explain the rationale for the low carbohydrate/high 
fat diet. 


51. Right enzymes. Wrong location. Zwellweger syndrome 
is caused by a defect in enzyme importation into the peroxi- 
somes. What disease of carbohydrate metabolism is caused 
by inappropriate cellular distribution of enzymes? 
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Data Interpretation Problem 


52. Mutant enzyme. Carnitine palmitoyl transferase I 
(CPTI) catalyzes the conversion of long-chain acyl CoA 
into acyl carnitine, a prerequisite for transport into mito- 
chondria and subsequent degradation. A mutant enzyme 
was constructed with a single amino acid change at position 
3 of glutamic acid for alanine. Graphs A through C show 
data from studies performed to identify the effect of the 
mutation. 


(a) What is the effect of the mutation on enzyme activity 
when the concentration of carnitine is varied (Graph A)? 
What are the K,, and V,,,, values for the wild-type and 
mutant enzymes? 


15 - 


Wild type 


| | | | J 
0 250 500 750 1000 1250 


(A) [Carnitine], »M 


CPTI activity (nmol mg~! min~') 


(b) What is the effect when the experiment is repeated with 
varying concentrations of palmitoyl CoA (Graph B)? What are 
the K,, and V,,,. values for the wild-type and mutant enzymes? 


50 - 


40 
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30 
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(c) Graph C shows the inhibitory effect of malonyl CoA on 
the wild-type and mutant enzymes. Which enzyme is more 
sensitive to malonyl CoA inhibition? 


100 


© 
[o) 


a 
fo) 


Mutant 


CPTI activity (% of control) 


40 
20 
| | | | J 
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((@) [Malonyl CoA], »M 
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(d) Suppose that the concentration of palmitoyl CoA = 
100 wM, that of carnitine = 100 pM, and that of malonyl 
CoA = 10 wM. Under these conditions, what is the most 
prominent effect of the mutation on the properties of the 
enzyme? 


(e) What can you conclude about the role of glutamate 3 in 
CPTI function? 


[Data from J. Shi, H. Zhu, D. N. Arvidson, and G. J. Woldegiorgis. J. Biol. 
Chem. 274:9421-9426, 1999.] 


53. Partners. Colipase binds to lipid particles in the intestine 
and facilitates the subsequent binding and activation of pan- 
creatic lipase. The lipase then hydrolyzes the lipids so that 
they can be absorbed by intestinal cells. Experiments were 
undertaken using site-directed mutagenesis (Section 5.2) to 
determine which amino acids in the colipase facilitated lipase 
binding and which activated the lipase. A portion of these 
experiments is presented below. 


Figure A shows the results of two different amino acid sub- 
stitutions on the ability of colipase to facilitate the binding 
of lipase to the lipid particles. In both panels, the blue cir- 
cles show the control (no amino acid changes) data and the 
red circles the data with the altered amino acid. The first 
letter represents the normal amino acid, the number its 
position in the amino acid sequence, and the second letter 
the replacement amino acid. 


(a) What is the amino acid alteration in the upper panel? 
(A) 
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60 
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(b) How does this mutation change the ability of colipase to 
bind the lipase to the lipid particle? 


(c) What is the amino acid substitution in the lower panel? 


(d) What is the effect of this substitution on the ability of 
the colipase to bind the lipase to the lipid particle? 


Next, the activity of the bound lipase was examined, as 
shown in Figure B. 


(B) 
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[Data from: Ross, L. E., Xiao, X., and Lowe, M. E. 2018. Identification of 
amino acids in human colipase that mediate adsorption to lipid emulsions 
and mixed micelles. Biochim. Biophys. Acta 1831: 1052-1059.] 


(e) What was the effect of the L16A mutation on lipase 
activity? 

(f) What does this suggest about the lipase binding and 
activation properties of the colipase? 


(g) What was the effect of the YS55A mutation on lipase 
activity? 


54. Nip it in the bud. Soraphen A is anatural antifungal agent 
isolated from a myxobacterium. Soraphen A is also a potent 
inhibitor of acetyl CoA carboxylase, the regulatory enzyme 
that initiates fatty acid synthesis. As discussed earlier (p. 667), 
cancer cells must produce large amount of fatty acids to gen- 
erate phospholipids for membrane synthesis. Thus, acetyl 
CoA carboxylase might be a target for anti-cancer drugs. 


Following are the results of experiments testing this hypoth- 
esis on a cancer line. 


Figure A shows the effect of various amounts of soraphen A 
on fatty acid synthesis as measured by the incorporation of 
radioactive acetate (!4C) into fatty acids. 


(B) 
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= 
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(a) How did soraphen A alter fatty acid synthesis? How 


was synthesis affected by increasing concentrations of sora- 
phen A? 


Figure B shows the results obtained when fatty acid oxida- 
tion was measured, as the release of radioactive CO, from 


added radioactive (!*C) palmitate, in the presence of sora- 
phen A. 


(b) How did soraphen A alter fatty acid oxidation? 


(c) Explain the results obtained in B in light of the fact that 
soraphen A inhibits acetyl CoA carboxylase. 


More experiments were undertaken to assess whether car- 
boxylase inhibition did indeed prevent phospholipid syn- 
thesis. Again, cells were grown in the presence of radioac- 
tive acetate and phospholipids were subsequently isolated 
and the amount of '*C incorporated into the phospholipid 
was determined. The effect of soraphen A on phospholipid 
synthesis is shown in Figure C. 
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[Data from: Beckers, A., Organe, S., Timmermans, L., Scheys, 

K., Peeters, A., Brusselmans, K., Verhoeven, G., and Swinnen, J. V. 2007. 
Chemical inhibition of acetyl-CoA carboxylase induces growth arrest and 
cytotoxicity selectively in cancer cells. Cancer Res. 67: 8180-8187,] 


(d) Did the inhibition of carboxylase by the drug alter 
phospholipid synthesis? 

(e) How might phospholipid synthesis affect cell viability? 
Finally, as shown in Figure D, experiments were performed 
to determine if the drug inhibits cancer growth in vitro by 


determining the number of cancer cells surviving upon 
exposure to soraphen A. 


(f) How did soraphen A affect cancer cell viability? 


Protein Turnover and Amino 
Acid Catabolism 


Argininosuccinate 


\ 


he digestion of dietary proteins in the intestine and the degradation of 

proteins within the cell provide a steady supply of amino acids to the 
cell. Many cellular proteins are constantly degraded and resynthesized in 
response to changing metabolic demands. Others are misfolded or become 
damaged and they, too, must be degraded. Unneeded or damaged proteins 
are marked for destruction by the covalent attachment of chains of a small 
protein called ubiquitin and then degraded by a large, ATP-dependent com- 
plex called the proteasome. The primary use of amino acids provided through 
degradation or digestion is as building blocks for the synthesis of proteins and 
other nitrogenous compounds such as nucleotide bases. 

Amino acids in excess of those needed for biosynthesis can neither be 
stored, in contrast with fatty acids and glucose, nor excreted. Rather, sur- 
plus amino acids are used as metabolic fuel. The a-amino group is removed, 
and the resulting carbon skeleton is converted into a major metabolic intermedi- 
ate. Most of the amino groups harvested from surplus amino acids are con- 
verted into urea through the urea cycle, and their carbon skeletons are 
transformed into acetyl CoA, acetoacetyl CoA, pyruvate, or one of the 
intermediates of the citric acid cycle. The carbon skeletons are converted 
into glucose, glycogen, and fats. 


f Arginine 


Citrulline 


CHAPTER 


Ornithine 
Carbamoyl 
phosphate 


This fourteenth-century hand-colored woodcut from Germany 
depicts a wheel that classifies urine samples according to their color 
and consistency. In the middle of the wheel, a doctor inspects a 
patient’s urine by sight, smell, and taste. The vials on the wheel 
aided physicians in diagnosing diseases. A key component of urine 
is urea, which is formed from the amino groups released during the 
metabolism of amino acids. [(Left) Rosenwald Collection. Rare Book 
and Special Collections Division, Library of Congress (128.2).] 
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TABLE 23.1 Essential amino acids in 
human beings 

Histidine 

Isoleucine 

Leucine 

Lysine 

Methionine 

Phenylalanine 

Threonine 

Tryptophan 

Valine 


Several coenzymes play key roles in amino acid degradation; foremost 
among them is pyridoxal phosphate. This coenzyme forms Schiff-base inter- 
mediates, which are a type of aldimine, that allow a-amino groups to be 
shuttled between amino acids and ketoacids. We will consider several 
genetic errors of amino acid degradation that lead to brain damage and men- 
tal retardation unless remedial action is initiated soon after birth. The study 
of amino acid metabolism is especially rewarding because it is rich in con- 
nections between basic biochemistry and clinical medicine. 


23.1 Proteins Are Degraded to Amino Acids 


Dietary protein is a vital source of amino acids. Especially important dietary 
proteins are those containing the essential amino acids—amino acids that 
cannot be synthesized and must be acquired in the diet (Table 23.1). 
Proteins ingested in the diet are digested into amino acids or small peptides 
that can be absorbed by the intestine and transported in the blood. Another 
crucial source of amino acids is the degradation of cellular proteins. 


The digestion of dietary proteins begins in the stomach and is 
completed in the intestine 


e Protein digestion begins in the stomach, where the acidic environ- 

ment denatures proteins into random coils. Denatured proteins are 
more accessible as substrates for proteolysis than are native proteins. The 
primary proteolytic enzyme of the stomach is pepsin, a nonspecific protease 
that, remarkably, is maximally active at pH 2. Thus, pepsin can function in 
the highly acidic environment of the stomach that disables other proteins. 

The partly digested proteins then move from the acidic environment of 
the stomach to the beginning of the small intestine. The low pH of the food 
as well as the polypeptide products of pepsin digestion stimulate the release 
of hormones that promote the secretion from the pancreas of sodium bicar- 
bonate (NaHCO,), which neutralizes the pH of the food, and a variety of 
pancreatic proteolytic enzymes. Recall that these enzymes are secreted as 
inactive zymogens that are then converted into active enzymes (Sections 9.1 
and 10.4). The battery of enzymes displays a wide array of specificity, and 
so the substrates are degraded into free amino acids as well as di- and tri- 
peptides. Digestion is further enhanced by proteolytic enzymes, such as 
aminopeptidase N, that are located in the plasma membrane of the intesti- 
nal cells. Aminopeptidases digest proteins from the amino-terminal end. 
Single amino acids, as well as di- and tripeptides, are transported into the 
intestinal cells. 

At least seven different transporters exist, each specific to a different 
group of amino acids. A number of inherited disorders result from muta- 
tions in these transporters. For example, Hartnup disease, a rare disorder 
characterized by rashes, ataxia (lack of muscle control), delayed mental 
development, and diarrhea, results from a defect in the transporter for tryp- 
tophan and other nonpolar amino acids. The absorbed amino acids are 
subsequently released into the blood by a number of Nat—amino acid anti- 
porters for use by other tissues (Figure 23.1). 


Cellular proteins are degraded at different rates 


Protein turnover—the degradation and resynthesis of proteins—takes place 
constantly in cells. Although some proteins are very stable, many proteins 
are short lived, particularly those that participate in metabolic regulation. 
These proteins can be quickly degraded to activate or shut down a signaling 
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FIGURE 23.1 Digestion and absorption of proteins. Protein digestion is primarily a result 
of the activity of enzymes secreted by the pancreas. Aminopeptidases associated with the 
intestinal epithelium further digest proteins. The amino acids and di- and tripeptides are 
absorbed into the intestinal cells by specific transporters (green and orange ovals). Free amino 
acids are then released into the blood by transporters (red oval) for use by other tissues. 


pathway. In addition, cells must eliminate damaged proteins. A significant 
proportion of newly synthesized protein molecules are defective because of 
errors in translation or misfolding. Even proteins that are normal when first 
synthesized may undergo oxidative damage or be altered in other ways with 
the passage of time. These proteins must be removed before they accumu- 
late and aggregate. Indeed, a number of pathological conditions, such as 
certain forms of Parkinson disease and Huntington disease, are associated 
with protein aggregation. 

The half-lives of proteins range over several orders of magnitude. Ornithine 
decarboxylase, at approximately 11 minutes, has one of the shortest half-lives 
of any mammalian protein. This enzyme participates in the synthesis of poly- 
amines, which are cellular cations essential for growth and differentiation. The 
life of hemoglobin, however, is limited only by the life of the red blood cell, 
and the lens protein crystallin is limited by the life of the organism. 


23.2 Protein Turnover Is Tightly Regulated 


How can a cell distinguish proteins that should be degraded? Ubiquitin 
(Ub), a small (76 amino acids) protein present in all eukaryotic cells, is a tag 
that marks proteins for destruction (Figure 23.2). Ubiquitin is the cellular 
equivalent of the “black spot” of Robert Louis Stevenson’s Treasure 
Island—the signal for death. 


Ubiquitin tags proteins for destruction 


Ubiquitin is highly conserved in eukaryotes: yeast and human ubiquitin 
differ at only 3 of 76 residues. The carboxyl-terminal glycine residue of 
ubiquitin becomes covalently attached to the e-amino groups of several 
lysine residues on a protein destined to be degraded. The energy for the 
formation of these isopeptide bonds (iso because e- rather than a-amino 
groups are targeted) comes from ATP hydrolysis. 

Three enzymes participate in the attachment of ubiquitin to a protein 
(Figure 23.3): ubiquitin-activating enzyme, or E1; ubiquitin-conjugating 
enzyme, or E2; and ubiquitin—protein ligase, or E3. First, the C-terminal 
carboxylate group of ubiquitin becomes linked to a sulfhydryl group of E1 
by a thioester bond. This ATP-driven reaction is reminiscent of fatty acid 
activation (Section 22.2). In this reaction, an acyl adenylate is formed at the 
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hk FIGURE 23.2 Structure of ubiquitin. 
Notice that ubiquitin has an extended 
carboxyl terminus, which is activated and 
linked to proteins targeted for destruction. 
Lysine residues, including lysine 48, the 
major site for linking additional ubiquitin 
molecules, are shown as ball-and-stick 
models. [Drawn from 1UBI.pdb.] 
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FIGURE 23.3 Ubiquitin conjugation. The 
ubiquitin-activating enzyme E1 adenylates 
ubiquitin (Ub) (1) and transfers the ubiquitin 
© one of its own cysteine residues (2). 
Ubiquitin is then transferred to a cysteine 
residue in the ubiquitin-conjugating enzyme 
E2 by the E2 enzyme. (8). Finally, the 
ubiquitin—protein ligase E3 transfers the 
ubiquitin to a lysine residue on the target 
protein (4a and 4b). 


FIGURE 23.4 Structure of tetraubiquitin. 
Four ubiquitin molecules are linked by 
isopeptide bonds. Notice that each 
isopeptide bond is formed by the linkage 
of the carboxylate group at the end of the 
extended C terminus with the e-amino 
group of a lysine residue. Dashed lines 
indicate the positions of the extended 
C-termini that were not observed in the 
crystal structure. This unit is the primary 
signal for degradation when linked to a 
target protein. [Drawn from 1TBE. pdb.] 


Isopeptide bonds 


C-terminal carboxylate of ubiquitin with the release of pyrophosphate, and 
the ubiquitin is subsequently transferred to a sulfhydryl group of a key cys- 
teine residue in E1. The activated ubiquitin is then shuttled to a sulfhydryl 
group of E2, a reaction catalyzed by E2 itself. Finally, E3 catalyzes the 
transfer of ubiquitin from E2 to an e-amino group on the target protein. 
The ubiquitination reaction is processive: E3 remains bound to the target 
proteins and generates a chain of ubiquitin molecules by linking the 
€-amino group of lysine residue 48 of one ubiquitin molecule to the termi- 
nal carboxylate of another. A chain of four or more ubiquitin molecules is 
especially effective in signaling the need for degradation (Figure 23.4). 

What determines whether a protein becomes ubiquitinated? A specific 
sequence of amino acids, termed a degron, indicates that a protein should 
be degraded. One such signal appears to be quite simple. The half-life of a 
cytoplasmic protein is determined to a large extent by its amino-terminal residue 
(Table 23.2). This dependency is referred to as the N-terminal rule or the 
N-terminal degron. A yeast protein with methionine at its N terminus typi- 
cally has a half-life of more than 20 hours, whereas one with arginine at this 
position has a half-life of about 2 minutes. A highly destabilizing N-terminal 
residue such as arginine or leucine favors rapid ubiquitination, whereas a 
stabilizing residue such as methionine or proline does not. Why is the 
N-terminal rule only apparently simple? Sometimes the N-terminal degron is 
exposed only after the protein is proteolytically cleaved. 
Such degrons are called pro-N-terminal degrons, in analogy 
to proenzymes (Section 10.4), because the protein must be 
cleaved to expose the signal. In other cases, the destabilizing 
amino acid is added to the protein after the protein is synthe- 
sized. Other modifications, notably N-terminal acetylation, 
can also activate a degron. Additional degrons thought to 
identify proteins for degradation include cyclin destruction 
boxes, which are amino acid sequences that mark cell-cycle 
proteins for destruction, and PEST sequences, which contain 
the amino acid sequence proline (P, single-letter abbrevia- 
tion), glutamic acid (E), serine (S), and threonine (T). 

E3 enzymes are the readers of N-terminal residues. 
Although most eukaryotes have only one or a small number 
of distinct E1 enzymes, all eukaryotes have many distinct 
E2 and E3 enzymes. Moreover, there appears to be only a 
single family of evolutionarily related E2 proteins but three 
distinct families of E3 proteins, all together consisting of 


To target 
protein 


hundreds of members. Indeed, the E3 family is one of the largest gene fami- 
lies in human beings. The diversity of target proteins that must be tagged 
for destruction requires a large number of E3 proteins as readers. 


e Three examples demonstrate the importance of E3 proteins to nor- 

mal cell function. Proteins that are not broken down owing to a defec- 
tive E3 may accumulate to create a disease of protein aggregation such as 
juvenile or early-onset Parkinson disease. A defect in another member of 
the E3 family causes Angelman syndrome, a severe neurological disorder 
characterized by mental retardation, absence of speech, uncoordinated 
movement, and hyperactivity. Conversely, uncontrolled protein turnover 
also can create dangerous pathological conditions. For example, human 
papillomavirus (HPV) encodes a protein that activates a specific E3 enzyme. 
The enzyme ubiquitinates the tumor suppressor p53 and other proteins that 
control DNA repair, which are then destroyed. The activation of this E3 
enzyme is observed in more than 90% of cervical carcinomas. Thus, the 
inappropriate marking of key regulatory proteins for destruction can trigger 
further events, leading to tumor formation. 

It is important to note that the role of ubiquitin is much broader than 
merely marking proteins for destruction. Although we have focused on pro- 
tein degradation, monoubiquitination also regulates 
proteins involved in DNA repair, chromatin remodel- 
ing, and protein kinase activation, among other bio- 
chemical processes. 


a-type subunits 


B-type subunits 


The proteasome digests the ubiquitin-tagged Piype SubHaits 


roteins 
P a-type subunits 


If ubiquitin is the mark of death, what is the execu- 
tioner? A large protease complex called the proteasome or 
the 26S proteasome digests the ubiquitinated proteins. 
This ATP-driven multisubunit protease spares ubiqui- 


TABLE 23.2 Dependence of the 
half-lives of cytoplasmic 
yeast proteins on the 
identity of their 
amino-terminal residues 


Highly stabilizing residues 

(ty; > 20 hours) 

Ala Cys Gly Met 
Pro Ser Thr Val 


Intrinsically destabilizing residues 

(t,,. = 2 to 30 minutes) 

Arg His lle Leu 
Lys Phe Trp Tyr 


Destabilizing residues after chemical 
modification (t,,, = 3 to 30 minutes) 


Asn Asp Gln Glu 


Data from J. W. Tobias, T. E. Schrader, G. Rocap, 
and A. Varshavsky. Science 254(1991):1374-1377. 


N-terminal 
threonine 
nucleophile 


tin, which is then recycled. The 265 proteasome is a 
complex of two components: a 20S catalytic unit and a 
195 regulatory unit. 


FIGURE 23.5 20S proteasome. The 20S proteasome comprises 
28 homologous subunits (a-type, red; B-type, blue), arranged in four rings 
of 7 subunits each. Some of the B-type subunits (right) include protease 


The 205 unit is composed of 28 subunits, encoded 
by 14 genes, arranged into 4 heteroheptomeric rings 
stacked to form a structure resembling a barrel (Figure 23.5). The outer two 
rings of the barrel are made up of a-type subunits and the inner two rings 
of B-type subunits. The 205 catalytic core is a sealed barrel. Access to its 
interior is controlled by a 19S regulatory unit, itselfa 700-kDa complex made 
up of 19 subunits. Two such 19S complexes bind to the 20S proteasome 
core, one at each end, to form the complete 265 proteasome (Figure 23.6). 
The 195 regulatory unit has three functions. First, components of the 195 
unit are ubiquitin receptors that bind specifically to polyubiquitin chains, 
thereby ensuring that only ubiquitinated proteins are degraded. Second, an 
isopeptidase in the 195 unit cleaves off intact ubiquitin molecules from the 
proteins so that they can be reused. Finally, the doomed protein is unfolded 
and directed into the catalytic core. Key components of the 195 complex are 
six ATPases of a type called the AAA class (ATPase associated with vari- 
ous cellular activities). ATP hydrolysis assists the 19S complex to unfold 
the substrate and induce conformational changes in the 205 catalytic core 
so that the substrate can be passed into the center of the complex. 

The proteolytic active sites are sequestered in the interior of the barrel to 
protect potential substrates until they are directed into the barrel. There are 
three types of active sites in the B subunits, each with a different specificity, 


active sites at their amino termini. [Subunit drawn from 1RYP.pdb.] 
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FIGURE 23.6 26S proteasome. A 19S 
regulatory subunit is attached to each end of 
the 20S catalytic unit. [From W. Baumeister, 

J. Walz, F. Zuhl, and E. Seemuller. Cel/ 
92(1998):367-380; courtesy of Dr. Wolfgang 
Baumeister.] 
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Archaeal proteasome Eukaryotic proteasome 


FIGURE 23.8 Proteasome evolution. The 
archaeal proteasome consists of 14 identical 
a subunits and 14 identical B subunits. In the 
eukaryotic proteasome, gene duplication and 
specialization has led to 7 distinct subunits of 
each type. The overall architecture of the 
proteasome is conserved. 


FIGURE 23.7 The proteasome and other 
proteases generate free amino acids. 
Ubiquitinated proteins are processed to 
peptide fragments from which the ubiquitin is 
subsequently removed and recycled. The 
peptide fragments are further digested to 
yield free amino acids, which can be used for 
biosynthetic reactions, most notably protein | 
synthesis. Alternatively, the amino group can 

be removed and processed to urea (p. 693) 
and the carbon skeleton can be used to 
synthesize carbohydrate or fats or used 
directly as a fuel for cellular respiration. 
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but all employ an N-terminal threonine. The hydroxyl group of the threo- 
nine residue is converted into a nucleophile that attacks the carbonyl groups 
of peptide bonds to form acyl-enzyme intermediates. Substrates are 
degraded in a processive manner without the release of degradation inter- 
mediates, until the substrate is reduced to peptides ranging in length from 
seven to nine residues. These peptide products are released from the prote- 
asome and further degraded by other cellular proteases to yield individual 
amino acids. Thus, the ubiquitination pathway and the proteasome cooper- 
ate to degrade unwanted proteins. Figure 23.7 presents an overview of the 
fates of amino acids following proteasomal digestion. 


The ubiquitin pathway and the proteasome have 
prokaryotic counterparts 


Ae Both the ubiquitin pathway and the proteasome appear to be present 

in all eukaryotes. Homologs of the proteasome are also found in some 
prokaryotes. The proteasomes of some archaea are quite similar in overall 
structure to their eukaryotic counterparts and similarly have 28 subunits 
(Figure 23.8). In the archaeal proteasome, however, all a outer-ring sub- 
units and all 8 inner-ring subunits are identical; in eukaryotes, each a or B 
subunit is one of seven different isoforms. This specialization provides 
distinct substrate specificity. 


Protein degradation can be used to regulate biological function 


e Table 23.3 lists a number of physiological processes that are con- 

trolled at least in part by protein degradation through the ubiquitin— 
proteasome pathway. In each case, the proteins being degraded are 
regulatory proteins. Consider, for example, control of the inflammatory 
response. A transcription factor called NF-«B (NF for nuclear factor) initi- 
ates the expression of a number of the genes that take part in this response. 
This factor is itself activated by the degradation of an attached inhibitory 
protein, I-«B (I for inhibitor). In response to inflammatory signals that 
bind to membrane-bound receptors, I-«B is phosphorylated at two serine 
residues, creating an E3 binding site. The binding of E3 leads to the ubiq- 
uitination and degradation of I-«B, unleashing NF-«B. The liberated 
transcription factor migrates from the cytoplasm to the nucleus to stimulate 
the transcription of the target genes. The NF-«B-—I-x«B system illustrates the 
interplay of several key regulatory motifs: receptor-mediated signal trans- 
duction, phosphorylation, compartmentalization, controlled and specific 
degradation, and selective gene expression. The importance of the ubiquitin— 
proteasome system for the regulation of gene expression is highlighted by 
the use of bortezomib (Velcade), a potent inhibitor of the proteasome, as a 
therapy for multiple myeloma. Bortezomib is a dipeptidyl boronic acid 
inhibitor of the proteasome. 


e The evolutionary studies of proteasomes described previously have 

also yielded potential clinical benefits. The bacterial pathogen 
Mycobacterium tuberculosis, the cause of tuberculosis, harbors a proteasome 
that is very similar to the human counterpart. Nevertheless, recent work has 
shown that it is possible to exploit the differences between the human and 
the bacterial proteasomes to develop specific inhibitors of the M. tuberculo- 
sis complex. Oxathiazol-2-one compounds such as HT1171 are suicide 
inhibitors (Section 8.5) of the proteolytic activity of the M. tuberculosis pro- 
teasome, but have no effect on the proteasomes of the human host. This is 
especially exciting because these drugs kill the nonreplicating form of 
M. tuberculosis, and thus may not require the prolonged treatment required 
with conventional drugs, thereby reducing the likelihood of drug resistance 
due to interruption of the treatment regime. 


23.3 The First Step in Amino Acid Degradation Is 
the Removal of Nitrogen 


What is the fate of amino acids released on protein digestion or turnover? 
The first call is for use as building blocks for biosynthetic reactions. However, 
any not needed as building blocks are degraded to compounds able to enter 
the metabolic mainstream. The amino group is first removed, and then the 
remaining carbon skeleton is metabolized to glucose, one of several citric acid 
cycle intermediates, or to acetyl CoA. The major site of amino acid degrada- 
tion in mammals is the liver, although muscles readily degrade the branched- 
chain amino acids (Leu, Ile, and Val). The fate of the a-amino group will be 
considered first, followed by that of the carbon skeleton (Section 23.5). 


Alpha-amino groups are converted into ammonium ions by the 
oxidative deamination of glutamate 


The a-amino group of many amino acids is transferred to a-ketoglutarate 
to form glutamate, which is then oxidatively deaminated to yield ammo- 
nium ion (NH,*). 


TABLE 23.3 Processes regulated by 
protein degradation 


Gene transcription 
Cell-cycle progression 
Organ formation 
Circadian rhythms 
Inflammatory response 
‘Tumor suppression 
Cholesterol metabolism 
Antigen processing 
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Aminotransferases catalyze the transfer of an a-amino group from an 
a-amino acid to an a-ketoacid. These enzymes, also called transaminases, gen- 
erally funnel a-amino groups from a variety of amino acids to a-ketoglutarate 
for conversion into NH,*. 


+H3N fe) .e) +H3N 
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S + ——— + x 
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Aspartate aminotransferase, one of the most important of these enzymes, 
catalyzes the transfer of the amino group of aspartate to a-ketoglutarate. 


Aspartate + a-ketoglutarate == oxaloacetate + glutamate 


Alanine aminotransferase catalyzes the transfer of the amino group of 
alanine to a-ketoglutarate. 


Alanine + a-ketoglutarate == pyruvate + glutamate 


‘Transamination reactions are reversible and can thus be used to synthesize 
amino acids from a-ketoacids, as we shall see in Chapter 24. 

The nitrogen atom in glutamate is converted into free ammonium ion by 
oxidative deamination, a reaction catalyzed by glutamate dehydrogenase. 
This enzyme is unusual in being able to utilize either NAD* or NADP*, at 
least in some species. The reaction proceeds by dehydrogenation of the 
C—N bond, followed by hydrolysis of the resulting ketimine. 


NADH + Ht 


(or NADP*) (or NADPH + H*) H,0 NH,* 


NH,t | Oo 
~ eo ee es 


Ketimine a-Ketoglutarate 
intermediate 


This reaction equilibrium constant is close to 1 in the liver, so the direction 
of the reaction is determined by the concentrations of reactants and prod- 
ucts. Normally, the reaction is driven forward by the rapid removal of 
ammonium ion. Glutamate dehydrogenase, essentially a liver-specific 
enzyme, is located in mitochondria, as are some of the other enzymes 
required for the production of urea. This compartmentalization sequesters 
free ammonium ion, which is toxic. 

In mammals, but not in other organisms, glutamate dehydrogenase is 
allosterically inhibited by GTP and stimulated by ADP. These nucleotides 
exert their regulatory effects in a unique manner. An abortive complex is 
formed on the enzyme when a product is replaced by substrate before the 
reaction is complete. For instance, the enzyme bound to glutamate and 
NAD(P)H is an abortive complex. GTP facilitates the formation of such 
complexes while ADP destabilizes the complexes. 

The sum of the reactions catalyzed by aminotransferases and glutamate 
dehydrogenase is 


a-Amino acid + NAD* + H,O == a-ketoacid + NH,* + NADH + H* 689 
(or NADP*) (or NADPH) 23.3 Nitrogen Removal 


In most terrestrial vertebrates, NH,* is converted into urea, which is 
excreted. 


O 
a-Amino acid a-Ketoglutarate NADH + NH,* ——~> ee 
H2 NH> 
Urea 
a-Ketoacid Glutamate NAD* + H20 
Mechanism: Pyridoxal phosphate forms Schiff-base intermediates 
in aminotransferases 


All aminotransferases contain the prosthetic group pyridoxal phosphate 
(PLP), which is derived from pyridoxine (vitamin B,). Pyridoxal phosphate 
includes a pyridine ring that is slightly basic to which is attached an OH 


group that is slightly acidic. Thus, pyridoxal phosphate derivatives can ice 
form a stable tautomeric form in which the pyridine nitrogen atom is pro- Z OH 
tonated and, hence, positively charged, while the OH group loses a proton Ho 
and hence is negatively charged, forming a phenolate. ~— 
N CHz 
0. H (0) H Pyridoxine 
2- i 2- f (Vitamin Bg) 
sep OH cee. o- 
ory A oy a 
of fo) if fo) _ O. OH 
~ f 
N CH; SK CH; P| OH 
H on) Sy we 
PLP (protonated) PLP (phenolate) 0 
The most important functional group on PLP is the aldehyde. This N CH3 
group forms covalent Schiff-base intermediates with amino acid substrates. a fe 


Indeed, even in the absence of substrate, the aldehyde group of PLP usually 
forms a Schiff-base linkage with the e-amino group of a specific lysine resi- 
due at the enzyme’s active site. A new Schiff-base linkage is formed on addi- 
tion of an amino acid substrate. 


Schiff-base Lysine 
linkage 
‘ (protonated) 
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The a-amino group of the amino acid substrate displaces the e-amino group 
of the active-site lysine residue. In other words, an internal aldimine becomes 
an external aldimine. The amino acid—PLP Schiff base that is formed 
remains tightly bound to the enzyme by multiple noncovalent interactions. 
The Schiff-base linkage often accepts a proton at the N, with the positive 
charge stabilized by interaction with the negatively charged phenolate 
group of PLP. 


Aldimine 


FIGURE 23.9 Transamination 
mechanism. (1) The external aldimine loses 
a proton to form a quinonoid intermediate. 
(2) Reprotonation of this intermediate at the 
aldehyde carbon atom yields a ketimine. 

(8) This intermediate is hydrolyzed to 
generate the a-ketoacid product and 
pyridoxamine phosphate. 


Pyridoxamine phosphate 
(PMP) 


FIGURE 23.10 Aspartate 
aminotransferase. The active site of 

this prototypical PLP-dependent enzyme 
includes pyridoxal phosphate attached to 
the enzyme by a Schiff-base linkage with 
sine 258. An arginine residue in the active 
site helps orient substrates by binding to 
heir a-carboxylate groups. Only one of the 
enzyme’s two subunits is shown. [Drawn 
rom 1AAW.pdb.] 
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The Schiff base between the amino acid substrate and PLP, the external 
aldimine, loses a proton from the a-carbon atom of the amino acid to form a 
quinonoid intermediate (Figure 23.9). Reprotonation of this intermediate at 
the aldehyde carbon atom yields a ketimine. The ketimine is then hydrolyzed 
to an a-ketoacid and pyridoxamine phosphate (PMP). These steps constitute 
half of the transamination reaction. 


Amino acid; + E-PLP = —  a-ketoacid; + E-PMP 


The second half takes place by the reverse of the preceding pathway. A 
second a-ketoacid reacts with the enzyme—pyridoxamine phosphate com- 
plex (E-PMP) to yield a second amino acid and regenerate the enzyme— 
pyridoxal phosphate complex (E-PLP). 


a-Ketoacid, + E-PMP == aminoacid, + E-PLP 
The sum of these partial reactions is 


Amino acid, + a-ketoacid? === amino acid? + a-ketoacid, 


Aspartate aminotransferase is an archetypal 
pyridoxal-dependent transaminase 


The mitochondrial enzyme aspartate aminotransferase provides an espe- 
cially well studied example of PLP as a coenzyme for transamination reac- 
tions (Figure 23.10). X-ray crystallographic studies provided detailed views 
of how PLP and substrates are bound and confirmed much of the proposed 
catalytic mechanism. Each of the identical 45-kDa subunits of this dimer 
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consists of a large domain and a small one. PLP is bound to the large 
domain, in a pocket near the domain interface. In the absence of substrate, 
the aldehyde group of PLP is in a Schiff-base linkage with lysine 258, as 
expected. Adjacent to the coenzyme’s binding site is a conserved arginine 
residue that interacts with the a-carboxylate group of the amino acid sub- 
strate, helping to orient the substrate appropriately in the active site. A base 
is necessary to remove a proton from the a-carbon group of the amino acid 
and to transfer it to the aldehyde carbon atom of PLP (Figure 23.9, steps 
1 and 2). The lysine amino group that was initially in Schiff-base linkage 
with PLP appears to serve as the proton donor and acceptor. 


Blood levels of aminotransferases serve a diagnostic function 


cz The presence of alanine and aspartate aminotransferase in the blood is 

an indication of liver damage. Liver damage can occur for a number of 
reasons, including viral hepatitis, long-term excessive alcohol consumption, 
and reaction to drugs such as acetaminophen (Section 36.1). Under these 
conditions, liver cell membranes are damaged and some cellular proteins, 
including the aminotransferases, leak into the blood. Normal blood values for 
alanine and aspartate aminotransferase activity are 5-30 units/] and 40-125 
units/1, respectively. Depending on the extent of liver damage, the values will 
reach 200-300 units/1. 


Pyridoxal phosphate enzymes catalyze a wide array of reactions 


Transamination is just one of a wide range of amino acid transformations 
that are catalyzed by PLP enzymes. The other reactions catalyzed by PLP 
enzymes at the a-carbon atom of amino acids are decarboxylations, deami- 
nations, racemizations, and aldol cleavages (Figure 23.11). In addition, PLP 
enzymes catalyze elimination and replacement reactions at the B-carbon 
atom (e.g., tryptophan synthetase in the synthesis of tryptophan) and the 
y-carbon atom (e.g., cystathionine B-synthase in the synthesis of cysteine) 
of amino acid substrates. Three common features of PLP catalysis underlie 
these diverse reactions. 


1. A Schiff base is formed by the amino acid substrate (the amine compo- 
nent) and PLP (the carbonyl component). 


2. The protonated form of PLP acts as an electron sink to stabilize catalytic 
intermediates that are negatively charged. Electrons from these intermedi- 
ates are attracted to the positive charge on the ring nitrogen atom. In other 
words, PLP is an electrophilic catalyst. 


3. The product Schiff base is cleaved at the completion of the reaction. 


How does an enzyme selectively break a particular one of three bonds at 
the a-carbon atom of an amino acid substrate? An important principle is 
that the bond being broken must be perpendicular to the 7m orbitals of the elec- 
tron sink (Figure 23.12). An aminotransferase, for example, binds the amino 
acid substrate so that the C,—H bond is perpendicular to the PLP ring 


Bond most nearly FIGURE 23.12 Stereoelectronic effects. The 
Lge cnarpapactone orientation about the N—C, bond determines the 
most favored reaction catalyzed by a pyridoxal 
phosphate enzyme. The bond that is most nearly 
V perpendicular to the plane of delocalized 7 orbitals 
N , (represented by dashed lines) of the pyridoxal 
Rz phosphate electron sink is most easily cleaved. 


t orbitals 
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FIGURE 23.11 Bond cleavage by PLP 
enzymes. Pyridoxal phosphate enzymes 
labilize one of three bonds at the a-carbon 
atom of an amino acid substrate. For 
example, bond a is labilized by 
aminotransferases, bond b by 
decarboxylases, and bond c by aldolases 
(such as threonine aldolases). PLP 
enzymes also catalyze reactions at the 

B- and y-carbon atoms of amino acids. 
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FIGURE 23.13 Reaction choice. In aspartate 
aminotransferase, the C,—H bond is most 
nearly perpendicular to the z-orbital system and Bond most nearly 
is cleaved. In serine hydroxymethyltransferase, a perpendicular 
small rotation about the N—C, bond places the to x orbitals 


C,—Cp bond perpendicular to the m system, 
favoring its cleavage. 


(Figure 23.13). In serine hydroxymethyltransferase, the enzyme that con- 
verts serine into glycine, the N—C, bond is rotated so that the C a Cp 
bond is most nearly perpendicular to the plane of the PLP ring, favoring its 
cleavage. This means of choosing one of several possible catalytic outcomes 
is called stereoelectronic control. 


n/a Many of the PLP enzymes that catalyze amino acid transformations, 

such as serine hydroxymethyltransferase, have a similar structure and 
are clearly related by divergent evolution. Others, such as tryptophan synthe- 
tase, have quite different overall structures. Nonetheless, the active sites of 
these enzymes are remarkably similar to that of aspartate aminotransferase, 
revealing the effects of convergent evolution. 


Serine and threonine can be directly deaminated 


The a-amino groups of serine and threonine can be directly converted into 
NH,* without first being transferred to a-ketoglutarate. These direct 
deaminations are catalyzed by serine dehydratase and threonine dehydratase, 
in which PLP is the prosthetic group. 


Serine —> pyruvate + NH," 
Threonine —> a-ketobutyrate + NH,* 


These enzymes are called dehydratases because dehydration precedes 
deamination. Serine loses a hydrogen ion from its a-carbon atom and a 
hydroxide ion group from its B-carbon atom to yield aminoacrylate. This 
unstable compound reacts with H,O to give pyruvate and NH,*. Thus, the 
presence of a hydroxyl group attached to the B-carbon atom in each of these 
amino acids permits the direct deamination. 


Peripheral tissues transport nitrogen to the liver 


Although most amino acid degradation takes place in the liver, other tissues 
can degrade amino acids. For instance, muscle uses branched-chain amino 
acids as a source of fuel during prolonged exercise and fasting. How is the 
nitrogen processed in these other tissues? As in the liver, the first step is 
the removal of the nitrogen from the amino acid. However, muscle lacks the 
enzymes of the urea cycle, and so the nitrogen must be released in a non- 
toxic form that can be absorbed by the liver and converted into urea. 
Nitrogen is transported from muscle to the liver in two principal trans- 
port forms. Glutamate is formed by transamination reactions, but the nitro- 
gen is then transferred to pyruvate to form alanine, which is released into 
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Active pathways: 


. Glycogen breakdown, Chapter 21 
Glycolysis, Chapter 16 

Citric acid cycle, Chapter 17 

Oxidative phosphorylation, Chapter 18 
Gluconeogenesis, Chapter 16 

Urea cycle, Chapter 23 
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the blood (Figure 23.14). The liver takes up the alanine and converts it back 
into pyruvate by transamination. The pyruvate can be used for gluconeo- 


genesis and the amino group eventually appears as urea. Thi 


s transport is 


referred to as the glucose—alanine cycle. It is reminiscent of the Cori cycle 
discussed earlier (Figure 16.33). However, in contrast with the Cori cycle, 
pyruvate is not reduced to lactate by NADH, and thus more high-energy 
electrons are available for oxidative phosphorylation in the muscle. 
Glutamine is also a key transport form of nitrogen. Glutamine synthe- 
tase catalyzes the synthesis of glutamine from glutamate and NH,” in an 


ATP-dependent reaction: 
NH,* + glutamate + ATP 


Glutamine synthetase 


glutamine + 


ADP + P. 


The nitrogens of glutamine can be converted into urea in the liver. 


23.4 Ammonium lon Is Converted into Urea in Most 


Terrestrial Vertebrates 


Some of the NH,* formed in the breakdown of amino acids is 
consumed in the biosynthesis of nitrogen compounds. In most 
terrestrial vertebrates, the excess NH 7 is converted into urea 
and then excreted. Such organisms are referred to as ureotelic. 
In terrestrial vertebrates, urea is synthesized by the urea 
cycle (Figure 23.15). One of the nitrogen atoms of urea is trans- 
ferred from an amino acid, aspartate. The other nitrogen atom 
is derived directly from free NH,~, and the carbon atom comes 
from HCO,” (derived by the hydration of CO,,; Section 9.2). 


fe) 
NHq HN NH> 


Urea 


The urea cycle begins with the formation of 
carbamoyl phosphate 


The urea cycle begins with the coupling of free NH, with 
HCO, to form carbamoyl phosphate, the committed reaction 
of the urea cycle, which is catalyzed by carbamoyl phosphate 


Fumarate 


Argininosuccinate 


Aspartate 
R—NH) 


CYTOPLASM 


FIGURE 23.14 PATHWAY INTEGRATION: 
The glucose-alanine cycle. During 
prolonged exercise and fasting, muscle uses 
branched-chain amino acids as fuel. The 
nitrogen removed is transferred (through 
glutamate) to alanine, which is released into 
the blood stream. In the liver, alanine is taken 
up and converted into pyruvate for the 
subsequent synthesis of glucose. 
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FIGURE 23.15 The urea cycle. 
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synthetase I. Carbamoyl phosphate is a simple molecule, but its synthesis is 
complex, requiring three steps. 


| ADP ; 0 O2 NH; P ATP ADP 2~ i ; 
ane, \/ LA os Wy 
X yf a yee ps, yeh 
HO re) O HO NH ie) (o) NH> 
Carboxyphosphate Carbamic acid Carbamoyl phosphate 


Note that NH,, because it is a strong base, normally exists as NH,” in aque- 
ous solution. However, carbamoyl phosphate synthetase uses only NH, as a 
substrate. The reaction begins with the phosphorylation of HCO,” to form 
carboxyphosphate, which then reacts with NH, to form carbamic acid. Finally, 
a second molecule of ATP phosphorylates carbamic acid to form carbamoyl 
phosphate. The structure and mechanism of the enzyme that catalyzes these 
reactions will be presented in Chapter 25. The consumption of two molecules 
of ATP makes this synthesis of carbamoyl phosphate essentially irreversible. 


Carbamoyl phosphate synthetase is the key regulatory 
enzyme for urea synthesis 


Carbamoyl phosphate synthetase is regulated both allosterically and by 
covalent modification so that it is maximally active when amino acids are 
being metabolized for fuel use. The allosteric regulator N-acetylglutamate 
(NAG) is required for synthetase activity. This molecule is synthesized by 
N-acetylglutamate synthase. 


melele -OOC 


7 : H N-Acetylglutamate 
= synthase 
_Z0OA + > 
H3C Ss *H3N CoO- CoO- 
Acetyl CoA Glutamate N-Acetylglutamate 


N-acetylglutamate synthase is itself activated by arginine. Thus, NAG is 
synthesized when amino acids, as represented by arginine and glutamate, 
are readily available, and carbamoyl phosphate synthetase is then activated 
to process the generated ammonia. When ammonia is not being generated, 
the synthetase is inhibited by acetylation. A rise in the mitochondrial 
NAD*%, indicative of an energy-poor state, stimulates a deacetylase that 
removes the acetyl group, activating the synthetase and readying the 
enzyme for processing ammonia from protein degradation. The control of 
the acetylation of the synthetase is not yet clear. 


Carbamoyl phosphate reacts with ornithine to begin the urea cycle 


The carbamoyl group of carbamoyl phosphate has a high transfer potential 
because of its anhydride linkage. The carbamoyl group is transferred to orni- 
thine to form citrulline, in a reaction catalyzed by ornithine transcarbamoylase. 


Vi 


H2N——C 
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+ Cc - =a 
+HSN COO- HN ~o~ 2 _Omiithine +HsN coo- 
oO transcarbamoylase 
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Ornithine and citrulline are amino acids, but they are not used as 
building blocks of proteins. The formation of NH,* by glutamate dehy- 
drogenase, its incorporation into carbamoyl phosphate as NH,, and the 
subsequent synthesis of citrulline take place in the mitochondrial matrix. 
In contrast, the next three reactions of the urea cycle, which lead to the 
formation of urea, take place in the cytoplasm. 

Citrulline is transported to the cytoplasm, where it condenses with 
aspartate, the donor of the second amino group of urea. This synthesis of 
argininosuccinate, catalyzed by argininosuccinate synthetase, is driven by the 
cleavage of ATP into AMP and pyrophosphate and by the subsequent 
hydrolysis of pyrophosphate. 


~OOC 
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Argininosuccinase (also called argininosuccinate lyase) cleaves arginino- 
succinate into arginine and fumarate. Thus, the carbon skeleton of aspartate 
is preserved in the form of fumarate. 


H 
Argininosuccinase a L 
SEER 
+ - Oe" 
H3N COO 
Argininosuccinate Arginine Fumarate 


Finally, arginine is hydrolyzed to generate urea and ornithine in a reac- 
tion catalyzed by arginase. Ornithine is then transported back into the mito- 
chondrion to begin another cycle. The urea is excreted. Indeed, human 
beings excrete about 10 kg (22 pounds) of urea per year. 


NH 
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695 
23.4 The Urea Cycle 


In ancient Rome, urine was a valuable 
commodity. Vessels were placed on street 
corners for passersby to urinate into. 
Bacteria would degrade the urea, releasing 
ammonium ion, which would act as a bleach 
to brighten togas. 
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FIGURE 23.16 Metabolic integration of 
nitrogen metabolism. The urea cycle, 
gluconeogenesis, and the transamination 
of oxaloacetate are linked by fumarate 
and aspartate. 


& FIGURE 23.17 Homologous 
enzymes. The structure of the catalytic 
subunit of ornithine transcarbamoylase 
(blue) is quite similar to that of the catalytic 
subunit of aspartate transcarbamoylase 
(red), indicating that these two enzymes 
are homologs. [Drawn from 1AKM.pdb 
and 1RAI.pdb.] 


The urea cycle is linked to gluconeogenesis 


The stoichiometry of urea synthesis is 


CO, + NH,* + 3 ATP + aspartate + 2H,O —> 
urea + 2 ADP + P; + AMP + PP; + fumarate 


Pyrophosphate is rapidly hydrolyzed, and so the equivalent of four mol- 
ecules of ATP are consumed in these reactions to synthesize one mole- 
cule of urea. The synthesis of fumarate by the urea cycle is important 
because it is a precursor for glucose synthesis (Figure 23.16). Fumarate is 
hydrated to malate, which is in turn oxidized to oxaloacetate. Oxaloacetate 
can be converted into glucose by gluconeogenesis or transaminated 
to aspartate. 
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Urea-cycle enzymes are evolutionarily related to enzymes 
in other metabolic pathways 


Se Carbamoyl phosphate synthetase generates carbamoyl phosphate 
for both the urea cycle and the first step in pyrimidine biosynthesis 
(Section 25.1). In mammals, two distinct isozymes of the enzyme are pres- 
ent. Carbamoy] phosphate synthetase II, used in pyrimidine biosynthesis, 
differs in two important ways from its urea-cycle counterpart. First, this 
enzyme utilizes glutamine as a nitrogen source rather than NH,. The side- 
chain amide of glutamine is hydrolyzed within one domain of the enzyme, 
and the ammonia generated moves through a tunnel in the enzyme to a 
second active site, where it reacts with carboxyphosphate. Second, this 
enzyme is part of a large complex that catalyzes several steps in pyrimi- 
dine biosynthesis (Section 25.1). Interestingly, the domain in which 
glutamine hydrolysis takes place is largely preserved in the urea-cycle 
enzyme, although that domain is catalytically inactive. This site binds 
N-acetylglutamate, an allosteric activator of the enzyme. A catalytic site in 
one isozyme has been adapted to act as an allosteric site in another isozyme 
having a different physiological role. 

Can we find homologs for the other enzymes in the urea cycle? Ornithine 
transcarbamoylase is homologous to aspartate transcarbamoylase, which 
catalyzes the first step in pyrimidine biosynthesis, and the structures of 
their catalytic subunits are quite similar (Figure 23.17). Thus, two consecu- 
tive steps in the pyrimidine biosynthetic pathway were adapted for urea 
synthesis. The next step in the urea cycle is the addition of aspartate to 
citrulline to form argininosuccinate, and the subsequent step is the removal 
of fumarate. These two steps together accomplish the net addition of an 
amino group to citrulline to form arginine. Remarkably, these steps are 
analogous to two consecutive steps in the purine biosynthetic pathway 
(Section 25.2). 


Aspartate 


Fumarate 
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P-ribose P-ribose 


The enzymes that catalyze these steps are homologous to argininosuc- 
cinate synthetase and argininosuccinase, respectively. Thus, four of the five 
enzymes in the urea cycle were adapted from enzymes taking part in nucleo- 
tide biosynthesis. The remaining enzyme, arginase, appears to be an ancient 
enzyme found in all domains of life. 


Inherited defects of the urea cycle cause hyperammonemia 
and can lead to brain damage 


e The synthesis of urea in the liver is the major route for the removal of 

NH,*. Urea cycle disorders occur with a prevalence of about 1 in 
15,000. A blockage of carbamoyl phosphate synthesis or of any of the four 
steps of the urea cycle has devastating consequences because there is no 
alternative pathway for the synthesis of urea. All defects in the urea cycle lead 
to an elevated level of NH,* in the blood (hyperammonemia). Some of these 
genetic defects become evident a day or two after birth, when the afflicted 
infant becomes lethargic and vomits periodically. Coma and irreversible 
brain damage may soon follow, a condition called hepatic encephalopathy. 
Why are high levels of NH,* toxic? The answer to this question is not yet 
known. Recent work, however, suggests that NH,* may inappropriately 
activate a sodium-potassium-chloride cotransporter. This activation dis- 
rupts the osmotic balance of the nerve cell, causing swelling that damages 
the cell and results in neurological disorders. 

Ingenious strategies for coping with deficiencies in urea synthesis have 
been devised on the basis of a thorough understanding of the underlying 
biochemistry. Consider, for example, argininosuccinase deficiency. This 
defect can be partly bypassed by providing a surplus of arginine in the diet 
and restricting the total protein intake. In the liver, arginine is split into 
urea and ornithine, which then reacts with carbamoyl phosphate to form 
citrulline (Figure 23.18). This urea-cycle intermediate condenses with 
aspartate to yield argininosuccinate, which is then excreted. Note that two 
nitrogen atoms—one from carbamoyl phosphate and the other from 
aspartate—are eliminated from the body per molecule of arginine pro- 
vided in the diet. In essence, argininosuccinate substitutes for urea in carry- 
ing nitrogen out of the body. 

The treatment of carbamoyl phosphate synthetase deficiency or ornithine 
transcarbamoylase deficiency illustrates a different strategy for circumvent- 
ing a metabolic block. Citrulline and argininosuccinate cannot be used to 
dispose of nitrogen atoms because their formation is impaired. Under these 
conditions, excess nitrogen accumulates in glycine and glutamine. The 
challenge then is to rid the body of the nitrogen accumulating in these two 
amino acids. That goal is accomplished by supplementing a protein- 
restricted diet with large amounts of benzoate and phenylacetate. Benzoate is 
activated to benzoyl CoA, which reacts with glycine to form hippurate, 
which is excreted (Figure 23.19). Likewise, phenylacetate is activated to 
phenylacetyl CoA, which reacts with glutamine to form phenylacetylgluta- 
mine, which is also excreted. These conjugates substitute for urea in the 
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FIGURE 23.18 Treatment of 
argininosuccinase deficiency. 
Argininosuccinase deficiency can be 
managed by supplementing the diet with 
arginine. Nitrogen is excreted in the form 
of argininosuccinate. 
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FIGURE 23.19 Treatment of carbamoyl 
phosphate synthetase and ornithine 
transcarbamoylase deficiencies. Both 
deficiencies can be treated by 
supplementing the diet with benzoate and 
phenylacetate. Nitrogen is excreted in the 


form of hippurate and phenylacetylglutamine. 
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disposal of nitrogen. Thus, latent biochemical pathways can be activated to 
partly bypass a genetic defect. 


Urea is not the only means of disposing of excess nitrogen 


As stated earlier, most terrestrial vertebrates are ureotelic; they excrete excess 
nitrogen as urea. However, urea is not the only excretable form of nitrogen. 
Ammoniotelic organisms, such as aquatic vertebrates and invertebrates, release 
nitrogen as NH,* and rely on the aqueous environment to dilute this toxic 
substance. Interestingly, lungfish, which are normally ammoniotelic, become 
ureotelic in time of drought, when they live out of the water. 

Both ureotelic and ammoniotelic organisms depend on sufficient water, 
to varying degrees, for nitrogen excretion. In contrast, uricotelic organisms, 
such as birds and reptiles, secrete nitrogen as the purine uric acid. Uric acid is 
secreted as an almost solid slurry requiring little water. The secretion of uric 
acid also has the advantage of removing four atoms of nitrogen per mole- 
cule. The pathway for nitrogen excretion developed in the course of evolu- 
tion clearly depends on the habitat of the organism. 


23.5 Carbon Atoms of Degraded Amino Acids Emerge 
as Major Metabolic Intermediates 


We now turn to the fates of the carbon skeletons of amino acids after the 
removal of the a-amino group. The strategy of amino acid degradation is to 
transform the carbon skeletons into major metabolic intermediates that can be 
converted into glucose or oxidized by the citric acid cycle. The conversion 
pathways range from extremely simple to quite complex. The carbon skel- 
etons of the diverse set of 20 fundamental amino acids are funneled into 
only seven molecules: pyruvate, acetyl CoA, acetoacetyl CoA, a-ketoglutarate, 
succinyl CoA, fumarate, and oxaloacetate. \We see here an example of the 
remarkable economy of metabolic conversions. 

Amino acids that are degraded to acetyl CoA or acetoacetyl CoA are 
termed ketogenic amino acids because they can give rise to ketone bodies or 
fatty acids. Amino acids that are degraded to pyruvate, a-ketoglutarate, 
succinyl CoA, fumarate, or oxaloacetate are termed glucogenic amino acids. 
Oxaloacetate, generated from pyruvate and other citric acid cycle interme- 
diates, can be converted into phosphoenolpyruvate and then into glucose 
(Section 16.3). Recall that mammals lack a pathway for the net synthesis of 
glucose from acetyl CoA or acetoacetyl CoA. 
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Of the basic set of 20 amino acids, only leucine and lysine are 
solely ketogenic (Figure 23.20). Isoleucine, phenylalanine, tryptophan, 
and tyrosine are both ketogenic and glucogenic. Some of their carbon 
atoms emerge in acetyl CoA or acetoacetyl CoA, whereas others appear in 
potential precursors of glucose. The other 14 amino acids are classed as 
solely glucogenic. We will identify the degradation pathways by the entry 
point into metabolism. 


Pyruvate is an entry point into metabolism for a number 
of amino acids 


Pyruvate is the entry point of the three-carbon amino acids—alanine, 
serine, and cysteine—into the metabolic mainstream (Figure 23.21). The 
transamination of alanine directly yields pyruvate. 


Alanine + a-ketoglutarate == pyruvate + glutamate 
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FIGURE 23.21 Pyruvate formation from amino acids. Pyruvate is the point of entry for 
alanine, serine, cysteine, glycine, threonine, and tryptophan. 
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FIGURE 23.22 a-Ketoglutarate formation 
from amino acids. a-Ketoglutarate is the 


point of entry 


of several five-carbon amino 


acids that are first converted into glutamate. 
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As mentioned earlier in the chapter, glutamate is then oxidatively deami- 
nated, yielding NH,* and regenerating a-ketoglutarate. The sum of these 
reactions is 


Alanine + NAD(P)* +H,O <=> pyruvate + NH,* + NAD(P)H + Ht 


Another simple reaction in the degradation of amino acids is the deami- 
nation of serine to pyruvate by serine dehydratase (p. 692). 


Serine —> pyruvate + NH,~ 


Cysteine can be converted into pyruvate by several pathways, with its sulfur 
atom emerging in H,S, SCN”, or sO 

The carbon atoms of three other amino acids can be converted into pyru- 
vate. Glycine can be converted into serine by the enzymatic addition of a 
hydroxymethyl group or it can be cleaved to give CO,, NH,*, and an acti- 
vated one-carbon unit. Threonine can give rise to pyruvate through the 
intermediate 2-amino-3-ketobutyrate. Three carbon atoms of tryptophan 
can emerge in alanine, which can be converted into pyruvate. 


Oxaloacetate is an entry point into metabolism for 
aspartate and asparagine 


Aspartate and asparagine are converted into oxaloacetate, a citric acid cycle 
intermediate. Aspartate, a four-carbon amino acid, is directly transaminated 
to oxaloacetate. 


Aspartate + a-ketoglutarate == oxaloacetate + glutamate 


Asparagine is hydrolyzed by asparaginase to NH,* and aspartate, which is 
then transaminated. 

Recall that aspartate can also be converted into fumarate by the urea 
cycle (Figure 23.16). Fumarate is a point of entry for half the carbon atoms 
of tyrosine and phenylalanine, as will be discussed shortly. 


Alpha-ketoglutarate is an entry point into 
metabolism for five-carbon amino acids 


The carbon skeletons of several five-carbon amino 
O acids enter the citric acid cycle at a-ketoglutarate. 


These amino acids are first converted into gluta- 
=60e coo- ‘mate, which is then oxidatively deaminated by glu- 


tamate dehydrogenase to yield a-ketoglutarate 
(Figure 23.22). 

Histidine is converted into 4-imidazolone 5-propionate (Figure 23.23). 
The amide bond in the ring of this intermediate is hydrolyzed to the 
N-formimino derivative of glutamate, which is then converted into gluta- 
mate by the transfer of its formimino group to tetrahydrofolate, a carrier of 
activated one-carbon units (Figure 24.9). 
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FIGURE 23.23 Histidine degradation. Conversion of histidine into glutamate. 
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FIGURE 23.24 Proline and arginine 
degradation. Conversion of proline and 
arginine into glutamate. 


Arginine Ornithine 


Glutamine is hydrolyzed to glutamate and NH,” by glutaminase. Proline 
and arginine are each converted into glutamate y-semialdehyde, which is 
then oxidized to glutamate (Figure 23.24). 


Succinyl coenzyme A is a point of entry for several 
nonpolar amino acids 


Succinyl CoA is a point of entry for some of the carbon atoms of methio- 
nine, isoleucine, and valine. Propionyl CoA and then methylmalonyl CoA 
are intermediates in the breakdown of these three nonpolar amino acids 
(Figure 23.25). This pathway from propionyl CoA to succinyl CoA is also 
used in the oxidation of fatty acids that have an odd number of carbon 
atoms. The mechanism for the interconversion of propionyl CoA and meth- 
ylmalonyl CoA was presented in Section 22.3. 
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H> formation. Conversion of 
O O 0 methionine, isoleucine, and valine 


Propionyl CoA Methylmalonyl CoA Succinyl CoA into succinyl CoA. 


Methionine degradation requires the formation of a key methyl 
donor, S-adenosylmethionine 


Methionine is converted into succinyl CoA in nine steps (Figure 23.26). 
The first step is the adenylation of methionine to form S-adenosylmethionine 
(SAM), a common methyl donor in the cell (Section 24.2). Loss of the 
methyl and adenosyl groups yields homocysteine, which is eventually pro- 
cessed to a-ketobutyrate. This a-ketoacid is oxidatively decarboxylated by 
the a-ketoacid dehydrogenase complex to propionyl CoA, which is pro- 
cessed to succinyl CoA, as described in Section 22.3. 


The branched-chain amino acids yield acetyl CoA, acetoacetate, 
or propionyl CoA 


The branched-chain amino acids are degraded by reactions that we have 
already encountered in the citric acid cycle and fatty acid oxidation. Leucine 
is transaminated to the corresponding a-ketoacid, a-ketoisocaproate. 
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FIGURE 23.26 Methionine metabolism. 
The pathway for the conversion of 
methionine into succinyl CoA. 
S-Adenosylmethionine, formed along this 
pathway, is an important methyl donor. 


Propionyl CoA a-Ketobutyrate Cystathionine 


This a-ketoacid is oxidatively decarboxylated to isovaleryl CoA by the 
branched-chain a-ketoacid dehydrogenase complex. 
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The a-ketoacids of valine and isoleucine, the other two branched-chain 
aliphatic amino acids, also are substrates (as is a-ketobutyrate derived from 
methionine). The oxidative decarboxylation of these w-ketoacids is analogous 
to that of pyruvate to acetyl CoA and of a-ketoglutarate to succinyl CoA. The 
branched-chain a-ketoacid dehydrogenase, a multienzyme complex, is a 
homolog of pyruvate dehydrogenase (Section 17.1) and a-ketoglutarate 
dehydrogenase (Section 17.2). Indeed, the E3 components of these enzymes, 
which regenerate the oxidized form of lipoamide, are identical. 

The isovaleryl CoA derived from leucine is dehydrogenated to yield 
B-methylcrotonyl CoA. This oxidation is catalyzed by isovaleryl CoA dehydrog- 
enase. The hydrogen acceptor is FAD, as in the analogous reaction in fatty acid 
oxidation that is catalyzed by acyl CoA dehydrogenase. B-Methylglutaconyl 
CoA is then formed by the carboxylation of B-methylcrotonyl CoA at the 
expense of the hydrolysis of a molecule of ATP. As might be expected, the 
carboxylation mechanism of B-methylcrotonyl CoA carboxylase is similar 
to that of pyruvate carboxylase and acetyl CoA carboxylase. 
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B-Methylcrotonyl CoA B-Methylglutaconyl CoA 

B-Methylglutaconyl CoA is then hydrated to form 3-hydroxy-3-methyl- 
glutaryl CoA, which is cleaved into acetyl CoA and acetoacetate. This reac- 
tion has already been discussed in regard to the formation of ketone bodies 
from fatty acids (Section 22.3). 
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_00- 
H3C Cc 
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Acetoacetate 


The degradative pathways of valine and isoleucine resemble that of leucine. 
After transamination and oxidative decarboxylation to yield a CoA deriva- 
tive, the subsequent reactions are like those of fatty acid oxidation. 
Isoleucine yields acetyl CoA and propionyl CoA, whereas valine yields 
CO, and propionyl CoA. The degradation of leucine, valine, and isoleucine 
validate a point made earlier (Chapter 15): the number of reactions in 
metabolism is large, but the number of kinds of reactions is relatively small. 
The degradation of leucine, valine, and isoleucine provides a striking illus- 
tration of the underlying simplicity and elegance of metabolism. 


Oxygenases are required for the degradation of aromatic amino acids 


The degradation of the aromatic amino acids yields the common intermedi- 
ates acetoacetate, fumarate, and pyruvate. The degradation pathway is not 
as straightforward as that of the amino acids previously discussed. For the 
aromatic amino acids, molecular oxygen is used to break the aromatic ring. 

The degradation of phenylalanine begins with its hydroxylation to tyro- 
sine, a reaction catalyzed by phenylalanine hydroxylase. This enzyme is 
called a monooxygenase (or mixed-function oxygenase) because one atom of O, 
appears in the product and the other in H,O. 


OH 


Phenylalanine 
hydroxylase 


+ O> + tetrahydrobiopterin 
+H3N CcOO- +H3N CcoO- 


Phenylalanine Tyrosine 


The reductant here is tetrahydrobiopterin, an electron carrier that has not 
been previously discussed and is derived from the cofactor biopterin. 
Because biopterin is synthesized in the body, it is not a vitamin. The quino- 
noid form of dihydrobiopterin is produced in the hydroxylation of phenyl- 
alanine. It is reduced back to tetrahydrobiopterin by NADPH in a reaction 
catalyzed by dihydropteridine reductase. 
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+ 
Ht NADP* 


> 
Dihydropteridine 
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FIGURE 23.27 Phenylalanine and 
tyrosine degradation. The pathway 
for the conversion of phenylalanine into 
acetoacetate and fumarate. 


FIGURE 23.28 Tryptophan degradation. 


The pathway for the conversion of 
tryptophan into alanine and acetoacetate. 
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The sum of the reactions catalyzed by phenylalanine hydroxylase and dihy- 
dropteridine reductase is 


Phenylalanine + O, + NADPH + H* ——-> tyrosine + NADP* + H,O 


Note that these reactions can also be used to synthesize tyrosine from 
phenylalanine. 

The next step in the degradation of phenylalanine and tyrosine is the 
transamination of tyrosine to p-hydroxyphenylpyruvate (Figure 23.27). This 
a-ketoacid then reacts with O, to form homogentisate. The enzyme catalyz- 
ing this complex reaction, p-hydroxyphenylpyruvate hydroxylase, is called a 
dioxygenase because both atoms of O, become incorporated into the product, 
one on the ring and one in the carboxyl group. The aromatic ring of homo- 
gentisate is then cleaved by O,, which yields 4-maleylacetoacetate. This 
reaction is catalyzed by homogentisate oxidase, another dioxygenase. 
4-Maleylacetoacetate is then isomerized to 4-fumarylacetoacetate by an 
enzyme that uses glutathione as a cofactor. Finally, 4-fumarylacetoacetate is 
hydrolyzed to fumarate and acetoacetate. 

Tryptophan degradation requires several oxygenases (Figure 23.28). 
Tryptophan 2,3-dioxygenase cleaves the pyrrole ring, and kynureinine 
3-monooxygenase hydroxylates the remaining benzene ring, a reaction similar 
to the hydroxylation of phenylalanine to form tyrosine. Alanine is removed 
and the 3-hydroxyanthranilate is cleaved by another dioxygenase and 
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subsequently processed to acetoacetyl CoA. Nearly all cleavages of aromatic 
rings in biological systems are catalyzed by dioxygenases. The active sites of these 
enzymes contain iron that is not part of heme or an iron-sulfur cluster. 


23.6 Inborn Errors of Metabolism Can Disrupt 
Amino Acid Degradation 


e Errors in amino acid metabolism provided some of the first examples 

of biochemical defects linked to pathological conditions. For instance, 
alcaptonuria, an inherited metabolic disorder caused by the absence of 
homogentisate oxidase, was described in 1902. Homogentisate, a normal 
intermediate in the degradation of phenylalanine and tyrosine (Figure 23.27), 
accumulates in alcaptonuria because its degradationis blocked. Homogentisate 
is excreted in the urine, which turns dark on standing as homogentisate is 
oxidized and polymerized to a melanin-like substance. 

Although alcaptonuria is a relatively harmless condition, such is not the case 
with other errors in amino acid metabolism. In maple syrup urine disease, the 
oxidative decarboxylation of a-ketoacids derived from valine, isoleucine, and 
leucine is blocked because the branched-chain dehydrogenase is missing or 
defective. Hence, the levels of these a-ketoacids and the branched-chain amino 
acids that give rise to them are markedly elevated in both blood and urine. The 
urine of patients has the odor of maple syrup—hence the name of the disease 
(also called branched-chain ketoaciduria). Maple syrup urine disease usually 
leads to mental and physical retardation unless the patient is placed on a diet low 
in valine, isoleucine, and leucine early in life. The disease can be readily detected 
in newborns by screening urine samples with 2,4-dinitrophenylhydrazine, 
which reacts with a-ketoacids to form 2,4-dinitrophenylhydrazone derivatives. 
A definitive diagnosis can be made by mass spectrometry. Table 23.4 lists some 
other diseases of amino acid metabolism. 


NO, 
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H3C CH; H3C CH 
a-Ketoacid 2,4-Dinitrophenylhydrazone 
derivative 
TABLE 23.4 Inborn errors of amino acid metabolism 
Disease Enzyme deficiency Symptoms 
Citrullinema Arginosuccinate lyase Lethargy, seizures, reduced 
muscle tension 
Tyrosinemia Various enzymes of tyrosine Weakness, liver damage, 
degradation mental retardation 
Albinism Tyrosinase Absence of pigmentation 
Homocystinuria Cystathionine B-synthase Scoliosis, muscle weakness, mental 
retardation, thin blond hair 
Hyperlysinemia a-Aminoadipic semialdehyde Seizures, mental retardation, 


dehydrogenase lack of muscle tone, ataxia 
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Phenylketonuria is one of the most common 
metabolic disorders 


Phenylketonuria is perhaps the best known of the diseases of amino acid 
metabolism. Phenylketonuria, which occurs with a prevalence of 1 in 
10,000 births, is caused by an absence or deficiency of phenylalanine hydrox- 
ylase or, more rarely, of its tetrahydrobiopterin cofactor. Phenylalanine 
accumulates in all body fluids because it cannot be converted into tyrosine. 
Normally, three-quarters of phenylalanine molecules are converted into 
tyrosine, and the other quarter become incorporated into proteins. Because 
the major outflow pathway is blocked in phenylketonuria, the blood level 
of phenylalanine is typically at least 20-fold as high as in normal people. 
Minor fates of phenylalanine in normal people, such as the formation of 
phenylpyruvate, become major fates in phenylketonurics. Indeed, the ini- 
tial description of phenylketonuria in 1934 was made by observing the 
reaction of phenylpyruvate in the urine of phenylketonurics with FeCl, 
which turns the urine olive green. 

Almost all untreated phenylketonurics are severely mentally retarded. The 
brain weight of these people is below normal, myelination of their nerves 
is defective, and their reflexes are hyperactive. The life expectancy of 
untreated phenylketonurics is drastically shortened. Half die by age 20 and 
three-quarters by age 30. Phenylketonurics appear normal at birth but are 
severely defective by age 1 if untreated. The therapy for phenylketonuria 
is a low-phenylalanine diet supplemented with tyrosine, because tyrosine 
is normally synthesized from phenylalanine. The aim is to provide just 
enough phenylalanine to meet the needs for growth and replacement. 
Proteins that have a low content of phenylalanine, such as casein from 
milk, are hydrolyzed and phenylalanine is removed by adsorption. A low- 
phenylalanine diet must be started very soon after birth to prevent irre- 
versible brain damage. In one study, the average IQ of phenylketonurics 
treated within a few weeks after birth was 93; a control group treated start- 
ing at age 1 had an average IQ of 53. 

Early diagnosis of phenylketonuria is essential and has been accom- 
plished by mass screening programs of all babies born in the United States 
and Canada. The phenylalanine level in the blood is the preferred diagnostic 
criterion because it is more sensitive and reliable than the FeCl, test. 
Prenatal diagnosis of phenylketonuria with DNA probes has become feasi- 
ble because the gene has been cloned and the exact locations of many muta- 
tions have been discovered in the protein. 


Determining the basis of the neurological symptoms of 
phenylketonuria is an active area of research 


The biochemical basis of retardation is not firmly established, but one 
hypothesis suggests that the lack of hydroxylase reduces the amount of 
tyrosine, an important precursor to neurotransmitters such as dopamine. 
Moreover, high concentrations of phenylalanine prevent amino acid trans- 
port of any tyrosine present as well as tryptophan, a precursor to the neu- 
rotransmitter serotonin, into the brain. Because all three of the amino acids 
are transported by the same carrier, phenylalanine will saturate the carrier, 
preventing access to tyrosine and tryptophan. Finally, high blood levels of 
phenylalanine result in higher levels of phenylalanine in the brain, and 
evidence suggests this elevated concentration inhibits glycolysis at pyru- 
vate kinase, disrupts myelination of nerve fibers and reduces the synthesis 
of several neurotransmitters. 


SUMMARY 


23.1 


23.2 


23.3 


23.4 


23.5 


Proteins Are Degraded to Amino Acids 

Dietary protein is digested in the intestine, producing amino acids that 
are transported throughout the body. Cellular proteins are degraded at 
widely variable rates, ranging from minutes to the life of the organism. 


Protein Turnover Is Tightly Regulated 

The turnover of cellular proteins is a regulated process requiring com- 
plex enzyme systems. Proteins to be degraded are conjugated with 
ubiquitin, a small conserved protein, in a reaction driven by ATP 
hydrolysis. The ubiquitin-conjugating system is composed of three 
distinct enzymes. A large, barrel-shaped complex called the protea- 
some digests the ubiquitinated proteins. The proteasome also requires 
ATP hydrolysis to function. The resulting amino acids provide a 
source of precursors for protein, nucleotide bases, and other nitroge- 
nous compounds. 


The First Step in Amino Acid Degradation Is the Removal of Nitrogen 
Surplus amino acids are used as building blocks and as metabolic fuel. 
The first step in their degradation is the removal of their a-amino 
groups by transamination to a-ketoacids. Pyridoxal phosphate is the 
coenzyme in all aminotransferases and in many other enzymes cata- 
lyzing amino acid transformations. The a-amino group funnels into 
a-ketoglutarate to form glutamate, which is then oxidatively deami- 
nated by glutamate dehydrogenase to give NH,* and a-ketoglutarate. 
NAD* or NADP?* is the electron acceptor in this reaction. 


Ammonium lon Is Converted into Urea in Most Terrestrial Vertebrates 
The first step in the synthesis of urea is the formation of carbamoyl 
phosphate, which is synthesized from HCO, , NH, and two mole- 
cules of ATP by carbamoyl phosphate synthetase. Ornithine is then 
carbamoylated to citrulline by ornithine transcarbamoylase. These 
two reactions take place in mitochondria. Citrulline leaves the mito- 
chondrion and condenses with aspartate to form argininosuccinate, 
which is cleaved into arginine and fumarate. The other nitrogen atom 
of urea comes from aspartate. Urea is formed by the hydrolysis of argi- 
nine, which also regenerates ornithine. 


Carbon Atoms of Degraded Amino Acids Emerge as 

Major Metabolic Intermediates 

The carbon atoms of degraded amino acids are converted into pyru- 
vate, acetyl CoA, acetoacetate, or an intermediate of the citric acid 
cycle. Most amino acids are solely glucogenic, two are solely keto- 
genic, and a few are both ketogenic and glucogenic. Alanine, serine, 
cysteine, glycine, threonine, and tryptophan are degraded to pyru- 
vate. Asparagine and aspartate are converted into oxaloacetate. 
a-Ketoglutarate is the point of entry for glutamate and four amino 
acids (glutamine, histidine, proline, and arginine) that can be con- 
verted into glutamate. Succinyl CoA is the point of entry for some of 
the carbon atoms of three amino acids (methionine, isoleucine, and 
valine) that are degraded through the intermediate methylmalonyl 
CoA. Leucine is degraded to acetoacetate and acetyl CoA. The break- 
down of valine and isoleucine is like that of leucine. Their a-ketoacid 
derivatives are oxidatively decarboxylated by the branched-chain 
a-ketoacid dehydrogenase. 
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The rings of aromatic amino acids are degraded by oxygenases. 
Phenylalanine hydroxylase, a monooxygenase, uses tetrahydrobiop- 
terin as the reductant. One of the oxygen atoms of O, emerges in 
tyrosine and the other in water. Subsequent steps in the degradation of 
these aromatic amino acids are catalyzed by dioxygenases, which cata- 
lyze the insertion of both atoms of O, into organic products. Four of 
the carbon atoms of phenylalanine and tyrosine are converted into 
fumarate, and four emerge in acetoacetate. 


Inborn Errors of Metabolism Can Disrupt Amino Acid Degradation 

Errors in amino acid metabolism were sources of some of the first 
insights into the correlation between pathology and biochemistry. 
Phenylketonuria is the best known of the many hereditary errors of 
amino acid metabolism. This condition results from the accumulation 
of high levels of phenylalanine in the body fluids. This accumulation 
leads to mental retardation unless the afflicted are placed on low- 


phenylalanine diets immediately after birth. 


KEY TERMS 


ubiquitin (p. 683) 

degron (p. 684) 

proteasome (p. 685) 

aminotransferase (transaminase) (p. 688) urea cycle (p. 693) 


glutamate dehydrogenase (p. 688) 


PROBLEMS 


pyridoxal phosphate (PLP) (p. 689) 
pyridoxamine phosphate (PMP) (p. 690) 
glucose—alanine cycle (p. 693) 


carbamoyl phosphate synthetase (p. 693) 


N-acetylglutamate (p. 694) 
ketogenic amino acid (p. 698) 
glucogenic amino acid (p. 698) 
biopterin (p. 703) 
phenylketonuria (p. 706) 


1. Getting exposure. Proteins are denatured by acid in the 
stomach. This denaturation makes them better substrates 
for proteolysis. Explain why this is the case. 


2. Targeting for destruction. What are the steps required to 
attach ubiquitin to a target protein? 


3. Not the dating service. Match the description on the right 
with the term on the left. 


(a) Pepsin 1. Requires an adenylate 
(b) N-terminal rule intermediate 

(c) Ubiquitin 2. Marks a protein for 
(d) PEST sequence destruction 

(e) Threonine nucleophiles 3.19S regulatory subunit 


4, Determines half-life of a 
(f) ATP-dependent protein protein 
unfolding 5. 205 core 


(g) Proteasome 6. Substrate for ligase 
(h) Ubiquitin-activating 7. Stomach proteolytic 
enzyme enzyme 
(i) Ubiquitin-conjugating 8. Recognizes protein to be 
enzyme degraded 
(j) Ubiquitin-ligase 9. Protein degrading 
machine 


10. Pro-Glu-Ser-Thr 
4. Wasted energy? Protein hydrolysis is an exergonic pro- 
cess, yet the 26S proteasome is dependent on ATP hydroly- 
sis for activity. 


(a) Explain why ATP hydrolysis is required by the 268 
proteasome. 


(b) Small peptides can be hydrolyzed without the expendi- 
ture of ATP. How does this information concur with your 
answer to part a? 


5. Keto counterparts. Name the a-ketoacid that is formed 
by the transamination of each of the following amino acids: 


(a) Alanine (d) Leucine 
(b) Aspartate (e) Phenylalanine 


(c) Glutamate (f) Tyrosine 


6. A versatile building block. 


(a) Write a balanced equation for the conversion of aspar- 
tate into glucose through the intermediate oxaloacetate. 
Which coenzymes participate in this transformation? (b) 
Write a balanced equation for the conversion of aspartate 
into oxaloacetate through the intermediate fumarate. 


7. The benefits of specialization. The archaeal proteasome 
contains 14 identical active B subunits, whereas the eukary- 
otic proteasome has 7 distinct B subunits. What are the 
potential benefits of having several distinct active subunits? 


8. Propose a structure. The 195 subunit of the proteasome 
contains six subunits that are members of the AAA ATPase 
family. Other members of this large family are associated 
into homohexamers with sixfold symmetry. Propose a 


structure for the AAA ATPases within the 19S proteasome. 
How might you test and refine your prediction? 


9. Effective electron sinks. Pyridoxal phosphate stabi- 
lizes carbanionic intermediates by serving as an electron 
sink. Which other prosthetic group catalyzes reactions in 
this way? 


10. Cooperation. How do aminotransferases and glutamate 
dehydrogenase cooperate in the metabolism of the amino 
group of amino acids? 


11. Taking away the nitrogen. What amino acids yield citric 
acid cycle components and glycolysis intermediates when 
deaminated? 


12. One reaction only. What amino acids can be deaminated 
directly? 


13. Useful products. What are the common features of the 
breakdown products of the carbon skeletons of amino acids? 


14. Helping hand. Propose a role for the positively charged 
guanidinium nitrogen atom in the cleavage of argininosuc- 
cinate into arginine and fumarate. 


15. Nitrogen sources. What are the immediate biochemical 
sources for the two nitrogen atoms in urea? 


16. Counterparts. Match the biochemical on the right with 
the property on the left. 


(a) Formed from NH,* 1. Aspartate 

2. Urea 

(b) Hydrolyzed to yield urea 3. Ornithine 

4, Carbamoyl phosphate 
5. Arginine 

6. Citrulline 

7. Arginosuccinate 


(c) A second source of 
nitrogen 
(d) Reacts with aspartate 


(e) Cleavage yields fumarate 


(f) Accepts the first nitrogen 


(g) Final product 


17. Line up. Identify structures A—D, and place them in the 
order that they appear in the urea cycle. 


fp 


H,N——C 


NH 


709 
Problems 


“OOC 


(9) (D) 


18. Completing the cycle. Four high-transfer-potential 
phosphoryl groups are consumed in the synthesis of urea 
according to the stoichiometry given on page 696. In this 
reaction, aspartate is converted into fumarate. Suppose that 
fumarate is converted into oxaloacetate. What is the result- 
ing stoichiometry of urea synthesis? How many high-trans- 
fer-potential phosphoryl groups are spent? 


19. A good bet. A friend bets you a bazillion dollars that you 
can’t prove that the urea cycle is linked to the citric acid 
cycle and other metabolic pathways. Can you collect? 


20. Inhibitor design. Compound A has been synthesized 
as a potential inhibitor of a urea-cycle enzyme. Which 
enzyme do you think compound A might inhibit? Explain 
your answer. 


2- 


9 a0 
*HsN VV 
xO c So 
H H 
~OOC 
Compound A 


21. Ammonia toxicity. Glutamate is an important neu- 
rotransmitter whose levels must be carefully regulated in 
the brain. Explain how a high concentration of ammonia 
might disrupt this regulation. How might a high concentra- 
tion of ammonia alter the citric acid cycle? 


22. A precise diagnosis. The urine of an infant gives a posi- 
tive reaction with 2,4-dinitrophenylhydrazine. Mass spec- 
trometry shows abnormally high blood levels of pyruvate, 
a-ketoglutarate, and the a-ketoacids of valine, isoleucine, 
and leucine. Identify a likely molecular defect and propose 
a definitive test of your diagnosis. 


23. Therapeutic design. How would you treat an infant who 
is deficient in argininosuccinate synthetase? Which mole- 
cules would carry nitrogen out of the body? 


24. Damaged liver. As we will see later (Chapter 27), liver 
damage (cirrhosis) often results in ammonia poisoning. 
Explain why this is the case. 
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25. Argininosuccinic aciduria. Argininosuccinic aciduria is 
a condition that results when the urea-cycle enzyme 
argininosuccinase is deficient. Argininosuccinate is present 
in the blood and urine. Suggest how this condition might be 
treated while still removing nitrogen from the body. 


26. Sweet hazard. Why should phenylketonurics avoid 
using aspartame, an artificial sweetener? (Hint: Aspartame 
is L-aspartyl-L-phenylalanine methy] ester.) 


27. Déja vu. N-Acetylglutamate is required as a cofactor 
in the synthesis of carbamoyl phosphate. How is N- 
acetylglutamate synthesized from glutamate? 


28. Negative nitrogen balance. A deficiency of even one 
amino acid results in a negative nitrogen balance. In this 
state, more protein is degraded than is synthesized, and so 
more nitrogen is excreted than is ingested. Why would pro- 
tein be degraded if one amino acid were missing? 


29. Precursors. Differentiate between ketogenic amino 
acids and glucogenic amino acids. 


30. A slight of hand. The end products of tryptophan deg- 
radation are acetyl CoA and acetoacetyl CoA, yet trypto- 
phan is a gluconeogenic amino acid in animals. Explain. 


31. Closely related. Pyruvate dehydrogenase complex and 
a-ketoglutarate dehydrogenase complex are huge enzymes 
consisting of three discrete enzymatic activities. Which 
amino acids require a related enzyme complex, and what is 
the name of the enzyme? 


32. Supply lines. The carbon skeletons of the 20 common 
amino acids can be degraded into a limited number of end 
products. What are the end products and in what metabolic 
pathway are they commonly found? 


Mechanism Problems 


33. Serine dehydratase. Write out a complete mechanism 
for the conversion of serine into aminoacrylate catalyzed by 
serine dehydratase. 


34. Serine racemase. The nervous system contains a sub- 
stantial amount of D-serine, which is generated from 
L-serine by serine racemase, a PLP-dependent enzyme. 
Propose a mechanism for this reaction. What is the equilib- 
rium constant for the reaction L-serine = D-serine? 


Chapter Integration Problems 


35. Multiple substrates. In Chapter 8, we learned that there 
are two types of bisubstrate reactions, sequential and double- 
displacement. Which type characterizes the action of ami- 
notransferases? Explain your answer. 


36. Double duty. Degradation signals are commonly located in 
protein regions that also facilitate protein-protein interactions. 
Explain why this coexistence of two functions in the same 
domain might be useful. 


37. Fuel choice. Within a few days after a fast begins, nitro- 
gen excretion accelerates to a higher-than-normal level. After 
a few weeks, the rate of nitrogen excretion falls to a lower level 
and continues at this low rate. However, after the fat stores 
have been depleted, nitrogen excretion rises to a high level. 


(a) What events trigger the initial surge of nitrogen excretion? 


(b) Why does nitrogen excretion fall after several weeks 
of fasting? 


(c) Explain the increase in nitrogen excretion when the 
lipid stores have been depleted. 


38. A serious situation. Pyruvate carboxylase deficiency is a 
fatal disorder. Patients with pyruvate carboxylase defi- 
ciency sometimes display some or all of the following symp- 
toms: lactic acidosis, hyperammonemia (excess NH, * in the 
blood), hypoglycemia, and demyelination of the regions of 
the brain due to insufficient lipid synthesis. Provide a pos- 
sible biochemical rationale for each of these observations. 


39. Isoleucine degradation. Isoleucine is degraded to acetyl 
CoA and succinyl CoA. Suggest a plausible reaction 
sequence, based on reactions discussed in the text, for this 
degradation pathway. 


40. Many voles. Pyridoxal phosphate is an important 
coenzyme in transamination reactions. We have seen this 
coenzyme before, in glycogen metabolism. Which enzyme 
in glycogen metabolism requires pyridoxal phosphate and 
what role does the coenzyme play in this enzyme? 


41. Enough cycles to have a race. The glucose—alanine cycle 
is reminiscent of the Cori cycle, but the glucose—alanine 
cycle can be said to be more energy efficient. Explain why 
this is so. 


Data Interpretation Problem 


42. Another helping hand. In eukaryotes, the 20S proteasome 
component in conjunction with the 195 component degrades 
ubiquitinated proteins with the hydrolysis of a molecule of 
ATP. Archaea lack ubiquitin and the 265 proteasome but do 
contain a 20S proteasome. Some archaea also contain an 
ATPase that is homologous to the ATPases of the eukaryotic 
19S component. This archaeal ATPase activity was isolated 
as a 650-kDa complex (called PAN) from the archaeon 
Thermoplasma, and experiments were performed to deter- 
mine if PAN could enhance the activity of the 20S protea- 
some from Thermoplasma as well as other 20S proteasomes. 


Protein degradation was measured as a function of time 
and in the presence of various combinations of components. 
Graph A shows the results. 
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(a) What is the effect of PAN on archaeal proteasome 
activity in the absence of nucleotides? 


(b) What is the nucleotide requirement for protein digestion? 


(c) What evidence suggests that ATP hydrolysis, and not 
just the presence of ATP, is required for digestion? 


A similar experiment was performed with a small peptide as 
a substrate for the proteasome instead of a protein. The 
results obtained are shown in graph B. 
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(d) How do the requirements for peptide digestion differ 
from those of protein digestion? 


(e) Suggest some reasons for the difference. 


The ability of PAN from the archaeon Thermoplasma to 
support protein degradation by the 20S proteasomes from 
the archaeon Methanosarcina and rabbit muscle was then 
examined. 


Percentage of digestion of protein substrate 
(20S proteasome source) 


Additions Thermoplasma Methanosarcina Rabbit muscle 
None 11 10 10 
PAN 8 8 8 
PAN + ATP 100 40 30 
PAN + ADP 12 9 10 


[Data from P. Zwickl, D. Ng, K. M. Woo, H.-P. Klenk, and A. L. Goldberg. 
An archaebacterial ATPase, homologous to ATPase in the eukaryotic 26S 


proteasome, activates protein breakdown by 20S proteasomes. J. Biol. Chem. 
274(1999): 26008-26014. ] 


(f) Can the Thermoplasma PAN augment protein digestion 
by the proteasomes from other organisms? 


(g) What is the significance of the stimulation of rabbit 
muscle proteasome by Thermoplasma PAN? 


The Biosynthesis of Amino Acids 


CHAPTER 


NHz 


Glutamate 


Nitrogen is a key component of amino acids. The atmosphere is rich 
in nitrogen gas (N,), a very unreactive molecule. Certain organisms, 
such as bacteria that live in the root nodules of yellow clover, can 
convert nitrogen gas into ammonia (NH.), which can then be used to 
synthesize, first, glutamate and then other amino acids. [(Left) Hugh 


Spencer/Science Source.] 


he assembly of biological molecules, including proteins and nucleic 

acids, requires the generation of appropriate starting materials. We have 
already considered the assembly of carbohydrates in discussions of the 
Calvin cycle and the pentose phosphate pathway (Chapter 20). The present 
chapter and the next two examine the assembly of the other important 
building blocks—namely, amino acids, nucleotides, and lipids. 

The pathways for the biosynthesis of these molecules are extremely 
ancient, going back to the last common ancestor of all living things. Indeed, 
these pathways probably predate many of the pathways of energy transduc- 
tion discussed in Part II and may have provided key selective advantages in 
early evolution. Many of the intermediates in energy-yielding pathways 
play a role in biosynthesis as well. These common intermediates allow effi- 
cient interplay between energy-yielding (catabolic) and energy-requiring 
biosynthetic (anabolic) pathways. Thus, cells are able to balance the degra- 
dation of compounds for energy mobilization and the synthesis of starting 
materials for macromolecular construction. 

We begin our consideration of biosynthesis with amino acids—the 
building blocks of proteins and the nitrogen source for many other impor- 
tant molecules, including nucleotides, neurotransmitters, and prosthetic 
groups such as porphyrins. Amino acid biosynthesis is intimately connected 
with nutrition because many higher organisms, including human beings, 
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have lost the ability to synthesize some amino acids and must therefore 
obtain adequate quantities of these essential amino acids in their diets. 
Furthermore, because some amino acid biosynthetic enzymes are absent in 
mammals but present in plants and microorganisms, they are useful targets 
for herbicides and antibiotics. 


Amino acid synthesis requires solutions to three key 
biochemical problems 


Nitrogen is an essential component of amino acids. Earth has an abundant 
supply of nitrogen, but it is primarily in the form of atmospheric nitrogen 
gas (N,), a remarkably inert molecule. Thus, a fundamental problem for 
biological systems is to obtain nitrogen in a more usable form. This prob- 
lem is solved by certain microorganisms capable of reducing the inert 
N=N molecule of nitrogen gas to two molecules of ammonia in one of the 
most amazing reactions in biochemistry. Nitrogen in the form of ammonia 
is the source of nitrogen for all the amino acids. The carbon backbones 
come from the glycolytic pathway, the pentose phosphate pathway, or the 
citric acid cycle. 

In amino acid production, we encounter an important problem in bio- 
synthesis—namely, stereochemical control. Because all amino acids except 
glycine are chiral, biosynthetic pathways must generate the correct isomer 
with high fidelity. In each of the 19 pathways for the generation of chiral 
amino acids, the stereochemistry at the a-carbon atom is established by a 
transamination reaction that includes pyridoxal phosphate (PLP). Almost 
all the aminotransferases that catalyze these reactions descend from a com- 
mon ancestor, illustrating once again that effective solutions to biochemical 
problems are retained throughout evolution. 

Biosynthetic pathways are often highly regulated such that building 
blocks are synthesized only when supplies are low. Very often, a high con- 
centration of the final product of a pathway inhibits the activity of allosteric 
enzymes that function early in the pathway to control the committed step. 
These enzymes are similar in functional properties to aspartate transcar- 
bamoylase and its regulators (Section 10.1). Feedback and allosteric mecha- 
nisms ensure that all 20 amino acids are maintained in sufficient amounts 
for protein synthesis and other processes. 


24.1 Nitrogen Fixation: Microorganisms Use ATP and 
a Powerful Reductant to Reduce Atmospheric 
Nitrogen to Ammonia 


The nitrogen in amino acids, purines, pyrimidines, and other biomole- 
cules ultimately comes from atmospheric nitrogen, N ,. The biosynthetic 
process starts with the reduction of N, to NH, (ammonia), a process 
called nitrogen fixation. The extremely strong N = N bond, which has a 
bond energy of 940 kJ mol~! (225 kcal mol~'), is highly resistant to 
chemical attack. Indeed, Antoine Lavoisier named nitrogen gas “azote,” 
from Greek words meaning “without life,” because it is so unreactive. 
Nevertheless, the conversion of nitrogen and hydrogen to form ammonia 
is thermodynamically favorable; the reaction is difficult kinetically 
because of the activation energy required to form intermediates along the 
reaction pathway. 

Although higher organisms are unable to fix nitrogen, this conversion is 
carried out by some bacteria and archaea. Symbiotic Rhizobium bacteria 
invade the roots of leguminous plants and form root nodules in which they 


fix nitrogen, supplying both the bacteria and the plants. The importance of 
nitrogen fixation by diazotrophic (nitrogen-fixing) microorganisms to the 
metabolism of all higher eukaryotes cannot be overstated: the amount of N, 
fixed by these species has been estimated to be 10!! kilograms per year, 
about 60% of Earth’s newly fixed nitrogen. Lightning and ultraviolet radia- 
tion fix another 15%; the other 25% is fixed by industrial processes. The 
industrial process for nitrogen fixation devised by Fritz Haber in 1910 is 
still being used in fertilizer factories. 


N. + 3Hp = 2NH3 


The fixation of N, is typically carried out by mixing __ Electrons from 
with H, gas over an iron catalyst at about 500°C and a Rita ts 
pressure of 300 atmospheres. eo 

To meet the kinetic challenge, the biological process 
of nitrogen fixation requires a complex enzyme with 
multiple redox centers. The nitrogenase complex, which 
carries out this fundamental transformation, consists of 
two proteins: a reductase (also called the iron protein or 


ing power, and nitrogenase (also called the molybde- 
num-iron protein or MoFe protein), which uses these 


: : : . . Reductase 
Fe protein), which provides electrons with high reduc- (Fe protein) 
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ADP 


ATP 


Nitrogenase 
(MoFe protein) 


FIGURE 24.1 Nitrogen fixation. Electrons flow from ferredoxin to 
the reductase (iron protein, or Fe protein) to nitrogenase 


electrons to reduce N, to NH. The transfer of electrons (molypdenum-iron protein, or MoFe protein) to reduce nitrogen to 
from the reductase to the nitrogenase component is — ammonia. ATP hydrolysis within the reductase drives conformational 
coupled to the hydrolysis of ATP by the reductase changes necessary for the efficient transfer of electrons. 


(Figure 24.1). 
In principle, the reduction of N,, to NH; is a six-electron process. 


N,+6e +6H* —>+2NH;, 


However, the biological reaction always generates at least 1 mol of H, in 
addition to 2 mol of NH, for each mol of N = N. Hence, an input of two 
additional electrons is required. 


N.+8e +8H* —>2NH;3 + H, 


In most nitrogen-fixing microorganisms, the eight high-potential electrons 
come from reduced ferredoxin, generated by oxidative processes. Two mole- 
cules of ATP are hydrolyzed for each electron transferred. Thus, at least 16 
molecules of ATP are hydrolyzed for each molecule of N, reduced. 


Nyt Se +80" + 16ATP + 16,0 —> 
2NH, + H+ 16ADP + 16P, 


Note that O, is required for oxidative phosphorylation to generate the 
ATP necessary for nitrogen fixation. However, the nitrogenase complex is 
exquisitely sensitive to inactivation by O,. To allow ATP synthesis and 
nitrogenase to function simultaneously, leguminous plants maintain a very 
low concentration of free O, in their root nodules, the location of the nitro- 
genase. This is accomplished by binding O, to leghemoglobin, a homolog of 
hemoglobin (Section 6.3). 


The iron-molybdenum cofactor of nitrogenase binds and reduces 
atmospheric nitrogen 


Both the reductase and the nitrogenase components of the complex are iron— 
sulfur proteins, in which iron is bonded to the sulfur atom of a cysteine resi- 
due and to inorganic sulfide. Recall that iron-sulfur clusters act as electron 
carriers (Section 18.3). The reductase is a dimer of identical 30-kDa subunits 
bridged by a 4Fe—45 cluster (Figure 24.2). 


ATP 


FIGURE 24.2 Fe Protein. This protein 
is a dimer composed of two polypeptide 
chains linked by a 4Fe-4S cluster. Notice 
that each monomer is a member of the 
P-loop NTPase family and contains an 
ATP-binding site. [Drawn from 1N2C.pdb.] 
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The role of the reductase is to transfer electrons from a suitable 

donor, such as reduced ferredoxin, to the nitrogenase component. 
The 4Fe—45 cluster carries the electrons, one at a time, to nitrogenase. The 
binding and hydrolysis of ATP triggers a conformational change that moves 
the reductase closer to the nitrogenase component, whence it is able to 
transfer its electron to the center of nitrogen reduction. The structure of the 
ATP-binding region reveals it to be a member of the P-loop NTPase family 
(Section 9.4) that is clearly related to the nucleotide-binding regions found 
in G proteins and related proteins. Thus, we see another example of how 
this domain has been recruited in evolution because of its ability to couple 
nucleoside triphosphate hydrolysis to conformational changes. 

The nitrogenase component is an a8, tetramer (240 kDa), in which the 
a and B subunits are homologous to each other and structurally quite similar 
(Figure 24.3). The nitrogenase requires the FeMo cofactor, which consists of 
[Fe,-S,] and [Mo—Fe,—S,] subclusters joined by three disulfide bonds. 
A carbon atom (the interstitial carbon), donated by S-adenosylmethionine 
(p. 724), sits at the interstices of the iron atoms of the FeMo cofactor. The 
FeMo cofactor is also coordinated to a homocitrate moiety and to the a sub- 
unit through one histidine residue and one cysteinate residue. 

Electrons from the reductase enter at the P clusters, which are located at 
the a—B interface. The role of the P clusters is to store electrons until they 
can be used productively to reduce nitrogen at the FeMo cofactor. The 
FeMo cofactor is the site of nitrogen fixation. One face of the FeMo cofactor 
is likely to be the site of nitrogen reduction. The electron-transfer reactions 
from the P cluster take place in concert with the binding of hydrogen ions to 
nitrogen as it is reduced. Further studies are under way to elucidate the 
mechanism of this remarkable reaction. 


P cluster FeMo cofactor 


Interstitial 
carbon atom 


FIGURE 24.3 MoFe protein. This protein is a heterotetramer composed of two a subunits 
(red) and two B subunits (blue). Notice that the protein contains two copies each of two types 
of clusters: P clusters and FeMo cofactors. Each P cluster contains eight iron atoms (green) 
and seven sulfides linked to the protein by six cysteinate residues. Each FeMo cofactor 
contains one molybdenum atom, seven iron atoms, nine sulfides, the interstitial carbon atom, 
and a homocitrate, and is linked to the protein by one cysteinate residue and one histidine 
residue. [Drawn from 1M1N.pdb.] 


Ammonium ion is assimilated into an amino acid 
through glutamate and glutamine 


The next step in the assimilation of nitrogen into biomolecules is the entry 
of NH,* into amino acids. The amino acids glutamate and glutamine play 
pivotal roles in this regard, acting as nitrogen donors for most amino acids. 
The a-amino group of most amino acids comes from the a-amino group of 
glutamate by transamination (Section 23.3). Glutamine, the other major 
nitrogen donor, contributes its side-chain nitrogen atom in the biosynthesis 
of a wide range of important compounds, including the amino acids trypto- 
phan and histidine. 

Glutamate is synthesized from NH," and a-ketoglutarate, a citric acid 
cycle intermediate, by the action of glutamate dehydrogenase. We have 
already encountered this enzyme in the degradation of amino acids 
(Section 23.3). Recall that NAD* is the oxidant in catabolism, whereas 
NADPH is the reductant in biosyntheses. Glutamate dehydrogenase is 
unusual in that it does not discriminate between NADH and NADPH, at 
least in some species. 


Cc 
NH,t + -ooc~ ~~ coo- + NAD(P)H + Ht —— 


a-Ketoglutarate Glutamate 


The reaction proceeds in two steps. First, a Schiff base forms between 
ammonia and a-ketoglutarate. The formation of a Schiff base between an 
amine and a carbonyl compound is a key reaction that takes place at many 
stages of amino acid biosynthesis and degradation. 


Rs H ES Rz 
fe) N~ a ne 
| \ 
Cc + R;—NH, === Cc + HO Cc 
~~ v oy 
Ri Os Ri R2 Ht Ri Ro 
Carbonyl Amino Schiff base Protonated 
compound donor Schiff base 


Schiff bases are easily protonated. In the second step, the protonated Schiff 
base is reduced by the transfer of a hydride ion from NAD(P)H to form 
glutamate. 


Ht 
+ 
HH NAD(P)H NAD(P)* 


a-Ketoglutarate 


This reaction is crucial because it establishes the stereochemistry of the 
a-carbon atom (S absolute configuration) in glutamate. The enzyme binds 
the a-ketoglutarate substrate in such a way that hydride transferred from 
NAD(P)H is added to form the L isomer of glutamate (Figure 24.4). As we 
shall see, this stereochemistry is established for other amino acids by trans- 
amination reactions that rely on pyridoxal phosphate. 

A second ammonium ion is incorporated into glutamate to form glutamine 
by the action of glutamine synthetase. This amidation is driven by the hydroly- 
sis of ATP. ATP participates directly in the reaction by phosphorylating the 


aP 
Cc 
-00c~ ~~~ coo- + NAD()* + H,0 


*H3N 


K 
-ooc~ 


Glutamate 
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FIGURE 24.4 Establishment of 
chirality. In the active site of glutamate 
dehydrogenase, hydride transfer (green) 
from NAD(P)H to a specific face of the 
achiral protonated Schiff base of 
a-ketoglutarate establishes the L 
configuration of glutamate. 
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side chain of glutamate to form an acylphosphate intermediate, which then 
reacts with ammonia to form glutamine. 


+H; +H5N 


H 
NH> 


Acyl-phosphate intermediate Glutamine 


A high-affinity ammonia-binding site is formed in the enzyme only 
after the formation of the acylphosphate intermediate. A specific site for 
ammonia binding is required to prevent attack by water from hydrolyzing 
the intermediate and wasting a molecule of ATP. The regulation of gluta- 
mine synthetase plays a critical role in controlling nitrogen metabolism 
(Section 24.3). 

Glutamate dehydrogenase and glutamine synthetase are present in all 
organisms. Most prokaryotes also contain an evolutionarily unrelated 
enzyme, glutamate synthase, which catalyzes the reductive amination of 
a-ketoglutarate to glutamate. Glutamine is the nitrogen donor. 


a-Ketoglutarate + glutamine + NADPH + H* == 
2 glutamate + NADP* 


The side-chain amide of glutamine is hydrolyzed to generate ammonia 
within the enzyme, a recurring theme throughout nitrogen metabolism. 
When NH," is limiting, most of the glutamate is made by the sequential 
action of glutamine synthetase and glutamate synthase. The sum of these 
reactions is 


NH,* + a-ketoglutarate + NADPH + ATP —> 
glutamate + NADP* + ADP + P. 


Note that this stoichiometry differs from that of the glutamate dehydroge- 
nase reaction in that ATP is hydrolyzed. Why do prokaryotes sometimes 
use this more expensive pathway? The answer is that the value of K,, of 
glutamate dehydrogenase for NH,” is high (~1 mM), and so this enzyme 
is not saturated when NH," is limiting. In contrast, glutamine synthetase 
has very low K,, for NH,*. Thus, ATP hydrolysis is required to capture 
ammonia when it is scarce. 


24.2 Amino Acids Are Made from Intermediates of the 
Citric Acid Cycle and Other Major Pathways 


Thus far, we have considered the conversion of N, into NE and the 
assimilation of NH,* into glutamate and glutamine. We turn now to the 
biosynthesis of the other amino acids, the majority of which obtain their 
nitrogen from glutamate or glutamine. The pathways for the biosynthesis of 
amino acids are diverse. However, they have an important common feature: 
their carbon skeletons come from intermediates of glycolysis, the pentose phos- 
phate pathway, or the citric acid cycle. On the basis of these starting materi- 
als, amino acids can be grouped into six biosynthetic families (Figure 24.5). 
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FIGURE 24.5 Biosynthetic families of amino acids in bacteria and plants. Major 
metabolic precursors are shaded blue. Amino acids that give rise to other amino acids are 
shaded yellow. Essential amino acids are in boldface type. 


Human beings can synthesize some amino acids but 
must obtain others from their diet 


Most microorganisms, such as E. coli, can synthesize the entire basic set of 20 
amino acids, whereas human beings cannot make 9 of them. The amino acids 
that must be supplied in the diet are called essential amino acids, whereas the 
others, which can be synthesized if dietary content is insufficient, are termed 
nonessential amino acids (Table 24.1). These designations refer to the needs 
of an organism under a particular set of conditions. For example, enough 
arginine is synthesized by the urea cycle to meet the needs of an adult, but 
perhaps not those of a growing child. A deficiency of even one amino acid 
results in a negative nitrogen balance. In this state, more protein is degraded 
than is synthesized, and so more nitrogen is excreted than is ingested. 

The nonessential amino acids are synthesized by quite simple reactions, 
whereas the pathways for the formation of the essential amino acids are 
quite complex. For example, the nonessential amino acids alanine and 
aspartate are synthesized in a single step from pyruvate and oxaloacetate, 
respectively. In contrast, the pathways for the essential amino acids require 
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TABLE 24.1 Basic set of 20 

amino acids 
Nonessential Essential 
Alanine Histidine 
Arginine Isoleucine 
Asparagine Leucine 
Aspartate Lysine 
Cysteine Methionine 
Glutamate Phenylalanine 
Glutamine Threonine 
Glycine Tryptophan 
Proline Valine 
Serine 
Tyrosine 


i Nonessential 
i Essential 


Number of amino acids 


1234567 8 9 10111213141516 
Number of steps in pathway 


FIGURE 24.6 Essential and nonessential 
amino acids. Some amino acids are 
nonessential to human beings because they 
can be biosynthesized in a small number of 
steps. Those amino acids requiring a large 
number of steps for their synthesis are 
essential in the diet because some of the 
enzymes for these steps have been lost in 
the course of evolution. 


FIGURE 24.7 Amino acid biosynthesis 
by transamination. (1) Within a 
aminotransferase, the internal aldimine is 
converted into pyridoxamine phosphate 
(PMP) by reaction with glutamate in a 
multistep process not shown. (2) PMP then 
reacts with an a-ketoacid to generate a 
ketimine. (3) This intermediate is converted 
into a quinonoid intermediate (4), which in 
turn yields an external aldimine. (6) The 
aldimine is cleaved to release the newly 
formed amino acid to complete the cycle. 
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from 5 to 16 steps (Figure 24.6). The sole exception to this pattern 1s argi- 
nine, inasmuch as the synthesis of this nonessential amino acid de novo 
requires 10 steps. Typically, however, it is made in only 3 steps from orni- 
thine as part of the urea cycle. Tyrosine, classified as a nonessential amino 
acid because it can be synthesized in 1 step from phenylalanine, requires 
10 steps to be synthesized from scratch and is essential if phenylalanine is 
not abundant. We begin with the biosynthesis of nonessential amino acids. 


Aspartate, alanine, and glutamate are formed by the 
addition of an amino group to an alpha-ketoacid 


Three a-ketoacids—a-ketoglutarate, oxaloacetate, and pyruvate—can be 
converted into amino acids in one step through the addition of an amino 
group. We have seen that a-ketoglutarate can be converted into glutamate 
by reductive amination (Section 23.3). The amino group from glutamate 
can be transferred to other a-ketoacids by transamination reactions. Thus, 
aspartate and alanine can be made from the addition of an amino group to 
oxaloacetate and pyruvate, respectively. 


Oxaloacetate + glutamate == aspartate + a-ketoglutarate 
Pyruvate + glutamate —— alanine + a-ketoglutarate 


These reactions are carried out by pyridoxal phosphate-dependent amino- 
transferases. Transamination reactions are required for the synthesis of most 
amino acids. 

In Section 23.3, we considered the mechanism of aminotransferases 
as applied to the metabolism of amino acids. Let us review the aminotransfer- 
ase mechanism as it operates in the biosynthesis of amino acids (Figure 23.11). 
The reaction pathway begins with pyridoxal phosphate in a Schiff-base linkage 
with lysine at the aminotransferase active site, forming an internal aldimine 
(Figure 24.7). An amino group is transferred from glutamate to form pyri- 
doxamine phosphate (PMP), the actual amino donor, in a multistep process. 
PMP then reacts with an incoming a-ketoacid to form a ketimine. Proton loss 
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forms a quinonoid intermediate that then accepts a proton at a different site 
to form an external aldimine. The newly formed amino acid is released with 
the concomitant formation of the internal aldimine. 


A common step determines the chirality of all amino acids 


Wa Aspartate aminotransferase is the prototype of a large family of PLP- 
dependent enzymes. Comparisons of amino acid sequences as well as 
several three-dimensional structures reveal that almost all amino- 
transferases having roles in amino acid biosynthesis are related to 
aspartate aminotransferase by divergent evolution. An examina- 
tion of the aligned amino acid sequences reveals that two residues 
are completely conserved. These residues are the lysine residue 
that forms the Schiff base with the PLP cofactor (lysine 258 in 
aspartate aminotransferase) and an arginine residue that interacts 


« 


with the a-carboxylate group of the ketoacid (Figure 23.12). errr a wig 
An essential step in the transamination reaction is the transferred 


protonation of the quinonoid intermediate to form the exter- 

nal aldimine. The chirality of the amino acid formed is deter- 

mined by the direction from which this proton is added to the quinonoid form 
(Figure 24.8). The interaction between the conserved arginine residue 
and the a-carboxylate group helps orient the substrate so that the lysine 
residue transfers a proton to the bottom face of the quinonoid intermedi- 
ate, generating an aldimine with an L configuration at the C, center. 


The formation of asparagine from aspartate requires an 

adenylated intermediate 

The formation of asparagine from aspartate is chemically analogous to the for- 
mation of glutamine from glutamate. Both transformations are amidation reac- 
tions and both are driven by the hydrolysis of ATP. The actual reactions are 
different, however. In bacteria, the reaction for the asparagine synthesis is 


Aspartate + NH,* + ATP —-> asparagine + AMP + PP; + H* 


Thus, the products of ATP hydrolysis are AMP and PP. rather than 
ADP and P.. Aspartate is activated by adenylation rather than by 
phosphorylation. 
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We have encountered this mode of activation in fatty acid degradation and 
will see it again in lipid and protein synthesis. 

In mammals, the nitrogen donor for asparagine is glutamine rather than 
ammonia as in bacteria. Ammonia is generated by hydrolysis of the side 
chain of glutamine and directly transferred to activated aspartate, bound in 
the active site. An advantage is that the cell is not directly exposed to NH,*, 
which is toxic at high levels to human beings and other mammals. The use 
of glutamine hydrolysis as a mechanism for generating ammonia for use within 
the same enzyme is a motif common throughout biosynthetic pathways. 
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Arginine 


Lysine 


FIGURE 24.8 Stereochemistry of proton 
addition. In an aminotransferase active 
site, the addition of a proton from the lysine 
residue to the bottom face of the quinonoid 
intermediate determines the L configuration 
of the amino acid product. The conserved 
arginine residue interacts with the 
a-carboxylate group and helps establish 
the appropriate geometry of the quinonoid 
intermediate. 


Asparagine 


722 Glutamate is the precursor of glutamine, proline, and arginine 


CHAPTER 24 The Biosynthesis of 
Amino Acids 


The synthesis of glutamate by the reductive amination of a-ketoglutarate 
has already been discussed, as has the conversion of glutamate into gluta- 
mine (p. 717). Glutamate is the precursor of two other nonessential amino 
acids: proline and arginine. First, the y-carboxyl group of glutamate reacts 
with ATP to form an acyl phosphate. This mixed anhydride is then reduced 


by NADPH to an aldehyde. 
ae a i 0 
WwW a: / 
oC ATP = ADP’ (§$§-d; OO NADPH — NADP* HC 


+H; coo- +H3N 


Glutamate Acyl-phosphate Glutamic y-semialdehyde 
intermediate 


Glutamic ‘y-semialdehyde cyclizes with a loss of H,O in a nonenzymatic 
process to give A!-pyrroline 5-carboxylate, which is reduced by NADPH 
to proline. Alternatively, the semialdehyde can be transaminated to ornithine, 
which is converted in several steps into arginine in the urea cycle (Figure 23.17). 


NH;* 


ADP fe) a-Keto- 
‘ /) glutarate 
ATP P; ¢ 
as H— Glutamate 


+ 
NADPH NADP* 


COoO- *HZN CcOoO- 7 
Glutamate Glutamic A Se 
‘y-semialdehyde H yt 
H 


Ornithine 


Ht 
+ 
NADPH  NADPt 


4 Mt 
H Y 
H N° 
H 


A'-Pyrroline Proline 
5-carboxylate 


H,0 COO" COO" 


3-Phosphoglycerate is the precursor of serine, cysteine, and glycine 


Serine is synthesized from 3-phosphoglycerate, an intermediate in glycolysis. 
The first step is an oxidation to 3-phosphohydroxypyruvate. This a-ketoacid 
is transaminated to 3-phosphoserine, which is then hydrolyzed to serine. 


2- 2- 
Q 0 QO O 
oe Nye” 
ae a-Keto- am 
Glutamate — glutarate 
> 
H 
COO- 
re) *Hs COO- 
3-Phospho- 3-Phosphohydroxy- 3-Phospho- Serine 
glycerate pyruvate serine 


Serine is the precursor of cysteine and glycine. As we shall see, the conver- 
sion of serine into cysteine requires the substitution of a sulfur atom derived 
from methionine for the side-chain oxygen atom. In the formation of glycine, 


H 723 
ie 24.2 Synthesis of Amino Acids 
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OH HN10 


Pteridine Q-OOC y 


FIGURE 24.9 Tetrahydrofolate. This 
cofactor includes three components: a 

e) ao ee pteridine ring, p-aminobenzoate, and one 
p-Aminobenzoate Glutamate or more glutamate residues. 


the side-chain methylene group of serine is transferred to tetrahydrofolate, a 
carrier of one-carbon units that will be discussed shortly (Figure 24.9). 


OH N N 
Sy “Sy 
CH, Jen et 
5 Uy, “ny, 
*H5N coo- HN. H>C—Na 
Serine Tetrahydrofolate N°,N!°-Methylene- Glycine 


tetrahydrofolate 


This interconversion is catalyzed by serine hydroxymethyltransferase, 
a PLP enzyme that is homologous to aspartate aminotransferase. The for- 
mation of the Schiff base of serine renders the bond between its a- and 
B-carbon atoms susceptible to cleavage, enabling the transfer of the B-carbon 
to tetrahydrofolate and producing the Schiff base of glycine. 


Tetrahydrofolate carries activated one-carbon units at 
several oxidation levels 


Tetrahydrofolate (also called tetrahydropteroylglutamate) is a highly versatile 
carrier of activated one-carbon units. This cofactor consists of three groups: 
a substituted pteridine, p-aminobenzoate, and a chain of one or more gluta- 
mate residues (Figure 24.9). Mammals can synthesize the 
pteridine ring, but they are unable to conjugate it to the other TABLE 24.2 One-carbon groups carried by tetrahydrofolate 
two units. They obtain tetrahydrofolate from their diets or ~ =  ©)3©3©[—.— YM 
from microorganisms in their intestinal tracts. 

The one-carbon group carried by tetrahydrofolate is 
bonded to its N-5 or N-10 nitrogen atom (denoted as N° and = Most reduced —CH, Methyl 
N!°) or to both. This unit can exist in three oxidation states _ (= methanol) 


Group 


Oxidation state Formula Name 


(Table 24.2). The most-reduced form carries a methyl group, irene de) —CH,— Methylene 
whereas the intermediate form carries a_methylene STOUP. — \fogt oxidized —cHO Fanaa 
More-oxidized forms carry a formyl, formimino, or methenyl (= formic acid) —CHNH Formimino 
group. The fully oxidized one-carbon unit, CO,, is carried —CH= Methenyl 


by biotin rather than by tetrahydrofolate. 

The one-carbon units carried by tetrahydrofolate are interconvertible 
(Figure 24.10). N°,N!°-Methylenetetrahydrofolate can be reduced to N°- 
methyltetrahydrofolate or oxidized to N°,N!°-methenyltetrahydrofolate. 
N°,N!°-Methenyltetrahydrofolate can be converted into N?- 
formiminotetrahydrofolate or N!°-formyltetrahydrofolate, both of which are 
at the same oxidation level. N!°-Formyltetrahydrofolate can also be synthe- 
sized from tetrahydrofolate, formate, and ATP. N>-Formyltetrahydrofolate 
can be reversibly isomerized to N!?-formyltetrahydrofolate or it can be con- 
verted into N*,N!°-methenyltetrahydrofolate. 
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FIGURE 24.10 Conversions of one-carbon 
units attached to tetrahydrofolate. 
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Tetrahydrofolate N3,N'9-Methylene- N5-Methyl- 
tetrahydrofolate tetrahydrofolate 
Formate + ATP NADPt 
ADP + P; NADPH 
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HC——-N CH HN 
Ke * HN~ Ma, 
N' te a N°,N'°-Methenyl- N>-Formimino- 
tetrahydrofolate tetrahydrofolate tetrahydrofolate 
ADP + P; 
ATP 


N°-Formyl- 
tetrahydrofolate 


These tetrahydyofolate derivatives serve as donors of one-carbon units in a 
variety of biosyntheses. Methionine is regenerated from homocysteine by trans- 
fer of the methyl group of N°-methyltetrahydrofolate, as will be discussed 
shortly. We shall see in Chapter 25 that some of the carbon atoms of purines 
are acquired from derivatives of N!°-formyltetrahydrofolate. The methyl 
group of thymine, a pyrimidine, comes from N°, N!°-methylenetetra- 
hydrofolate. This tetrahydrofolate derivative can also donate a one-carbon 
unit in an alternative synthesis of glycine that starts with CO, and NH ae 
reaction catalyzed by glycine synthase (called the glycine cleavage enzyme when 
it operates in the reverse direction). 


CO, + NH,* + N°,N!°-methylenetetrahydrofolate + NADH == 
glycine + tetrahydrofolate + NADT 


Thus, one-carbon units at each of the three oxidation levels are utilized in 
biosyntheses. Furthermore, tetrahydrofolate serves as an acceptor of one- 
carbon units in degradative reactions. The major source of one-carbon units 
is the facile conversion of serine into glycine by serine hydroxymethyltrans- 
ferase (p. 723), which yields N°,N!°-methylenetetrahydrofolate. Serine can 
be derived from 3-phosphoglycerate, and so this pathway enables one-carbon 
units to be formed de novo from carbohydrates. 


S-Adenosylmethionine is the major donor of methyl groups 


Tetrahydrofolate can carry a methyl group on its N-5 atom, but its transfer 
potential is not sufficiently high for most biosynthetic methylations. 


Rather, the activated methyl donor is usually S-adenosylmethionine (SAM), 725 
which 1s synthesized by the transfer of an adenosyl group from ATP to the 24.2 Synthesis of Amino Acids 
sulfur atom of methionine. 


~OOC H3t NH» 
J Sn 
+ ATP ——> ¢ | es 
N ae 
fe) N 
H3C 
HO OH 
Methionine S-Adenosylmethionine (SAM) 


The methyl group of the methionine unit 1s activated by the positive charge 
on the adjacent sulfur atom, which makes the molecule much more reactive 
than N°-methyltetrahydrofolate. The synthesis of S-adenosylmethionine 
is unusual in that the triphosphate group of ATP is split into pyrophos- 
phate and orthophosphate; the pyrophosphate is subsequently hydro- 
lyzed to two molecules of P.. S-Adenosylhomocysteine is formed when the 
methyl group of S-adenosylmethionine is transferred to an acceptor. S- 
Adenosylhomocysteine is then hydrolyzed to homocysteine and adenosine. 


Hs* NH> 
Va San -00C 
¢ | H,0 Adenosine 
N ee 
fe) N oe 2 


HO OH HO OH 
S-Adenosylmethionine (SAM) S-Adenosylhomocysteine Homocysteine 


melele N 


Methionine can be regenerated by the transfer of a methyl group to homo- 
cysteine from N°-methyltetrahydrofolate, a reaction catalyzed by methio- 
nine synthase (also known as homocysteine methyltransferase). 


-OOC H é H 
N 00c H 
i eco Gia 
: a » es 
Ne es. 
bccn 
3 Te HN. 
HzC S-Adenosyl- 
ATP methionine 
Homocysteine N5-Methyl- Methionine Tetrahydrofolate 
tetrahydrofolate 
; ‘ : Methionine S-Adenosyl- 
The coenzyme that mediates this transfer of a methyl group is methylcobala- hemonyteine 
min, derived from vitamin B,,. In fact, this reaction and the rearrangement 
2 


of L-methylmalonyl CoA to succinyl CoA (p. 654), catalyzed by a homolo- 

gous enzyme, are the only two B, ,-dependent reactions known to take place ~ Homocysteine 

in mammals. Another enzyme that converts homocysteine into methionine G= 

without vitamin B,, also is present in many organisms. FIGURE 24.11 Activated methyl cycle. 
These reactions constitute the activated methyl cycle (Figure 24.11). The methyl group of methionine is activated 

Methyl groups enter the cycle in the conversion of homocysteine into by the formation of S-adenosylmethionine. 
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methionine and are then made highly reactive by the addition of an 
adenosyl group, which makes the sulfur atoms positively charged and 
the methyl groups much more electrophilic. The high transfer potential 
of the S-methyl group enables it to be transferred to a wide variety 
of acceptors. Among the acceptors modified by S-adenosylmethionine 
are specific bases in bacterial DNA. For instance, the methylation of 
DNA protects bacterial DNA from cleavage by restriction enzymes 
(Section 9.3). Methylation is also important for the synthesis of phos- 
pholipids (Section 26.1). 

S-Adenosylmethionine is also the precursor of ethylene, a gaseous plant 
hormone that induces the ripening of fruit. S-Adenosylmethionine is 
cyclized to a cyclopropane derivative that is then oxidized to form ethylene. 
The Greek philosopher Theophrastus recognized more than 2000 years ago 
that sycamore figs do not ripen unless they are scraped with an iron claw. 
The reason is now known: Wounding triggers ethylene production, which in 
turn induces ripening. 


H2C——CH2 
eee ACC synthase ACC oxidase 
S-Adenosylmethionine. —————> x > Hj»C—=CH2 
+H3N COO- Ethylene 


1-Aminocyclopropane- 
1-carboxylate (ACC) 


Cysteine is synthesized from serine and homocysteine 


In addition to being a precursor of methionine in the activated methyl cycle, 
homocysteine is an intermediate in the synthesis of cysteine. Serine and 
homocysteine condense to form cystathionine. This reaction is catalyzed by 
cystathionine B-synthase. Cystathionine is then deaminated and cleaved to 
cysteine and a-ketobutyrate by cystathionine y-lyase or cystathionase. Both 
of these enzymes utilize PLP and are homologous to aspartate aminotrans- 
ferase. The net reaction is 


Homocysteine + serine == cysteine + a-ketobutyrate + NH," 


Note that the sulfur atom of cysteine is derived from homocysteine, whereas the 
carbon skeleton comes from serine. 


High homocysteine levels correlate with vascular disease 


=e People with elevated serum levels of homocysteine (homocystein- 

emia) or the disulfide-linked dimer homocystine (homocystinuria) 
have an unusually high risk for coronary heart disease and arteriosclerosis. 
The most common genetic cause of high homocysteine levels is a mutation 
within the gene encoding cystathionine B-synthase. High levels of homo- 
cysteine appear to damage cells lining blood vessels and to increase the 
growth of vascular smooth muscle. The amino acid raises oxidative stress 
as well and has also been implicated in the development of type 2 diabetes 
(Section 27.3). The molecular basis of homocysteine’s action has not been 
clearly identified, but may result from stimulation of the inflammatory 
response. Vitamin treatments are sometimes effective in reducing homo- 
cysteine levels in some people. Treatment with vitamins maximizes the 
activity of the two major metabolic pathways processing homocysteine. 
Pyridoxal phosphate, a vitamin B, derivative, is necessary for the activity 
of cystathionine B-synthase, which converts homocysteine into cystathi- 
one; tetrahydrofolate, as well as vitamin B,,, supports the methylation of 
homocysteine to methionine. 


Shikimate and chorismate are intermediates in 
the biosynthesis of aromatic amino acids 


We turn now to the biosynthesis of essential amino acids. These amino acids 
are synthesized by plants and microorganisms, and those in the human diet 
are ultimately derived primarily from plants. The essential amino acids are 
formed by much more complex routes than are the nonessential amino 
acids. The pathways for the synthesis of aromatic amino acids in bacteria 
have been selected for discussion here because they are well understood and 
exemplify recurring mechanistic motifs. 

Phenylalanine, tyrosine, and tryptophan are synthesized by a common 
pathway in E. coli (Figure 24.12). The initial step is the condensation of 
phosphoenolpyruvate (a glycolytic intermediate) with erythrose 4-phos- 
phate (a pentose phosphate pathway intermediate). The resulting seven- 
carbon open-chain sugar is oxidized, loses its phosphoryl group, and 
cyclizes to 3-dehydroquinate. Dehydration then yields 3-dehydroshikimate, 
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Chorismate 
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24.2 Synthesis of Amino Acids 


FIGURE 24.12 Pathway to chorismate. 


Chorismate is an intermediate in the 
biosynthesis of phenylalanine, tyrosine, 


and tryptophan. 
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Gluta-  a-Keto- NH3* 
CO2 mate — glutarate 


+ 
OH™ \ A 


XY Phenylpyruvate Phenylalanine 

3 COO- COO 
HO oH NADt ‘ 

Chorismate Prephenate Gluta- —__«-Keto- 


NADH mate _glutarate 


+ 
CO, \ / 


FIGURE 24.13 Synthesis of phenylalanine 
and tyrosine. Chorismate can be converted 
into prephenate, which is subsequently OH OH 

converted into phenylalanine and tyrosine. p-Hydroxyphenylpyruvate Tyrosine 


which is reduced by NADPH to shikimate. The phosphorylation of shiki- 
mate by ATP gives shikimate 3-phosphate, which condenses with a second 
molecule of phosphoenolpyruvate. The resulting 5-enolpyruvyl intermedi- 
ate loses its phosphoryl group, yielding chorismate, the common precursor 
of all three aromatic amino acids. The importance of this pathway is revealed 
by the effectiveness of glyphosate (commercially known as Roundup), a 
broad-spectrum herbicide. This compound is an uncompetitive inhibitor of 
the enzyme that produces 5-enolpyruvylshikimate 3-phosphate. It blocks 
aromatic amino acid biosynthesis in plants but is fairly nontoxic in animals 
because they lack the enzyme. 


2 f\ 

O 
Glyphosate 
(Roundup) 


The pathway bifurcates at chorismate. Let us first follow the prephenate 
branch (Figure 24.13). A mutase converts chorismate into prephenate, the 
immediate precursor of the aromatic ring of phenylalanine and tyrosine. 
This fascinating conversion is a rare example of an electrocyclic reaction in 
biochemistry, mechanistically similar to the well-known Diels—Alder reac- 
tion in organic chemistry. Dehydration and decarboxylation yield phenylpy- 
ruvate. Alternatively, prephenate can be oxidatively decarboxylated to 
p-hydroxyphenylpyruvate. These a-ketoacids are then transaminated to form 
phenylalanine and tyrosine. 

The branch starting with anthranilate leads to the synthesis of trypto- 
phan (Figure 24.14). Chorismate acquires an amino group derived from 
the hydrolysis of the side chain of glutamine and releases pyruvate to form 
anthranilate. Anthranilate then condenses with 5-phosphoribosyl-1- 
pyrophosphate (PRPP), an activated form of ribose phosphate. PRPP is also 

OPO;P033- an important intermediate in the synthesis of histidine, pyrimidine 
Ho oH nucleotides, and purine nucleotides (Sections 25.1 and 25.2). The C-1 

Ss Phosihamiosi ip rphaishiels atom of ribose 5-phosphate becomes bonded to the nitrogen atom of 
(PRPP) anthranilate in a reaction that is driven by the release and hydrolysis of 
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H H 
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phosphate 
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pyrophosphate. The ribose moiety of phosphoribosylanthranilate under- 
goes rearrangement to yield 1-(0-carboxyphenylamino)-1-deoxyribulose 
5-phosphate. This intermediate is dehydrated and then decarboxylated to 
indole-3-glycerol phosphate. Tryptophan synthase completes the synthe- 
sis of tryptophan with the removal of the side chain of indole-3-glycerol 
phosphate, yielding glyceraldehyde 3-phosphate, and its replacement 
with the carbon skeleton of serine. 


Tryptophan synthase illustrates substrate channeling 
in enzymatic catalysis 


The E. coli enzyme tryptophan synthase, an a8, tetramer, can be dis- 
sociated into two a subunits and a B, dimer (Figure 24.15). The a sub- 
unit catalyzes the formation of indole from indole-3-glycerol phosphate, 
whereas each B subunit has a PLP-containing active site that catalyzes 
the condensation of indole and serine to form tryptophan. Serine forms 
a Schiff base with this PLP, which is then dehydrated to give the Schiff 
base of aminoacrylate. This reactive intermediate is attacked by indole to 
give tryptophan. 

The synthesis of tryptophan poses a challenge. Indole, a hydrophobic 
molecule, readily traverses membranes and would be lost from the cell if 
it were allowed to diffuse away from the enzyme. This problem is solved 
in an ingenious way. A 25-A-long channel connects the active site of 
the a subunit with that of the adjacent B subunit in the a,B, tetramer 


~OOC 


2-O5P oO. 


HO OH 


N-(5'-Phosphoribosyl)- 
anthranilate 


1-(o-Carboxyphenylamino)- 
1-deoxyribulose 5-phosphate 


FIGURE 24.14 Synthesis of tryptophan. 
Chorismate can be converted into 
anthranilate, which is subsequently 
converted into tryptophan. 


Indole 


Schiff base of aminoacrylate 
(derived from serine) 
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FIGURE 24.16 Substrate 
channeling. A 25-A tunnel runs 
from the active site of the a subunit 
of tryptophan synthase (yellow) to 
the PLP cofactor (red) in the active 
site of the 8 subunit (blue). 


Dimeric 
regulatory domain 


Catalytic 
domain 


hk FIGURE 24.17 Structure of 
3-phosphoglycerate dehydrogenase. 
This enzyme, which catalyzes the committed 
step in the serine biosynthetic pathway, is 
inhibited by serine. Notice the two serine- 
binding dimeric regulatory domains—one at 
the top and the other at the bottom of the 
structure. [Drawn from 1PSD.pdb.] 


Kh FIGURE 24.15 Structure of 
tryptophan synthase. The structure 
of the complex formed by one a 
subunit (yellow) and one 8 subunit 
(blue). Notice that pyridoxal phosphate 
(PLP) is bound deeply inside the B 
subunit, a considerable distance from : 
the a subunit. [Drawn from 1BKS.pdb.] B subunit 


(Figure 24.16). Thus, indole can diffuse from one active site to the other 
without being released into bulk solvent. Isotopic-labeling experiments 
showed that indole formed by the a subunit does not leave the enzyme 
when serine is present. Furthermore, the two partial reactions are coordi- 
nated. Indole is not formed by the a subunit until the highly reactive 
aminoacrylate is ready and waiting in the B subunit. We see here a clear- 
cut example of substrate channeling in catalysis by a multienzyme complex. 
Channeling substantially increases the catalytic rate. Furthermore, a 
deleterious side reaction—in this case, the potential loss of an intermedi- 
ate—is prevented. We shall encounter other examples of substrate chan- 
neling in Chapter 25. 


24.3 Feedback Inhibition Regulates Amino 
Acid Biosynthesis 


The rate of synthesis of amino acids depends mainly on the amounts of the 
biosynthetic enzymes and on their activities. We now consider the control of 
enzymatic activity. The regulation of enzyme synthesis will be discussed in 
Chapter 31. 

In a biosynthetic pathway, the first irreversible reaction, called the 
committed step, is usually an important regulatory site. The final product 
of the pathway (Z) often inhibits the enzyme that catalyzes the committed 
step (A > B). 


Inhibited 
by Z 


A=| => 2 —<> C <>) <=> — mp 7 


This kind of control is essential for the conservation of building blocks and 
metabolic energy. Consider the biosynthesis of serine (p. 722). The commit- 
ted step in this pathway is the oxidation of 3-phosphoglycerate, catalyzed 
by the enzyme 3-phosphoglycerate dehydrogenase. The E. coli enzyme is a 
tetramer of four identical subunits, each comprising a catalytic domain and 
a serine-binding regulatory domain (Figure 24.17). The binding of serine 


to a regulatory site reduces the value of V..,, for the 
enzyme; an enzyme bound to four molecules of serine is 
essentially inactive. Thus, if serine is abundant in the cell, 
the enzyme activity is inhibited, and so 3-phosphoglycer- 


Threonine 


Threonine 
deaminase 


Hydroxyethyl-TPP 


ate, a key building block that can be used for other pro- Pyruvate o-Ketobutyrate 

cesses, 1s not wasted. “y 4 

Branched pathways require sophisticated regulation 

The regulation of branched pathways is more complicated 4 4 oll le 
because the concentration of two products must be { 2| |2 
accounted for. In fact, several intricate feedback mecha- $ 2 = 
nisms have been found in branched biosynthetic pathways. | | =i T= 
Feedback inhibition and activation. Two pathways with esa salen 

a common initial step may each be inhibited by its own | | | 


product and activated by the product of the other path- Leucine 

way. Consider, for example, the biosynthesis of the 

branched chain amino acids valine, leucine, and isoleu- 

cine. A common intermediate, hydroxyethyl thiamine 

pyrophosphate (hydroxyethyl-TPP; Section 17.1), initiates the pathways 
leading to all three of these amino acids. Hydroxyethyl-TPP reacts with 
a-ketobutyrate in the initial step for the synthesis of isoleucine. Alternatively, 
hydroxyethyl-TPP reacts with pyruvate in the committed step for the path- 
ways leading to valine and leucine. Thus, the relative concentrations of 
a-ketobutyrate and pyruvate determine how much isoleucine is produced 
compared with valine and leucine. Threonine deaminase, the PLP enzyme 
that catalyzes the formation of a-ketobutyrate, is allosterically inhibited by 
isoleucine (Figure 24.18). This enzyme is also allosterically activated by 
valine. Thus, this enzyme is inhibited by the end product of the pathway 
that it initiates and is activated by the end product of a competitive path- 
way. This mechanism balances the amounts of different amino acids that 
are synthesized. 


a 4 The regulatory domain in threonine deaminase is very similar in 
structure to the regulatory domain in 3-phosphoglycerate dehydroge- 
nase (Figure 24.19). In the latter enzyme, regulatory domains of two sub- 
units interact to form a dimeric serine-binding regulatory unit, and so the 
tetrameric enzyme contains two such regulatory units. Each unit is capable 
of binding two serine molecules. In threonine deaminase, the two regulatory 


Amino acid-binding sites 


Dimeric regulatory domain of Single-chain regulatory domain of 
3-phosphoglycerate dehydrogenase threonine deaminase 


Valine 


Isoleucine 


FIGURE 24.18 Regulation of threonine 
deaminase. Threonine is converted into 
a-ketobutyrate in the committed step, leading 
to the synthesis of isoleucine. The enzyme 
that catalyzes this step, threonine deaminase, 
is inhibited by isoleucine and activated by 
valine, the product of a parallel pathway. 


Kk FIGURE 24.19 A recurring regulatory 
domain. The regulatory domain formed by 
two subunits of 3-phosphoglycerate 
dehydrogenase is structurally related to the 
single-chain regulatory domain of threonine 
deaminase. Notice that both structures 
have four a helices and eight 8 strands in 
similar locations. Sequence analyses have 
revealed this amino acid-binding regulatory 
domain to be present in other enzymes as 
well. [Drawn from 1PSD and 1TDu.pdb.] 
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FIGURE 24.20 Domain structures of 


three aspartokinases. Each catalyzes the Aspartokinase 

committed step in the biosynthesis of a domain 

different amino acid: (top) methionine, SS 
(middle) threonine, and (bottom) lysine. Unregulated 


They have a catalytic domain in common 

(red) but differ in their regulatory domains CO 
(yellow and orange). The blue domain Threonine sensitive 

represents another enzyme (homoserine 
dehydrogenase) involved in aspartate 
metabolism. Thus, the top two Lysine sensitive 
aspartokinases are bifunctional enzymes. 


domains are fused into a single unit with two differentiated amino acid- 
binding sites, one for isoleucine and the other for valine. Sequence analysis 
shows that similar regulatory domains are present in other amino acid bio- 
synthetic enzymes. The similarities suggest that feedback-inhibition processes 
may have evolved by the linkage of specific regulatory domains to the catalytic 
domains of biosynthetic enzymes. 


Enzyme multiplicity. The committed step can be catalyzed by two or more 
isozymes, enzymes with essentially identical catalytic mechanisms, but dif- 
ferent regulatory properties (Section 10.2). For example, the phosphoryla- 
tion of aspartate is the committed step in the biosynthesis of threonine, 
methionine, and lysine. Three distinct aspartokinases, which evolved by 
gene duplication, catalyze this reaction in E. coli (Figure 24.20). The cata- 
lytic domains of these enzymes show approximately 30% sequence identity. 
Although the mechanisms of catalysis are the same, their activities are regu- 
lated differently: one enzyme is not subject to feedback inhibition, another 
is inhibited by threonine, and the third is inhibited by lysine. 


Cumulative feedback inhibition. A common step is partly inhibited by 
each of the final products, acting independently. The regulation of gluta- 
mine synthetase in E. coli is a striking example of cumulative feedback 
inhibition. Recall that glutamine is synthesized from glutamate, NH and 
ATP. Glutamine synthetase consists of 12 identical 50-kDa subunits arranged 
in two hexagonal rings that face each other. This enzyme regulates the flow 
of nitrogen and hence plays a key role in controlling bacterial metabolism. 
The amide group of glutamine 1s a source of nitrogen in the biosyntheses of 
a variety of compounds, such as tryptophan, histidine, carbamoyl phos- 
phate, glucosamine 6-phosphate, cytidine triphosphate, and adenosine 
monophosphate. Glutamine synthetase is cumulatively inhibited by each of 
these final products of glutamine metabolism, as well as by alanine and 
glycine. In cumulative inhibition, each inhibitor can reduce the activity of the 
enzyme, even when other inhibitors are bound at saturating levels. The enzy- 
matic activity of glutamine synthetase is switched off almost completely 
when all final products are bound to the enzyme. 


The sensitivity of glutamine synthetase to allosteric regulation 
is altered by covalent modification 


The activity of glutamine synthetase is also controlled by reversible covalent 
modification—the attachment of an AMP unit by a phosphodiester linkage 
to the hydroxyl group of a specific tyrosine residue in each subunit. This 
adenylylated enzyme is less active and more susceptible to cumulative feedback 
inhibition than is the deadenylylated form. The covalently attached AMP unit 
is removed from the adenylylated enzyme by phosphorolysis. 


The adenylylation and phosphorolysis reactions are catalyzed 
by the same enzyme, adenylyl transferase. The adenylyl transfer- 
ase is composed of two homologous halves, one half catalyzing 
the adenylylation reaction and the other half the phosphorolytic 
deadenylylation reaction. What determines whether an AMP 
unit is added or removed? The specificity of adenylyl transferase 
is controlled by a regulatory protein P,,, a trimeric protein that can 
exist in two forms, unmodified (P};) or covalently bound to UMP 
(P,;-UMP). The complex of P,, and adenylyl transferase cata- 
lyzes the attachment of an AMP unit to glutamine synthetase, 
which reduces its activity. Conversely, the complex of P;;,-UMP 
and adenylyl transferase removes AMP from the adenylylated 
enzyme (Figure 24.21). 

This scheme of regulation immediately raises the question, 
how is the modification of P,, controlled? P|, is converted into P,,- 
UMP by the attachment of uridine monophosphate to a specific 
tyrosine residue (Figure 24.21). This reaction, which is catalyzed by uridy- 
lyl transferase, is stimulated by ATP and a-ketoglutarate, whereas it is 
inhibited by glutamine. In turn, the UMP units on P,, are removed 
by hydrolysis, a reaction promoted by glutamine and inhibited by 
a-ketoglutarate. These opposing catalytic activities are present on a single 
polypeptide chain and are controlled so that the enzyme does not simulta- 
neously catalyze uridylylation and hydrolysis. In essence, if glutamine is 
present, the covalent modification system favors adenylylation and inacti- 
vation of glutamine synthetase. If glutamine is absent, as indicated by the 
presence of its precursors a-ketoglutarate and ATP, the control system 
results in the deadenylylation and activation of the synthetase. 

The integration of nitrogen metabolism in a cell requires that a large 
number of input signals be detected and processed. In addition, the regula- 
tory protein P,, also participates in regulating the transcription of genes for 
glutamine synthetase and other enzymes taking part in nitrogen metabo- 
lism. The evolution of covalent regulation superimposed on feedback inhi- 
bition provided many more regulatory sites and made possible a finer tun- 
ing of the flow of nitrogen in the cell. We have previously seen such a dual 
regulatory format in the regulation of glycogen metabolism (Section 21.5). 
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FIGURE 24.21 Covalent regulation 

of glutamine synthetase. Adenylyl 
transferase (AT), in association with the 
regulatory protein P,,, adenylylates and 
inactivates the synthetase. When 
associated with P;, bound to UMP, AT 
deadenylylates the synthetase, thereby 
activating the enzyme. Uridylyl transferase 
(UT), the enzyme that modifies P,,, is 
allosterically regulated by a-ketoglutarate, 
ATP, and glutamine. [Information from 

D. L. Nelson and M. M. Cox, Lehninger 
Principles of Biochemistry 6th ed. 

(W. H. Freeman and Company, 2013), 

Fig. 22.91] 
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24.4 Amino Acids Are Precursors of Many Biomolecules 


In addition to being the building blocks of proteins and peptides, amino 
acids serve as precursors of many kinds of small molecules that have impor- 
tant and diverse biological roles. Let us briefly survey some of the biomol- 
ecules that are derived from amino acids (Figure 24.22). 


Adenine Cytosine 


HO I 


Thyroxine 
(Tetraiodothyronine) 


FIGURE 24.22 Selected biomolecules 
derived from amino acids. The atoms 
contributed by amino acids are shown 
in blue. 
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Purines and pyrimidines are derived largely from amino acids. The bio- 
synthesis of these precursors of DNA, RNA, and numerous coenzymes will 
be discussed in detail in Chapter 25. The reactive terminus of sphingosine, 
an intermediate in the synthesis of sphingolipids, comes from serine. 
Histamine, a potent vasodilator, is derived from histidine by decarboxyl- 
ation. Tyrosine is a precursor of thyroxine (tetraiodothyronine, a hormone 
that modulates metabolism), epinephrine (adrenaline), and melanin (a com- 
plex polymeric molecule responsible for skin pigmentation). The neu- 
rotransmitter serotonin (5-hydroxytryptamine) and the nicotinamide ring of 
NAD* are synthesized from tryptophan. Let us now consider in more 
detail three particularly important biochemicals derived from amino acids. 


Glutathione, a gamma-glutamyl peptide, serves as a sulfhydryl 
buffer and an antioxidant 


Glutathione, a tripeptide containing a sulfhydryl group, is a highly distinc- 
tive amino acid derivative with several important roles (Figure 24.23). For 
example, glutathione, present at high levels (~5 mM) in animal cells, 
protects red blood cells from oxidative damage by serving as a sulfhy- 
dryl buffer (Section 20.5). It cycles between a reduced thiol form 
(GSH) and an oxidized form (GSSG) in which two tripeptides are 


C00" linked by a disulfide bond. 


2GSH + RO—OH = — G8SG + H,O + ROH 


y-Glutamate Cysteine Glycine 


FIGURE 24.23 Glutathione. This tripeptide consists 
of a cysteine residue flanked by a glycine residue and 
a glutamate residue that is linked to cysteine by an 
isopeptide bond between glutamate’s side-chain 
carboxylate group and cysteine’s amino group. 


GSSG is reduced to GSH by glutathione reductase, a flavoprotein 
that uses NADPH as the electron source. The ratio of GSH to GSSG 
in most cells is greater than 500. Glutathione plays a key role in detoxi- 
fication by reacting with hydrogen peroxide and organic peroxides, the 
harmful by-products of aerobic life. 


Selenocysteine 


GSSG E-Se- (ROOH 
+Ht Selenolate 
GSH ROH 
E-Se-S-G E-SeOH 


Selenosulfide Selenenic acid 
H,O0 GSH 


FIGURE 24.25 Catalytic cycle of 
glutathione peroxidase. [Information from 
O. Epp, R. Ladenstein, and A. Wendel. Eur. J. 
Biochem. 133(1983):51-69.] 


K FIGURE 24.24 Structure of 
glutathione peroxidase. This enzyme, 
which has a role in peroxide detoxification, 
contains a selenocysteine residue in its 
active site. [Drawn from 1GP1.pdb.] 


Glutathione peroxidase, the enzyme catalyzing this reaction, is 
remarkable in having a modified amino acid containing a selenium (Se) 
atom (Figure 24.24). Specifically, its active site contains the selenium 
analog of cysteine, in which selenium has replaced sulfur. The selenolate 
(E-Se~) form of this residue reduces the peroxide substrate to an alcohol 
and is in turn oxidized to selenenic acid (E-SeOH). Glutathione then 
comes into action by forming a selenosulfide adduct (E-Se-5-G). A sec- 
ond molecule of glutathione then regenerates the active form of the 
enzyme by attacking the selenosulfide to form oxidized glutathione 
(Figure 24.25). 


Nitric oxide, a short-lived signal molecule, is formed from arginine 


Nitric oxide (NO) is an important messenger in many vertebrate signal- 
transduction processes, identified first as a relaxing factor in the cardiovas- 
cular system. It is now known to have a variety of roles not only in the 
cardiovascular system, but also in the immune and nervous systems. NO 
has also been shown to stimulate mitochondrial biogenesis. This free-radi- 
cal gas is produced endogenously from arginine in a complex reaction that is 
catalyzed by nitric oxide synthase. NADPH and O, are required for the 
synthesis of nitric oxide (Figure 24.26). Nitric oxide acts by binding to and 
activating soluble guanylate cyclase, an important enzyme in signal trans- 
duction (Section 32.3). This enzyme is homologous to adenylate cyclase but 
includes a heme-containing domain that binds NO. 
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15N labeling: A pioneer’s account 
“Myself as a Guinea Pig 

...in 1944, | undertook, together with 
David Rittenberg, an investigation on the 
turnover of blood proteins of man. To this 
end | synthesized 66 g of glycine labeled with 
35 percent !5N at a cost of $1000 for the 1N. 
On 12 February 1945, | started the ingestion 
of the labeled glycine. Since we did not know 
the effect of relatively large doses of the stable 
isotope of nitrogen and since we believed that 
the maximum incorporation into the proteins 
could be achieved by the administration of 
glycine in some continual manner, | ingested 
1 g samples of glycine at hourly intervals for the 
next 66 hours... . At stated intervals, blood 
was withdrawn and after proper preparation 
the '®N concentrations of different blood 
proteins were determined.” 


—David Shemin 
Bioessays 10(1989):30 


FIGURE 24.26 
Formation of nitric 
oxide. NO is generated 
by the oxidation of 
arginine. 
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FIGURE 24.27 Heme labeling. The origins 
of atoms in heme revealed by the results of 
isotopic labeling studies. 
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Porphyrins are synthesized from glycine and succinyl coenzyme A 


The participation of an amino acid in the biosynthesis of the porphyrin 
rings of hemes and chlorophylls was first revealed by isotope-labeling 
experiments carried out by David Shemin and his colleagues. In 1945, they 
showed that the nitrogen atoms of heme were labeled after the feeding of 
[!5N]glycine to human subjects, whereas the ingestion of [!°N]glutamate 
resulted in very little labeling. 

Experiments using 4C, which had just become available, revealed that 
8 of the carbon atoms of heme in nucleated duck erythrocytes are derived 
from the a-carbon atom of glycine and none from the carboxyl carbon atom. 
Subsequent studies demonstrated that the other 26 carbon atoms of heme 
can arise from acetate. Moreover, the '*C in methyl-labeled acetate emerged 
in 24 of these carbon atoms, whereas the '4C in carboxyl-labeled acetate 
appeared only in the other 2 (Figure 24.27). 

This highly distinctive labeling pattern suggested that acetate is converted 
to succinyl-CoA through enzymes from the citric acid cycle (Section 17.2) and 
that a heme precursor is formed by the condensation of glycine with succinyl- 
CoA. Indeed, the first step in the biosynthesis of porphyrins in mammals is the 
condensation of glycine and succinyl CoA to form 6-aminolevulinate. 


~00C CoA ~OOC. 
+ 
H* co, 
. oe ae 
coA + -00c~ SNH5* NH;* 
O O 
Succinyl CoA Glycine 5-Aminolevulinate 


This reaction is catalyzed by 6-aminolevulinate synthase, a PLP enzyme 
present in mitochondria. Consistent with the labeling studies described 
earlier, the carbon atom from the carboxyl group of glycine is lost as carbon 
dioxide, while the a-carbon remains in 6-aminolevulinate. 

Two molecules of 6-aminolevulinate condense to form porphobilinogen, 
the next intermediate. Four molecules of porphobilinogen then condense 
head to tail to form a linear tetrapyrrole in a reaction catalyzed by porphobi- 
linogen deaminase. The enzyme-bound linear tetrapyrrole then cyclizes to 
form uroporphyrinogen III, which has an asymmetric arrangement of side 
chains. This reaction requires a cosynthase. In the presence of synthase 
alone, uroporphyrinogen I, the nonphysiological symmetric isomer, is pro- 
duced. Uroporphyrinogen III is also a key intermediate in the synthesis of 
vitamin B,, by bacteria and that of chlorophyll by bacteria and plants 
(Figure 24.28). 

The porphyrin skeleton is now formed. Subsequent reactions alter the 
side chains and the degree of saturation of the porphyrin ring (Figure 24.28). 
Coproporphyrinogen III is formed by the decarboxylation of the acetate side 
chains. The desaturation of the porphyrin ring and the conversion of two of 
the propionate side chains into vinyl groups yield protoporphyrin LX. The 
chelation of iron finally gives heme, the prosthetic group of proteins such as 
myoglobin, hemoglobin, catalase, peroxidase, and cytochrome c. The inser- 
tion of the ferrous form of iron is catalyzed by ferrochelatase. Iron is trans- 
ported in the plasma by transferrin, a protein that binds two ferric ions, and 
is stored in tissues inside molecules of ferritin (Section 32.4). 

Studies with !SN-labeled glycine revealed that the normal human eryth- 
rocyte has a life span of about 120 days. The first step in the degradation of 
the heme group is the cleavage of its a-methine bridge to form the green 
pigment biliverdin, a linear tetrapyrrole. The central methine bridge 
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Iron FIGURE 24.28 Heme biosynthetic pathway. The pathway 
for the formation of heme starts with eight molecules of 
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Heme 


of biliverdin is then reduced by biliverdin reductase to form bilirubin, a red 
pigment (Figure 24.29). The changing color of a bruise is a highly graphic 
indicator of these degradative reactions. 


Porphyrins accumulate in some inherited disorders of 
porphyrin metabolism 


e Porphyrias are inherited or acquired disorders caused by a defi- 
ciency of enzymes in the heme biosynthetic pathway. Porphyrin is 
synthesized in both the erythroblasts and the liver, and either one may be 
the site of a disorder. Congenital erythropoietic porphyria, for example, 
prematurely destroys erythrocytes. This disease results from insufficient 
cosynthase. In this porphyria, the synthesis of the required amount of 
uroporphyrinogen III is accompanied by the formation of very large quan- 
tities of uroporphyrinogen I, the useless symmetric isomer. Uroporphyrin 
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FIGURE 24.29 Heme degradation. The 
formation of the heme-degradation 
products biliverdin and bilirubin is 
responsible for the color of bruises. 
Abbreviations: M, methy!; V, vinyl. 
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I, coproporphyrin I, and other symmetric derivatives also accumulate. The 
urine of patients having this disease is red because of the excretion of large 
amounts of uroporphyrin I. Their teeth exhibit a strong red fluorescence 
under ultraviolet light because of the deposition of porphyrins. 
Furthermore, their skin is usually very sensitive to light because photoex- 
cited porphyrins are quite reactive. Acute intermittent porphyria is the most 
prevalent of the porphyrias affecting the liver. This porphyria is character- 
ized by the overproduction of porphobilinogen and 6-aminolevulinate, 
which results in severe abdominal pain and neurological dysfunction. The 
“madness” of George III, king of England during the American Revolution, 
may have been due to this porphyria. 


SUMMARY 


24.1 Nitrogen Fixation: Microorganisms Use ATP and a Powerful 

Reductant to Reduce Atmospheric Nitrogen to Ammonia 

Microorganisms use ATP and reduced ferredoxin, a powerful reduc- 
tant, to reduce N,, to NH,. An iron-molybdenum cluster in nitroge- 
nase deftly catalyzes the fixation of N,, a very inert molecule. Higher 
organisms consume the fixed nitrogen to synthesize amino acids, 
nucleotides, and other nitrogen-containing biomolecules. The major 
points of entry of NH,* into metabolism are glutamine or glutamate. 


24.2 Amino Acids Are Made from Intermediates of the Citric Acid Cycle 
and Other Major Pathways 
Human beings can synthesize 11 of the basic set of 20 amino 
acids. These amino acids are called nonessential, in contrast with the 


24.3 


24.4 


essential amino acids, which must be supplied in the diet. The path- 
ways for the synthesis of nonessential amino acids are quite simple. 
Glutamate dehydrogenase catalyzes the reductive amination of 
a-ketoglutarate to glutamate. A transamination reaction takes place 
in the synthesis of most amino acids. At this step, the chirality of the 
amino acid is established. Alanine and aspartate are synthesized by 
the transamination of pyruvate and oxaloacetate, respectively. 
Glutamine is synthesized from NH,* and glutamate, and aspara- 
gine is synthesized similarly. Proline and arginine are derived from 
glutamate. Serine, formed from 3-phosphoglycerate, is the precursor 
of glycine and cysteine. Tyrosine is synthesized by the hydroxylation 
of phenylalanine, an essential amino acid. The pathways for the bio- 
synthesis of essential amino acids are much more complex than those 
for the nonessential ones. 

Tetrahydrofolate, a carrier of activated one-carbon units, plays an 
important role in the metabolism of amino acids and nucleotides. This 
coenzyme carries one-carbon units at three oxidation states, which are 
interconvertible: most reduced—methyl; intermediate—methylene; 
and most oxidized—formyl, formimino, and methenyl. The major 
donor of activated methyl groups is S-adenosylmethionine, which is 
synthesized by the transfer of an adenosyl group from ATP to the 
sulfur atom of methionine. S-Adenosylhomocysteine is formed when 
the activated methyl group is transferred to an acceptor. It is hydro- 
lyzed to adenosine and homocysteine, and the latter is then methylated 
to methionine to complete the activated methy] cycle. 


Feedback Inhibition Regulates Amino Acid Biosynthesis 

Most of the pathways of amino acid biosynthesis are regulated by 
feedback inhibition in which the committed step is allosterically 
inhibited by the final product. The regulation of branched pathways 
requires extensive interaction among the branches that includes both 
negative and positive regulation. The regulation of glutamine synthe- 
tase in E. coli is a striking demonstration of cumulative feedback inhi- 
bition and of control by a cascade of reversible covalent modifications. 


Amino Acids Are Precursors of Many Biomolecules 

Amino acids are precursors of a variety of biomolecules. Glutathione 
(y-Glu-Cys-Gly) serves as a sulfhydryl buffer and detoxifying agent. 
Glutathione peroxidase, a selenoenzyme, catalyzes the reduction of 
hydrogen peroxide and organic peroxides by glutathione. Nitric oxide, 
a short-lived messenger, is formed from arginine. Porphyrins are syn- 
thesized from glycine and succinyl CoA, which condense to give 
6-aminolevulinate. Two molecules of this intermediate become linked 
to form porphobilinogen. Four molecules of porphobilinogen combine 
to form a linear tetrapyrrole, which cyclizes to uroporphyrinogen III. 
Oxidation and side-chain modifications lead to the synthesis of proto- 
porphyrin [X, which acquires an iron atom to form heme. 
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KEY TERMS 
nitrogen fixation (p. 714) tetrahydrofolate (p. 723) enzyme multiplicity (p. 732) 
nitrogenase complex (p. 715) S-adenosylmethionine (SAM) (p. 724) cumulative feedback inhibition (p. 732) 
essential amino acids (p. 719) activated methyl cycle (p. 725) glutathione (p. 734) 
nonessential amino acids (p. 719) substrate channeling (p. 729) nitric oxide (NO) (p. 735) 


pyridoxal phosphate (p. 720) committed step (p. 730) 


porphyria (p. 737) 
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| PROBLEMS 


1. Out of thin air. Define nitrogen fixation. What organ- 
isms are capable of nitrogen fixation? 


2. Like Trinidad and Tobago. Match each term with its 
description. 


(a) Nitrogen fixation 1. Methylated to form 


(b) Nitrogenase complex methionine 
2. An important methyl 
(c) Glutamate_ donor 
(d) Essential amino acids 3. Coenzyme required by 
aminotransferases 
(e) Nonessential amino acids 4. Conversion of N, 
into NH, 
(f) Aminotransferase__——— 55. A carrier of various 


one-carbon units 

6. Amino acids that are 

(h) Tetrahydrofolate_ dietary requirements 

(i) S-Adenosylmethionine 7. Amino acids that are 

readily synthesized 

(j) Homocysteine 8. Responsible for nitrogen 
fixation 

9. Transfers amino groups 

between keto acids 

10. A common amino 
group donor 


(g) Pyridoxal phosphate 


3. Teamwork. Identify the two components of the nitroge- 
nase complex and describe their specific tasks. 


4. The fix is in. “The mechanistic complexity of nitroge- 
nase is necessary because nitrogen fixation is a thermody- 
namically unfavorable process.” True or false? Explain. 


5. Siphoning resources. Nitrogen-fixing bacteria on the 
roots of some plants can consume as much as 20% of the 
ATP produced by their host—consumption that does not 
seem very beneficial to the plant. Explain why this loss of 
valuable resources is tolerated and what the bacteria are 
doing with the ATP. 


6. From few, many. What are the seven precursors of the 20 
amino acids? 


7. Vital, in the truest sense. Why are certain amino acids 
defined as essential for human beings? 


8. From sugar to amino acid. Write a balanced equation for 
the synthesis of alanine from glucose. 


9. From air to blood. What are the intermediates in the flow 
of nitrogen from N,, to heme? 


10. Common component. What cofactor is required by all 
aminotransferases? 


11. Here, hold this. In this chapter, we considered three dif- 
ferent cofactors/cosubstrates that act as carriers of one- 
carbon units. Name them. 


12. One-carbon transfers. Which derivative of folate is a 
reactant in the conversion of (a) glycine into serine? (b) 
homocysteine into methionine? 


13. Telltale tag. In the reaction catalyzed by glutamine syn- 
thetase, an oxygen atom is transferred from the side chain of 
glutamate to orthophosphate, as shown by the results of 
18Q)-labeling studies. Account for this finding. 


14. Telltale tag, redux. In contrast to the production of glu- 
tamine by glutamine synthetase (problem 13), the genera- 
tion of asparagine from '°O-labeled aspartate does not 
result in the transfer of an 'SO atom to orthophosphate. In 
what molecule do you expect to find one of the !8O atoms? 


15. Therapeutic glycine. Isovaleric acidemia is an inherited 
disorder of leucine metabolism caused by a deficiency of 
isovaleryl CoA dehydrogenase. Many infants having this 
disease die in the first month of life. The administration of 
large amounts of glycine sometimes leads to marked clinical 
improvement. Propose a mechanism for the therapeutic 
action of glycine. 


16. Lending a hand. The atoms from tryptophan shaded 
below are derived from two other amino acids. Name them. 


Tryptophan 


17. Deprived bacteria. Blue-green algae (cyanobacteria) 
form heterocysts when deprived of ammonia and nitrate. In 
this form, the cyanobacteria lack nuclei and are attached to 
adjacent vegetative cells. Heterocysts have photosystem I 
activity but are entirely devoid of photosystem II activity. 
What is their role? 


18. Cysteine and cystine. Most cytoplasmic proteins lack 
disulfide bonds, whereas extracellular proteins usually con- 
tain them. Why? 


19. Through the looking-glass. Suppose that aspartate ami- 
notransferase were chemically synthesized with the use of 
D-amino acids only. What products would you expect if 
this mirror-image enzyme were treated with (a) L-aspartate 
and a-ketoglutarate; (b) D-aspartate and a-ketoglutarate? 


20. To and fro. The synthesis of 6-aminolevulinate takes 
place in the mitochondrial matrix, whereas the formation of 
porphobilinogen takes place in the cytoplasm. Propose a 
reason for the mitochondrial location of the first step in 
heme synthesis. 


21. Direct synthesis. Which of the 20 amino acids can be 
synthesized directly from a common metabolic intermedi- 
ate by a transamination reaction? 


22. Alternative route to proline. Certain species of bacteria 
possess an enzyme, ornithine cyclodeaminase, that can 
catalyze the conversion of L-ornithine into L-proline in a 
single catalytic cycle. 


NH;* 


NH,* r 
N COO- 


Hp 


Ornithine Proline 


*H;N COO- 


The enzyme lysine cyclodeaminase has also been identified. 
Predict the product of the reaction catalyzed by lysine 
cyclodeaminase. 


23. Lines of communication. For the following example of a 
branched pathway, propose a feedback inhibition scheme 
that would result in the production of equal amounts of Y 
and Z. 


D —— E —~ Y 


A 


B——-C 


G——-Z 


24. Cumulative feedback inhibition. Consider the branched 
pathway in Problem 23. The first common step (A —> B) is 
partly inhibited by both of the final products, each acting 
independently of the other. Suppose that a high level of Y 
alone decreased the rate of the A — B step from 100 to 60 s~! 
and that a high level of Z alone decreased the rate from 100 
to 40 s-'. What would the rate be in the presence of high 
levels of both Y and Z? 


25. Recovered activity. Free sulfhydryl groups can be 
alkylated with 2-bromoethylamine to the corresponding 
thioether. 


NH. HBr 
3 
Beg + i ae a 


2-Bromoethylamine 
NH;* 
ge ° 


Researchers prepared a mutant form of aspartate amino- 
transferase in which lysine 258 was replaced by cysteine 
(Lys258Cys). This mutant protein has no observable cata- 
lytic activity. However, treatment of Lys258Cys with 
2-bromoethylamine yielded a protein with ~7% activity 
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relative to the wild-type enzyme. Explain why alkylation 
recovered some enzyme activity. 


Mechanism Problems 


26. Ethylene formation. Propose a mechanism for the conver- 
sion of S-adenosylmethionine into 1-aminocyclopropane- 
1-carboxylate (ACC) by ACC synthase, a PLP enzyme. 
What is the other product? 


27. Mirror-image serine. Brain tissue contains substantial 
amounts of D-serine, which acts as a neurotransmitter. 
D-serine is generated from L-serine by serine racemase, a 
PLP enzyme. Propose a mechanism for the interconversion 
of L- and D-serine. What is the equilibrium constant for the 
reaction L-serine == D-serine? 


28. An unusual amino acid. Elongation factor-2 (eEF-2), a 
protein taking part in translation, contains a histidine resi- 
due that is modified posttranslationally in several steps to a 
complex side chain known as diphthamide. An intermedi- 
ate along this pathway is referred to as diphthine. 


(a) Labeling experiments indicate that the diphthine inter- 
mediate is formed by the modification of histidine with 
four molecules of S-adenosylmethionine (indicated by the 
four colors). Propose a mechanism for the formation of 


diphthine. 


(b) The final conversion of diphthine into diphthamide is 
known to be ATP dependent. Propose two possible mecha- 
nisms for the final amidation step. 


HTN 
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Histidine 


Diphthine 


Diphthamide 
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Chapter Integration Problems 


29. Here, hold this again. In this chapter, we considered 
three different cofactors/cosubstrates that act as carriers of 
one-carbon units (problem 11). Now, name another carrier 
of one carbon units that we have encountered previously. 


30. Connections. How might increased synthesis of aspar- 
tate and glutamate affect energy production in a cell? How 
would the cell respond to such an effect? 


31. Protection required. Suppose that a mutation in bacteria 
resulted in the diminished activity of methionine adenosyl- 
transferase, the enzyme responsible for the synthesis of 
SAM from methionine and ATP. Predict how this dimin- 
ished activity might affect the stability of the mutated 
bacteria’s DNA. 


32. Heme biosynthesis. Shemin and coworkers used acetate- 
labeling experiments to conclude that succinyl-CoA is a key 
intermediate in the biosynthesis of heme. Identify the inter- 
mediates in the conversion of acetate into succinyl-CoA. 


33. Comparing K,,. Glutamate dehydrogenase (p. 717) 
and glutamine synthetase (p. 717) are present in all organ- 
isms. Most prokaryotes also contain another enzyme, 
glutamate synthase, which catalyzes the reductive amina- 
tion of a-ketoglutarate with the use of glutamine as the 
nitrogen donor. 


a-Ketoglutarate + glutamine + NADPH + H* 


Glutamate 
synthase 


2 glutamate + NADPt 


The side-chain amide of glutamine is hydrolyzed to gener- 
ate ammonia within the enzyme. When NH," is limiting, 
most of the glutamate is made by the sequential action of 
glutamine synthetase and glutamate synthase. The sum of 
these reactions is 


NH," + a-ketoglutarate + NADPH + ATP —> 
glutamate + NADP* + ADP + P; 


Note that this stoichiometry differs from that of the gluta- 
mate dehydrogenase reaction in that ATP is hydrolyzed. 
Why do prokaryotes sometimes use this more-expensive 
pathway? (Hint: The K,, value for NH,* of glutamate 
dehydrogenase is higher than that of glutamine synthase.) 


Chapter Integration and Data Interpretation Problem 


34. Light effects. The adjoining graph shows the concentra- 
tion of several free amino acids in light- and dark-adapted 
plants. 
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Free amino acids (pmol x 10? per leaf) 


[Data from B.B. Buchanan, W. Gruissem, and R.L. Jones, 
Biochemistry and Molecular Biology of Plants (American 
Society of Plant Physiology, 2000), Fig. 8.3, p. 363.] 


(a) Of the amino acids shown, which are most affected by 
light-dark adaptation? 


(b) Suggest a plausible biochemical explanation for the dif- 
ference observed. 


(c) White asparagus, a culinary delicacy, is the result of 
growing asparagus plants in the dark. What chemical might 
you think enhances the taste of white asparagus? 


Nucleotide Biosynthesis 


CHAPTER 


Methotrexate 


Nucleotides are required for DNA replication and cell division. A key enzyme for the synthesis of one nucleotide is 
dihydrofolate reductase (right). Cells grown in the presence of methotrexate, a reductase inhibitor, respond by increasing the 
number of copies of the reductase gene. The bright yellow regions visible on three of the chromosomes in the fluorescence 
micrograph (left), which were grown in the presence of methotrexate, contain hundreds of copies of the reductase gene. 


[(Left) Courtesy of Dr. Barbara Trask and Dr. Joyce Hamlin.] 


ucleotides are key biomolecules required for a variety of life processes. 

First, nucleotides are the activated precursors of nucleic acids, necessary 
for the replication of the genome and the transcription of the genetic infor- 
mation into RNA. Second, an adenine nucleotide, ATP, is the universal 
currency of energy. A guanine nucleotide, GTP, also serves as an energy 
source for a more select group of biological processes. Third, nucleotide 
derivatives such as UDP-glucose participate in biosynthetic processes, for 
example, the formation of glycogen. Fourth, nucleotides are essential compo- 
nents of signal-transduction pathways. Cyclic nucleotides such as cyclic 
AMP and cyclic GMP are second messengers that transmit signals both 
within and between cells. Furthermore, ATP acts as the donor of phospho- 
ryl groups transferred by protein kinases in a variety of signaling pathways 
and, in some cases, ATP is secreted as a signal molecule. 

In this chapter, we continue along the path begun in Chapter 24, which 
described the incorporation of nitrogen into amino acids from inorganic 
sources such as nitrogen gas. The amino acids glycine and aspartate are the 
scaffolds on which the ring systems present in nucleotides are assembled. 
Furthermore, aspartate and the side chain of glutamine serve as sources of 
NH, groups in the formation of nucleotides. 


OUTLINE 


25.1 The Pyrimidine Ring Is Assembled 


25.2 


25.3 


25.4 


25.5 


de Novo or Recovered by 
Salvage Pathways 


Purine Bases Can Be 
Synthesized de Novo or Recycled 
by Salvage Pathways 


Deoxyribonucleotides Are 
Synthesized by the Reduction of 
Ribonucleotides Through a 
Radical Mechanism 


Key Steps in Nucleotide 
Biosynthesis Are Regulated by 
Feedback Inhibition 


Disruptions in Nucleotide 
Metabolism Can Cause 
Pathological Conditions 
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DE NOVO PATHWAY 


Activated ribose (PRPP) + amino acids 
+ ATP + CO) +--: 


| 


Nucleotide 


SALVAGE PATHWAY 


Activated ribose (PRPP) + base 


| 


Nucleotide 


FIGURE 25.1 De novo and salvage 
pathways. In de novo synthesis, the base 
itself is synthesized from simpler starting 
materials, including amino acids. ATP 
hydrolysis is required for de novo synthesis. 
In a salvage pathway, a base is reattached 
to a ribose, activated in the form of 


5-phosphoribosyl-1-pyrophosphate (PRPP). 


Bicarbonate 
+ 
NH; 


2a | 


Carbamoyl 


phosphate Aspartate 


Pyrimidine 
ring 
PRPP 
(a ribose 


{ phosphate) 


UTP == CTP to RNA 


v v 


TTP dCTP) to DNA 


FIGURE 25.2 De novo pathway for 
pyrimidine nucleotide synthesis. The C-2 
and N-3 atoms in the pyrimidine ring come 
from carbamoyl phosphate, whereas the 


other atoms of the ring come from aspartate. 


TABLE 25.1 Nomenclature of bases, nucleosides, and nucleotides 


RNA 
Ribonucleotide 

Base Ribonucleoside (5’-monophosphate) 
Adenine (A) Adenosine Adenylate (AMP) 
Guanine (G) Guanosine Guanylate (GMP) 
Uracil (U) Uridine Uridylate (UMP) 
Cytosine (C) Cytidine Cytidylate (CMP) 
DNA 

Deoxyribonucleotide 
Base Deoxyribonucleoside (5'-monophosphate) 
Adenine (A) Deoxyadenosine Deoxyadenylate (dAMP) 
Guanine (G) Deoxyguanosine Deoxyguanylate (dGMP) 
Thymine (T) Thymidine Thymidylate (TMP) 
Cytosine (C) Deoxycytidine Deoxycytidylate (dCMP) 


Nucleotide biosynthetic pathways are tremendously important as inter- 
vention points for therapeutic agents. Many of the most widely used drugs 
in the treatment of cancer block steps in nucleotide biosynthesis, particu- 
larly steps in the synthesis of DNA precursors. 


Nucleotides can be synthesized by de novo or salvage pathways 


The pathways for the biosynthesis of nucleotides fall into two classes: de 
novo pathways and salvage pathways (Figure 25.1). In de novo (from 
scratch) pathways, the nucleotide bases are assembled from simpler com- 
pounds. The framework for a pyrimidine base is assembled first and then 
attached to ribose. In contrast, the framework for a purine base is synthe- 
sized piece by piece directly onto a ribose-based structure. These pathways 
each comprise a small number of elementary reactions that are repeated 
with variations to generate different nucleotides, as might be expected for 
pathways that appeared very early in evolution. In salvage pathways, pre- 
formed bases are recovered and reconnected to a ribose unit. 

De novo pathways lead to the synthesis of ribonucleotides. However, 
DNA is built from deoxyribonucleotides. Consistent with the notion that 
RNA preceded DNA in the course of evolution, all deoxyribonucleotides 
are synthesized from the corresponding ribonucleotides. The deoxyribose 
sugar is generated by the reduction of ribose within a fully formed nucleo- 
tide. Furthermore, the methyl group that distinguishes the thymine of 
DNA from the uracil of RNA is added at the last step in the pathway. 

The nomenclature of nucleotides and their constituent units was pre- 
sented in Chapter 4. Recall that a nucleoside is a purine or pyrimidine base 
linked to a sugar and that a nucleotide is a phosphate ester of a nucleoside. 
The names of the major bases of RNA and DNA, and of their nucleoside 
and nucleotide derivatives, are given in Table 25.1. 


25.1 The Pyrimidine Ring Is Assembled de Novo or 
Recovered by Salvage Pathways 


In de novo synthesis of pyrimidines, the ring is synthesized first and 
then it is attached to a ribose phosphate to form a pyrimidine nucleotide 
(Figure 25.2). Pyrimidine rings are assembled from bicarbonate, aspartate, 
and ammonia. Although an ammonia molecule already present in solution 
can be used, the ammonia is usually produced from the hydrolysis of the 
side chain of glutamine. 


Bicarbonate and other oxygenated carbon compounds are 
activated by phosphorylation 


The first step in de novo pyrimidine biosynthesis is the synthesis of car- 
bamoyl phosphate from bicarbonate and ammonia in a multistep process, 
requiring the cleavage of two molecules of ATP. This reaction is catalyzed 
by carbamoyl phosphate synthetase II (CPS II). Recall that carbamoyl phos- 
phate synthetase I facilitates ammonia incorporation into urea (Section 
23.4). Carbamoyl phosphate synthetase II is a dimer composed of a small 
subunit that hydrolyzes glutamine to form NH, and a large subunit that 
completes the synthesis of carbamoyl phosphate. Analysis of the structure 
of the large subunit reveals two homologous domains, each of which cata- 
lyzes an ATP-dependent step (Figure 25.3). 

In the first step, bicarbonate is phosphorylated by ATP to form car- 
boxyphosphate and ADP. Ammonia then reacts with carboxyphosphate to 
form carbamic acid and inorganic phosphate. 


ee ———————————— C =O 
How (OO Ho ~o~ NG HO~ NH, 


Bicarbonate Carboxyphosphate Carbamic acid 


The active site for this reaction lies ina domain formed by the amino-terminal 
third of CPS. This domain forms a structure called an ATP-grasp fold, which 
surrounds ATP and holds it in an orientation suitable for nucleophilic attack 
at the y phosphoryl group. Proteins containing AT P-grasp folds catalyze the 
formation of carbon—nitrogen bonds through acylphosphate intermediates. 
Such ATP-grasp folds are widely used in nucleotide biosynthesis. 

In the second step catalyzed by carbamoyl phosphate synthetase II, 
carbamic acid is phosphorylated by another molecule of ATP to form 
carbamoyl phosphate. 


oF 
HO~ ~NH, g” 0 NH, 


Carbamic acid Carbamoyl phosphate 


This reaction takes place in a second ATP-grasp domain within the enzyme. 
The active sites leading to carbamic acid formation and carbamoyl! phos- 
phate formation are very similar, revealing that this enzyme evolved by a 
gene duplication event. Indeed, duplication of a gene encoding an ATP- 
grasp domain followed by specialization was central to the evolution of 
nucleotide biosynthetic processes (p. 749). 


The side chain of glutamine can be hydrolyzed to generate ammonia 


Glutamine is the primary source of ammonia for carbamoyl phosphate 
synthetase II. In this case, the small subunit of the enzyme hydrolyzes glu- 
tamine to form ammonia and glutamate. The active site of the glutamine- 
hydrolyzing component contains a catalytic dyad comprising a cysteine and 
a histidine residue. Such a catalytic dyad, reminiscent of the active site of 
cysteine proteases (Figure 9.16), is conserved in a family of amidotransfer- 
ases, including CTP synthetase and GMP synthetase. 


Intermediates can move between active sites by channeling 


Carbamoyl phosphate synthetase II contains three different active sites 
(Figure 25.3), separated from one another by a total of 80 A. Intermediates 


Glutamine 
hydrolysis 
site 


LAW, 
Va IDINCG 
ue ray 


Carbamic acid 
phosphorylation 
site 


k FIGURE 25.3 Structure of 
carbamoyl phosphate synthetase II. 
Notice that the enzyme contains sites for 
three reactions. This enzyme consists of 
two chains. The smaller chain (yellow) 
contains a site for glutamine hydrolysis to 
generate ammonia. The larger chain 
includes two ATP-grasp domains (blue and 
red). In one ATP-grasp domain (blue), 
bicarbonate is phosphorylated to 
carboxyphosphate, which then reacts with 
ammonia to generate carbamic acid. In the 
other ATP-grasp domain, the carbamic acid 
is phosphorylated to produce carbamoyl 
phosphate. [Drawn from 1JDB.pdb.] 
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Glutamine 
NH; 
Carbamic ¢ 
acid 
hk FIGURE 25.4 Substrate 
channeling. The three active sites of 
carbamoyl phosphate synthetase II are VAN p 
linked by a channel (yellow) through we 
which intermediates pass. Glutamine p= ¢ 
enters one active site, and carbamoyl TY Sax wi 
phosphate, which includes the nitrogen Carbamoyl 
atom from the glutamine side chain, phosphate oD TAKS 
leaves another 80 A away. [Drawn from Be 
1 JDB.pdb.] 


generated at one site move to the next without leaving the enzyme. These inter- 
mediates move within the enzyme by means of substrate channeling, similar to 
the process described for tryptophan synthetase (Figure 25.4; also Figure 24.16). 
The ammonia generated in the glutamine-hydrolysis active site travels 45 A 
through a channel within the enzyme to reach the site at which carboxyphos- 
phate has been generated. The carbamic acid generated at this site diffuses an 
additional 35 A through an extension of the channel to reach the site at which 
carbamoyl phosphate is generated. This channeling serves two roles: (1) inter- 
mediates generated at one active site are captured with no loss caused by diffu- 
sion and (2) labile intermediates, such as carboxyphosphate and carbamic acid 
(which decompose in less than 1 s at pH 7), are protected from hydrolysis. We 
will see additional examples of substrate channeling later in this chapter. 


Orotate acquires a ribose ring from PRPP to form a pyrimidine 
nucleotide and is converted into uridylate 


Carbamoyl phosphate reacts with aspartate to form carbamoylaspartate 
in a reaction catalyzed by aspartate transcarbamoylase (Section 10.1). 
Carbamoylaspartate then cyclizes to form dihydroorotate, which is then 
oxidized by NAD* to form orotate. 


fe) fe) NADH fe) 
I l it I 
2-6 Aspartate PP; Cc Ht H20 Cc NAD* Ht C 
j 9 ff HN NH, HN NH LS HN“ NH 
P. Cc ee SS. = 
O-/ Now hy, ooce_| = ooc_| | i | 
NH mS COO > “~s. Ss oa. 
O 2 H . a H s 7 So -00c ie 6 SS 6) 
H H HH H 
Carbamoyl phosphate Carbamoylaspartate Dihydroorotate Orotate 


In mammals, the enzymes that form orotate are part of single large polypep- 
tide chain called CAD, for carbamoyl] phosphate synthetase, aspartate trans- 
carbamoylase and dihydroorotase. 

At this stage, orotate couples to ribose, in the form of 5-phosphoribosyl- 
1-pyrophosphate (PRPP), a form of ribose activated to accept nucleotide 
bases. 5-Phosphoribosyl-1-pyrophosphate synthetase synthesizes PRPP by 
adding a pyrophosphate from ATP to ribose-5-phosphate, which is formed 
by the pentose phosphate pathway. 


2-03POH2C 


2-0;POH C ~ 
3 2 Qo090 090 
fe) ATP AMP O - y 
4 \Y2 
OH PRPP o~ ~“o~ ‘o 
synthetase 
HO OH HO OH 
Ribose 5-phosphate PRPP 


Orotate reacts with PRPP to form orotidylate, a pyrimidine nucleotide. This 
reaction is driven by the hydrolysis of pyrophosphate. The enzyme that 
catalyzes this addition, orotate phosphoribosyltransferase, is homologous to a 
number of other phosphoribosyltransferases that add different groups to 
PRPP to form the other nucleotides. Orotidylate is then decarboxylated 
to form uridylate (UMP), a major pyrimidine nucleotide that is a precursor 
to RNA. This reaction is catalyzed by orotidylate decarboxylase, also called 
orotidine-5’ phosphate decarboxylase. 


O O 
o HN 
+ 
of H CO, of 
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Orotidylate Uridylate 

Orotidylate decarboxylase is one of the most proficient enzymes known. 
In its absence, decarboxylation is extremely slow and is estimated to take 
place once every 78 million years; with the enzyme present, it takes place 
approximately once per second, a rate enhancement of 10!7-fold. The phos- 
phoribosyltransferase and decarboxylase activities are located on the same 
polypeptide chain, providing another example of a bifunctional enzyme. 
The bifunctional enzyme is called uridine monophosphate synthetase. 


Nucleotide mono-, di-, and triphosphates are interconvertible 


How is the other major pyrimidine ribonucleotide, cytidine, formed? It is 
synthesized from the uracil base of UMP, but the synthesis can take place 
only after UMP has been converted into UTP. Recall that the diphosphates 
and triphosphates are the active forms of nucleotides in biosynthesis and 
energy conversions. Nucleoside monophosphates are converted into nucle- 
oside triphosphates in stages. First, nucleoside monophosphates are con- 
verted into diphosphates by specific nucleoside monophosphate kinases that 
utilize ATP as the phosphoryl-group donor. For example, UMP is phos- 
phorylated to UDP by UMP kinase. 


UMP + ATP == UDP + ADP 


Nucleoside diphosphates and triphosphates are interconverted by nucle- 
oside diphosphate kinase, an enzyme that has broad specificity, in contrast 
with the monophosphate kinases. X and Y represent any of several ribonu- 
cleosides or even deoxyribonucleosides: 


XDP + YTP == XTP + YDP 


CTP is formed by amination of UTP 


After uridine triphosphate has been formed, it can be transformed into cyti- 
dine triphosphate by the replacement of a carbonyl group by an amino 
group, a reaction catalyzed by cytidine triphosphate synthetase. 
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FIGURE 25.5 De novo pathway for purine 
nucleotide synthesis. The origins of the 
atoms in the purine ring are indicated. 
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Like the synthesis of carbamoyl! phosphate, this reaction requires ATP and 
uses glutamine as the source of the amino group. The reaction proceeds 
through an analogous mechanism in which the O-4 atom is phosphorylated 
to form a reactive intermediate, and then the phosphate is displaced by 
ammonia, freed from glutamine by hydrolysis. CTP can then be used in 
many biochemical processes, including lipid and RNA synthesis. 


Salvage pathways recycle pyrimidine bases 


Pyrimidine bases can be recovered from the breakdown products of DNA 
and RNA by the use of salvage pathways. In these pathways, a preformed 
base is reincorporated into a nucleotide. We will consider the salvage for the 
pyrimidine base thymine. Thymine is found in DNA and base-pairs with 
adenine in the DNA double helix. Thymine released from degraded DNA 
is salvaged in two steps. First, thymine is converted into the nucleoside 
thymidine by thymidine phosphorylase. 


Thymine + deoxyribose-1-phosphate == thymidine + P; 
Thymidine is then converted into a nucleotide by thymidine kinase. 
Thymidine + ATP == TMP + ADP 


=e Viral thymidine kinase differs from the mammalian enzyme and thus 

provides a therapeutic target. For instance, herpes simplex infections 
are treated with acyclovir (acycloguanosine), which viral thymidine kinase 
converts into acyclovir monophosphate by adding a phosphate to the 
hydroxyl group of acyclovir. Viral thymidine kinase binds acyclovir more 
than 200 times as tightly as cellular thymidine kinase, accounting for its 
effect on infected cells only. Acyclovir monophosphate is phosphorylated 
by cellular enzymes to yield acyclovir triphosphate. Acyclovir triphosphate 
competes with dGTP for DNA polymerase. Once incorporated into viral 
DNA, it acts as a chain terminator since it lacks a 3’-hydroxyl required for 
chain extension. As we will see shortly, thymidine kinase also plays a role in 
the de novo synthesis of thymidylate. 


25.2 Purine Bases Can Be Synthesized de Novo or 
Recycled by Salvage Pathways 


Like pyrimidine nucleotides, purine nucleotides can be synthesized de novo 
or by a salvage pathway. When synthesized de novo, purine synthesis 
begins with simple starting materials such as amino acids and bicarbonate 
(Figure 25.5). Unlike the bases of pyrimidines, the purine bases are assem- 
bled already attached to the ribose ring. Alternatively, purine bases, released 
by the hydrolytic degradation of nucleic acids and nucleotides, can be sal- 
vaged and recycled. Purine salvage pathways are especially noted for the 
energy that they save and the remarkable effects of their absence (p. 761). 


The purine ring system is assembled on ribose phosphate 


De novo purine biosynthesis, like pyrimidine biosynthesis, requires PRPP 
but, for purines, PRPP provides the foundation on which the bases are con- 
structed step by step. The initial step is the displacement of pyrophosphate 
by ammonia, rather than by a preassembled base, to produce 5-phospho- 
ribosyl-1-amine, with the amine in the B configuration. 

Glutamine phosphoribosyl amidotransferase catalyzes this reaction, which is 
the committed step in purine regulation. This enzyme comprises two domains: 
the first is homologous to the phosphoribosyltransferases in purine salvage 
pathways (p. 752), whereas the second produces ammonia from glutamine by 
hydrolysis. However, this glutamine-hydrolysis domain is distinct from the 
domain that performs the same function in carbamoyl phosphate synthetase II. 
In glutamine phosphoribosyl amidotransferase, a cysteine residue located at the 
amino terminus facilitates glutamine hydrolysis. To prevent wasteful hydrolysis 
of either substrate, the amidotransferase assumes the active configuration only 
on binding of both PRPP and glutamine. As is the case with carbamoyl phos- 
phate synthetase II, the ammonia generated at the glutamine-hydrolysis active 
site passes through a channel to reach PRPP without being released into solution. 


The purine ring is assembled by successive steps of activation 
by phosphorylation followed by displacement 


na Nine additional steps are required to assemble the purine ring. 

Remarkably, the first six steps are analogous reactions. Most of these 
steps are catalyzed by enzymes with ATP-grasp domains that are homolo- 
gous to those in carbamoyl phosphate synthetase. Each step consists of the 
activation of a carbon-bound oxygen atom (typically a carbonyl oxygen atom) 
by phosphorylation, followed by the displacement of the phosphoryl group by 
ammonia or an amine group acting as a nucleophile (Nu). 
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De novo purine biosynthesis proceeds as shown in Figure 25.6. Table 25.2 
lists the enzymes that catalyze each step of the reaction. 


1. The carboxylate group of a glycine residue is activated by phosphoryla- 
tion and then coupled to the amino group of phosphoribosylamine. A new 
amide bond is formed, and the amino group of glycine is free to act as a 
nucleophile in the next step. 


2. N!°-formyltetrahydrofolate donates a formyl moiety to this amino group 
to form formylglycinamide ribonucleotide. 


TABLE 25.2 The enzymes of de novo purine synthesis 


Step Enzyme 


Glycinamide ribonucleotide (GAR) synthetase 

GAR transformylase 

Formylglycinamidine synthase 

Aminoimidazole ribonucleotide synthetase 
Carboxyaminoimidazole ribonucleotide synthetase 
Succinylaminoimidazole carboxamide ribonucleotide synthetase 
Adenylosuccinate lyase 

Aminoimidazole carboxamide ribonucleotide transformylase 


CO MmON DN FW NF 


Inosine monophosphate cyclohydrolyase 


2-0,POH,C O fone) 0 0 


HO OH 


Glu + NH; 


Gln + H20 


2-03POH2C 


HO OH 
5-Phosphoribosyl-1-amine 


749 


e) I 
I Cc 
ATP ADP - ADP HS, A 
+ + NH3t i H ac H + 
Gly P; H | THF~ -H THF 7 | ATP P; H | 
N CHa Me N CH uw 
P-ribose-NH> 2 Poiboses” ~~ P-ribose~ 2 2 pape { 
O O H;N HO ~H 
Phosphoribosyl- Glycinamide Formylglycinamide ae Formylglycinamidine 
amine ribonucleotide ribonucleotide Glu Gln ribonucleotide 
ATP 
® ADP 
ATP re 
+ P; 
0 
Bn a3 ADP ie CN 
+ Cx. a 
f\. FV. A Ho So tea 
N s~ a a Ke 2 
~— P-ribose~ oF \y — Paihose” T c7 ® puibose® i. 
NH> HN o NH> 
ATP “=! 
ae Carboxyaminoimidazole \ 5-Aminoimidazole 
Asp ribonucleotide ribonucleotide 
ADP © 
+ -, 
P, 0oc H 
- Oo (e) 
MeN Ox! H” coo- ACN I Ha 
| “4 Pp \ Fumarate ‘ Pp THF~ HTH | " y 
NN G CH2 - Ne gee oe N ZC 
Pihoee \ e P-ribose ~C \ P-ibose~ > oF \ 
Pct @ [Nha Ty Ni 
NH, 4H \ NH ee, 
aie 5 
GO: = O 
5-Aminoimidazole- 5-Aminoimidazole- 5-Formaminoimidazole- 
4-(N-succinylcarboxamide) 4-carboxamide 4-carboxamide 
ribonucleotide ribonucleotide ribonucleotide 
HO 
FIGURE 25.6 De novo purine biosynthesis. (1) Glycine is coupled to the amino group of H. 
phosphoribosylamine. (2) N'°-Formyltetrahydrofolate (THF) transfers a formyl group to the amino me 0 
group of the glycine residue. (8) The inner amide group is phosphorylated and converted into an , C. ff 
amidine by the addition of ammonia derived from glutamine. (4) An intramolecular coupling es wee el af 
reaction forms the five-membered imidazole ring. (6) Bicarbonate adds first to the exocyclic \ NH 
amino group and then to a carbon atom of the imidazole ring. (6) The imidazole carboxylate is Ne 
phosphorylated, and the phosphate is displaced by the amino group of aspartate. (7) Fumarate H 
is released. (8) A second formyl group is donated from N1°-formyltetrahydrofolate (THF). 
(9) Cyclization completes the synthesis of inosinate, a purine nucleotide. Inosinate (IMP) 


3. The inner carbonyl group is activated by phosphorylation and then con- 
verted into an amidine by the addition of ammonia derived from glutamine. 


4. The product of this reaction, formylglycinamidine ribonucleotide, 
cyclizes to form the five-membered imidazole ring found in purines. 
Although this cyclization is likely to be favorable thermodynamically, a 
molecule of ATP is consumed to ensure irreversibility. The familiar pattern 
is repeated: a phosphoryl group from the ATP molecule activates the car- 
bonyl group and is displaced by the nitrogen atom attached to the ribose 
molecule. Cyclization is thus an intramolecular reaction in which the 
nucleophile and phosphate-activated carbon atom are present within the 
same molecule. In higher eukaryotes, the enzymes catalyzing steps 1, 2, 
and 4 (Table 25.2) are components of a single polypeptide chain. 
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5. Bicarbonate is activated by phosphorylation and then attacked by the 
exocyclic amino group. The product of the reaction in step 5 rearranges to 
transfer the carboxylate group to the imidazole ring. Interestingly, mammals 
do not require ATP for this step; bicarbonate apparently attaches directly to 
the exocyclic amino group and is then transferred to the imidazole ring. 


6. The imidazole carboxylate group is phosphorylated again and the phos- 
phate group is displaced by the amino group of aspartate. Once again, in 
higher eukaryotes, the enzymes catalyzing steps 5 and 6 (Table 25.2) share 
a single polypeptide chain. 


7. Fumarate, an intermediate in the citric acid cycle, is eliminated, leaving the 
nitrogen atom from aspartate joined to the imidazole ring. The use of aspartate 
as an amino-group donor and the concomitant release of fumarate are reminis- 
cent of the conversion of citrulline into arginine in the urea cycle, and these 
steps are catalyzed by homologous enzymes in the two pathways (Section 23.4). 


8. A formyl group from N!°-formyltetrahydrofolate is added to this nitrogen 
atom to form a final 5-formaminoimidazole-4-carboxamide ribonucleotide. 


9. 5-Formaminoimidazole-4-carboxamide ribonucleotide cyclizes with the 
loss of water to form inosinate. 


Many of the intermediates in the de novo purine biosynthesis pathway 
degrade rapidly in water. Their instability in water suggests that the prod- 
uct of one enzyme must be channeled directly to the next enzyme along the 
pathway. Recent evidence shows that the enzymes do indeed form com- 
plexes when purine synthesis is required (p. 752). 


AMP and GMP are formed from IMP 


A few steps convert inosinate into either AMP or GMP (Figure 25.7). Adenylate 
is synthesized from inosinate by the substitution of an amino group for the 
carbonyl oxygen atom at C-6. Again, the addition of aspartate followed by the 
elimination of fumarate contributes the amino group. GTP, rather than ATP, is 
the phosphoryl-group donor in the synthesis of the adenylosuccinate interme- 
diate from inosinate and aspartate. In accord with the use of GTP, the enzyme 
that promotes this conversion, adenylosuccinate synthetase, is structurally related 
to the G-protein family and does not contain an ATP-grasp domain. 
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FIGURE 25.7 Generating AMP and GMP. 
Inosinate is the precursor of AMP and 
GMP. AMP is formed by the addition of 
aspartate followed by the release of 
fumarate. GMP is generated by the addition 
of water, dehydrogenation by NAD*, and 
the replacement of the carbonyl oxygen 
atom by—NH, derived by the hydrolysis 

of glutamine. 
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(A) 


FIGURE 25.8 Formation of purinosomes. 
A gene construct encoding a fusion protein 
consisting of formylglycinamidine synthase 
and GFP was transfected into and 
expressed in Hela cells, a human cell line. 
(A) In the presence of purines (the absence 
of purine synthesis), the GFP was seen as 
a diffuse stain throughout the cytoplasm. 
(B) When the cells were shifted to a purine- 
free medium, purinosomes formed, seen as 
cytoplasmic granules, and purine synthesis 
occurred. White scale bars, 10 zm 

[S. An, R. Kumar, E. D. Sheets, and S. J. 
Benkovic. Science 320:103-106. Figure 2, 

C and D, 2008.] 
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Guanylate is synthesized by the oxidation of inosinate to xanthylate 
(XMP), followed by the incorporation of an amino group at C-2. NAD* is 
the hydrogen acceptor in the oxidation of inosinate. Xanthylate is activated 
by the transfer of an AMP group (rather than a phosphoryl group) from 
ATP to the oxygen atom in the newly formed carbonyl group. Ammonia, 
generated by the hydrolysis of glutamine, then displaces the AMP group to 
form guanylate, in a reaction catalyzed by GMP synthetase. Note that the 
synthesis of adenylate requires GTP, whereas the synthesis of guanylate 
requires ATP. This reciprocal use of nucleotides by the pathways creates an 
important regulatory opportunity (Section 25.4). 


Enzymes of the purine synthesis pathway associate with 
one another in vivo 


Biochemists believe that the enzymes of many metabolic pathways, such as 
glycolysis and the citric acid cycle, are physically associated with one 
another. Such associations would increase the efficiency of pathways by 
facilitating the movement of the product of one enzyme to the active site of 
the next enzyme in the pathway. The evidence for such associations comes 
primarily from experiments in which one component of a pathway, carefully 
isolated from the cell, is found to be bound to other components of the path- 
way. However, these observations raise the question, do enzymes associate 
with one another in vivo or do they spuriously associate during the isolation 
procedure? Recent in vivo evidence shows that the enzymes of the purine 
synthesis pathway associate with one another when purine synthesis is 
required. Various enzymes of the pathway were fused with the green fluores- 
cent protein (Figure 2.65) and transfected into cells. When cells were grown 
in the presence of purine, the GFP was spread diffusely throughout the 
cytoplasm (Figure 25.8A). When the cells were switched to growth media 
without purines, purine synthesis began and the enzymes became associated 
with one another, forming complexes dubbed purinosomes (Figure 25.8B). 
The experiments were repeated with other enzymes of the purine synthesis 
pathway bearing the GFP, and the results were the same: purine synthesis 
occurs when the enzymes form the purinosomes. What actually causes com- 
plex formation? While the results are not yet established, it appears that 
several G-protein coupled receptors, including those responding to epi- 
nephrine as well as ATP and ADP (purinergic receptors), instigate complex 
formation, while human creatine kinase IT (hCK2), responding to the pres- 
ence of purines, causes disassembly of the purinosome. 


Salvage pathways economize intracellular energy expenditure 


As we have seen, the de novo synthesis of purines requires a substantial 
investment of ATP. Purine salvage pathways provide a more economical 
means of generating purines. Free purine bases, derived from the turnover of 
nucleotides or from the diet, can be attached to PRPP to form purine nucleo- 
side monophosphates, in a reaction analogous to the formation of orotidylate. 
Two salvage enzymes with different specificities recover purine bases. 
Adenine phosphoribosyltransferase catalyzes the formation of adenylate (A MP): 


Adenine + PRPP ——> adenylate + PP; 


whereas hypoxanthine-guanine phosphoribosyltransferase (HGPRT) cata- 
lyzes the formation of guanylate (GMP) as well as inosinate (inosine mono- 
phosphate, IMP), a precursor of guanylate and adenylate. 


Guanine + PRPP —> guanylate + PP; 
Hypoxanthine + PRPP —— inosinate + PP; 
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25.3 Deoxyribonucleotides Are Synthesized by the 
Reduction of Ribonucleotides Through a 
Radical Mechanism 


We turn now to the synthesis of deoxyribonucleotides. These precursors of ADP GDP CDP UDP 


DNA are formed by the reduction of ribonucleotides; specifically, the Products of 
2'-hydroxyl group on the ribose moiety is replaced by a hydrogen atom. The selina conte pedis ae. 
substrates are ribonucleoside diphosphates, and the ultimate reductant is } | 


dADP dGDP dCDP dUDP 


Lid 


dATP dGTP dCTP TIP 


NADPH. The enzyme ribonucleotide reductase is responsible for the reduc- 
tion reaction for all four ribonucleotides. The ribonucleotide reductases of 
different organisms are a remarkably diverse set of enzymes. Yet detailed 
studies have revealed that they have a common reaction mechanism, and 
their three-dimensional structural features indicate that these enzymes are 
homologous. We will focus on the best understood of these enzymes, that of 
E. coli living aerobically. 


Further processing 
yields dNTP 


Mechanism: A tyrosyl radical 
is critical to the action of 
ribonucleotide reductase 


R1 dimer | Cys 
The ribonucleotide reductase of E. coli 
consists of two subunits: R1 (an 
87-kDa dimer) and R2 (a 43-kDa 
dimer). The R1 subunit contains the 
active site as well as two allosteric con- 
trol sites (Section 25.4). This subunit 
includes three conserved cysteine resi- 
dues and a glutamate residue, all four 
of which participate in the reduction of 
ribose to deoxyribose (Figure 25.9). 
The R2 subunit’s role in catalysis is to 
generate a free radical in each of its two 
chains. Each R2 chain contains a tyro- 
syl radical with an unpaired electron 
delocalized onto its aromatic ring, gen- 
erated by a nearby iron center consist- 
ing of two ferric (Fe?*) ions bridged by 
an oxide (O*-) ion (Figure 25.10). This 


R2 dimer 


Tyrosyl-radical site 


FIGURE 25.9 Ribonucleotide reductase. Ribonucleotide reductase reduces ribonucleotides 
to deoxyribonucleotides in its active site, which contains three key cysteine residues and one 


very unusual free radical is remarkably 
stable, with a half-life of 4 days at 4°C. 
By contrast, free tyrosine radicals in 
solution have microsecond lifetimes. 


glutamate residue. Each R2 subunit contains a tyrosyl radical that accepts an electron from 
one of the cysteine residues in the active site to initiate the reduction reaction. Two R1 
subunits come together to form a dimer as do two R2 subunits. 


In the synthesis of a deoxyribonucleotide, the OH bonded to C-2’ of the 
ribose ring is replaced by H, with retention of the configuration at the C-2' 


carbon atom (Figure 25.11). 


9 
9 k FIGURE 25.10 Ribonucleotide 
BZ reductase R2 subunit. The R2 
= dl . subunit contains a stable free radical 
Tyrosine on a tyrosine residue. This radical is 
(radical site) ; 
generated by the reaction of oxygen 
(not shown) at a nearby site containing 
two iron atoms. Two R2 subunits 
come together to form a dimer. [Drawn 
from 1RIB.pdb.] 
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FIGURE 25.11 Ribonucleotide reductase mechanism. (1) An electron is transferred from a 
cysteine residue on R1 to a tyrosine radical on R2, generating a highly reactive cysteine thiy! 
radical. (2) This radical abstracts a hydrogen atom from C-3' of the ribose unit. (8) The radical 
at C-3' releases OH~ from the C-2' carbon atom. Combined with a proton from a second 
cysteine residue, the OH is eliminated as water. (4) A hydride ion is transferred from a third 
cysteine residue with the concomitant formation of a disulfide bond. (5) The C-3' radical 
recaptures the originally abstracted hydrogen atom. (6) An electron is transferred from R2 to 
reduce the thiyl radical, which also accepts a proton. The deoxyribonucleotide is free to leave 
R1. The disulfide formed in the active site must be reduced to begin another cycle. 


1. The reaction begins with the transfer of an electron from a cysteine resi- 
due on R1 to the tyrosyl radical on R2. The loss of an electron generates a 
highly reactive cysteine thiyl radical within the active site of R1. 


2. This radical then abstracts a hydrogen atom from C-3’ of the ribose 
unit, generating a radical at that carbon atom. 


3. The radical at C-3’ promotes the release of the OH™ from the C-2’ car- 
bon atom. Protonated by a second cysteine residue, the departing OH™ 
leaves as a water molecule. 


4. A hydride ion (a proton with two electrons) is then transferred from a 
third cysteine residue to complete the reduction of the position, form a 
disulfide bond, and re-form a radical. 


5. This C-3’ radical recaptures the same hydrogen atom originally 
abstracted by the first cysteine residue, and the deoxyribonucleotide is free 
to leave the enzyme. 


6. R2 provides an electron to reduce the thiyl radical. The disulfide bond 
generated in the enzyme’s active site must then be reduced to regenerate the 
active enzyme. 


The electrons for this reduction come from NADPH, but not directly. 
One carrier of reducing power linking NADPH with the reductase is thio- 
vedoxin, a 12-kDa protein with two exposed cysteine residues near each 
other. These sulfhydryls are oxidized to a disulfide in the reaction catalyzed 
by ribonucleotide reductase itself. In turn, reduced thioredoxin is regener- 
ated by electron flow from NADPH. This reaction is catalyzed by thiore- 
doxin reductase, a flavoprotein. Electrons flow from NADPH to bound 
FAD of the reductase, to the disulfide of oxidized thioredoxin, and then to 
ribonucleotide reductase and finally to the ribose unit. 
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Ribose 
NADP* FADH2 «(| 
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Thioredoxin reductase Thioredoxin Ribonucleotide 
(T) reductase (RR) 


Stable radicals other than tyrosyl radical are employed by other 
ribonucleotide reductases 


A Ribonucleotide reductases that do not contain tyrosyl radicals have 
been characterized in other organisms. Instead, these enzymes con- 
tain other stable radicals that are generated by other processes. For example, 
in one class of reductases, the coenzyme adenosylcobalamin (vitamin B,,) is 
the radical source (Section 22.3). Despite differences in the stable radical 
employed, the active sites of these enzymes are similar to that of the E. coli 
ribonucleotide reductase, and they appear to act by the same mechanism, 
based on the exceptional reactivity of cysteine radicals. Thus, these enzymes 
have a common ancestor but evolved a range of mechanisms for generating 
stable radical species that function well under different growth conditions. 
The primordial enzymes appear to have been inactivated by oxygen, whereas 
enzymes such as the E. coli enzyme make use of oxygen to generate the initial 
tyrosyl radical. Note that the reduction of ribonucleotides to deoxyribonu- 
cleotides is a difficult reaction chemically, likely to require a sophisticated 
catalyst. The existence of a common protein enzyme framework for this 
process strongly suggests that proteins joined the RNA world before the 
evolution of DNA asa stable storage form for genetic information. 


Thymidylate is formed by the methylation of deoxyuridylate 


Uracil, produced by the pyrimidine synthesis pathway, is not a component 
of DNA. Rather, DNA contains thymine, a methylated analog of uracil. 
Another step is required to generate thymidylate from uracil. Thymidylate 
synthase catalyzes this finishing touch: deoxyuridylate (dUMP) is methy]l- 
ated to thymidylate (TMP). Recall that thymidylate synthase also functions 
in the thymine salvage pathway (p. 748). As will be described in Chapter 28, 
the methylation of this nucleotide marks sites of DNA damage for repair 
and, hence, helps preserve the integrity of the genetic information stored in 
DNA. The methyl! donor in this reaction is N°, N!°-methylenetetrahydro- 
folate rather than S-adenosylmethionine (Section 24.2). 

The methyl group becomes attached to the C-5 atom within the aro- 
matic ring of (UMP, but this carbon atom is not a good nucleophile and 
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FIGURE 25.12 Thymidylate synthesis. 
Thymidylate synthase catalyzes the addition 
of a methyl group (derived from N®, N1°- 
methylenetetrahydrofolate) to (UMP to 
form TMP. The addition of a thiolate from 
the enzyme activates dUMP. Opening the 
five-membered ring of the THF derivative 
prepares the methylene group for 
nucleophilic attack by the activated dUMP. 
The reaction is completed by the transfer of 
a hydride ion to form dihydrofolate. 
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cannot itself attack the appropriate group on the methyl! donor. Thymidylate 
synthase promotes methylation by adding a thiolate from a cysteine side 
chain to this ring to generate a nucleophilic species that can attack the 
methylene group of N°, N!°-methylenetetrahydrofolate (Figure 25.12). 
This methylene group, in turn, is activated by distortions imposed by the 
enzyme that favor opening the five-membered ring. The activated (UMP’s 
attack on the methylene group forms the new carbon-carbon bond. The 
intermediate formed is then converted into product: a hydride ion is trans- 
ferred from the tetrahydrofolate ring to transform the methylene group into 
a methyl group, and a proton is abstracted from the carbon atom bearing the 
methyl group to eliminate the cysteine and regenerate the aromatic ring. 
The tetrahydrofolate derivative loses both its methylene group and a 
hydride ion and, hence, is oxidized to dihydrofolate. For the synthesis of 
more thymidylate, tetrahydrofolate must be regenerated. 


Dihydrofolate reductase catalyzes the regeneration of 
tetrahydrofolate, a one-carbon carrier 


Tetrahydrofolate is regenerated from the dihydrofolate that is produced in 
the synthesis of thymidylate. This regeneration is accomplished by dihydro- 
folate reductase with the use of NADPH as the reductant. 
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A hydride ion is directly transferred from the nicotinamide ring of NADPH 
to the pteridine ring of dihydrofolate. The bound dihydrofolate and 
NADPH are held in close proximity to facilitate the hydride transfer. 


Several valuable anticancer drugs block the synthesis of thymidylate 757 


e Rapidly dividing cells require an abundant supply of thymidylate for 
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the synthesis of DNA. The vulnerability of these cells to the inhibi- 


tion of TMP synthesis has been exploited in 
the treatment of cancer. Thymidylate syn- 
thase and dihydrofolate reductase are choice 
targets of chemotherapy (Figure 25.13). 

Fluorouracil, an anticancer drug, is con- 
verted in vivo into fluorodeoxyuridylate 
(F-dUMP). This analog of (UMP irrevers- 
ibly inhibits thymidylate synthase after act- 
ing as a normal substrate through part of the 
catalytic cycle. Recall that the formation of 
TMP requires the removal of a proton (H*) 
from C-5 of the bound nucleotide 
(Figure 25.12). However, the enzyme cannot 
abstract F* from F-dUMP, and so catalysis 
is blocked at the stage of the covalent com- 
plex formed by F-dUMP, methylenetetrahy- 
drofolate, and the sulfhydryl group of the 
enzyme (Figure 25.14). We see here an 
example of suicide inhibition, in which an 
enzyme converts a substrate into a reactive 
inhibitor that halts the enzyme’s catalytic 
activity (Section 8.5). 


The synthesis of TMP can also be blocked by inhibiting the regenera- 
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FIGURE 25.13 Anticancer drug 


tion of tetrahydrofolate. Analogs of dihydrofolate, such as aminopterin and targets. Thymidylate synthase and 
methotrexate (amethopterin), are potent competitive inhibitors (K; < 1 nM) dihydrofolate reductase are choice targets 


of dihydrofolate reductase. 
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in cancer chemotherapy because the 
generation of large quantities of precursors 
for DNA synthesis is required for rapidly 
dividing cancer cells. 


Methotrexate is a valuable drug in the treatment of many rapidly growing 
tumors, such as those in acute leukemia and choriocarcinoma, a cancer 
derived from placental cells. However, methotrexate kills rapidly replicating 
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cells whether they are malignant or not. Stem cells in bone marrow, epithelial 
cells of the intestinal tract, and hair follicles are vulnerable to the action of 
this folate antagonist, accounting for its toxic side effects, which include 
weakening of the immune system, nausea, and hair loss. 

Folate analogs such as trimethoprim have potent antibacterial and antipro- 
tozoal activity. Trimethoprim binds 105-fold less tightly to mammalian dihy- 
drofolate reductase than it does to reductases of susceptible microorganisms. 
Trimethoprim Small differences in the active-site clefts of these enzymes account for the 
highly selective antimicrobial action. The combination of trimethoprim and 
sulfamethoxazole (an inhibitor of folate synthesis) is widely used to treat 
infections such as bronchitis, traveler’s diarrhea, and urinary tract infections. 


25.4 Key Steps in Nucleotide Biosynthesis Are Regulated 
by Feedback Inhibition 


Nucleotide biosynthesis is regulated by feedback inhibition in a manner 
similar to the regulation of amino acid biosynthesis (Section 24.3). These 
regulatory pathways ensure that the various nucleotides are produced in the 
required quantities. 


Pyrimidine biosynthesis is regulated by aspartate 
transcarbamoylase 


Aspartate transcarbamoylase, one of the key enzymes for the regulation of 
pyrimidine biosynthesis in bacteria, was described in detail in Chapter 10. 
Recall that ATCase is inhibited by CTP, the final product of pyrimidine bio- 
synthesis, and stimulated by ATP. 
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Carbamoyl phosphate synthetase II is also a site of feedback inhibition in 
both prokaryotes and eukaryotes. 


The synthesis of purine nucleotides is controlled by 
feedback inhibition at several sites 


The regulatory scheme for purine nucleotides is more complex than that for 
pyrimidine nucleotides (Figure 25.15). 


1. The committed step in purine nucleotide biosynthesis is the conversion 


FIGURE 25.15 Control of purine of PRPP into phosphoribosylamine by glutamine phosphoribosyl amidotrans- 


biosynthesis. Feedback inhibition controls ferase. This important enzyme is feedback-inhibited by many purine ribo- 
both the overall rate of purine biosynthesis nucleotides. It is noteworthy that AMP and GMP, the final products of the 
and the balance between AMP and GMP pathway, synergistically inhibit the amidotransferase. 
production. 
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2. Inosinate is the branch point in the synthesis of AMP and GMP. The 
reactions leading away from inosinate are sites of feedback inhibition. AMP 
inhibits the conversion of inosinate into adenylosuccinate, its immediate 
precursor. Similarly, GMP inhibits the conversion of inosinate into xan- 
thylate, its immediate precursor. 


3. As already noted, GTP is a substrate in the synthesis of AMP, whereas 
ATP is a substrate in the synthesis of GMP (Figure 25.7). This reciprocal 
substrate relation tends to balance the synthesis of adenine and guanine 
ribonucleotides. 


Note that the synthesis of PRPP by PRPP synthetase is highly regulated 
even though it is not the committed step in purine synthesis. Mutations 
have been identified in PRPP synthetase that result in a loss of allosteric 
response to nucleotides without any effect on catalytic activity of the 
enzyme. A consequence of this mutation is an overabundance of purine 
nucleotides that can result in gout, a pathological condition discussed later 
in the chapter. 


The synthesis of deoxyribonucleotides is controlled by the 
regulation of ribonucleotide reductase 


The reduction of ribonucleotides to deoxyribonucleotides is precisely con- 
trolled by allosteric interactions. Each polypeptide of the R1 subunit of the 
aerobic FE. coli ribonucleotide reductase contains two allosteric sites: one of 
them controls the overall activity of the enzyme, and the other regulates sub- 
strate specificity (Figure 25.16A). The overall catalytic activity of ribonucleotide 
reductase is diminished by the binding of dATP, which signals an abundance 
of deoxyribonucleotides (Figure 25.16B). The binding of ATP reverses this 
feedback inhibition. The binding of dATP or ATP to the substrate-specificity 
control site enhances the reduction of UDP and CDP, the pyrimidine nucleo- 
tides. The binding of thymidine triphosphate (TTP) promotes the reduction of 
GDP and inhibits the further reduction of pyrimidine ribonucleotides. The 
subsequent increase in the level of dGTP stimulates the reduction of ATP to 
dATP. This complex pattern of regulation supplies the appropriate balance of 
the four deoxyribonucleotides needed for the synthesis of DNA. 


e Ribonucleotide reductase is an attractive target for cancer therapy 
and a number of clinically approved anticancer drugs mimic sub- 
strates and regulators of the enzyme. The pyrimidine analog gemcitabine, 
once converted to the diphosphate form in vivo, becomes a suicide inhibitor 


FIGURE 25.16 Regulation of 
ribonucleotide reductase. (A) Each 
subunit in the R1 dimer contains two 
allosteric sites in addition to the active site. 
One site regulates the overall activity and 
the other site regulates substrate specificity. 
(B) The patterns of regulation with regard to 
different nucleoside diphosphates 
demonstrated by ribonucleotide reductase. 
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of the reductase used to treat advanced pancreatic cancer. The purine ana- 
logs clofarabine and cladribine, upon conversion to their triphosphate forms 
in vivo, are dATP analogs and act as allosteric inhibitors of the enzyme. 
Clofarabine is used to treat pediatric acute myeloid leukemia while cladri- 
bine is effective against some forms of chronic lymphoid leukemia. 


25.5 Disruptions in Nucleotide Metabolism Can 
Cause Pathological Conditions 


Nucleotides are vital to a host of biochemical processes. It is not surprising, 
then, that disruption of nucleotide metabolism would have a variety of physi- 
ological effects. The nucleotides of a cell undergo continual turnover. 
Nucleotides are hydrolytically degraded to nucleosides by nucleotidases. The 
phosphorolytic cleavage of nucleosides to free bases and ribose 1-phosphate 
(or deoxyribose 1-phosphate) is catalyzed by nucleoside phosphorylases. Ribose 
1-phosphate is isomerized by phosphoribomutase to ribose 5-phosphate, a 
substrate in the synthesis of PRPP. Some of the bases are reused to form 
nucleotides by salvage pathways. Others are degraded to products that are 
excreted (Figure 25.17). A deficiency of an enzyme can disrupt these path- 
ways, leading to a pathological condition. 


The loss of adenosine deaminase activity results in severe 
combined immunodeficiency 


e The pathway for the degradation of AMP includes an extra step 
because adenosine is not a substrate for nucleoside phosphorylase. 
First, the phosphate is removed by a nucleotidase to yield the nucleoside 
adenosine (Figure 25.17). In the extra step, adenosine is deaminated by 
adenosine deaminase to form inosine. 

A deficiency in adenosine deaminase activity is associated with some forms 
of severe combined immunodeficiency (SCID), an immunological disorder. 
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FIGURE 25.17 Purine catabolism. Purine bases are converted first into N N ‘e) N N O 
xanthine and then into urate for excretion. Xanthine oxidase catalyzes two H H H H 
steps in this process. Urate Uric acid 


Persons with the disorder have acute recurring infections, often leading to 
death at an early age. SCID is characterized by a loss of T cells, which are cru- 
cial to the immune response (Section 34.5). Although the biochemical basis of 
the disorder is not clearly established, a lack of adenosine deaminase results in 
an increase of 50 to 100 times the normal level of dATP, which inhibits ribo- 
nucleotide reductase and, consequently, DNA synthesis. Moreover, adenosine 
itself is a powerful signal molecule with a role in a number of regulatory path- 
ways. Disruption in the levels of adenosine may also be deleterious. SCID is 
often called the “bubble boy disease” because its treatment may include com- 
plete isolation of the patient from the environment. Adenosine deaminase 
deficiency has been successfully treated by gene therapy. 


Gout is induced by high serum levels of urate 


e Inosine generated by adenosine deaminase is subsequently metabo- 
lized by nucleoside phosphorylase to hypoxanthine. Xanthine oxidase, 
a molybdenum- and iron-containing flavoprotein, oxidizes hypoxanthine to 
xanthine and then to uric acid. Molecular oxygen, the oxidant in both reac- 
tions, is reduced to H,O,, which is decomposed to H,O and O, by catalase. 
Uric acid loses a proton at physiological pH to form urate. In human beings, 
urate is the final product of purine degradation and is excreted in the urine. 

High serum levels of urate (hyperuricemia) induce the painful joint disease 
gout. In this disease, the sodium salt of urate crystallizes in the fluid and lining 
of the joints. The small joint at the base of the big toe is a common site for 
sodium urate buildup, although the salt accumulates at other joints also. 
Painful inflammation results when cells of the immune system engulf the 
sodium urate crystals. The kidneys, too, may be damaged by the deposition of 
urate crystals. Gout is acommon medical problem, affecting 1% of the popu- 
lation of Western countries. It is nine times as common in men as in women. 

Administration of allopurinol, an analog of hypoxanthine, is one treat- 
ment for gout. The mechanism of action of allopurinol is interesting: it acts 
first as a substrate and then as an inhibitor of xanthine oxidase. The oxidase 
hydroxylates allopurinol to alloxanthine (oxipurinol), which then remains 
tightly bound to the active site. The binding of alloxanthine keeps the 
molybdenum atom of xanthine oxidase in the +4 oxidation state instead of 
it returning to the +6 oxidation state as in a normal catalytic cycle. We see 
here another example of suicide inhibition. 

The synthesis of urate from hypoxanthine and xanthine decreases soon 
after the administration of allopurinol. The serum concentrations of hypo- 
xanthine and xanthine rise, and that of urate drops. 


aa The average serum level of urate in human beings is close to the solu- 

bility limit and is higher than levels found in other primates. What is 
the selective advantage of a urate level so high that it teeters on the brink of 
gout in many people? It turns out that urate has a markedly beneficial action. 
Urate is a highly effective scavenger of reactive oxygen species. Indeed, urate 
is about as effective as ascorbate (vitamin C) as an antioxidant. The increased 
level of urate in human beings may protect against reactive oxygen species 
that are implicated in a host of pathological conditions (Table 18.3). 


Lesch-Nyhan syndrome is a dramatic consequence of mutations 
in a salvage-pathway enzyme 


e Mutations in genes that encode nucleotide biosynthetic enzymes can 
reduce levels of needed nucleotides and can lead to an accumulation 
of intermediates. A nearly total absence of hypoxanthine- guanine phospho- 
ribosyltransferase (HGPRT) has unexpected and devastating consequences. 
The most striking expression of this inborn error of metabolism, called the 
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Lesch—Nyhan syndrome, is compulsive self-destructive behavior. At age 2 or 3, 
children with this disease begin to bite their fingers and lips and will chew 
them off if unrestrained. These children also behave aggressively toward 
others. Mental deficiency and spasticity are other characteristics of the 
Lesch—Nyhan syndrome. Elevated levels of urate in the serum lead to the 
formation of kidney stones early in life, followed by the symptoms of gout 
years later. The disease is inherited as a sex-linked recessive disorder. 

What is the connection between the absence of HGPRT activity and the 
behavioral characteristics of Lesch-Nyhan syndrome? The answer is not 
clear, but it is possible to speculate. The brain has limited capacity for de 
novo purine synthesis. Consequently, lack of HGPRT results in a deficiency 
of purine nucleotides. ATP and ADP, formed from inosinate, are especially 
important in the brain as signal molecules. These nucleotides bind to and 
activate G-protein coupled receptors that regulate the dopamine secreting 
neurons. Thus, the lack of HGPRT results in an imbalance of key neu- 
rotransmitters. Moreover, the guanosine nucleotides required for G-protein 
function may also be in short supply. The Lesch-Nyhan syndrome demon- 
strates that the salvage pathway for the synthesis of IMP and GMP is not of 
minor importance. Moreover, the Lesch-Nyhan syndrome reveals that 
abnormal behavior such as self-mutilation and extreme hostility can be caused 
by the absence of a single enzyme. Psychiatry will no doubt benefit from the 
unraveling of the molecular basis of such mental disorders. 


Folic acid deficiency promotes birth defects such as spina bifida 


e Spina bifida is one of a class of birth defects characterized by the 

incomplete or incorrect formation of the neural tube early in develop- 
ment. In the United States, the prevalence of neural-tube defects is approxi- 
mately 1 case per 1000 births. A variety of studies have demonstrated that 
the prevalence of neural-tube defects is reduced by as much as 70% when 
women take folic acid as a dietary supplement before and during the first 
trimester of pregnancy. One hypothesis is that more folate derivatives are 
needed for the synthesis of DNA precursors when cell division is frequent 
and substantial amounts of DNA must be synthesized. 


SUMMARY 


25.1 The Pyrimidine Ring Is Assembled de Novo or Recovered by 

Salvage Pathways 

The pyrimidine ring is assembled first and then linked to ribose 
phosphate to form a pyrimidine nucleotide. 5-Phosphoribosyl-1- 
pyrophosphate is the donor of the ribose phosphate moiety. The 
synthesis of the pyrimidine ring starts with the formation of carbamo- 
ylaspartate from carbamoyl phosphate and aspartate, a reaction cata- 
lyzed by aspartate transcarbamoylase. Dehydration, cyclization, and 
oxidation yield orotate, which reacts with PRPP to give orotidylate. 
Decarboxylation of this pyrimidine nucleotide yields UMP. CTP is 
then formed by the amination of UTP. 


25.2 Purine Bases Can Be Synthesized de Novo or Recycled by 

Salvage Pathways 

The purine ring is assembled from a variety of precursors: glutamine, 
glycine, aspartate, N!°-formyltetrahydrofolate, and CO,. The com- 
mitted step in the de novo synthesis of purine nucleotides is the forma- 
tion of 5-phosphoribosylamine from PRPP and glutamine. The purine 
ring is assembled on ribose phosphate, in contrast with the de novo 
synthesis of pyrimidine nucleotides. The addition of glycine, followed 


25.3 


25.4 


25.5 


by formylation, amination, and ring closure, yields 5-aminoimidazole 
ribonucleotide. This intermediate contains the completed five- 
membered ring of the purine skeleton. The addition of CO,, the nitro- 
gen atom of aspartate, and a formyl group, followed by ring closure, 
yields inosinate, a purine ribonucleotide. AMP and GMP are formed 
from IMP. Purine ribonucleotides can also be synthesized by a salvage 
pathway in which a preformed base reacts directly with PRPP. 


Deoxyribonucleotides Are Synthesized by the Reduction of 
Ribonucleotides Through a Radical Mechanism 

Deoxyribonucleotides, the precursors of DNA, are formed in E. coli by 
the reduction of ribonucleoside diphosphates. These conversions are cata- 
lyzed by ribonucleotide reductase. Electrons are transferred from NADPH 
to sulfhydryl groups at the active sites of this enzyme by thioredoxin. A 
tyrosyl free radical generated by an iron center in the reductase initiates a 
radical reaction on the sugar, leading to the exchange of H for OH at C-2’. 
TMP is formed by the methylation of (UMP. The donor of a methylene 
group anda hydride in this reaction is N°, N!°-methylenetetrahydrofolate, 
which is converted into dihydrofolate. Tetrahydrofolate is regenerated by 
the reduction of dihydrofolate by NADPH. Dihydrofolate reductase, 
which catalyzes this reaction, is inhibited by folate analogs such as ami- 
nopterin and methotrexate. These compounds and fluorouracil, an 
inhibitor of thymidylate synthase, are used as anticancer drugs. 


Key Steps in Nucleotide Biosynthesis Are Regulated by 

Feedback Inhibition 

Pyrimidine biosynthesis in FE. coli is regulated by the feedback inhibi- 
tion of aspartate transcarbamoylase, the enzyme that catalyzes the 
committed step. CTP inhibits and ATP stimulates this enzyme. The 
feedback inhibition of glutamine-PRPP amidotransferase by purine 
nucleotides is important in regulating their biosynthesis. Disruption 
of the regulation and activity of ribonucleotide reductase by drugs is 
an effective chemotherapy for some types of cancer. 


Disruptions in Nucleotide Metabolism Can Cause 

Pathological Conditions 

Severe combined immunodeficiency results from the absence of ade- 
nosine deaminase, an enzyme in the purine degradation pathway. 
Purines are degraded to urate in human beings. Gout, a disease that 
affects joints and leads to arthritis, is associated with an excessive 
accumulation of urate. The Lesch-Nyhan syndrome, a genetic disease 
characterized by self-mutilation, mental deficiency, and gout, is 
caused by the absence of hypoxanthine-guanine phosphoribosyltrans- 
ferase. This enzyme is essential for the synthesis of purine nucleotides 
by the salvage pathway. Neural-tube defects are more frequent when a 
pregnant woman is deficient in folate derivatives early in pregnancy, 
possibly because of the important role of these derivatives in the syn- 
thesis of DNA precursors. 


KEY TERMS 
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PROBLEMS 


1. From the beginning or extract and save and reuse. 
Differentiate between the de novo synthesis of nucleotides 
and salvage-pathway synthesis. 


2. Finding their roots 1. Identify the source of the atoms in 
the pyrimidine ring 


3. Finding their roots 2. Identify the source of the atoms in 
the purine ring. 


4. Multifaceted. List some of the biochemical roles played 
by nucleotides. 


5. An s instead of a t? Differentiate between a nucleoside 
and a nucleotide. 


6. Associate ’em. 


(a) Excessive urate 1. Spina bifida 
(b) Lack of adenosine 2. Precursor to both ATP 
deaminase and GTP 
(c) Lack of HGPRT 3. Purine 
(d) Carbamoyl! phosphate 4, Deoxynucleotide synthe- 
sis 


.UTP 
. Lesch-Nyhan disease 


(e) Inosinate 5 
6 
7. Immunodeficiency 
8 
9 
0 


(f£) Ribonucleotide reductase 


(g) Lack of folic acid . Pyrimidine 
(h) Glutamine phosphoribo- 9. Gout 
syl transferase 10. First step in pyrimidine 
(i) Single ring synthesis 
(j) Bicyclic ring 11. Committed step in 


(k) Precursor to CTP purine synthesis 


7. Safe passage. What is substrate channeling? How does it 
affect enzyme efficiency? 


8. Activated ribose phosphate. Write a balanced equation for 
the synthesis of PRPP from glucose through the oxidative 
branch of the pentose phosphate pathway. 


9. Making a pyrimidine. Write a balanced equation for the 
synthesis of orotate from glutamine, CO, and aspartate. 


10. Identifying the donor. What is the activated reactant in 
the biosynthesis of each of the following compounds? 

(a) Phosphoribosylamine (c) Orotidylate (from orotate) 
(b) Carbamoylaspartate (d) Phosphoribosylanthranilate 


11. Inhibiting purine biosynthesis. Amidotransferases are 
inhibited by the antibiotic azaserine (O-diazoacetyl-L- 
serine), which is an analog of glutamine. 


“ie 
n~ ao : 


0 H3N COoO- 


Azaserine 


Which intermediates in purine biosynthesis would accu- 
mulate in cells treated with azaserine? 


12. The price of methylation. Write a balanced equation for 
the synthesis of TMP from dUMP that is coupled to the 


conversion of serine into glycine. 


13. Sulfa action. Bacterial growth is inhibited by sulfanil- 
amide and related sulfa drugs, and there is a concomitant 
accumulation of 5-aminoimidazole-4-carboxamide ribonu- 
cleotide. This inhibition is reversed by the addition of 
p-aminobenzoate. 


{son 


Sulfanilamide 


Propose a mechanism for the inhibitory effect of sulfa- 
nilamide. 


14. HAT medium. Mutant cells unable to synthesize nucle- 
otides by salvage pathways are very useful tools in molecu- 
lar and cell biology. Suppose that cell A lacks thymidine 
kinase, the enzyme catalyzing the phosphorylation of thy- 
midine to thymidylate, and that cell B lacks hypoxanthine- 
guanine phosphoribosyl transferase. 


(a) Cell A and cell B do not proliferate in a HAT medium 
containing hypoxanthine, aminopterin or amethopterin 
(methotrexate), and thymine. However, cell C, formed by 
the fusion of cells A and B, grows in this medium. Why? 


(b) Suppose that you want to introduce foreign genes into 
cell A. Devise a simple means of distinguishing between cells 
that have taken up foreign DNA and those that have not. 


15. Vitamin requirement. What is the role of folate in 
pyrimidine synthesis and what is the consequence of a 
folate deficiency during development? 


16. Bringing equilibrium. What is the reciprocal substrate 
relation in the synthesis of ATP and GTP? 


17. Find the label. Suppose that cells are grown on amino 
acids that have all been labeled at the a carbons with !3C. 
Identify the atoms in cytosine and guanine that will be 


labeled with 'C. 


18. Need a map? Describe the path for the synthesis of 
TTP from UTP. 


19. Adjunct therapy. Allopurinol is sometimes given to 
patients with acute leukemia who are being treated with 
anticancer drugs. Why is allopurinol used? 


20. A hobbled enzyme. Both side-chain oxygen atoms of 
aspartate 27 at the active site of dihydrofolate reductase form 
hydrogen bonds with the pteridine ring of folates. The 


importance of this interaction was assessed by studying two 
mutants at this position, Asn 27 and Ser 27. The dissociation 
constant of methotrexate was 0.07 nM for the wild type, 1.9 
nM for the Asn 27 mutant, and 210 nM for the Ser 27 
mutant, at 25°C. Calculate the standard free energy of the 
binding of methotrexate by these three proteins. What is the 
decrease in binding energy resulting from each mutation? 


21. Needed supplies. Why are cancer cells especially sensi- 
tive to inhibitors of TMP synthesis? 


22. Alternative routes. Orotic aciduria results when one of the 
enzyme activities of UMP synthetase is absent. This syn- 
drome is characterized by large amounts of orotic acid in the 
blood and urine, megaloblastic anemia (characterized by large, 
immature, and dysfunctional red blood cells) and retarded 
growth. Suggest a possible treatment for this condition. 


23. Correcting deficiencies. Suppose that a person is found 
who is deficient in an enzyme required for IMP synthesis. 
How might this person be treated? 


24. Labeled nitrogen. Purine biosynthesis is allowed to 
take place in the presence of [!°N]aspartate, and the newly 
synthesized GTP and ATP are isolated. What positions 
are labeled in the two nucleotides? 


25. On the trail of carbons. Tissue culture cells were incu- 
bated with glutamine labeled with !5N in the amide group. 
Subsequently, IMP was isolated and found to contain some 
15N. Which atoms in IMP were labeled? 


26. Mechanism of action. What is the biochemical basis of 
allopurinol treatment for gout? 


27. Changed inhibitor. Xanthine oxidase treated with allo- 
purinol results in the formation of a new compound that is 
an extremely potent inhibitor of the enzyme. Propose a 
structure for this compound. 


28. Calculate the ATP footprint. How many molecules of 
ATP are required to synthesize one molecule of CTP from 
scratch? 


29. Blockages. What intermediate in purine synthesis will 
accumulate if a strain of bacteria is lacking each of the fol- 
lowing biochemicals? 


(a) Aspartate 
(b) Tetrahydrofolate 


(c) Glycine 
(d) Glutamine 


Mechanism Problems 


30. The same and not the same. Write out mechanisms for 
the conversion of phosphoribosylamine into glycinamide 
ribonucleotide and of xanthylate into guanylate. 


31. Closing the ring. Propose a mechanism for the conver- 
sion of 5-formamidoimidazole-4-carboxamide ribonucleo- 
tide into inosinate. 
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32. Different strokes. Human beings contain two different 
carbamoyl phosphate synthetase enzymes. One uses gluta- 
mine as a substrate, whereas the other uses ammonia. What 
are the functions of these two enzymes? 


33. A generous donor. What major biosynthetic reactions 
utilize PRPP? 


34. They’re everywhere! Nucleotides play a variety of roles 
in the cell. Give an example of a nucleotide that acts in each 
of the following roles or processes. 

e) Transfer of electrons 
f) DNA sequencing 

g) Chemotherapy 

h) Allosteric effector 


(a) Second messenger 

(b) Phosphoryl-group transfer 
(c) Activation of carbohydrates 
(d) Activation of acetyl groups 


oe es ee 


35. Pernicious anemia. Purine biosynthesis is impaired by 
vitamin B,, deficiency. Why? How might fatty acid and 
amino acid metabolism also be affected by a vitamin B,, 
deficiency? 


36. Folate deficiency. Suppose someone was suffering 
from a folate deficiency. What cells would you think 
might be most affected? Symptoms may include diarrhea 
and anemia. 


37. Hyperuricemia. Many patients with glucose 6-phos- 
phatase deficiency have high serum levels of urate. 
Hyperuricemia can be induced in normal people by the 
ingestion of alcohol or by strenuous exercise. Propose a 
common mechanism that accounts for these findings. 


38. Labeled carbon. Succinate uniformly labeled with '*C is 
added to cells actively engaged in pyrimidine biosynthesis. 
Propose a mechanism by which carbon atoms from succi- 
nate could be incorporated into a pyrimidine. At what posi- 
tions is the pyrimidine labeled? 


39. Something funny going on here. Cells were incubated 
with glucose labeled with !4C in carbon 2, shown in red in 
the structure below. Later, uracil was isolated and found 
to contain '*C in carbons 4 and 6. Account for this label- 
ing pattern. 


H 2 
HC—OH 
HO—CH 


HC——OH 


HC——OH 


CH,0H 


40. Side effects. Azathioprine is an immunosuppressant 
drug used in kidney transplants. In vivo, azathioprine is 
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metabolized to the hypoxanthine analog 6-mercaptopu- 
rine, which is then converted into 6-mercaptopurine 
ribose monophosphate, the active form of the drug. 
6-Mercaptopurine ribose monophosphate also inhibits de 
novo purine synthesis, reducing uric acid levels in the 
blood and urine. However, when administered to Lesch— 
Nyhan patients, there was no effect on uric acid levels. 
Explain why this is the case. 


41. Exercising muscle. Some interesting reactions take place 
in muscle tissue to facilitate the generation of ATP for con- 
traction. In muscle contraction, ATP is converted into ADP. 


Adenylate kinase converts two molecules of ADP into a 
molecule of ATP and AMP. 


Adenylosuccinate 


Adenylosuccinate % 
lyase 
Fumarate 
GDP + P; 
Adenylate Aarne i 
kinase ‘Adenivl fi 
losuccinate 
2 ADP ATP + AMP NUCLEOTIDE synthetase 
CYCLE 
Aspartate + GTP 
H20 
AMP deaminase 
NH3 IMP 


(a) Why is this reaction beneficial to contracting muscle? 


(b) Why is the equilibrium for the adenylate kinase approx- 
imately equal to 1? 


Muscle can metabolize AMP by using the purine nucleo- 
tide cycle. The initial step in this cycle, catalyzed by AMP 
deaminase, is the conversion of AMP into IMP. 


(c) Why might the deamination of AMP facilitate ATP 


formation in muscle? 


(d) How does the purine nucleotide cycle assist the aerobic 
generation of ATP? 


42. A common step. What three reactions transfer an amino 
group from aspartate to yield the aminated product and 
fumarate? 


43. Your pet duck. You suspect that your pet duck has gout. 
Why should you think twice before administering a dose of 
allopurinol-laced bread? 


Data Interpretation Problem 


44, Not all absences are equal. In order to better understand 
the nature of the defects in HGPRT in patients suffering 
from Lesch—Nyhan disease, cell lines from two Lesch— 
Nyhan patients (LND 1 and LND 2) as well as a normal 
individual were established. HGPRT was then purified 
from each of the cell lines using the techniques examined in 
Chapter 3. Using the purified enzyme, the basic kinetics of 
the three enzymes were compared, as shown in Figure A. In 
each case, the same amount of purified enzyme was used in 
the analysis. 


(a) Describe the results shown in the figure. 


(A) 


° 
fo) 
l 


LND 1 
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ul 
oO 


25 50 75 
Hypoxanthine (44M) 


(b) Suggest some possible explanations for the appearance 
of Lesch—Nyhan disease even in the presence of apparently 
normal enzyme. 


(c) The stability of the HGPRT was tested to determine 
if the LND 1 enzyme was more labile. The same amounts 
of the normal and LND 1 enzyme were incubated at 37 °C 
in the absence of any substrates or products. At various 
times, a portion of the enzyme was removed and assayed for 
enzyme activity. The results are shown in Figure B. Explain 
these results. 
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The Biosynthesis of Membrane 
Lipids and Steroids 


his chapter examines the biosynthesis of three important compo- 

nents of biological membranes—phospholipids, sphingolipids, and 
cholesterol—which were introduced in Chapter 12. Triacylglycerols also 
are considered here because the pathway for their synthesis overlaps that 
of phospholipids. Cholesterol is of interest both as a membrane compo- 
nent and as a precursor of potent signal molecules, such as the steroid 
hormones progesterone, testosterone, estrogen, and cortisol. 

The transport and uptake of cholesterol vividly illustrate a recurring 
mechanism for the entry of metabolites and signal molecules into cells. 
Cholesterol is transported in blood by the low-density lipoprotein (LDL) 
and taken up into cells by the LDL receptor on the cell surface. The LDL 
receptor is absent in people with familial hypercholesterolemia, a genetic 
disease. People lacking the receptor have markedly elevated cholesterol lev- 
els in the blood and cholesterol deposits on blood vessels, and they are prone 
to childhood heart attacks. Indeed, cholesterol is implicated in the develop- 
ment of atherosclerosis in people who do not have genetic defects. Thus, the 
regulation of cholesterol synthesis and transport can be a source of espe- 
cially clear insight into the role that our understanding of biochemistry 
plays in medicine. 


CHAPTER 


Fats such as the triacylglycerol molecule (below) are widely 
used to store excess energy for later use and to fulfill other 
purposes, illustrated by the insulating blubber of whales. 
The natural tendency of fats to exist in nearly water free 
forms makes these molecules well suited to these 

roles. [(Left) Frangois Gohier/Science Source.] 
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CHAPTER 26 _ The Biosynthesis of 
Membrane Lipids and Steroids 


HO—C=H 0 
HC 
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Glycerol 3-phosphate 


FIGURE 26.1 PATHWAY INTEGRATION: 
Sources of intermediates in the 
synthesis of triacylglycerols and 
phospholipids. Phosphatidate, synthesized 
from dihydroxyacetone phosphate (DHAP) 
produced in gluconeogenesis and fatty 
acids, can be further processed to produce 
triacylglycerol or phospholipids. 
Phospholipids and other membrane lipids 
are continuously produced in all cells. 
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26.1 Phosphatidate ls a Common Intermediate in the 
Synthesis of Phospholipids and Triacylglycerols 


Lipid synthesis requires the coordinated action of gluconeogenesis and fatty 
acid metabolism, as illustrated in Figure 26.1. The common step in the syn- 
thesis of both phospholipids for membranes and triacylglycerols for energy 
storage is the formation of phosphatidate (diacylglycerol 3-phosphate). In 
mammalian cells, phosphatidate is synthesized in the endoplasmic reticulum 
and the outer mitochondrial membrane. The pathway begins with glycerol 
3-phosphate, which is formed primarily by the reduction of dihydroxyace- 
tone phosphate (DHAP) synthesized by the gluconeogenic pathway, and to 
a lesser extent by the phosphorylation of glycerol. The addition of two fatty 
acids to glycerol-3-phosphate yields phosphatidate. First, acyl coenzyme A 
contributes a fatty acid chain to form lysophosphatidate and, then, a second 
acyl CoA contributes a fatty acid chain to yield phosphatidate. 


Usually saturated 


R 0 R O 
R,CO—CoA CoA | Se cH, R,CO—CoA CoA Ne cu, 
A : i I : 
OHO—C=H 0 OCH 9 
pf a a 
H2C R— H2C RR 
oO 6 Sos 
Oo ie) 
Usually unsaturated 
Lysophosphatidate Phosphatidate 


These acylations are catalyzed by glycerol phosphate acyltransferase. In most 
phosphatidates, the fatty acid chain attached to the C-1 atom is saturated, 
whereas the one attached to the C-2 atom is unsaturated. Phosphatidate can 
also be synthesized from diacylglycerol (DAG), in what is essentially a sal- 
vage pathway, by the action of diacylglycerol kinase: 


Diacylglycerol + ATP — > phosphatidate + ADP 


The phospholipid and triacylglycerol pathways diverge at phosphati- 
date. In the synthesis of triacylglycerols, a key enzyme in the regulation of 
lipid synthesis, phosphatidic acid phosphatase, hydrolyzes phosphatidate to 
give a diacylglycerol. This intermediate is acylated to a triacylglycerol 
through the addition of a third fatty acid chain in a reaction that is catalyzed 
by diglyceride acyltransferase. Both enzymes are associated in a triacylglycerol 
synthetase complex that is bound to the endoplasmic reticulum membrane. 


@ 
Glycerol 
Lactate (===> DHAP ==> 3-phosphate 
(ADIPOSE TISSUE OR DIET > / 
Glycerol Pi 
Triacylglycerol ——_ @) 
Free } 2 
S fatty acids J *For phospholipid synthesis, either 


phosphatidate or the alcohol must 
be activated by reaction with an 
NTP (4). 


Active pathways: 

1. Gluconeogenesis, Chapter 16 

2. Triacylglycerol breakdown, Chapter 22 
3. Triacylglycerol synthesis, Chapter 26 
4. Phospholipid synthesis, Chapter 26 
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The liver is the primary site of triacylglycerol synthesis. From the liver, the 
triacylglycerols are transported to the muscles for energy conversion or to 
the adipose cells for storage. 


The synthesis of phospholipids requires an activated intermediate 


Membrane-lipid synthesis continues in the endoplasmic reticulum and in 
the Golgi apparatus. Phospholipid synthesis requires the combination of a 
diacylglycerol with an alcohol. As in most anabolic reactions, one of the 
components must be activated. In this case, either the diacylglycerol or the 
alcohol may be activated, depending on the source of the reactants. 

The synthesis of some phospholipids begins with the reaction of phos- 
phatidate with cytidine triphosphate (CTP) to form the activated diacylglycerol, 
cytidine diphosphodiacylglycerol (CDP-diacylglycerol). This reaction, like those 
of many biosyntheses, is driven forward by the hydrolysis of pyrophosphate. 
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Phosphatidate CDP-diacylglycerol 


The activated phosphatidyl unit then reacts with the hydroxyl group of an 
alcohol to form a phosphodiester linkage. If the alcohol is inositol, the prod- 
ucts are phosphatidylinositol and cytidine monophosphate (CMP). 
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Subsequent phosphorylations catalyzed by specific kinases lead to the syn- 
thesis of phosphatidylinositol 4, 5-bisphosphate, the precursor of two intracellu- 
lar messengers—diacylglycerol and inositol 1,4,5-trisphosphate (Section 14.2). 
If the alcohol is phosphatidylglycerol, the products are diphosphatidylglycerol 
(cardiolipin) and CMP. In eukaryotes, cardiolipin is synthesized in the mito- 
chondria and is located exclusively in inner mitochondrial membranes. 
Cardiolipin plays an important role in the organization of the protein compo- 
nents of oxidative phosphorylation. For example, it is required for the full 
activity of cytochrome c oxidase (Section 18.3). 
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The fatty acid components of phospholipids may vary, and thus cardio- 
lipin, as well as most other phospholipids, represents a class of molecules 
rather than a single species. As a result, a single mammalian cell may con- 
tain thousands of distinct phospholipids. Phosphatidylinositol is unusual in 
that it has a nearly fixed fatty acid composition. Stearic acid usually occu- 
pies the C-1 position and arachidonic acid the C-2 position. 


Some phospholipids are synthesized from an activated alcohol 


Phosphatidylethanolamine, the major phospholipid of the inner leaflet of 
cell membranes, is synthesized from the alcohol ethanolamine. To acti- 
vate the alcohol, ethanolamine is phosphorylated by ATP to form the 
precursor, phosphorylethanolamine. This precursor then reacts with CTP 
to form the activated alcohol, CDP-ethanolamine. The phosphoryletha- 
nolamine unit of CDP-ethanolamine is transferred to a diacylglycerol to 
form phosphatidylethanolamine. 


Phosphatidylcholine is an abundant phospholipid 


The most common phospholipid in mammals is phosphatidylcholine, com- 
prising approximately 50% of the membrane mass. Dietary choline is acti- 
vated in a series of reactions analogous to those in the activation of 
ethanolamine. CTP-phosphocholine cytidylyltransferase (CCT) catalyzes 
the formation of CDP-choline, the rate-limiting step in phosphatidylcho- 
line synthesis. CCT is an amphitropic enzyme, a class of enzymes whose 
regulator ligand is the membrane itself. A portion of the enzyme, normally 
associated with the membrane, detects a fall in phosphatidylcholine as an 
alteration in the physical properties of the membrane. When this occurs, 
another portion of the enzyme is inserted into the membrane, leading to 
enzyme activation. Indeed, the k,,,/K,, value (Section 8.4) increases three 
orders of magnitude upon activation, resulting in the restoration of phos- 
phatidylcholine levels. 


Earlier (Section 22.4) we examined how cancer cells increase lipogen- 

esis to meet the fatty acid needs for membrane synthesis. Evidence is 
accumulating that CCT is specifically activated in some cancers to generate 
the required phosphocholine. 


The importance of phosphatidylcholine is attested to by the fact that 
the liver possesses an enzyme, phosphatidylethanolamine methyltransferase, 
which synthesizes phosphatidylcholine from phosphatidylethanolamine 
when dietary choline is insufficient. The amino group of this phosphatidy]l- 
ethanolamine is methylated three times to form phosphatidylcholine. 
S-Adenosylmethionine is the methyl donor (Section 24.2). 
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Thus, phosphatidylcholine can be produced by two distinct pathways in 
mammals, ensuring that this phospholipid can be synthesized even if the 
components for one pathway are in limited supply. 


Excess choline is implicated in the development of heart disease 


e Choline is a popular dietary supplement that is believed by some 

to enhance liver and neuronal function. Although the effectiveness 
of choline supplements is not established, the dangers of excess choline 
consumption are becoming clear. Gut bacteria convert excess choline 
into trimethylamine (TMA), a gas that smells like rotten fish, and the 
liver converts the absorbed TMA into trimethylamine- N-oxide 
(TMAO). TMAO stimulates cholesterol uptake by macrophages, a 
process that can result in atherosclerosis (p. 787). Foods rich in phos- 
phatidylcholine, such as red meats and dairy products, may also result in 
TMAO production. 


Base-exchange reactions can generate phospholipids 


Phosphatidylserine makes up 10% of the phospholipids in mammals. 
This phospholipid is synthesized in a base-exchange reaction of serine with 
phosphatidylcholine or phosphatidylethanolamine. In the reaction, serine 
replaces choline or ethanolamine. 


Phosphatidylcholine + serine ——> choline + phosphatidylserine 


Phosphatidylethanolamine + serine —> 
ethanolamine + phosphatidylserine 


Phosphatidylserine is normally located in the inner leaflet of the plasma 
membrane bilayer but is moved to the outer leaflet in apoptosis 
(Section 18.6). There, it serves to attract phagocytes to consume the cell 
remnants after apoptosis is complete. Phosphatidylserine is translocated 
from one side of the membrane to the other by an ATP-binding cassette 
translocase (Section 13.2). 

Note that a cytidine nucleotide plays the same role in the synthesis 
of these phosphoglycerides as a uridine nucleotide does in the formation of 
glycogen (Section 21.4). In all of these biosyntheses, an activated intermedi- 
ate (UDP-glucose, CDP-diacylglycerol, or CDP-alcohol) is formed from a 
phosphorylated substrate (glucose 1-phosphate, phosphatidate, or a phos- 
phorylalcohol) and a nucleoside triphosphate (UTP or CTP). The activated 
intermediate then reacts with a hydroxyl group (the terminus of glycogen, 
an alcohol, or a diacylglycerol). 
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FIGURE 26.2 Synthesis of ceramide. 


Palmitoyl CoA and serine combine to 
initiate the synthesis of ceramide. 


Sphingolipids are synthesized from ceramide 


We now turn from glycerol-based phospholipids to another class of 
membrane lipid—the sphingolipids. These lipids are found in the plasma 
membranes of all eukaryotic cells, although the concentration is highest 
in the cells of the central nervous system. The backbone of a sphingolipid 
is sphingosine, rather than glycerol. Palmitoyl CoA and serine condense to 
form 3-ketosphinganine. The serine—palmitoy] transferase catalyzing this 
reaction is the rate-limiting step in the pathway and requires pyridoxal 
phosphate, revealing again the dominant role of this cofactor in transfor- 
mations that include amino acids. Ketosphinganine is then reduced to 
dihydrosphingosine before conversion into ceramide, a lipid consisting 
of a fatty acid chain attached to the amino group of a sphingosine back- 
bone (Figure 26.2). 

In all sphingolipids, the amino group of ceramide is acylated. The termi- 
nal hydroxyl group also is substituted (Figure 26.3). In sphingomyelin, a 
component of the myelin sheath covering many nerve fibers, the substitu- 
ent is phosphorylcholine, which comes from phosphatidylcholine. In a 
cerebroside, the substituent is glucose or galactose. UDP-glucose or UDP- 
galactose is the sugar donor. 


Gangliosides are carbohydrate-rich sphingolipids that 
contain acidic sugars 


Gangliosides are the most complex sphingolipids. In a ganglioside, an oligo- 
saccharide chain is linked to the terminal hydroxyl group of ceramide by a 
glucose residue (Figure 26.4). This oligosaccharide chain contains at least 
one acidic sugar, either N-acetylneuraminate or N-glycolylneuraminate. 
These acidic sugars are called sialic acids. Their nine-carbon backbones 
are synthesized from phosphoenolpyruvate (a three-carbon unit) and 
N-acetylmannosamine 6-phosphate (a six-carbon unit). 

Gangliosides are synthesized by the ordered, step-by-step addition of 
sugar residues to ceramide. The synthesis of these complex lipids requires 
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(CH2)12 UDP-glucose 
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FIGURE 26.3 Synthesis of sphingolipids. 
Ceramide is the starting point for the formation (CHa)i2 
of sphingomyelin and gangliosides. CHs 
Cerebroside GalNAc 
4 4 
: B B © Ceramide 
the activated sugars UDP-glucose, UDP-galactose, and UDP-N- 83 - 
acetylgalactosamine, as well as the CMP derivative of N-acetylneuraminate. = 
CMP-N-acetylneuraminate is synthesized from CTP and N- 
acetylneuraminate. The sugar composition of the resulting ganglioside is NAN 
determined by the specificity of the glycosyltransferases in the cell. Almost FIGURE 26.4 Ganglioside Guy. 
200 different gangliosides have been characterized (see Figure 26.4 for the This ganglioside consists of five 
composition of ganglioside G, ;,). monosaccharides linked to ceramide: one 
glucose (Glc) molecule, two galactose (Gal) 
Ganglioside-binding by the cholera toxin is the first step in the ele aloes She N= aceiielacigeainine 
(GalNAc) molecule, and one 


development of cholera, a pathological condition characterized by 
severe diarrhea (Section 14.5). Enterotoxigenic FE. coli, the most common 
cause of diarrhea, including traveler’s diarrhea, produces a toxin that also 
gains access to the cell by first binding to gangliosides. Gangliosides are 
also crucial for binding immune-system cells to sites of injury in the R2 = H, N-acetylneuraminate 
inflammatory response. Ro = OH, N-glycolylneuraminate 


N-acetylneuraminate (NAN) molecule. 


Sphingolipids confer diversity on lipid structure and function 


H 

e The structures of sphingolipids and the more abundant glycero- soe eae ao coer 

phospholipids are very similar (Figure 12.8). Given the structural ol R, 
similarity of these two types of lipids, why are sphingolipids required at R, H oH 
all? Indeed, the prefix “sphingo” was applied to capture the “sphinxlike” H OH 
properties of this enigmatic class of lipids. Although the precise role of OH H 
sphingolipids is not firmly established, progress toward solving the riddle 
of their function is being made. As discussed in Chapter 12, sphingolipids | 
are important components of lipid rafts, highly organized regions of the H——C—OH 
plasma membrane that are important in signal transduction. Sphingosine, R= oe oS 


sphingosine 1-phosphate, and ceramide serve as second messengers in the | 
regulation of cell growth, differentiation, and death. For instance, ceramide CH,OH 
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FIGURE 26.5 Lysosome with lipids. 

An electron micrograph of a lysosome 
engorged with lipids. Such lysosomes are 
sometimes described as being “onion-skin” 
like because the layers of undigested lipids 
resemble a sliced onion. [Graphics & 
Photography Service/Custom Medical 

Stock Photo—All rights reserved.] 


derived from a sphingolipid initiates programmed cell death in some cell 
types (p. 775) and may contribute to the development of type 2 diabetes 
(Chapter 27). 


Respiratory distress syndrome and Tay-Sachs disease result 
from the disruption of lipid metabolism 


e Respiratory distress syndrome is a pathological condition resulting 
from a failure in the biosynthetic pathway for dipalmitoylphospha- 
tidylcholine. This phospholipid, in conjunction with specific proteins 
and other phospholipids, is found in the extracellular fluid that sur- 
rounds the alveoli of the lung. Its function is to decrease the surface ten- 
sion of the fluid to prevent lung collapse at the end of the expiration 
phase of breathing. Premature infants may suffer from respiratory dis- 
tress syndrome because their immature lungs do not synthesize enough 
dipalmitoylphosphatidylcholine. 

Tay-Sachs disease is caused by a failure of lipid degradation: an 
inability to degrade gangliosides. Gangliosides are found in highest con- 
centration in the nervous system, particularly in gray matter, where they 
constitute 6% of the lipids. Gangliosides are normally degraded inside 
lysosomes by the sequential removal of their terminal sugars but, in Tay— 
Sachs disease, this degradation does not take place. As a consequence, 
neurons become significantly swollen with lipid-filled lysosomes 
(Figure 26.5). An affected infant displays weakness and retarded psycho- 
motor skills before 1 year of age. The child is demented and blind by age 2 
and usually dies before age 3. 

The ganglioside content of the brain of an infant with Tay—Sachs disease 
is greatly elevated. The concentration of ganglioside G,, is many times higher 
than normal because its terminal N-acetylgalactosamine residue is removed very 
slowly or not at all. The missing or deficient enzyme is a specific B-N- 
acetylhexosaminidase. 
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Tay—Sachs disease can be diagnosed in the course of fetal development. 
Cells obtained from either the villi of the placenta (chorionic villus sam- 
pling) or from the amniotic fluid (amniocentesis) are tested for the presence 
of the defective gene. Tay—Sachs disease was especially prominent among 
Ashkenazi Jews (descendants of Jews from central and eastern Europe). A 
genetic testing program, initiated in the early 1970s upon the development 
of a simple blood test to identify carriers, has virtually eliminated the dis- 
ease in the population. 


Ceramide metabolism stimulates tumor growth 


e Ceramide is a precursor for sphingomyelin, cerebroside, and ganglio- 
sides. Ceramide itself, however, induces programmed cell death or 
apoptosis (Section 18.6). Recall that cancer cells require all types of lipids for 


membrane formation (p. 771 and Section 22.4). How do cancer cells prevent 775 
ceramide-induced cell death? It appears that these cells destroy the apop- 26.1 Synthesis of Membrane Lipids 
totic signal by converting it into a pro-mitotic signal. Ceramidase removes 

the fatty acid from the amino group of ceramide, generating sphingosine. 

Sphingosine is then converted into sphingosine 1-phosphate by sphingosine 

kinase, which stimulates cell division. 


+HN ATP ADP 
OH wv, 
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(CH2)12 (CHa)i2 (CHa)i2 
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Ceramide Sphingosine Sphingosine 1-phosphate 


Thus, cancer cells convert a potentially lethal signal molecule into one that 
promotes tumor growth. Efforts are underway to develop inhibitors of 
ceramidase for use as chemotherapeutic agents. 


Phosphatidic acid phosphatase is a key regulatory enzyme 
in lipid metabolism 


Although the details of the regulation of lipid synthesis remain to be eluci- 
dated, evidence suggests that phosphatidic acid phosphatase (PAP), working 
in concert with diacylglycerol kinase, plays a key role in the regulation of 
lipid synthesis. PAP, also called lipin 1 in mammals, controls the extent to 
which triacylglycerols are synthesized relative to phospholipids and regu- 
lates the type of phospholipid synthesized (Figure 26.6). For instance, when 
PAP activity is high, phosphatidate is dephosphorylated and diacylglycerol 
is produced, which can react with the appropriate activated alcohols to yield 
phosphatidylethanolamine, phosphatidylserine, or phosphatidylcholine. 
Diacylglycerol can also be converted into triacylglycerols, and evidence 
suggests that the formation of triacylglycerols may act as a fatty acid buffer. 
This buffering helps to regulate the levels of diacylglycerol and sphingolip- 
ids, which serve signaling functions. 

When PAP activity is lower, phosphatidate is used as a precursor 
for different phospholipids, such as phosphatidylinositol and cardio- 
lipin. Moreover, phosphatidate is a signal molecule itself. Phosphatidate 


bncophetidic tid FIGURE 26.6 Regulation of lipid 


phosphatase : si : ; 
synthesis. Phosphatidic acid phosphatase is 
H,0 P; Phosphatidylethanolamine _ the key regulatory enzyme in lipid synthesis. 
ehasshaiauhacetel : 4 Piysabhadavichali When active, PAP generates diacylglycerol 
eae Phosphatidate Diacylglcyerol a alee ec (DAG), which can react with activated 
Cardiolipin <= / \ “==> Phosphatidylserine alcohols to form phospholipids or with fatty 
ADP ATP Triacylglycerol acyl CoA to form triacylglycerols. When PAP 


is inactive, phosphatidate is converted into 
Diacylglycerol CMP-DAG for the synthesis of different 
Ene phospholipids. PAP also controls the amount 
of DAG and phosphatidate, both of which 
Second messengers function as second messengers. 
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FIGURE 26.7 Labeling of cholesterol. |Isotope-labeling 
experiments reveal the source of carbon atoms in 
cholesterol synthesized from acetate labeled in its methyl 


group (blue) or carboxylate atom (red). 


Cholesterol 
“Cholesterol is the most highly decorated 
small molecule in biology. Thirteen Nobel 
Prizes have been awarded to scientists who 
devoted major parts of their careers to 
cholesterol. Ever since it was isolated from 
gallstones in 1784, cholesterol has exerted 
an almost hypnotic fascination for scientists 
from the most diverse areas of science and 
medicine. . . . Cholesterol is a Janus-faced 
molecule. The very property that makes it 
useful in cell membranes, namely its absolute 
insolubility in water, also makes it lethal.” 
—Michael Brown and Joseph Goldstein, 
on the occasion of their receipt of a 
Nobel Prize for elucidating the control 
of blood levels of cholesterol. 
Nobel Lectures (1985) © The Nobel 
Foundation, 1985 


H,C 


regulates the growth of endoplasmic reticulum and nuclear membranes 
and acts as a cofactor that stimulates the expression of genes in phospho- 
lipid synthesis. 

What are the signal molecules that regulate the activity of PAP? CDP- 
diacylglycerol, phosphatidylinositol, and cardiolipin enhance PAP activity, 
and sphingosine and dihydrosphingosine inhibit it. The enzyme also under- 
goes extensive phosphorylation and dephosphorylation. When phosphory- 
lated, the enzyme resides in the cytoplasm. Upon dephosphorylation, the 
enzyme associates with the endoplasmic reticulum, the location of its sub- 
strate phosphatidate. Details of the covalent modification are under 
investigation. 


cs Studies in mice clearly show the importance of PAP for the regulation 

of fatty acid synthesis. The loss of PAP function prevents normal 
adipose-tissue development, leading to lipodystrophy (severe loss of body 
fat) and insulin resistance. Excess PAP activity results in obesity. The regu- 
lation of phospholipid synthesis is an exciting area of research that will be 
active for some time to come. 


26.2 Cholesterol Is Synthesized from Acetyl 
Coenzyme A in Three Stages 


We now turn our attention to the synthesis of the fundamental 


Cc lipid cholesterol. This steroid modulates the fluidity of animal 
i" cell membranes (Section 12.5) and is the precursor of steroid 
hormones such as progesterone, testosterone, estradiol, and cor- 
tisol. All 27 carbon atoms of cholesterol are derived from acetyl 

; CoA in a three-stage synthetic process (Figure 26.7). 
sm, 1. Stage one is the synthesis of isopentenyl pyrophosphate, 


an activated isoprene unit that is the key building block of 
cholesterol. 


2. Stage two is the condensation of six molecules of isopentenyl 
pyrophosphate to form squalene. 


3. In stage three, squalene cyclizes and the tetracyclic product 
is subsequently converted into cholesterol. 


The first stage takes place in the cytoplasm, and the next two in the 
endoplasmic reticulum. 


The synthesis of mevalonate, which is activated as isopentenyl 
pyrophosphate, initiates the synthesis of cholesterol 


The first stage in the synthesis of cholesterol is the formation of isopentenyl 
pyrophosphate from acetyl CoA. This set of reactions starts with the forma- 
tion of 3-hydroxy-3-methylglutaryl CoA (HMG CoA) from acetyl CoA and 
acetoacetyl CoA. This intermediate is reduced to mevalonate for the synthe- 
sis of cholesterol (Figure 26.8). Recall that, alternatively, 3-hydroxy-3- 
methylglutaryl CoA may be generated in the mitochondria and processed to 
form ketone bodies, which are subsequently secreted to provide fuel for other 
tissues, notably the brain under starvation conditions (Section 22.3). 

The synthesis of mevalonate is the committed step in cholesterol formation. 
The enzyme catalyzing this irreversible step, 3-hydroxy-3-methylglutaryl 
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FIGURE 26.8 Fates of 3-hydroxy-3-methylglutaryl CoA. In the ~OOC 
cytoplasm, HMG-CoA is converted into mevalonate. In mitochondria, 
it is converted into acetyl CoA and acetoacetate. MITOCHONDRIA Acetoacetate 


(@) 


CoA reductase (HMG-CoA reductase), is an important control site in cho- 
lesterol biosynthesis, as will be discussed shortly. 


3-Hydroxy-3-methylglutaryl CoA + 2 NADPH + 2H* —> 
mevalonate + 2NADP* + CoA 


HMG-CoA reductase is an integral membrane protein in the endoplasmic 
reticulum. 

Mevalonate is converted into 3-isopentenyl pyrophosphate in three consecu- 
tive reactions requiring ATP (Figure 26.9). In the last step, the release of CO, 
yields isopentenyl pyrophosphate, an activated isoprene unit that is a key build- 
ing block for many important biomolecules throughout the kingdoms of life. 


Squalene (C,,) is synthesized from six molecules of 
isopentenyl pyrophosphate (C,) 


Squalene is synthesized from isopentenyl pyrophosphate by the reaction CH; 
sequence 


Cs > Cio > Cis > C30 H2C i 
; . . . : . ; Isoprene 
This stage in the synthesis of cholesterol starts with the isomerization of 


isopentenyl pyrophosphate to dimethylallyl pyrophosphate. 


CH; CH; 


ee ee 


Isopentenyl pyrophosphate Dimethylallyl pyrophosphate 


FIGURE 26.9 Synthesis of isopentenyl 
These two isomeric C, units (one of each type) condense to form a C,,) pyrophosphate. This activated intermediate 


 s : : is formed from mevalonate in three steps 
compound: isopentenyl pyrophosphate attacks an allylic carbocation FecalirnseonTe (lowed by -adece bon eine: 


CcoOo- COO- COO- + 
(ee) 
eZ > H2C 
> CHz 
y CH,O _O a 
ae eye 
R? id RB? 
Py fs aN 
O_O 0 0O O_O 0 0O 
Mevalonate 5-Phospho- 5-Pyrophospho- 3-Isopentenyl 
mevalonate mevalonate pyrophosphate 


777 


3-Isopentenyl CH; 


OPO;P0;2- 


hosphat 
Poerre a J CH;  OPO3P033- CH;  OPO3P053- 
Ho ) fi a y sie 2 
CH;  OPO3P032- CH; He cH Ht c . CH; HoC c 
PP; | H> H | by 
CH, <om + _-CH2 CH> i CH> 
—_ a ae a in 
H H H 
Allylic substrate Allylic Geranyl 
carbocation (or farnesyl) 
pyrophosphate 


FIGURE 26.10 Condensation mechanism in cholesterol synthesis. The isomeric C, units 
dimethylallyl pyrophosphate and isopentenyl pyrophosphate join to form geranyl pyrophosphate. 
The same mechanism is used to add an additional isopentenyl pyrophosphate to form farnesy! 


pyrophosphate. 


CH; 


ee 


Dimethylallyl pyrophosphate 


CH; 


oe oe 


Isopentenyl pyrophosphate 


formed from dimethylallyl pyrophos- 
phate to yield geranyl pyrophosphate 
(Figure 26.10). The same kind of reaction 
takes place again: geranyl pyrophosphate 
is converted into an allylic carbonium 
ion, which is attacked by isopentenyl 
pyrophosphate. The resulting C,, com- 
pound is called farnesyl pyrophosphate. 


a The same enzyme, gevanyl transferase, 
CH; catalyzes each of these condensations. 
The last step in the synthesis of 
CH; = OPO;P0;2- squalene is a reductive tail-to-tail conden- 
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oe 
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Farnesyl pyrophosphate 
Farnesyl pyrophosphate + NADPH + H* 


2 PP; + NADPt 


CH; 
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sation of two molecules of farnesyl pyro- 
phosphate catalyzed by the endoplasmic 
reticulum enzyme squalene synthase. 


2 Farnesyl pyrophosphate(Cjs) 
+ NADPH —> squalene(C39) 
+ 2PP; + NADP” + H™ 


The reactions leading from C, units to 
squalene, a C,, isoprenoid, are summa- 
rized in Figure 26.11. 


Squalene cyclizes to form cholesterol 


The final stage of cholesterol biosyn- 
thesis starts with the cyclization of 
squalene (Figure 26.12). Squalene is 
first activated by conversion into squa- 
lene epoxide (2,3-oxidosqualene) in a 
reaction that uses O, and NADPH. 
Squalene epoxide is then cyclized to 
lanosterol by oxidosqualene cyclase. The 
enzyme holds squalene epoxide in an 
appropriate conformation and initiates 


FIGURE 26.11 Squalene synthesis. One 
molecule of dimethyallyl pyrophosphate and 
two molecules of isopentenyl pyrophosphate 
condense to form farnesyl pyrophosphate. The 
tail-to-tail coupling of two molecules of farnesy! 
pyrophosphate yields squalene. 
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+ 
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FIGURE 26.12 Squalene cyclization. The formation of the steroid nucleus from HO : 
squalene begins with the formation of squalene epoxide. This intermediate = H 


is protonated to form a carbocation that cyclizes to form a tetracyclic structure, H3C CH; 
which rearranges to form lanosterol. Lanosterol 


the reaction by protonating the epoxide oxygen. The carbocation formed 
spontaneously rearranges to produce lanosterol. Lanosterol is converted 
into cholesterol in a multistep process by the removal of three methyl 
groups, the reduction of one double bond by NADPH, and the migration of 
the other double bond (Figure 26.13). 


CH; 


26.3 The Complex Regulation of Cholesterol Biosynthesis 
Takes Place at Several Levels 


Cholesterol can be obtained from the diet or it can be synthesized de 
novo. Cholesterol biosynthesis is one of the most highly regulated meta- 
bolic pathways known. Biosynthetic rates may vary several hundredfold, 


depending on how much cholesterol is consumed in the diet. An adult on a HC “CH; 
low-cholesterol diet typically synthesizes about 800 mg of cholesterol per cancetarsl 
day. The liver is the major site of cholesterol synthesis in mammals, 
although the intestine also forms significant amounts. The rate of choles- 19 steps 
terol formation by these organs is highly responsive to the cellular level of HCOOH + 2 CO, 
cholesterol. This feedback regulation is mediated primarily by changes in the 

a H3C. 
amount and activity of 3-hydroxy-3-methylglutaryl CoA reductase. As ce 
described earlier (p. 776), this enzyme catalyzes the formation of mevalon- ? 
ate, the committed step in cholesterol biosynthesis. HMG CoA reductase is ns ie 
controlled in multiple ways: CH; 
1. The rate of synthesis of reductase mRNA is controlled by the sterol regula- 
tory element binding protein (SREBP), a member of a family of transcription 
factors that regulate the proteins required for lipid synthesis. This tran- HO 
scription factor binds to a short DNA sequence called the sterol regulatory Cholesterol 
element (SRE) on the 5’ side of the reductase gene when cholesterol levels Pieune sis Gudene ein: 
are low, enhancing transcription of the gene. In its inactive state, the SREBP kancetefol'ls converted into cholestaralin 
resides in the endoplasmic reticulum membrane, where it is associated with a complex process. 
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FIGURE 26.14 The SREBP pathway. SREBP resides in the endoplasmic reticulum (ER), 
where it is bound to SCAP by its regulatory (Reg) domain. When cholesterol levels fall, SCAP 
and SREBP move to the Golgi complex, where SREBP undergoes successive proteolytic 
cleavages by a serine protease and a metalloprotease. The released DNA-binding domain 
moves to the nucleus to alter gene expression. [Information from an illustration provided by 

Dr. Michael Brown and Dr. Joseph Goldstein.] 


the SREBP cleavage activating protein (SCAP), an integral membrane pro- 
tein. SCAP is the cholesterol sensor. When cholesterol levels fall, SCAP 
escorts SREBP in small membrane vesicles to the Golgi complex, where it 
is released from the membrane by two specific proteolytic cleavages 
(Figure 26.14). The first cleavage frees a fragment of SREBP from SCAP, 
whereas the second cleavage releases the regulatory domain from the mem- 
brane. The released protein migrates to the nucleus and binds the SRE of 
the HMG-CoA reductase gene, as well as several other genes in the choles- 
terol biosynthetic pathway, to enhance transcription. When cholesterol 
levels rise, the proteolytic release of the SREBP is blocked, and the SREBP 
in the nucleus is rapidly degraded by proteasomes located in the nucleus. 
These two events halt the transcription of genes of the cholesterol biosyn- 
thetic pathways. 

What is the molecular mechanism that retains SCAP-SREBP in the 
ER when cholesterol is present but allows movement to the Golgi complex 
when cholesterol concentration is low? When cholesterol is low, SCAP 
binds to vesicular proteins that facilitate the transport of SCAP-SREBP to 
the Golgi apparatus, as heretofore described. When cholesterol is present, 
SCAP binds cholesterol, which causes a structural change in SCAP so that 
it binds to another endoplasmic reticulum protein called Insig (insulin- 
induced gene) (Figure 26.15). Insig is the anchor that retains SCAP and 
thus SREBP in the endoplasmic reticulum in the presence of cholesterol. 
The interactions between SCAP and Insig can also be forged when Insig 
binds 25-hydroxycholesterol, a metabolite of cholesterol. Thus, two distinct 
steroid—protein interactions prevent the inappropriate movement of SCAP— 
SREBP to the Golgi complex. 


ER Cholesterol 


Cytoplasm 


Insig SCAP SREBP 


FIGURE 26.15 Insig regulates SCAP-SREBP movement. In the presence of cholesterol, 
Insig interacts with SCAP-SREBP and prevents the activation of SREBP. Cholesterol binding 
to SCAP or 25-hydroxycholesterol binding to Insig facilitates the interaction of Insig and 
SCAP, retaining SCAP-SREBP in the endoplasmic reticulum. [Information from M. S. Brown 
and J. L. Goldstein. Cholesterol feedback: From Schoenheimer’s bottle to Scap’s MELADL, 

J. Lipid Res. 50:515-S27, 2009.] 


2. The rate of translation of reductase mRNA is inhibited by nonsterol 
metabolites derived from mevalonate. 


3. The degradation of the reductase is stringently controlled. The enzyme is 
bipartite: its cytoplasmic domain carries out catalysis and its membrane 
domain senses signals that lead to its degradation. The membrane domain 
may undergo structural changes in response to increasing concentrations of 
sterols such as lanosterol and 25-hydroxycholesterol. Under these conditions, 
the reductase appears to bind to a subset of Insigs that are also associated 
with the ubiquitinating enzymes (Figure 26.16). The reductase is polyubiq- 
uitinated and subsequently extracted from the membrane in a process that 
requires gerenylgeraniol. The extracted reductase is then degraded by the 
proteasome (Section 23.2). The combined regulation at the levels of tran- 
scription, translation, and degradation can alter the amount of enzyme in 
the cell more than 200-fold. 


4. Phosphorylation decreases the activity of the reductase. This enzyme, like 
acetyl CoA carboxylase (which catalyzes the committed step in fatty acid 
synthesis, Section 22.5), is switched off by an AMP-activated protein 
kinase. Thus, cholesterol synthesis ceases when the ATP level is low. 
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FIGURE 26.16 Insig facilitates the degradation of HMG-CoA reductase. In the presence 
of sterols, a subclass of Insig associated with ubiquitinating enzymes binds HMG-CoA 
reductase. This interaction results in the ubiquitination of the enzyme. This modification and 
the presence of geranylgeraniol results in extraction of the enzyme from the membrane and 
degradation by the proteasome. [Information from R. A. DeBose-Boyd. Feedback regulation of 
cholesterol synthesis: Sterol-accelerated ubiquitination and degradation of HMG CoA reductase, 
Cell Res. 18:609-621, 2008.] 
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TABLE 26.1 Properties of plasma lipoproteins 


Composition (%) 


Plasma lipoproteins Density (g ml!) Diameter(nm) Apolipoprotein Physiological role TAG CE C PL P 

Chylomicron <0.95 75-1200 B-48, C, E Dietary fat transport 86 3 1 

Very low density 0.95-1.006 30-80 B-100, C, E Endogenous fat By 14 7 18 
lipoprotein transport 

Intermediate-density 1.006-1.019 15-35 B-100, E LDL precursor 38 30 8 23 11 
lipoprotein 

Low-density lipoprotein 1.019-1.063 18-25 B-100 Cholesterol transport 10 38 8 22. 21 

High-density lipoprotein —_1.063-1.21 7.5-20 A Reverse cholesterol 5-10 14-21 3-7 19-29 33-57 

transport 


Abbreviations: TAG, triacylglyerol; CE, cholesteryl ester; C, free cholesterol; PL, phospholipid; P, protein. 


500 nm 


FIGURE 26.17 Site of cholesterol 
synthesis. Electron micrograph of a part of 
a liver cell actively engaged in the synthesis 
and secretion of very low density lipoprotein 
(VLDL). The arrow points to a vesicle that is 
releasing its content of VLDL particles. 
[Courtesy of Dr. George Palade/Yale 
University, Harvey Cushing/John Hay Whitney 
Medical Library.] 
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Lipoproteins transport cholesterol and triacylglycerols 
throughout the organism 


Cholesterol and triacylglycerols are transported in body fluids in the form 
of lipoprotein particles, which are important for a number of reasons. First, 
lipoprotein particles are the means by which triacylglycerols are delivered to 
tissues, from the intestine or liver, for use as fuel or for storage. Second, the 
fatty acid constituents of the triacylglycerol components of the lipoprotein 
particles are incorporated into phospholipids for membrane synthesis. 
Likewise, cholesterol is a vital component of membranes and is a precursor 
to the powerful signal molecules, the steroid hormones. Finally, cells are not 
able to degrade the steroid nucleus. Consequently, the cholesterol must be 
used biochemically or excreted by the liver. Excess cholesterol plays a role 
in the development of atherosclerosis. Lipoprotein particles function in 
cholesterol homeostasis, transporting the molecule from sites of synthesis to 
sites of use, and finally to the liver for excretion. 

Each lipoprotein particle consists of a core of hydrophobic lipids sur- 
rounded by a shell of more-polar lipids and proteins. The protein components 
of these macromolecular aggregates, called apoproteins, have two roles: they 
solubilize hydrophobic lipids and contain cell-targeting signals. Apolipoproteins 
are synthesized and secreted by the liver and the intestine. Lipoprotein par- 
ticles are classified according to increasing density (Table 26.1): chylomicrons, 
chylomicron remnants, very low density lipoproteins (VLDLs), intermediate- 
density lipoproteins (IDLs), low-density lipoproteins (LDLs), and high-density 
lipoproteins (HDLs). These classes have numerous subtypes. Moreover, lipo- 
protein particles can shift between classes as they release or pick up cargo, 
thereby changing their density. 

Triacylglycerols, cholesterol, and other lipids obtained from the diet are 
carried away from the intestine in the form of large chylomicrons (Section 
22.1). These particles have a very low density because triacylglycerols con- 
stitute about 90% of their content. Apolipoprotein B-48 (apo B-48), a large 
protein (240 kDa), forms an amphipathic spherical shell around the fat 
globule; the external face of this shell is hydrophilic. The triacylglycerols in 
chylomicrons are released through hydrolysis by lipoprotein lipases. These 
enzymes are located on the lining of blood vessels in muscle and other tis- 
sues that use fatty acids as fuels or in the synthesis of lipids. The liver then 
takes up the cholesterol-rich residues, known as chylomicron remnants. 

Lipoprotein particles are also crucial for the transport of lipids from the 
liver, which is a major site of triacylglycerol and cholesterol synthesis, to other 
tissues in the body (Figure 26.17). Triacylglycerols and cholesterol in excess of 
the liver’s own needs are exported into the blood in the form of very low den- 
sity lipoproteins. These particles are stabilized by two apolipoproteins—apo 
B-100 (513 kDa) and apo E (34 kDa). Triacylglycerols in very low density 


wa Unesterified cholesterol 
al Phospholipid 

[_] cholesteryl ester 

ey Apolipoprotein B-100 


lipoproteins, as in chylomicrons, are hydrolyzed by lipases on capillary sur- 
faces, with the released fatty acids being taken up by the muscle and other 
tissues. The resulting remnants, which are rich in cholesteryl esters, are 
called intermediate-density lipoproteins. These particles have two fates. Half 
of them are taken up by the liver for processing, and half are converted into 
low-density lipoprotein by the removal of more triacylglycerol by tissue 
lipases that absorb the released fatty acids. Interestingly, apo B-100, one of 
the largest proteins known, is a longer version of apo B-48, the protein com- 
ponent of chylomicrons. Both apo B proteins are encoded by the same gene 
and produced from the same initial RNA transcript. In the intestine, RNA 
editing (Section 29.3) modifies the transcript to generate the mRNA for apo 
B-48, the truncated form. 

Low-density lipoprotein is the major carrier of cholesterol in blood 
(Figure 26.18). It contains a core of some 1500 cholesterol molecules esteri- 
fied to fatty acids; the most common fatty acid chain in these esters is linole- 
ate, a polyunsaturated fatty acid. A shell of phospholipids and unesterified 
cholesterol molecules surrounds this highly hydrophobic core. The shell 
also contains a single copy of apo B-100, which is recognized by target cells. 
The vole of LDL is to transport cholesterol to peripheral tissues and regulate de 
novo cholesterol synthesis at these sites, as described on page 784. A different 
purpose is served by high-density lipoprotein, which picks up cholesterol 
released into the plasma from dying cells and from membranes undergoing 
turnover and delivers the cholesterol to the liver for excretion. An acyltrans- 
ferase in HDL esterifies these cholesterols, which are then returned by 
HDL to the liver (Figure 26.19). 
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FIGURE 26.18 Schematic model of low- 
density lipoprotein. The LDL particle is 
approximately 22 nm (220 A) in diameter. 


FIGURE 26.19 An overview of lipoprotein 
particle metabolism. Fatty acids are 
abbreviated FFA. [Information from 

J. G. Hardman (Ed.), L. L. Limbird (Ed.), and 
A. G. Gilman (Consult. Ed.), Goodman and 
Gilman’s The Pharmacological Basis of 
Therapeutics, 10th ed. (McGraw-Hill, 2001), 

p. 975, Fig. 36.1.] 
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FIGURE 26.20 Receptor-mediated endocytosis. The process of 
receptor-mediated endocytosis is illustrated for the cholesterol- 
carrying complex, low-density lipoprotein (LDL): (1) LDL binds to a 
specific receptor, the LDL receptor; (2) this complex invaginates to 
form an endosome; (3) after separation from its receptor, the LDL- 
containing vesicle fuses with a lysosome, leading to the 
degradation of the LDL and the release of the cholesterol. 


Low-density lipoproteins play a central role in 
cholesterol metabolism 


Cholesterol metabolism must be precisely regulated to 
prevent atherosclerosis. The mode of control in the liver, 
the primary site of cholesterol synthesis, has already been 
considered: dietary cholesterol reduces the activity and 
amount of 3-hydroxy-3-methylglutaryl CoA reductase, 
the enzyme catalyzing the committed step. In general, 
cells outside the liver and intestine obtain cholesterol from 
the plasma rather than synthesizing it de novo. Specifically, 
their primary source of cholesterol is the low-density lipopro- 
tein. The process of LDL uptake, called receptor-mediated 
endocytosis, serves as a paradigm for the uptake of many 
molecules (Figure 26.20). 

Endocytosis begins when apolipoprotein B-100 on the 
surface of an LDL particle binds to a specific receptor pro- 
tein on the plasma membrane of nonhepatic cells. The 
receptors for LDL are localized in specific regions called 
coated pits, which contain a protein called clathrin. The 
receptor-LDL complex is then internalized by endocytosis; 
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that is, the plasma membrane in the vicinity of the complex 
invaginates and then fuses to form an endocytic vesicle called an endosome. 
The endosome is acidified by an ATP-dependent proton pump homologous 
to the Nat/K*t ATPase (Section 13.2), which causes the receptor to release 
its cargo. Some of the receptor is returned to the cell membrane in a recycling 
vesicle, while a portion is degraded along with its cargo (p. 787). The round- 
trip time for a receptor is about 10 minutes; in its lifetime of about a day, each 
receptor may bring hundreds of LDL particles into the cell. The vesicles 
containing LDL and in some cases the receptor subsequently fuse with lyso- 
somes, acidic vesicles that carry a wide array of degradative enzymes. The 
protein component of LDL is hydrolyzed to free amino acids. The choles- 
teryl esters in LDL are hydrolyzed by a lysosomal acid lipase. The released 
unesterified cholesterol can then be used for membrane biosynthesis. Alternatively, 
it can be reestevified for storage inside the cell. In fact, free cholesterol activates 
acyl CoA:cholesterol acyltransferase (ACAT), the enzyme catalyzing this 
reaction. Reesterified cholesterol contains mainly oleate and palmitoleate, 
which are monounsaturated fatty acids, in contrast with the cholesteryl esters 
in LDL, which are rich in linoleate, a polyunsaturated fatty acid (Table 12.1). 
It is imperative that the cholesterol be reesterified. High concentrations of 
unesterified cholesterol disrupt the integrity of cell membranes. 

The synthesis of the LDL receptor is itself subject to feedback regula- 
tion. Studies of cultured fibroblasts show that, when cholesterol is abundant 
inside the cell, new LDL receptors are not synthesized, and so the uptake of 
additional cholesterol from plasma LDL is blocked. The gene for the LDL 
receptor, like that for the reductase, is regulated by SREBP, which binds to 
a sterol regulatory element that controls the rate of mRNA synthesis. 


The absence of the LDL receptor leads to hypercholesterolemia 
and atherosclerosis 


e The pioneering studies of familial hypercholesterolemia by Michael 
Brown and Joseph Goldstein revealed the physiological importance 
of the LDL receptor. The total concentration of cholesterol and LDL in 
the blood plasma is markedly elevated in this genetic disorder, which 
results from a mutation at a single autosomal locus. The cholesterol level 
in the plasma of homozygotes is typically 680 mg dl~!, compared with 


(A) (B) 


300 mg dl~! in heterozygotes (clinical assay results are often expressed in 
milligrams per deciliter, which is equal to milligrams per 100 milliliters). 
A value of < 200 mg dl“! is regarded as desirable, but many people have 
higher levels. In familial hypercholesterolemia, cholesterol is deposited in 
various tissues because of the high concentration of LDL cholesterol in the 
plasma. Nodules of cholesterol called xanthomas are prominent in skin and 
tendons. LDL accumulates under the endothelial cells lining the blood 
vessels. Of particular concern is the oxidation of the excess LDL to form 
oxidized LDL (oxLDL), which can instigate the inflammatory response 
by the immune system, a response that has been implicated in the develop- 
ment of cardiovascular disease. The oxLDL is taken up by immune 
system cells called macrophages, which become engorged to form foam 
cells. These foam cells become trapped in the walls of the blood vessels and 
contribute to the formation of atherosclerotic plaques that cause arterial 
narrowing and lead to heart attacks (Figure 26.21). In fact, most homozy- 
gotes die of coronary artery disease in childhood. The disease in heterozy- 
gotes (1 in 500 people) has a milder and more variable clinical course. 

The molecular defect in most cases of familial hypercholesterolemia is an 
absence or deficiency of functional receptors for LDL. Receptor mutations that 
disrupt each of the stages in the endocytotic pathway have been identified. 
Homozygotes have almost no functional receptors for LDL, whereas het- 
erozygotes have about half the normal number. Consequently, the entry of 
LDL into liver and other cells is impaired, leading to an increased level 
of LDL in the blood plasma. Furthermore, less IDL enters liver cells 
because IDL entry, too, is mediated by the LDL receptor. Consequently, 
IDL stays in the blood longer in familial hypercholesterolemia, and more of 
it is converted into LDL than in normal people. All deleterious conse- 
quences of an absence or deficiency of the LDL receptor can be attributed 
to the ensuing elevated level of LDL cholesterol in the blood. 


Mutations in the LDL receptor prevent LDL release and 
result in receptor destruction 


e One class of mutations that results in familial hypercholesterolemia 

generates receptors that are reluctant to give up the LDL cargo. Let us 
begin by examining the makeup of the receptor. The human LDL receptor 
is a 160-kDa glycoprotein, which is composed of six different types of 
domains (Figure 26.22A). The amino-terminal region of the receptor, which 
is the site of LDL binding, consists of seven homologous LDL receptor type- 
A (LA) domains, with domains 4 and 5 most important for LDL binding. 


FIGURE 26.21 The effects of excess 


cholesterol. Cross section of (A) a normal 


artery and (B) an artery blocked by a 
cholesterol-rich plaque. [SPL/Photo 
Researchers. ] 
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FIGURE 26.22 LDL receptor releases 
LDL in the endosome. (A) A schematic 
representation of the domain structure of 


(A) (B) 
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A second type of domain is homologous to one found in the epidermal 
growth factor (EGF). This domain is repeated three times, and in between 


the LDL receptor. (B) In the endosome, the the second and third repeat is a propeller structure consisting of six bladelike 
receptor converts from the open structure into domains. This part of the receptor is crucial in releasing LDL in the endo- 
the closed structure, resulting in the release some. The fourth domain, which is very rich in serine and threonine resi- 
of the LDL inte the endesome, [Information dues, contains O-linked sugars. These oligosaccharides function as struts to 


from |. D. Campbell, Biochem. Soc. Trans. 31 
(pt. 6p):1107-1114, 2008, Fig. 1A.] 
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keep the receptor extended from the membrane so that the LDL-binding 
domain is accessible to LDL. The fifth type of domain consists of 22 hydro- 
phobic residues that span the plasma membrane. The sixth and final domain 
consists of 50 residues and emerges on the cytoplasmic side of the mem- 
brane, where it controls the interaction of the receptor with coated pits and 
participates in endocytosis. 

How does the LDL receptor relinquish its cargo on entering the endo- 
some? The receptor exists in two interconvertible states: an extended or open 
state, capable of binding LDL, and a closed state that results in release of the 
LDL in the endosome. The receptor maintains the open state while in the 
plasma membrane, on binding LDL, and throughout its journey to the endo- 
some. The conversion from the open state into the closed state takes place on 
exposure to the acidic environment of the endosome (Figure 26.22B). Three 
contiguous modules LA7, EGFA, and EGFB rigidly position the propeller 
module to facilitate displacement of the LDL as the closed state is formed. 
Under neutral pH, aspartate residues of the propeller blades form hydrogen 
bonds that hold each blade to the rest of the propeller structure. Exposure to 
the low-pH environment of the endosome causes the propeller-like structures 
to interact with the LDL-binding domain. This interaction displaces the 
LDL, which is then digested by the lysosome. The receptor is often returned 
to the plasma membrane to again bind LDL. The importance of this process 
is highlighted by the fact that more than half of the point mutations that result 
in familial hypercholesterolemia are due to disruptions in the interconversion 
between the open state and the closed state. These mutations result in a failure 
to release the LDL cargo and loss of the receptor by degradation. 


Cycling of the LDL receptor is regulated 


PCSK9 (proprotein convertase subtilisin/kexin type) is a protease, 

secreted by the liver, which plays a crucial role in the regulation of 
cycling of the LDL receptor. Despite the fact that PCSK9 is a protease, enzy- 
matic activity of the protein is not required for cycling regulation. PCSK9 in 
the blood binds to the EGFA domain of the receptor (Figure 26.22). PCSK9 
locks the receptor in the open conformation even in the acidic conditions of 
the endosome. Failure to adopt the closed conformation prevents the recep- 
tor from returning to the plasma membrane, and it is degraded in the lyso- 
some along with its cargo. 

Individuals having a mutation that reduces the amount of PCSK9 in the 
blood have greatly reduced levels of LDL in the blood and display an almost 
90% reduction in the rate of cardiovascular disease. Presumably, reduced 
levels of PCSK9 allow more receptor cycling and more efficient removal of 
LDL from the blood. Much research in now being directed at inhibiting 
PCSK9-mediated receptor degradation in individuals with high cholesterol 
levels. Phase 3 trials (Section 36.4) are underway, testing the efficacy of a 
monoclonal antibody to PCSK9. Monoclonal antibody treatments are 
expensive, so the search continues for a chemical inhibitor of the protein. 


HDL appears to protect against atherosclerosis 


e Although the events that result in atherosclerosis take place rapidly 

in familial hypercholesterolemia, a similar sequence of events takes 
place in people who develop atherosclerosis over decades. In particular, the 
formation of foam cells and plaques are especially hazardous occurrences. 
HDL and its role in returning cholesterol to the liver appear to be important 
in mitigating these life-threatening circumstances. 

HDL has a number of antiatherogenic properties, including the inhibi- 
tion of LDL oxidation, but the best-characterized property is the removal 
of cholesterol from cells, especially macrophages. Earlier, we learned that 
HDL retrieves cholesterol from other tissues in the body to return the cho- 
lesterol to the liver for excretion as bile or in the feces. This transport, called 
reverse cholesterol transport, is especially important in regard to macro- 
phages. Indeed, when the transport fails, macrophages become foam cells 
and facilitate the formation of plaques. Macrophages that collect cholesterol 
from LDL normally transport the cholesterol to HDL particles. The more 
HDL, the more readily this transport takes place and the less likely that the 
macrophages will develop into foam cells. Presumably, this robust reverse 
cholesterol transport accounts for the observation that higher HDL levels 
confer protection against atherosclerosis. 

The importance of reverse cholesterol transport is illustrated by the occur- 
rence of mutations that inactivate a cholesterol-transport protein in endothelial 
cells and macrophages, ABCA1 (ATP-binding cassette transporter, subfamily 
A1) (Figure 13.7). Loss of activity of cholesterol-transport protein ABCA1 
results in a very rare condition called Tangier disease, which is characterized by 
HDL deficiency, accumulation of cholesterol in macrophages, and premature 
atherosclerosis. Under normal conditions, the apoprotein component of HDL, 
apoA-I, binds to ABCA1 to facilitate LDL transport. Moreover, the interac- 
tion between apoA-I and ABCA1 initiates a signal transduction pathway in 
the endothelial cells that inhibits the inflammatory response. 

Until recently, high levels of HDL-bound cholesterol (“good choles- 
terol’’) relative to LDL-bound cholesterol (“bad cholesterol’’) were believed 
to protect against cardiovascular disease. This belief was based on epide- 
miological studies. However, a number of recent clinical trials revealed that 
increased levels of HDL-bound cholesterol had no protective effects at all. 
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FIGURE 26.23 Lovastatin, a competitive 
inhibitor of HMG-CoA reductase. The 
part of the structure that resembles the 
3-hydroxy-3-methylglutaryl moiety is 
shown in red. 


These studies do not discount the protective effects of HDL alone, but they 
illustrate the danger of equating free HDL and cholesterol-bound HDL. 


The clinical management of cholesterol levels can be 
understood at a biochemical level 


e Homozygous familial hypercholesterolemia can be treated only by a 

liver transplant. A more generally applicable therapy is available for 
heterozygotes and others with high levels of cholesterol. The goal is to reduce 
the amount of cholesterol in the blood by stimulating the synthesis of more than 
the customary number of LDL receptors. We have already observed that the 
production of LDL receptors is controlled by the cell’s need for cholesterol. 
The therapeutic strategy is to deprive the cell of ready sources of choles- 
terol. When cholesterol is required, the amount of mRNA for the LDL 
receptor rises and more receptors are found on the cell surface. This state 
can be induced by a two-pronged approach. First, the reabsorption of bile 
salts from the intestine is inhibited. Bile salts are cholesterol derivatives that 
promote the absorption of dietary cholesterol and dietary fats. Second, de 
novo synthesis of cholesterol is blocked. 

The reabsorption of bile is impeded by oral administration of positively 
charged polymers, such as cholestyramine, that bind negatively charged bile 
salts and are not themselves absorbed. Cholesterol synthesis can be effec- 
tively blocked by a class of compounds called statins. A well-known example 
of such acompound is lovastatin, which is also called mevacor (Figure 26.23). 
These compounds are potent competitive inhibitors (K; = 1 nM) of HMG- 
CoA reductase, the essential control point in the biosynthetic pathway. 
Plasma cholesterol levels decrease by 50% in many patients given both 
lovastatin and inhibitors of bile-salt reabsorption. Lovastatin and other 
inhibitors of HMG-CoA reductase are widely used to lower the plasma- 
cholesterol level in people who have atherosclerosis, which is the leading 
cause of death in industrialized societies. Preliminary studies suggest that 
reducing levels of PCSK9 (p. 787) and HMG-CoA reductase activity may 
be an especially effective means of reducing cholesterol levels. The develop- 
ment of statins as effective drugs is further described in Chapter 36. 


26.4 Important Derivatives of Cholesterol Include 
Bile Salts and Steroid Hormones 


Although cholesterol is well known in its own right as a contributor to the 
development of heart disease, metabolites of cholesterol—the steroid 
hormones—also appear in the news frequently. Indeed, steroid-hormone 
abuse seems to be as prominent in the sports pages as any athlete is. In addi- 
tion to steroid hormones, cholesterol is a precursor for two other important 
molecules: bile salts and vitamin D. We begin with a look at the bile salts, 
molecules crucial for the uptake of lipids in the diet. 


Bile salts. Bile salts are polar derivatives of cholesterol. These compounds 
are highly effective detergents because they contain both polar and nonpolar 
regions. Bile salts are synthesized in the liver, stored and concentrated in the 
gallbladder, and then released into the small intestine. Bile salts, the major 
constituent of bile, solubilize dietary lipids. Solubilization increases the 
effective surface area of lipids with two consequences: (1) more surface area 
is exposed to the digestive action of lipases and (2) lipids are more readily 
absorbed by the intestine. Bile salts are also the major breakdown products 
of cholesterol. The bile salts glycocholate, the primary bile salt, and tauro- 
cholate are shown in Figure 26.24. 


FIGURE 26.24 Synthesis of bile salts. 
The OH groups in red are added to 
cholesterol, as are the groups shown 

in blue. 
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e In addition to bile salts, bile is composed of cholesterol, phospho- 

lipids, and the breakdown products of heme, bilirubin, and biliverdin 
(Section 24.4). If too much cholesterol is present in the bile, it will precipi- 
tate to form gallbladder stones (cholelithiasis). These stones may block bile 
secretion and inflame the gallbladder, a condition called cholecystitis. 
Symptoms include pain in the upper right abdomen, especially after a fatty 
meal, and nausea. If need be, the gall bladder is removed, and bile flows 
from the liver through the bile duct directly into the intestine. 
Steroid hormones. Cholesterol is the precursor of the five major classes 
of steroid hormones: progestagens, glucocorticoids, mineralocorticoids, 
androgens, and estrogens (Figure 26.25). These hormones are powerful 
signal molecules that regulate a host of organismal functions. Progesterone, 
a progestogen, prepares the lining of the uterus for the implantation of an 
ovum. Progesterone is also essential for the maintenance of pregnancy by 
preventing premature uterine contractions. Androgens (such as testosterone) 
are responsible for the development of male secondary sex 
characteristics, whereas estrogens (such as estradiol) are Cholesterol (C57) 
required for the development of female secondary sex 
characteristics. Estrogens, along with progesterone, also Vv 
participate in the ovarian cycle. Glucocorticoids (such as Pregnenolone (C,,) 
cortisol) promote gluconeogenesis and glycogen synthesis, 
enhance the degradation of fat and protein, and inhibit the ¥ 
inflammatory response. They enable animals to respond to Progestogens (C;) 
stress; indeed, the absence of glucocorticoids can be fatal. 
Mineralocorticoids (primarily aldosterone) act on the distal 
tubules of the kidney to increase the reabsorption of Nat 
and the excretion of K* and H™, which leads to an increase 
in blood volume and blood pressure. The major sites of as 

; Glucocorticoids Androgens 
synthesis of these classes of hormones are the corpus (Co) Winaeaincannenide (Cy) 
luteum, for progestogens; the testes, for androgens; the (Co) 
ovaries, for estrogens; and the adrenal cortex, for glucocor- Vv 
ticoids and mineralocorticoids. Estrogens 
Steroid hormones bind to and activate receptor mole- (Cia) 


cules that serve as transcription factors to regulate gene 
expression (Section 32.2). These small similar molecules 


FIGURE 26.25 Biosynthetic relations of classes of steroid 
hormones and cholesterol. 
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CH; 


Hydroxyl group 


above plane CHz 


HO 
3B-Hydroxy 


CH; 


Hydroxyl group 
below plane 


HO” 


3a-Hydroxy 


CH; 


H 


5B-Hydrogen 
(cis fusion) 


CH; 


FIGURE 26.26 Cholesterol 
carbon numbering. The 
numbering scheme for the 
carbon atoms in cholesterol 
and other steroids. 


are able to have greatly differing effects because the slight structural differ- 
ences among them allow interactions with specific receptor molecules. 


Letters identify the steroid rings and numbers identify the 
carbon atoms 


Carbon atoms in steroids are numbered, as shown for cholesterol in 
Figure 26.26. The rings in steroids are denoted by the letters A, B, C, 
and D. Cholesterol contains two angular methyl groups: the C-19 methyl 
group is attached to C-10, and the C-18 methyl group is attached to C-13. 
The C-18 and C-19 methyl groups of cholesterol lie above the plane con- 
taining the four rings. A substituent that is above the plane is termed 
B oriented, whereas a substituent that is below the plane is @ oriented. 

If a hydrogen atom is attached to C-5, it can be either @ or B oriented. 
The A and B steroid rings are fused in a trans conformation if the C-5 
hydrogen is @ oriented, and cis if it is B oriented. The absence of a Greek 
letter for the C-5 hydrogen atom on the steroid nucleus implies a trans 
fusion. The C-5 hydrogen atom is a oriented in all steroid hormones that 
contain a hydrogen atom in that position. In contrast, bile salts have a 
B-oriented hydrogen atom at C-5. Thus, a cis fusion is characteristic of the 
bile salts, whereas a trans fusion is characteristic of all steroid hormones that 
possess a hydrogen atom at C-5. A trans fusion yields a nearly planar struc- 
ture, whereas a cis fusion gives a buckled structure. 


AQ H 
5a-Hydrogen H 
(trans fusion) 


Steroids are hydroxylated by cytochrome P450 monooxygenases 
that use NADPH and O, 


The addition of OH groups plays an important role in the synthesis of choles- 
terol from squalene and in the conversion of cholesterol into steroid hormones 
and bile salts. All these hydroxylations require NADPH and O,. The oxygen 
atom of the incorporated hydroxyl group comes from O, rather than from 
HO. Whereas one oxygen atom of the O, molecule goes into the substrate, 
the other is reduced to water. The enzymes catalyzing these reactions are called 
monooxygenases (or mixed-function oxygenases). Recall that a monooxygenase 
also participates in the hydroxylation of aromatic amino acids (Section 23.5). 


RH + O, + NADPH + H* — ROH + H,O + NADP* 


Hydroxylation requires the activation of oxygen. In the synthesis of steroid 
hormones and bile salts, activation is accomplished by members of the cyto- 
chrome P450 family, a family of cytochromes that absorb light maximally at 
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ROH Substrate 


RH © Oo) 
[e) 
ces 
Cees 
H,0 e<— 
6) @ 4 2 FIGURE 26.27 Cytochrome P450 
2Ht ox id mechanism. 1. Substrate binds to the 
enzyme. 2. Adrenodoxin donates an 
RHO” electron, reducing the heme iron. 


NADPH NADp* 3: Oxygen binds to Fe**. 4. Adrenodoxin 
Cre**5 donates a second electron. 5. The bond 
between the oxygen atoms is cleaved, 
® RH Lae a a molecule of water is released, and 
an Fe** =O intermediate is formed. 
Cfe?*S 0, 6. The Fe** =O intermediate forms the 
e hydroxylated product and returns the 


: iron to the Fe’* state, ready for another 
Adrenodoxin a : Y 
reaction cycle. 


450 nm when complexed in vitro with exogenous carbon monoxide. These 
membrane-anchored proteins (~50 kDa) contain a heme prosthetic group. 
Oxygen is activated through its binding to the iron atom in the heme group. 

Because the hydroxylation reactions promoted by P450 enzymes are 
oxidation reactions, it is at first glance surprising that they also consume the 
reductant NADPH. NADPH transfers its high-potential electrons to a 
flavoprotein, which transfers them, one at a time, to adrenodoxin, a non- 
heme iron protein. Adrenodoxin transfers one electron to reduce the ferric 
(Fe3*) form of P450 to the ferrous (Fe?*) form (Figure 26.27). 

Without the addition of this electron, P450 will not bind oxygen. Recall 
that only the ferrous form (Fe**) of myoglobin and hemoglobin binds oxy- 
gen (Section 7.1). The binding of O, to the heme is followed by the accep- 
tance of a second electron from adrenodoxin. The acceptance of this second 
electron leads to cleavage of the O—O bond. One of the oxygen atoms is 
then protonated and released as water. The remaining oxygen atom forms a 
highly reactive ferryl (Fe++) Fe=O intermediate. This intermediate 
abstracts a hydrogen atom from the substrate RH to form Re. This transient 
free radical captures the OH group from the iron atom to form ROH, the 
hydroxylated product, returning the iron atom to the ferric state. 


The cytochrome P450 system is widespread and performs 
a protective function 


es The cytochrome P450 system, which in mammals is located primar- 
ily in the smooth endoplasmic reticulum of the liver and small intes- 
tine, is also important in the detoxification of foreign substances (xenobiotic 
compounds). For example, the hydroxylation of phenobarbital, a barbitu- 
rate, increases its solubility and facilitates its excretion. Likewise, polycyclic 
aromatic hydrocarbons that are ingested by drinking contaminated water 
are hydroxylated by P450, providing sites for conjugation with highly polar 
units (e.g., glucuronate or sulfate) that markedly increase the solubility of 
the modified aromatic molecule. One of the most relevant functions of the 
cytochrome P450 system to human beings is its role in metabolizing drugs 
such as caffeine and ibuprofen (Chapter 36). Some members of the cyto- 
chrome P450 system also metabolize ethanol (Section 27.6). The duration of 
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action of many medications depends on their rate of inactivation by the 
P450 system. Despite its general protective role in the removal of foreign 
chemicals, the action of the P450 system is not always beneficial. Some of 
the most powerful carcinogens are generated from harmless compounds by the 
P450 system in vivo in the process of metabolic activation (Figure 28.34). In 
plants, the cytochrome P450 system plays a role in the synthesis of toxic 
compounds as well as the pigments of flowers. 


Earlier (Section 9.1), we examined the use of protease inhibitors to 

treat HIV infection. These inhibitors, in conjunction with inhibitors 
of other HIV enzymes, have drastically reduced deaths due to AIDS. The 
effectiveness of these drugs is sometimes compromised because P450 
enzymes, such as cytochrome P450-3A4, inactivate them. The drug ritona- 
vir was developed in the search for more effective inhibitors. Interestingly, 
ritonavir, although a protease inhibitor, is also a potent inhibitor of cyto- 
chrome P450-3A4. This serendipitous “side-effect” allows the clinician to 
administer low doses of ritonavir in combination with other protease inhibi- 
tors, effectively increasing the concentrations of these drugs while allowing 
a reduction in dosage and dosing frequency. 


Pregnenolone, a precursor of many other steroids, is formed 
from cholesterol by cleavage of its side chain 


Steroid hormones contain 21 or fewer carbon atoms, whereas cholesterol 
contains 27. Thus, the first stage in the synthesis of steroid hormones is the 
removal of a six-carbon unit from the side chain of cholesterol to form preg- 
nenolone. The side chain of cholesterol is hydroxylated at C-20 and then at 
C-22, and the bond between these carbon atoms is subsequently cleaved by 
desmolase. Three molecules of NADPH and three molecules of O, are con- 
sumed in this six-electron oxidation. 


OH 
CH 3 H 3C Oo 
CHz 
Cholesterol 20«,228-Dihydroxycholesterol Pregnenolone 


Progesterone and corticosteroids are synthesized from pregnenolone 


Progesterone is synthesized from pregnenolone in two steps. The 3-hydroxyl 
group of pregnenolone is oxidized to a 3-keto group, and the A® double bond 
is isomerized to a A+ double bond (Figure 26.28). Cortisol, the major gluco- 
corticoid, is synthesized from progesterone by hydroxylations at C-11, C-17, 
and C-21; C-17 must be hydroxylated before C-21 is hydroxylated, whereas 
C-11 can be hydroxylated at any stage. The enzymes catalyzing these hydrox- 
ylations are highly specific. The initial step in the synthesis of aldosterone, 
the major mineralocorticoid, is the hydroxylation of progesterone at C-21. 
The resulting deoxycorticosterone is hydroxylated at C-11. The oxidation 
of the C-18 angular methyl group to an aldehyde then yields aldosterone. 


Androgens and estrogens are synthesized from pregnenolone 


Androgens and estrogens also are synthesized from pregnenolone through 
the intermediate progesterone. Androgens contain 19 carbon atoms. The 
synthesis of androgens starts with the hydroxylation of progesterone at 


Hs O 
CH; 
CH; CH; 
e 
H O 


Pregnenolone Progesterone 
21 
HOH) 
HO CH; 
CH; 
FIGURE 26.28 Pathways for the formation of 10) 
progesterone, cortisol, and aldosterone. Corticosterone 


C-17 (Figure 26.29). The side chain consisting of C-20 and C-21 is then 
cleaved to yield androstenedione, an androgen. Testosterone, another andro- 
gen, 1s formed by the reduction of the 17-keto group of androstenedione. 
‘Testosterone, through its actions in the brain, is paramount in the develop- 
ment of male sexual behavior. It is also important for the maintenance of the 
testes and the development of muscle mass. Owing to the latter activity, 
testosterone is referred to as an anabolic steroid. Testosterone is reduced by 
5a-reductase to yield dihydrotestosterone (DHT), a powerful embryonic 
androgen that instigates the development and differentiation of the male 
phenotype (Problems 21 and 24). 


OH 
CHz 
CHz 
NADPH + H* = NADPt 
Sf ‘ 
5a-Reductase 
Oo H 
H 
Testosterone 5a-Dihydrotestosetrone 


Estrogens are synthesized from androgens by the loss of the C-19 angular 
methyl group and the formation of an aromatic A ring. Estrone, an estrogen, 
is derived from androstenedione, whereas estradiol, the biologically most 
potent estrogen, is formed from testosterone. Estradiol can also be formed 
from estrone. The formation of the aromatic A ring 1s catalyzed by the P450 
enzyme aromatase. 


ee Because breast and ovarian cancers frequently depend on estrogens for 

growth, aromatase inhibitors are often used as a treatment for these can- 
cers. Anastrozole is a competitive inhibitor of the enzyme, whereas exemestane 
is a suicide inhibitor that covalently modifies and inactivates the enzyme. 


Cortisol 


Aldosterone 


NC CN 
H3C CH 
CH3 CH3 


Anastrozole 


CH2 


Exemestane 
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FIGURE 26.29 Pathways for the formation 


of androgens and estrogens. Estrone Estradiol 


Vitamin D is derived from cholesterol by the ring-splitting 

activity of light 

Cholesterol is also the precursor of vitamin D, which plays an essential role in 
the control of calcium and phosphorus metabolism. 7-Dehydrocholesterol (pro- 
vitamin D,) is photolyzed by the ultraviolet light of sunlight to previtamin D,, 
which spontaneously isomerizes to vitamin D, (Figure 26.30). Vitamin D, 


CH; 


Ultraviolet 
light 


HO 
7-Dehydrocholesterol 


CH; 


FIGURE 26.30 Vitamin 

D synthesis. The pathway 
for the conversion of 
7-dehydrocholesterol into 
vitamin D3 and then into 
calcitriol, the active 
hormone. 
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Calcitriol Vitamin D; 
(1,25-Dihydroxycholecalciferol) (Cholecalciferol) 


(cholecalciferol) is converted into calcitriol (1,25-dihydroxycholecalciferol), 
the active hormone, by hydroxylation reactions in the liver and kidneys. 
Although not a steroid, vitamin D acts in an analogous fashion. It binds to 
a receptor, structurally similar to the steroid receptors, to form a complex 
that functions as a transcription factor, regulating gene expression. 


e Vitamin D deficiency in childhood produces rickets, a disease charac- 

terized by inadequate calcification of cartilage and bone. Rickets was 
so common in seventeenth-century England that it was called the “chil- 
dren’s disease of the English.” The 7-dehydrocholesterol in the skin of these 
children was not photolyzed to previtamin D,, because there was little 
sunlight for many months of the year. Furthermore, their diets provided 
little vitamin D, because most naturally occurring foods have a low content 
of this vitamin. Fish-liver oils are a notable exception. Cod-liver oil, 
abhorred by generations of children because of its unpleasant taste, was 
used in the past as a rich source of vitamin D. Today, the most reliable 
dietary sources of vitamin D are fortified foods. Milk, for example, is forti- 
fied to a level of 400 international units per quart (10 wg per quart). The 
recommended daily intake of vitamin D is 200 international units until 
age 50, after which it increases with age. In adults, vitamin D deficiency 
leads to softening and weakening of bones, a condition called osteomalacia. 
The occurrence of osteomalacia in Muslim women who are clothed so that 
only their eyes are exposed to sunlight is a striking reminder that vitamin D 
is needed by adults as well as by children. 

Research over the past few years indicates that vitamin D may play a 
much larger biochemical role than simply the regulation of bone metabolism. 
Muscle seems to be a target of vitamin D action. In muscle, vitamin D 
appears to affect a number of biochemical processes with the net effect being 
enhanced muscle performance. Studies also suggest that vitamin D prevents 
cardiovascular disease, reduces the incidence of a variety of cancers, and pro- 
tects against autoimmune diseases including diabetes. Moreover, vitamin D 
deficiency appears to be more common than previously thought. In the 
United States, 75% of Blacks and many Hispanics and Asians have insuffi- 
cient blood levels of vitamin D. This recent research on vitamin D shows 
again the dynamic nature of biochemical investigations. Vitamin D, a chemi- 
cal whose biochemical role was believed to be well established, now offers 
new frontiers of biomedical research. 


SUMMARY 


26.1 Phosphatidate Is a Common Intermediate in the Synthesis of 

Phospholipids and Triacylglycerols 

Phosphatidate is formed by successive acylations of glycerol 3- 
phosphate by acyl CoA. Hydrolysis of its phosphoryl group fol- 
lowed by acylation yields a triacylglycerol. CDP-diacylglycerol, the 
activated intermediate in the de novo synthesis of several phospho- 
lipids, is formed from phosphatidate and CTP. The activated phos- 
phatidyl unit is then transferred to the hydroxyl group of a polar 
alcohol, such as inositol, to form a phospholipid such as phospha- 
tidylinositol. In mammals, phosphatidylethanolamine is formed by 
CDP-ethanolamine and diacylglycerol. Phosphatidylethanolamine 
is methylated by S-adenosylmethionine to form phosphatidylcho- 
line. In mammals, this phosphoglyceride can also be synthesized by 
a pathway that utilizes dietary choline. CDP-choline is the activated 
intermediate in this route. 
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26.2 


26.3 


26.4 


Sphingolipids are synthesized from ceramide, which is formed 
by the acylation of sphingosine. Gangliosides are sphingolipids that 
contain an oligosaccharide unit having at least one residue of 
N-acetylneuraminate or a related sialic acid. They are synthesized by 
the step-by-step addition of activated sugars, such as UDP-glucose, 
to ceramide. 


Cholesterol Is Synthesized from Acetyl Coenzyme A in Three Stages 
Cholesterol is a steroid component of animal membranes and a pre- 
cursor of steroid hormones. The committed step in its synthesis is the 
formation of mevalonate from 3-hydroxy-3-methylglutaryl CoA 
(derived from acetyl CoA and acetoacetyl CoA). Mevalonate is con- 
verted into isopentenyl pyrophosphate (C,), which condenses with its 
isomer, dimethylallyl pyrophosphate (C;), to form geranyl pyrophos- 
phate (C,,)). The addition of a second molecule of isopentenyl pyro- 
phosphate yields farnesyl pyrophosphate (C,.), which condenses with 
itself to form squalene (C,,). This intermediate cyclizes to lanosterol 
(C4), which is modified to yield cholesterol (C,,). 


The Complex Regulation of Cholesterol Biosynthesis 

Takes Place at Several Levels 

In the liver, cholesterol synthesis is regulated by changes in the 
amount and activity of 3-hydroxy-3-methylglutaryl CoA reductase. 
Transcription of the gene, translation of the mRNA, and degradation 
of the enzyme are stringently controlled. In addition, the activity of 
the reductase is regulated by phosphorylation. 

Triacylglycerols exported by the intestine are carried by chylomi- 
crons and then hydrolyzed by lipases lining the capillaries of target 
tissues. Cholesterol and other lipids in excess of those needed by the 
liver are exported in the form of very low density lipoprotein. After 
delivering its content of triacylglycerols to adipose tissue and other 
peripheral tissue, VLDL is converted into intermediate-density lipo- 
protein and then into low-density lipoprotein. IDL and LDL carry 
cholesteryl esters, primarily cholesteryl linoleate. Liver and peripheral 
tissue cells take up LDL by receptor-mediated endocytosis. The LDL 
receptor, a protein spanning the plasma membrane of the target cell, 
binds LDL and mediates its entry into the cell. Absence of the LDL 
receptor in the homozygous form of familial hypercholesterolemia 
leads to a markedly elevated plasma level of LDL cholesterol and the 
deposition of cholesterol on blood-vessel walls, which in turn may 
result in childhood heart attacks. High-density lipoproteins transport 
cholesterol from the peripheral tissues to the liver. 


Important Derivatives of Cholesterol Include Bile 
Salts and Steroid Hormones 
In addition to bile salts, which facilitate the digestion of lipids, five 
major classes of steroid hormones are derived from cholesterol: pro- 
gestogens, glucocorticoids, mineralocorticoids, androgens, and estro- 
gens. Hydroxylations by P450 monooxygenases that use NADPH and 
O, play an important role in the synthesis of steroid hormones and bile 
salts from cholesterol. P450 enzymes, a large superfamily, also partici- 
pate in the detoxification of drugs and other foreign substances. 
Pregnenolone (C,,) is an essential intermediate in the synthesis of 
steroids. This steroid is formed by scission of the side chain of choles- 
terol. Progesterone (C,,), synthesized from pregnenolone, is the precur- 
sor of cortisol and aldosterone. Hydroxylation of progesterone and 
cleavage of its side chain yields androstenedione, an androgen (C,,). 


Estrogens (C,.) are synthesized from androgens by the loss of an angu- 797 


lar methyl group and the formation of an aromatic A ring. Vitamin D, 


Problems 


which is important in the control of calctum and phosphorus metabo- 
lism, is formed from a derivative of cholesterol by the action of light. 


KEY TERMS 


phosphatidate (p. 768) 

triacylglycerol (p. 769) 

phospholipid (p. 769) 

cytidine diphosphodiacylglycerol 
(CDP-diacylglycerol) (p. 769) 

sphingolipid (p. 772) 


(p. 776) 


ceramide (p. 772) 
cerebroside (p. 772) 
ganglioside (p. 772) 


| PROBLEMS 


cholesterol (p. 776) 

mevalonate (p. 776) 

3-hydroxy-3-methylglutaryl CoA 
reductase (HMG-CoA reductase) 


3-isopentenyl pyrophosphate (p. 777) 
sterol regulatory element binding protein 
(SREBP) (p. 779) 


lipoprotein particles (p. 782) 


low-density lipoprotein (LDL) (p. 782) 
high-density lipoprotein (HDL) 

(p. 782) 
receptor-mediated endocytosis (p. 784) 
reverse cholesterol transport (p. 787) 
bile salt (p. 788) 
steroid hormone (p. 789) 


cytochrome P450 monooxygenase 
(p. 790) 


1. Different roles. Describe the roles of glycerol 3-phosphate, 
phosphatidate, and diacylglycerol in triacylglycerol and 
phospholipid synthesis. 


2. Needed supplies. How is the glycerol 3-phosphate 
required for phosphatidate synthesis generated? 


3. Making fat. Write a balanced equation for the synthesis 
of a triacylglycerol, starting from glycerol and fatty acids. 


4. Making a phospholipid. Write a balanced equation for the 
synthesis of phosphatidylethanolamine by the de novo path- 
way, starting from ethanolamine, glycerol, and fatty acids. 


5. ATP needs. How many _ high-phosphoryl-transfer- 
potential molecules are required to synthesize phosphati- 
dylethanolamine from ethanolamine and diacylglycerol? 
Assume that the ethanolamine is the activated component. 


6. Identifying differences. Differentiate among sphingomy- 
elin, a cerebroside, and a ganglioside. 


7. Let’s count the ways. There may be 50 ways to leave your 
lover, but, in principle, there are only three ways to make a 
glycerol-based phospholipid. Describe the three pathways. 


8. Activated donors. What is the activated reactant in each 
of the following biosyntheses? 


(a) Phosphatidylinositol from inositol 

(b) Phosphatidylethanolamine from ethanolamine 

(c) Ceramide from sphingosine 

(d) Sphingomyelin from ceramide 

(e) Cerebroside from ceramide 

(f) Ganglioside G),, from ganglioside Gy, 

(g) Farnesyl pyrophosphate from geranyl pyrophosphate 


9. No DAG, no TAG. What would be the effect of a mutation 
that decreased the activity of phosphatidic acid phosphatase? 


10. Like Wilbur and Orville. Match each term with its 
description. 


(a) Phosphatidate_ 1. Glycerol-based 
(b) Triacylglycerol membrane lipid 
(c) Phospholipid 2. Product of the committed 
(d) Sphingolipid step in cholesterol 
(e) Cerebroside____ synthesis 
(f) Ganglioside___ 3. Ceramide with either 
(g) Cholesterol glucose or galactose 
(h) Mevalonate___ attached 
(i) Lipoprotein particle___— 4. Storage form of fatty 
(j) Steroid hormone ____ acids 

5. Squalene is a precursor 


to this molecule 

. Transports cholesterol 
and lipids 

. Derived from cholesterol 

. Precursor to both 
phospholipids and 
triacylglycerols 

. Formed from ceramide 
by the attachment of 
phosphocholine 

10. Ceramide with multiple 

carbohydrates attached 


11. The Law of Three Stages. What are the three stages 
required for the synthesis of cholesterol? 


nD 


oo NI 


No) 


12. Many regulations to follow. Outline the mechanisms of 
the regulation of cholesterol biosynthesis. 


13. Telltale labels. What is the distribution of isotopic 
labeling in cholesterol synthesized from each of the follow- 
ing precursors? 

(a) Mevalonate labeled with !*C in its carboxyl carbon atom 


(b) Malonyl CoA labeled with '4C in its carboxyl carbon atom 
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14. Too much, too soon. What is familial hypercholesterol- 
emia and what are its causes? 


15. Familial hypercholesterolemia. Several classes of LDL- 
receptor mutations have been identified as causes of this 
disease. Suppose that you have been given cells from 
patients with different mutations, an antibody specific for 
the LDL receptor that can be seen with an electron micro- 
scope, and access to an electron microscope. What differ- 
ences in antibody distribution might you expect to find in 
the cells from different patients? 


16. Breakfast conversation. You and a friend are eating 
breakfast together. While eating, your friend is reading the 
back of her cereal box and comes across the following state- 
ment: “Cholesterol plays beneficial roles in your body, 
making cells, hormones, and tissues.” Knowing that you are 
taking biochemistry, she asks if the statement makes sense. 
What do you reply? 


17. A good thing. What are statins? What is their pharma- 
cological function? 


18. Too much of a good thing. Would the development of a 
“super statin” that inhibited all HMG CoA reductase 
activity be a useful drug? Explain. 


19. RNA editing. A shortened version (apo B-48) of apoli- 
poprotein B is formed by the intestine, whereas the full- 
length protein (apo B-100) is synthesized by the liver. A 
glutamine codon (CAA) is changed into a stop codon. 
Propose a simple mechanism for this change. 


20. A means of entry. Describe the process of receptor- 
mediated endocytosis by using LDL as an example. 


21. Inspiration for drug design. Some actions of androgens 
are mediated by dihydrotestosterone, which is formed by 
the reduction of testosterone. This finishing touch is cata- 
lyzed by an NADPH-dependent 5a-reductase (p. 793). 
Chromosomal XY males with a genetic deficiency of this 
reductase are born with a male internal urogenital tract but 
predominantly female external genitalia. These people are 
usually reared as girls. At puberty, they masculinize because 
the testosterone level rises. The testes of these reductase- 
deficient men are normal, whereas their prostate glands 
remain small. How might this information be used to design 
a drug to treat benign prostatic hypertrophy, a common con- 
sequence of the normal aging process in men? A majority of 
men older than age 55 have some degree of prostatic 
enlargement, which often leads to urinary obstruction. 


22. Drug idiosyncrasies. Debrisoquine, a B-adrenergic block- 
ing agent, has been used to treat hypertension. The optimal 
dose varies greatly (20-400 mg daily) in a population of 
patients. The urine of most patients taking the drug contains 
a high level of 4-hydroxydebrisoquine. However, those most 
sensitive to the drug (about 8% of the group studied) excrete 


debrisoquine and very little of the 4-hydroxy derivative. 
Propose a molecular basis for this drug idiosyncrasy. Why 
should caution be exercised in giving other drugs to patients 
who are very sensitive to debrisoquine? 


Debrisoquine 


23. Removal of odorants. Many odorant molecules are 
highly hydrophobic and concentrate within the olfactory 
epithelium. They would give a persistent signal indepen- 
dent of their concentration in the environment if they were 
not rapidly modified. Propose a mechanism for converting 
hydrophobic odorants into water-soluble derivatives that 
can be rapidly eliminated. 


24. Development difficulties. Propecia (finasteride) is a syn- 
thetic steroid that functions as a competitive and specific 
inhibitor of 5a-reductase (p. 793), the enzyme responsible 
for the synthesis of dihydrotestosterone from testosterone. 


Finasteride 


N 
H 


In addition to its use as a treatment for benign prostatic 
hypertrophy (Problem 21), finasteride is also widely used to 
retard the development of male pattern hair loss. Pregnant 
women are advised to avoid handling this drug. Why is it 
vitally important that pregnant women avoid contact with 
Propecia? 


25. Life-style consequences. Human beings and the plant 
Arabidopsis evolved from the same distant ancestor possess- 
ing a small number of cytochrome P450 genes. Human 
beings have approximately 50 such genes, whereas 
Arabidopsis has more than 250 of them. Propose a role for 
the large number of P450 isozymes in plants. 


26. Personalized medicine. The cytochrome P450 system 
metabolizes many medicinally useful drugs. Although all 
human beings have the same number of P450 genes, indi- 
vidual polymorphisms exist that alter the specificity and 
efficiency of the proteins encoded by the genes. How could 
knowledge of individual polymorphisms be useful clinically? 


27. Honeybee crisis. In 2006, there was a sudden, unex- 
plained die-off of honeybee colonies throughout the United 
States. The die-off was economically significant because 
one-third of the human diet comes from insect-pollinated 
plants, and honeybees are responsible for 80% of the polli- 
nation. In October of 2006, the sequence of the honeybee 


genome was reported. Interestingly, the genome was found 
to contain far fewer cytochrome P450 genes than do the 
genomes of other insects. Suggest how the die-off and the 
paucity of P450 genes may be related. 


28. Missing enzyme. Congenital adrenal hyperplasia is a life- 
threatening condition that results from a deficiency in the 
P450 enzyme steroid 21-hydroxylase. This enzyme cata- 
lyzes the first step in the conversion of progesterone into 
cortisol and aldosterone (Figures 26.28 and 26.29). A char- 
acteristic of congenital adrenal hyperplasia is an increase in 
sex hormone production. Explain why this is the case. 


29. Let the sun shine in. At a biochemical level, vitamin D 
functions like a steroid hormone (Chapter 31). Therefore, it 
is sometimes referred to as an honorary steroid. Why is 
vitamin D not an actual steroid? 


Mechanism Problems 


30. An interfering phosphate. In the course of the overall 
reaction catalyzed by HMG-CoA reductase, a histidine 
residue protonates a coenzyme A thiolate, CoA— 5’, gener- 
ated in an earlier step. 


-NH 


CoA—S™ ite 
Wh ) 
His Ser 


NH 
CoA—SH LON os 


A nearby serine residue can be phosphorylated by AMP- 
dependent kinase, which results in a loss of activity. Propose 
an explanation for why phosphorylation of the serine resi- 
due inhibits enzyme activity. 


31. Demethylation. Methyl amines are often demethylated 
by cytochrome P450 enzymes. Propose a mechanism for 
the formation of methylamine from dimethylamine cata- 
lyzed by cytochrome P450. What is the other product? 


Chapter Integration Problems 


32. Similarities. Compare the role of CTP in phosphoglyc- 
eride synthesis with the role of UTP in glycogen synthesis. 


33. Hold on tight or you might be thrown to the cytoplasm. 
Many proteins are modified by the covalent attachment of 
a farnesyl (C,,;) or a geranylgeranyl (C,,)) unit to the car- 
boxyl-terminal cysteine residue of the protein. Suggest why 
this modification might occur. 


34. Fork in the road. 3-Hydroxy-3-methylglutaryl CoA is on 
the pathway for cholesterol biosynthesis. It is also a component 
of another pathway. Name the pathway. What determines 
which pathway 3-hydroxy-3-methylglutaryl CoA follows? 


799 
Problems 


35. Requires a club membership. How is methionine metab- 
olism related to the synthesis of phosphatidylcholine? 


36. Drug resistance. Dichlorodiphenyltrichloroethane (DDT) 
is a potent insecticide rarely used today because of its effects 
on other forms of life. In insects, DDT disrupts sodium chan- 
nel function, leading to eventual death. Mosquitos have devel- 
oped resistance to DDT and other insecticides that function in 
a similar fashion. Suggest two means by which DDT resis- 
tance might develop. 


37. ATP requirements. Explain how cholesterol synthesis 
depends on the activity of ATP-citrate lyase. 


Data Interpretation and Chapter Integration Problem 


38. Cholesterol feeding. Mice were divided into four groups, 
two of which were fed a normal diet and two of which were 
fed a cholesterol-rich diet. HMG-CoA reductase mRNA 
and protein from liver were then isolated and quantified. 
Graph A shows the results of the mRNA isolation. 
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(a) What is the effect of cholesterol feeding on the amount 
of HMG-CoA reductase mRNA? 


(b) What is the purpose of also isolating the mRNA for the 
protein actin, which is not under the control of the sterol 
regulatory element? 


HMG-CoA reductase protein was isolated by precipitation 
with a monoclonal antibody to HMG-CoA reductase. The 
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amount of HMG-CoA protein in each group is shown in 
graph B. 


(c) What is the effect of the cholesterol diet on the amount 
of HMG-CoA reductase protein? 


(d) Why is this result surprising in light of the results in 
graph A? 


(e) Suggest possible explanations for the results in graph B. 


39. Gaucher disease. Gaucher disease, the most common 
lysosomal storage disease in humans, is caused by muta- 
tions in the gene encoding glucocerebrosidase (GCase), the 
lysosomal enzyme that degrades glucocerebrosides. The 
disease has a variety of symptoms depending on the severity 
of the disease and the person affected. Identical twins can 
display very different levels of severity. Symptoms include 
bone pain, enlarged liver, excessive fatigue, and mental 
retardation. 

In order to better understand the disease, researchers 
undertook a series of experiments to characterize the nature 
of the defect in the enzyme. Cells were obtained from a per- 
son without the disease (control) as well as from a patient 
with the disease (GD). The cells were grown in culture, the 
enzyme was isolated, and enzyme activity of 10 wg of each 
sample was measured (Figure A). 


Control GD 
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[Data from J. Lu, Proc. Natl. Acad. Sci. 
U.S. A. 107:21665-21670, 2012.] 
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(a) What do the results tell about the catalytic activity of 
the enzyme from the GD cells? Why were these results sur- 
prising to the researchers? 


Again, cells were grown in culture and cell extracts were 
made from the control and GD samples. Care was taken 
to insure that the same number of cells were used in both 
extractions. A western blot with antibodies to GCase was 


performed on the cell extracts, with the results shown in 
Figure B. 


Control 


— 


— 
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(B) 
[Data from J. Lu, Proc. Natl. Acad. Sci. U. S. A. 
107:21665-21670, 2012,] 


(b) Provide two possible explanations for the result shown 
in Figure B. Why was it crucial to insure that the extract was 
made from the same number of cells? What was the purpose 
of including the western blot of actin? 


(c). The researchers then measured the amount of mRNA 
for the enzyme in both the control and GD cells. The 
amount of mRNA was identical in the two samples. With 
this information, reinterpret the results shown in Figure B. 


Next, cells from the control and GD individuals were 
grown in the presence and absence of a potent proteasome 
inhibitor. The amount of GCase was determined using 
western blots. The results are shown in Figure C. 
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(9) 

[Data from J. Lu, Proc. Natl. Acad. Sci. 

U.S. A. 107:21665-21670, 2012.] 
(d) Suggest the nature of the defect in the GD enzyme. 
What is the significance of the increase in activity of the 
enzyme from the normal cell observed in the presence of the 
inhibitor? 


The Integration of Metabolism 


Glucose 


Insulin 


The image above shows a detail of runners on a Greek amphora ‘i 
painted in the sixth century B.C. Athletic feats, as well as others as 
seemingly simple as the maintenance of blood-glucose levels, require 
elaborate metabolic integration. The representation at the right shows the 
organs that have essential roles in the metabolic integration that regulates 
blood-glucose levels during exercise and at rest. Insulin and leptin 
(secreted by adipocytes) are two of the hormones that modulate the 
metabolic pathways of organs throughout the body such that adequate 
energy is available to meet the demands of living. [(Left) Copyright © The 
Metropolitan Museum of Art/Art Resource, NY] 


Pancreas 


VV have been examining the biochemistry of metabolism one pathway 
at a time. We have seen how useful energy is extracted from fuels and 
used to power biosynthetic reactions and signal-transduction pathways. In 
Chapters 28 through 30, we will extend our study of biosynthetic reactions 
to the synthesis of nucleic acids and proteins. Before we do that, however, in 
this chapter we will take a step back to examine large-scale biochemical 
interactions that constitute the physiology of the organism. In keeping with 
a central theme of life—energy manipulations—we will look at the regula- 
tion of energy at the organismal level, which boils down to an apparently 
simple but actually quite complex question: At the biochemical level, how 
does an organism know when to eat and when to refrain from eating? The 
ability to maintain adequate but not excessive energy stores is called caloric 
homeostasis or energy homeostasis. 

Next, we will examine a significant perturbation of caloric homeostasis— 
obesity—and how this physiological condition affects insulin action, fre- 
quently resulting in diabetes. We then turn our attention to a biochemical 
consideration of one of the most beneficial activities that humans can 


CHAPTER 


Wad 


rae Intestine 


OUTLINE 
27.1 Caloric Homeostasis Is a Means 
of Regulating Body Weight 


27.2 The Brain Plays a Key Role in 
Caloric Homeostasis 


27.3 Diabetes Is a Common Metabolic 
Disease Often Resulting from 
Obesity 


27.4 Exercise Beneficially Alters the 
Biochemistry of Cells 


27.5 Food Intake and Starvation 
Induce Metabolic Changes 


27.6 Ethanol Alters Energy Metabolism 
in the Liver 


801 


802 


CHAPTER 27 The Integration 
of Metabolism 


engage in—exercise—and see how exercise mitigates the effects of diabetes 
as well as how different forms of exercise use different sources of fuel. 

At the opposite end of the physiological spectrum from obesity and 
overnutrition are fasting and starvation, and we will examine the biochemi- 
cal responses to these challenges. The chapter ends with a consideration of 
another biochemical energy perturbation—excess alcohol consumption. 

We have already encountered instances of organismal energy regulation 
when we considered the actions of insulin and glucagon. Recall that insulin, 
secreted by the B cells of the pancreas, causes glucose to be removed from the 
blood and stimulates the synthesis of glycogen and lipids. Glucagon, secreted 
by the a cells of the pancreas, has effects opposite those of insulin. Glucagon 
increases the level of blood glucose by stimulating glycogen breakdown and 
gluconeogenesis. This chapter introduces two other hormones that play key 
roles in caloric homeostasis. Leptin and adiponectin, secreted by the adipose 
tissue, work in concert with insulin to regulate caloric homeostasis. 


27.1 Caloric Homeostasis Is a Means of 
Regulating Body Weight 


By now in our study of biochemistry, we are well aware of the fact that car- 
bohydrates and lipids are sources of energy. We consume these energy 
sources as foods, convert the energy into ATP, and use the ATP to power 
our lives. Like all energy transformations, our energy consumption and 
expenditure are governed by the laws of thermodynamics. Recall that the 
First Law of Thermodynamics states that energy can neither be created nor 
destroyed. Translated into the practical terms of our diets, 


Energy consumed = energy expended + energy stored 


This simple equation has severe physiological and health implications: 
according to the First Law of Thermodynamics, if we consume more energy 
than we expend, we will become overweight or obese. Obesity is generally 
defined as a body mass index (BMI) of more than 30 kg m~?, whereas over- 
weight is defined as a BMI of more than 25 kg m~? (Figure 27.1). Recall 
that excess fat is stored in adipocytes as triacylglycerols. The number of 
adipocytes becomes fixed in adults, and so obesity results in engorged adi- 
pocytes. Indeed, the cells may increase as much as 1000-fold in size. 

We are all aware that many of us, especially in the developed world, are 
becoming obese or have already attained that state. In the United States, 
obesity has become an epidemic, with nearly 30% of adults classified as such. 
Obesity is identified as a risk factor in a host of pathological conditions includ- 
ing diabetes mellitus, hypertension, and cardiovascular disease (Table 27.1). 
The cause of obesity is quite simple in most cases: more food is consumed 
than is needed, and the excess calories are stored as fat. We will consider the 
biochemical basis of pathologies caused by obesity later in the chapter. 

Before we undertake a biochemical analysis of the results of overcon- 
sumption, let us consider why the obesity epidemic is occurring in the first 
place. There are several possible, overlapping explanations. The first is a 
commonly held view that our bodies are programmed to rapidly store excess 
calories in times of plenty, an evolutionary adaptation from times past when 
humans were not assured of having ample food, as many of us are today. 
Consequently, we store calories as if a fast might begin tomorrow, but no 
such fast arrives. A second possible explanation is that we no longer face the 
risks of predation. Evidence indicates that predation was a common cause of 
death for our ancestors. An obese individual would more likely have been 
culled from a group of our ancestors than would a more nimble, lean indi- 
vidual. As the risk of predation declined, leanness became less beneficial. 


Height in feet and inches (in cm) 


4'g" A'10" 5'0" 5'2" 5'4" 5/6” 5'8" 5'10" 6'0" 62" 6a" 
(142) (147) (152) (157) (163) (168) (173) (178) (183) (188) (193) 


260 (117.9) | 58 54 51 48 45 42 40 37 35 33 32 


250 (113.4) | 56 52 49 46 43 40 38 36 34 32 30 


240 (108.9) | 54 50 47 44 41 39 36 34 33 31 29 


230 (104.3) | 52 | 48 | 45 | 42 | 39 | 37 | 35 | 33 | 31 | 30 | 28 


220 99.8) | 49 | 46 | 43 | 40 | 38 | 36 | 33 | 32 | 30 | 28 | 27 


210 (95.3) | 47 | 44 | 41 | 38 | 36 | 34 | 32 | 30 | 28 | 27 | 26 


200 90.7) | 45 | 42 | 39 | 37 | 34 | 32 | 30 | 29 | 27 | 26 | 24 


190 (86.2) | 43 | 40 | 37 | 35 | 33 | 31 | 29 | 27 | 26 | 24 | 23 


180 (81.6) | 40 | 38 | 35 | 33 | 31 | 29 | 27 | 26 | 24 | 23 | 22 


170 (77.1) (Bee «29 | 27 | 26 eas | 
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80 (36.3) | 18 17 16 15 14 13 12 11 11 10 10 


weight 
>30 | Obese BMI = ae te 
: height 
25-30 | Overweight 
18.5-25 | Normal 
<18.5 | Underweight 


A third possibility, one currently receiving much attention, is that 
calorie-dense highly palatable foods—foods rich in sugar and fats—that are 
readily accessible to most people in the developed world, act as drugs, trig- 
gering the same reward pathways in the brain that are triggered by drugs 
such as cocaine. These reward pathways can be strong enough to override 
appetite-suppressing signals. Fourth, a growing body of research suggests 
that our intestinal microbiome—the bacteria that inhibit our guts—plays a 
significant role in how we process our food. For instance, germ-free mice do 
not become obese, even when given unfettered access to a high-fat diet. 
However, when such mice are exposed to the intestinal flora of obese mice, 
the mice now become obese even on a normal chow diet. Moreover, the 
intestinal microbiome of obese mice triggers an inflammatory response that 
may blunt the effect of signal molecules that normally regulate the desire to 
eat as well as how the mice process the calories they consume. Many of these 
results have been extended to humans. Finally, it is evident that individuals 
respond differently to obesity-inducing environmental conditions and that 


TABLE 27.1 Health consequences of obesity or being overweight 


Coronary heart disease 

Type 2 diabetes 

Cancers (endometrial, breast, colon and others) 

Hypertension (high blood pressure) 

Dyslipidemia (disruption of lipid metabolism, e.g., high cholesterol and triacylglycerols) 
Stroke 

Liver and gallbladder disease 

Sleep apnea and respiratory problems 

Osteoarthritis (degeneration of cartilage and underlying bone at a joint) 

Gynecological problems (abnormal menses, infertility) 


Information from: Centers for Disease Control and Prevention Web site (www.cdc.gov). 
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27.1 Caloric Homeostasis 


FIGURE 27.1 Body mass index (BMI). 
The BMI value for an individual person is a 
reliable indicator of obesity for most people. 
[Data from the Centers for Disease Cortrol.] 
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FIGURE 27.2 Satiation signals. 
Cholecystokinin (CCK) and glucagon-like 
peptide 1 (GLP-1) are signal molecules that 
induce feelings of satiety in the brain. CCK 
is secreted by specialized cells of the small 
intestine in response to a meal and 
activates satiation pathways in the brain. 
GLP-1, secreted by L cells in the intestine, 
also activates satiation pathways in the 
brain and potentiates insulin action in 

the pancreas. [Information from S. C. Wood, 
Cell Metab. 9:489-498, 2009, Fig. 1.] 


this difference has a large genetic component. Various studies have indi- 
cated the heritability of fat mass to be between 30% and 70%, indicating that 
the tendency toward obesity may be highly heritable. 

Regardless of why we may havea propensity to gain weight, this propensity 
can be counteracted behaviorally—by eating less and exercising more. It also 
appears that it is easier to prevent weight gain than it is to lose weight. Clearly, 
caloric homeostasis is a complicated biological phenomenon. Understanding 
this phenomenon will engage biomedical research scientists for some time 
to come. 

As disturbing as the obesity epidemic is, an equally intriguing, almost 
amazing observation is that many people are able to maintain an approxi- 
mately constant weight throughout adult life, despite consuming tons of 
food over a lifetime (Problems 1 and 2). Although will power, exercise, and 
a bathroom scale often play a role in this homeostasis, some biochemical 
signaling must be taking place to achieve this remarkable physiological feat. 


e 27.2 The Brain Plays a Key Role in Caloric 
Homeostasis 


What makes this remarkable balance of energy input and output possible? 
As one might imagine, the answer is complicated, entailing many biochemi- 
cal signals as well as a host of behavioral factors. We will focus on a few key 
biochemical signals, and divide our discussion into two parts: short-term 
signals that are active during a meal and long-term signals that report on the 
overall energy status of the body. These signals originate in the gastrointes- 
tinal tract, the B cells of the pancreas, and fat cells. The primary target of 
these signals is the brain, in particular a groups of neurons in a region of the 
hypothalamus called the arcuate nucleus. 


Signals from the gastrointestinal tract induce feelings of satiety 


Short-term signals relay feelings of satiety from the gut to various regions of 
the brain and thus reduce the urge to eat (Figure 27.2). The best-studied 
short-term signal is cholecystokinin (CCK). Cholecystokinin is actually a 
family of peptide hormones of various lengths (from 8 to 58 amino acids in 
length, depending on posttranslational processing) secreted into the blood 
by cells in the duodenum and jejunum regions of the small intestine as a 


Pancreas 


Increased insulin secretion 
Increased insulin biosynthesis 


Increased satiety 
Decreased food intake 
Decreased body weight 


postprandial satiation signal. The CCK binds to the CCK receptor, a 
G-protein-coupled receptor (Section 14.1) located in various peripheral 
neurons, that relay signals to the brain. This binding initiates a signal- 
transduction pathway in the brain that generates a feeling of satiety. CCK 
also plays an important role in digestion, stimulating the secretion of pan- 
creatic enzymes and bile salts from the gallbladder. 

Another important gut signal is glucagon-like peptide 1 (GLP-1), a hor- 
mone of approximately 30 amino acids in length. GLP-1 is secreted by 
intestinal L cells, hormone-secreting cells located throughout the lining of 
the gastrointestinal tract. GLP-1 has a variety of effects, all apparently 
facilitated by binding to its receptor, another G-protein-coupled receptor. 
Like CCK, GLP-1 induces feelings of satiety that inhibit further eating. 
GLP-1 also potentiates glucose-induced insulin secretion by the 8 cells of 
the pancreas while inhibiting glucagon secretion. 

Although we have examined only two short-term signals, many others 
are believed to exist (Table 27.2). Most of the short-term signals thus far 
identified are appetite suppressants. Ghrelin, a peptide that is 28 amino 
acids in length and secreted by the stomach, acts on regions of the hypothala- 
mus to stimulate appetite through its receptor, a G-protein-coupled recep- 
tor. Ghrelin secretion increases before a meal and decreases afterward. 


Leptin and insulin regulate long-term control over caloric homeostasis 


‘Two key signal molecules regulate energy homeostasis over the time scale of 
hours or days: leptin, which is secreted by the adipocytes, and insulin, which is 
secreted by the 8 cells of the pancreas. Leptin reports on the status of the triac- 
ylglycerol stores, whereas insulin reports on the status of glucose in the blood— 
in other words, of carbohydrate availability. We will consider leptin first. 

Adipose tissue was formerly considered an inert depot of triacylglycer- 
ols. However, recent work has shown that adipose tissue is an active endo- 
crine tissue, secreting signal molecules called adipokines, such as leptin, that 
regulate a host of physiological processes. Leptin is secreted by the adipo- 
cytes in direct proportion to the amount of fat present. The more fat in a 
body, the more leptin is secreted. Leptin binding to its receptor throughout 
the body increases the sensitivity of muscle and the liver to insulin, stimu- 
lates B oxidation of fatty acids, and decreases triacylglycerol synthesis. 

Let us consider the effects of leptin in the brain. Leptin binds to its 
receptor, thereby activating a signal transduction pathway. The leptin 
receptor is found in various regions of the brain, but particularly in the arcu- 
ate nucleus of the hypothalamus. There, one population of neurons expresses 
appetite-stimulating (orexigenic) peptides, called neuropeptide Y (NPY) 
and agouti-related peptide (AgRP). Fasting stimulates the production of 
NPY and AgRP owing to the decrease in leptin levels that results from 
diminishing adipose tissue (Figure 27.3A). Leptin, on the other hand, 
inhibits the NPY/AgRP neurons, preventing the release of NPY and AgRP 
and thus repressing the desire to eat (Figure 27.3B). 

The second population of neurons containing leptin receptors expresses 
a large precursor polypeptide, proopiomelanocortin (POMC). In response 
to leptin binding to its receptor on POMC neurons, POMC is proteo- 
lytically processed to yield a variety of signal molecules, one of which, 
melanocyte-stimulating hormone (MSH), is especially important in this 
context. MSH, originally discovered as a stimulator of melanocytes (cells 
that synthesize the pigment melanin), activates appetite-suppressing 
(anorexigenic) neurons and thus inhibits food consumption. Fasting 
inhibits MSH activity and thus stimulates eating. AgRP inhibits MSH 
activity by acting as an antagonist, binding to the MSH receptor but 


TABLE 27.2 Gastrointestinal peptides 
that regulate food intake 


Appetite-suppressing signals 
Cholecystokinin 
Glucagon-like peptide 1 
Glucagon-like peptide 2 
Amylin 
Somatostatin 
Bombesin 
Enterostatin 
Apolipoprotein A-[V 
Gastric inhibitory peptide 


Appetite-enhancing peptides 
Ghrelin 


Information from: M. H. Stipanuk, ed., 


Biochemical, Physiological, Molecular Aspects of 
Human Nutrition, 2d ed. (Saunders/Elsevier, 2006), 


p. 627, Box 22-1. 


805 


(A) Decrease in fat cell mass 


y 


Decrease in leptin expression 


y 


Decrease in leptin action in hypothalamus 


4 “ 


Activation of NPY and Inhibition of 
AgRP-producing POMC-producing 
neurons neurons 


y y 


Increase in NPY and Decrease 
AgRP expression in MSH 
and release expression 


N 4 


Increase in food intake 


(B) Increase in fat cell mass 


Increase in leptin expression 
" 
v 
Increase in leptin action in hypothalamus 
Inhibition of NPY Activation of 


and AgRP-producing POMC-producing 
neurons neurons 


y y 


Decrease in NPY and Increase in 
AgRP expression MSH 
and release expression 


' 4 


Decrease in food intake 


FIGURE 27.3 The effects of leptin in the 
brain. Leptin is an adipokine secreted by 
adipose tissue in direct relation to fat mass. 
(A) When leptin levels fall, as in fasting, 
appetite-enhancing neuropeptides NPY and 
AgRP are secreted, whereas the secretion 
of appetite-suppressing signals such as 
MSH is inhibited. (B) When fat mass 
increases, leptin inhibits NPY and AgRP 
secretion while stimulating the release 

of appetite-suppressing hormone 

MSH. [Information from M. H. Stipanuk, 
Biochemical, Physiological, & Molecular 
Aspects of Human Nutrition, 2d ed. 
(Saunders/Elsevier, 2006), Fig. 22-2.] 
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failing to activate the receptor (Figure 27.3). Thus, the net effect of leptin 
binding to its receptor is the initiation of a complex signal-transduction 
pathway that ultimately curtails food intake. 

Insulin receptors are also present in the hypothalamus, although the 
mechanism of insulin action in the brain is less clear than that of leptin. 
Insulin appears to inhibit NPY/AgRP-producing neurons, thus inhibiting 
food consumption. 


Leptin is one of several hormones secreted by adipose tissue 


Leptin was the first adipokine discovered because of the dramatic effects of 
its absence. Researchers discovered a strain of mice called ob/ob mice, which 
lack leptin and, as a result, are extremely obese (refer to Chapter 22 opening 
figure on p. 643). These mice display hyperphagia (overeating), hyperlipid- 
emia (accumulation of triacylglycerols in muscle and liver), and an insensi- 
tivity to insulin. Since the discovery of leptin, other adipokines have been 
detected. For instance, adiponectin is another signal molecule produced by 
the adipocytes. Secretion of adiponectin falls in direct proportion to increases 
in fat mass. A key function of adiponectin appears to be increasing the sensi- 
tivity of the organism to insulin. Both leptin and adiponectin exert their 
effects through the key regulatory enzyme, AMP-activated protein kinase 
(AMPK). Recall that this enzyme is active when AMP levels are elevated 
and ATP levels are diminished, and this activation leads to a decrease in 
anabolism and an increase in catabolism, most notably an increase in fatty 
acid oxidation (Section 22.6). In insulin-resistant obese animals such as the 
ob/ob mice, leptin levels increase while those of adiponectin decrease. 

Adipocytes also produce two hormones, RBP4 (originally discovered as 
a retinol binding protein) and resistin, that promote insulin resistance. 
Although it is unclear why adipocytes secrete hormones that facilitate insu- 
lin resistance, a pathological condition, we can speculate on the reason. 
These signal molecules may help to fine-tune the actions of leptin and 
adiponectin or perhaps to act as “brakes” on the action of leptin and adipo- 
nectin to prevent hypoglycemia in the fasted state. Some evidence indicates 
that enlarged adipocytes that result from obesity may secrete higher levels 
of insulin-antagonizing hormones and thus contribute to insulin resistance. 
Resistin has recently been implicated as a causal factor in the increased inci- 
dence of cardiovascular disease in obese individuals. 


Leptin resistance may be a contributing factor to obesity 


If leptin is produced in proportion to body-fat mass and leptin inhibits eat- 
ing, why do people become obese? Obese people, in most cases, have both 
functioning leptin receptors and a high blood concentration of leptin. The 
failure to respond to the anorexigenic effects of leptin is called leptin resis- 
tance. What is the basis of leptin resistance? 

As for most questions in the exciting area of energy homeostasis, the 
answer is not well worked out, but recent evidence suggests that a group of 
proteins called suppressors of cytokine signaling (SOCS) may take part. These 
proteins fine-tune some hormonal systems by inhibiting receptor action. 
SOCS proteins inhibit receptor signaling by a number of means. Consider, 
for example, the effect of SOCS proteins on the insulin receptor. Recall that 
insulin stimulates the autophosphorylation of tyrosine residues on the insulin 
receptor, which in turn phosphorylates [RS-1, initiating the insulin-signaling 
pathway (Figure 27.4A). SOCS proteins bind to phosphorylated tyrosine 
residues on receptors or other members of the signal-transduction pathway, 
thereby disrupting signal flow and thus altering the cell’s biochemical activity 
(Figure 27.4B). In other cases, the binding of SOCS proteins to components 
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FIGURE 27.4 Suppressors of cytokine signaling (SOCS) regulate receptor function. 

(A) Insulin binding results in phosphorylation of the receptor and subsequent phosphorylation 
of IRS-1. These processes initiate the insulin-signaling pathway. (B) SOCS proteins disrupt 
interactions of components of the insulin-signaling pathway by binding phosphorylated 
proteins and thereby inhibiting the pathway. The binding of a signal component by SOCS 
results in proteasomal degradation in some cases. (IRS-1, insulin-receptor substrate 1; SOCS, 
suppressor of cytokine signaling.) 


of the signal-transduction pathway may also enhance proteolytic degradation 
of these components by the proteasome (Section 23.2). Evidence in support 
of a role for SOCS in leptin resistance comes from mice that have had SOCS 
selectively deleted from POMC-expressing neurons. These mice display an 
enhanced sensitivity to leptin and are resistant to weight gain even when fed 
a high-fat diet. The reason why the activity of SOCS proteins increases, lead- 
ing to leptin resistance, remains to be determined. 


Dieting is used to combat obesity 


Given the obesity epidemic that we currently face and its associated disorders, 
much attention has been focused on determining the most effective weight-loss 
diet. In general, two categories of diet try to help us control our caloric intake— 
low-carbohydrate diets and low-fat diets. Low-carbohydrate diets usually 
emphasize protein consumption. Although studies of the effects of diets on 
humans are immensely complex, data are beginning to accumulate suggesting 
that low-carbohydrate—high-protein diets may be the most effective for losing 
weight. The exact reasons are not clear, but there are two common hypotheses. 
First, proteins seem to induce a feeling of satiation more effectively than do fats 
or carbohydrates. Second, proteins require more energy to digest than do fats or 
carbohydrates, and the increased energy expenditure contributes to weight loss. 
For instance, some studies show that a diet that is 30% protein requires almost 
30% more energy to digest than that required by a diet that is 10% protein. 
The mechanisms by which protein-rich diets enhance energy expenditure and 
feelings of satiation remain to be determined. Regardless of the type of diet, the 
adage ‘‘Eat less, exercise more” always applies. 


27.3 Diabetes Is a Common Metabolic Disease 
Often Resulting from Obesity 


Having taken an overview of the regulation of body weight, we now examine 
the biochemical results when regulation fails because of behavior, genetics, 
or a combination of the two. The most common result of such a failure is 
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Diabetes 

Named for the excessive urination in the 
disease. Aretaeus, a Cappadocian physician 
of the second century A.D., wrote: “The 
epithet diabetes has been assigned to the 
disorder, being something like passing of 
water by a siphon.” He perceptively 
characterized diabetes as “being a melting- 
down of the flesh and limbs into urine.” 


Mellitus 

From Latin, meaning “sweetened with 
honey.” Refers to the presence of sugar in 
the urine of patients having the disease. 
Mellitus distinguishes this disease from 
diabetes insipidus, which is caused by 
impaired renal reabsorption of water. 


Insulin 
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obesity, a condition in which excess energy is stored as triacylglycerols. 
Recall that all excess food consumption is ultimately converted into triacyl- 
glycerols. Humans maintain about a day’s worth of glycogen and, after these 
stores have been replenished, excess carbohydrates are converted into fats 
and then into triacylglycerols. Amino acids are not stored at all, and so excess 
amino acids are ultimately converted into fats also. Thus, regardless of the 
type of food consumed, excess consumption results in increased fat stores. 

We begin our consideration of the effects of disruptions in caloric 
homeostasis with diabetes mellitus, a complex disease characterized by 
grossly abnormal fuel usage: glucose is overproduced by the liver and under- 
utilized by other organs. The incidence of diabetes mellitus (usually referred 
to simply as diabetes) is about 5% of the world population. Type 1 diabetes is 
caused by the autoimmune destruction of the insulin-secreting B cells in the 
pancreas and usually begins before age 20. Type 1 diabetes is also referred 
to as insulin-dependent diabetes, meaning that the affected person requires 
the administration of insulin to live. Most diabetics, in contrast, have a nor- 
mal or even higher level of insulin in their blood, but they are unresponsive 
to the hormone, a characteristic called insulin resistance. This form of the 
disease, known as type 2 diabetes, typically arises later in life than does the 
insulin-dependent form. Type 2 diabetes accounts for approximately 90% 
of the diabetes cases throughout the world and is the most common meta- 
bolic disease in the world. In the United States, it is the leading cause of 
blindness, kidney failure, and amputation. Obesity is a significant predispos- 
ing factor for the development of type 2 diabetes. 


Insulin initiates a complex signal-transduction pathway in muscle 


What is the biochemical basis of insulin resistance? How does insulin resis- 
tance lead to failure of the B cells of the pancreas that results in type 2 diabe- 
tes? How does obesity contribute to 
this progression? To answer these 
questions and begin to unravel the 
mysteries of metabolic disorders, let 
us examine the mechanism of action 
of insulin in muscle, the largest tissue 
target of insulin. Indeed, muscle uses 
about 85% of the glucose ingested 
during a meal. 

In a normal cell, insulin binds 
to its receptor, which then autophos- 
phorylates on tyrosine residues, 
with each subunit of the receptor 
phosphorylating its partner. Phos- 
phorylation of the receptor generates 
binding sites for insulin-receptor 
substrates (IRSs), such as IRS-1 
(Figure 27.5). Subsequent phosphor- 
ylation of IRS-1 by the tyrosine 
kinase activity of the insulin receptor 
engages the insulin-signaling path- 
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FIGURE 27.5 Insulin signaling. The binding of insulin results in the cross-phosphorylation 
and activation of the insulin receptor. Phosphorylated sites on the receptor act as binding 
sites for insulin-receptor substrates such as IRS-1. The lipid kinase phosphoinositide 3-kinase 
binds to phosphorylated sites on IRS-1 through its regulatory domain and then converts PIP, 
into PIP. Binding to PIP, activates PIP,-dependent protein kinase, which phosphorylates and 
activates kinases such as Akt. Activated Akt can then diffuse throughout the cell and continue 
the signal-transduction pathway. 


way (1). Phosphorylated IRS-1 binds 
to phosphoinositide 3-kinase (PI3K) 
and activates it. PI3K catalyzes the 
conversion of phosphatidylinositol 
4,5-bisphosphate (PIP,) into phos- 
phatidylinositol 3,4,5-trisphosphate 


(PIP;), a second messenger (2). PIP; activates the phosphatidylinositol- 
dependent protein kinase (PDK) (3), which in turn activates several other 
kinases, most notably Akt (4), also known as protein kinase B (PKB). Akt 
facilitates the translocation of the glucose transporter (GLUT4)-containing 
vesicles to the cell membrane, which leads to a more robust absorption of 
glucose from the blood. Moreover, Akt phosphorylates and inhibits glycogen 
synthase kinase (GSK3). Recall that GSK3 inhibits glycogen synthase 
(Section 21.5). Thus, insulin leads to the absorption of glucose from the 
blood, activation of glycogen synthase, and enhanced glycogen synthesis. 

Like all signal pathways, the insulin-signaling cascade must be capable 
of being turned off. Three different processes contribute to the down- 
regulation of insulin signaling. First, phosphatases deactivate the insulin 
receptor and destroy a key second messenger. Tyrosine phosphatase IB 
removes phosphoryl groups from the receptor, thus inactivating it. The 
second messenger PIP, is inactivated by the phosphatase PTEN (phospha- 
tase and tensin homolog), which dephosphorylates it, forming PIP,, which 
itself has no second-messenger properties. 


Phosphatidylinositol 4,5-bisphosphate (PIP,) 


Second, the IRS protein can be inactivated by phosphorylation on serine 
residues by specific Ser/Thr kinases. These kinases are activated by overnu- 
trition and other stress signals and may play a role in the development of 
insulin resistance. Finally, SOCS proteins, the regulatory proteins discussed 
earlier, interact with the insulin receptor and IRS-1 and apparently facilitate 
their proteolytic degradation by the proteasome complex. 


Metabolic syndrome often precedes type 2 diabetes 


With our knowledge of the key components of energy homeostasis, let us 
begin our investigation of the biochemical basis of insulin resistance and 
type 2 diabetes. Obesity is a contributing factor to the development of insu- 
lin resistance, which is an early development on the path to type 2 diabetes. 
Indeed, a cluster of pathologies—including insulin resistance, hyperglyce- 
mia, dyslipidemia (high blood levels of triacylglycerols, cholesterol, and 
low-density lipoproteins)—often develop together. This clustering, called 
metabolic syndrome, is thought to be a predecessor of type 2 diabetes. 
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FIGURE 27.6 The storage capacity of fat tissue can be 
exceeded in obesity. In caloric excess, the storage capacity 
of adipocytes can be exceeded with deleterious results. The 


A consequence of obesity is that the amount of triacyl- 
glycerols consumed exceeds the adipose tissue’s storage 
capacity. As a result, other tissues begin to accumulate fat, 
most notably liver and muscle (Figure 27.6). For reasons to 
be presented later in the chapter, this accumulation results 
in insulin resistance and ultimately in pancreatic failure. We 
will focus on muscle and the £ cells of the pancreas. 


Excess fatty acids in muscle modify metabolism 


We have seen many times the importance of fats as fuels. 
In regard to obesity, more fats are present than can be pro- 
cessed by muscle. Although the rate of B oxidation 
increases in response to the high concentration of fats, 
mitochondria are not capable of processing all of the fatty 
acids by B oxidation; fatty acids accumulate in the mito- 
Blood vessels chondria and eventually spill over into the cytoplasm. 
Indeed, the inability to process all of the fatty acids results 
in their reincorporation into triacylglycerols and the accu- 
mulation of fat in the cytoplasm. In the cytoplasm, levels 
of diacylglycerol and ceramide (a component of sphingo- 
lipids) also increase. Diacylglycerol is a second messenger 


excess fat accumulates in other tissues, resulting in biochemical that activates protein kinase C (PKC) (Section 11), 
malfunction of the tissues. When the pancreas, muscle, liver, and When active, PKC and other Ser/Thr protein kinases are 
cells lining the blood vessels are affected, metabolic syndrome, capable of phosphorylating IRS and reducing the ability 


a condition that often precedes type 2 diabetes, may result. 
[Information from S. Frdéjd6, H. Vidal, and L. Pirola, Biochim. Biophys. 


Acta 1792:83-92, 2009, Fig. 1.] 
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of IRS to propagate the insulin signal. Saturated and trans 
unsaturated fatty acids may also activate kinases that 
block the insulin signal. Ceramide or its metabolites 
inhibit glucose uptake and glycogen synthesis, apparently by inhibiting PDK 
and Akt (p. 808). The result is a diet-induced insulin resistance (Figure 27.7). 


Insulin resistance in muscle facilitates pancreatic failure 


What is the effect of overnutrition on the pancreas? This question is impor- 
tant because a primary function of the pancreas is to respond to the presence 
of glucose in the blood by secreting insulin, a process referred to as glucose- 
stimulated insulin secretion (GSIS). Indeed, the B cell is a virtual insulin fac- 
tory. Proinsulin mRNA, which encodes proinsulin, the precursor to insulin, 
constitutes 20% of the total mRNA in the pancreas. Proinsulin constitutes 
50% of the total protein synthesized in the pancreas. 

Glucose enters the B cells of the pancreas through the glucose transporter 
GLUT2. Recall that GLUT2 will allow glucose transport only when blood 
glucose is plentiful, ensuring that insulin is secreted only when glucose is 
abundant, such as after a meal (Section 16.2). The B cell metabolizes glucose 
to CO, and H,O in the process of cellular respiration, generating ATP 
(Chapters 16, 17, and 18). The resulting increase in the ATP/ADP ratio 
closes an ATP-sensitive K* channel that, when open, allows potassium to 
flow out of the cell (Figure 27.8). The resulting alteration in the cellular ionic 
environment opens a Ca”* channel. The influx of Ca** causes insulin- 
containing secretory vesicles to fuse with the cell membrane and release 
insulin into the blood. Thus, the increase in energy charge resulting from the 
metabolism of glucose has been translated by the membrane proteins into a 
physiological response—the secretion of insulin and the subsequent removal 
of glucose from the blood. 

What aspect of B cell function ultimately fails as a result of overnutri- 
tion, causing the transition from insulin resistance to full-fledged type 2 
diabetes? Recall that, under normal circumstances, the B cells of the 
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FIGURE 27.7 Excess fat in peripheral tissues can result in insulin insensitivity. 

Excess fat accumulation in peripheral tissues, most notably muscle, can disrupt some signal- 
transduction pathways and inappropriately activate others. In particular, diacylglycerols and 
ceramide activate stress-induced pathways that interfere with insulin signaling, resulting in 
insulin resistance. (Abbreviations: DAG, diacylglycerol; TGs, triacylglycerols; ROS, reactive 
oxygen species; CA1, carnitine acyltransferase 1; GLUT4, glucose transporter 4; IR, insulin 


receptor; ETC, electron-transport chain; TCA, tricarboxylic acid cycle.) 


pancreas synthesize large amounts of proinsulin. The proinsulin folds into 
its three-dimensional structure in the endoplasmic reticulum, is processed 
to insulin, and is subsequently packaged into vesicles for secretion. As insu- 
lin resistance develops in the muscle, the B cells respond by synthesizing yet 
more insulin in a futile attempt to drive insulin action. The ability of the 
endoplasmic reticulum to process all of the proinsulin and insulin becomes 


compromised, a condition known as endoplasmic reticulum 
(ER) stress, and unfolded or misfolded proteins accumu- 
late. ER stress initiates a signal pathway called the unfolded 
protein response (UPR), a pathway intended to save the 
cell. UPR consists of several steps. First, general protein 
synthesis is inhibited so as to prevent more proteins from 
entering the ER. Second, chaperone synthesis is stimu- 
lated. Recall that chaperones are proteins that assist the 
folding of other proteins (Section 2.6). Third, misfolded 
proteins are removed from the ER and are subsequently 
delivered to the proteasome for destruction. Finally, if the 
described response fails to alleviate the ER stress, pro- 
grammed cell death is triggered, which ultimately leads to 
8 cell death and full-fledged type 2 diabetes. 

What are the treatments for type 2 diabetes? Most are 
behavioral in nature. Diabetics are advised to count calo- 
ries, making sure that energy intake does not exceed energy 
output; to consume a diet rich in vegetables, fruits, and 
grains; and to get plenty of aerobic exercise. Note that these 
guidelines are the same as those for healthy living, even for 


FIGURE 27.8 Insulin release is regulated 
by ATP. The metabolism of glucose by 
glycolysis and cellular respiration increases 
the concentration of ATP, which causes an 
ATP-sensitive potassium channel to close. 
The closure of this channel alters the charge 
across the membrane (is) and causes a 
calcium channel to open. The influx of 
calcium causes insulin-containing granules 
to fuse with the plasma membrane, 
releasing insulin into the blood. 
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FIGURE 27.9 Regulation of glycolysis and gluconeogenesis. Phosphofructokinase 
is the key enzyme in the regulation of glycolysis, whereas fructose 1,6-bisphosphatase 
is the principal enzyme controlling the rate of gluconeogenesis. Note the reciprocal 


Gluconeogenesis 


those not suffering from type 2 diabetes. Treatments specific for type 2 dia- 
betes include the monitoring of blood-glucose concentration so that it is 
within the target range (normal is 3.6 to 6.1 mM). For those who are not able 
to maintain proper glucose levels with the behaviors described herein, drug 
treatments are required. The administration of insulin may be necessary 
upon pancreatic failure, and treatment with metformin (Glucophage), which 
activates AMPK, may be effective. AMPK promotes the oxidation of fats 
while inhibiting fat synthesis and storage. It also stimulates glucose uptake 
and storage by muscle while inhibiting gluconeogenesis in the liver. 


Metabolic derangements in type 1 diabetes result from insulin 
insufficiency and glucagon excess 


We now turn to the more-straightforward type 1 diabetes. In type 1 diabetes, 
insulin production is insufficient because of autoimmune destruction of the 
B cells of the pancreas. Consequently, the glucagon/insulin ratio is at higher- 
than-normal levels. In essence, the diabetic person is in biochemical fasting 
mode despite a high concentration of blood glucose. Because insulin is defi- 
cient, the entry of glucose into adipose and muscle cells is impaired. The liver 
becomes stuck in a gluconeogenic and ketogenic state. The gluconeogenic state 
is characterized by excessive production of glucose. The excessive level of glu- 
cagon relative to that of insulin leads to a decrease in the amount of fructose 

2,6-bisphosphate (F-2, 6-BP), which stimu- 


Fructose 1,6-bisphosphate lates glycolysis and inhibits gluconeogenesis in 
the liver. Hence, glycolysis is inhibited and 

H20 Fructose 1,6-bisphosphatase | gluconeogenesis is stimulated because of the 
Activated by citrate opposite effects of F-2, 6-BP on phosphofruc- 

Inhibited by AMP tokinase and fructose-1, 6-bisphosphatase 

Pi Inhibited by F-2,6-BP (Section 16.4; Figure 27.9). Essentially, the 


cells’ response to a lack of insulin amplifies the 


Fructose 6-phosphate amount of glucose in the blood. The high glu- 


cagon/insulin ratio in diabetes also promotes 
glycogen breakdown. Hence, an excessive 
amount of glucose is produced by the liver and 
released into the blood. Glucose is excreted in 


relation between the pathways and the signal molecules. the urine (hence the name mellitus) when its 


concentration in the blood exceeds the reab- 

sorptive capacity of the renal tubules. Water 
accompanies the excreted glucose, and so an untreated diabetic in the acute 
phase of the disease is hungry and thirsty. 

Because carbohydrate utilization is impaired, a lack of insulin leads to the 
uncontrolled breakdown of lipids and proteins, resulting in the ketogenic state. 
Large amounts of acetyl CoA are then produced by B oxidation. However, 
much of the acetyl CoA cannot enter the citric acid cycle, because there is insuf- 
ficient oxaloacetate for the condensation step. Recall that mammals can syn- 
thesize oxaloacetate from pyruvate, a product of glycolysis, but not from acetyl 
CoA; instead, they generate ketone bodies (Section 22.3). A striking feature of 
diabetes is the shift in fuel usage from carbohydrates to fats; glucose, more abun- 
dant than ever, is spurned. In high concentrations, ketone bodies overwhelm the 
kidney’s capacity to maintain acid-base balance. The untreated diabetic can go 
into a coma because of a lowered blood-pH level and dehydration. Interestingly, 
diabetic ketosis is rarely a problem in type 2 diabetes because insulin is active 
enough to prevent excessive lipolysis in liver and adipose tissue. 

What is the treatment for type 1 diabetes? Many of the behaviors recom- 
mended for type 2 diabetes apply to type 1: watching calories, exercising, 
and eating a healthy diet. Likewise, blood-glucose levels must be moni- 
tored. Insulin treatments are required for survival. 


cz 27.4 Exercise Beneficially Alters the Biochemistry 
of Cells 


Skeletal muscle accounts for ~40% of total body mass and ~35% of resting 
metabolic activity. Moreover, skeletal muscle is the largest target tissue for insu- 
lin. Given its biochemical prominence, increased muscle activity—exercise— 
coupled with a healthy diet, is one of the most effective treatments for diabetes 
as well as a host of other pathological conditions. These include coronary dis- 
ease, hypertension, depression, and a variety of cancers. With regard to diabe- 
tes, exercise increases the insulin sensitivity of people who are insulin resistant 
or type 2 diabetics. What is the basis of this beneficial effect? 


Mitochondrial biogenesis is stimulated by muscular activity 


When muscle is stimulated to contract during exercise by receiving nerve 
impulses from motor neurons, calcium is released from the sarcoplasmic reticu- 
lum. Calcium induces muscle contraction, as will be discussed in Chapter 35. 
Recall that calcium is also a potent second messenger and frequently works in 
association with the calcium-binding protein calmodulin (Section 14.1). In its 
capacity as a second messenger, calcium stimulates various calctum-dependent 
enzymes, suchas calmodulin-dependent protein kinase. The calctum-dependent 
enzymes, as well as AMPK, subsequently activate particular transcription- 
factor complexes. As we will see in Chapters 29 and 31, transcription factors are 
proteins that control gene expression. Two patterns of gene expression, in par- 
ticular, change in response to regular exercise (Figure 27.10). Regular exercise 
enhances the production of proteins required for fatty acid metabolism, such as 
the enzymes of B oxidation. Interestingly, fatty acids themselves function as 
signal molecules to activate the transcription of enzymes of fatty acid metabo- 
lism. Additionally, another set of transcription factors activated by the calctum 
signal cascade initiates a signal pathway that increases mitochondrial biogenesis. 
In concert, the increase in fatty acid oxidizing capability and additional mitochon- 
dria allow for the efficient metabolism of fatty acids. Because an excess of fatty 
acids results in insulin resistance, as already discussed, efficient metabolism of 
fatty acids results in an increase in insulin sensitivity. Indeed, muscles of well- 
trained athletes may contain high concentrations of triacylglycerols and still 
maintain exquisite sensitivity to insulin. These altera- 
tions of muscle biochemistry are some of the molecular 
manifestations of the training effect of exercise. 
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FIGURE 27.10 Exercise results in 
mitochondrial biogenesis and enhanced 
fat metabolism. An action potential causes 
Ca?* release from the sarcoplasmic 
reticulum (SR), the muscle-cell equivalent 
of the endoplasmic reticulum. The Ca?*, 
in addition to instigating muscle 
contraction, activates nuclear transcription 
factors that stimulate the expression of 
specific genes. The products of these 
genes, in conjunction with the products 

of mitochondrial genes, are responsible 
for mitochondrial biogenesis. Fatty acids 
activate a different set of genes that 
increase the fatty acid oxidation capability 
of mitochondria. [Information from 

D. A. Hood, J Appl. Physiol. 90:1137- 

1157, 2001, Fig. 2.] 
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In keeping with our theme of energy use under differ- 
ent physiological conditions, we now examine how 
fuels are used in different types of exercise. The fuels 
used in anaerobic exercises—sprinting, for example— 
differ from those used in aerobic exercises—such 
as distance running. The selection of fuels during these 
different forms of exercise illustrates many important 
facets of energy transduction and metabolic integra- 
tion. ATP directly powers myosin, the protein imme- 
diately responsible for converting chemical energy into 
movement (Chapter 35). However, the amount of ATP 
in muscle is small. Hence, the power output and, in 
turn, the velocity of running depend on the rate of ATP 
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production from other fuels. As shown in Table 27.3, 
creatine phosphate (phosphocreatine) can swiftly 
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FIGURE 27.11 Dependency of the velocity 
of running on the duration of the race. The 
values shown are world track records. [Data 
from trackandfieldnews.com.] 
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TABLE 27.3 Fuel sources for muscle contraction 


Maximal rate of Total 

ATP production ~P available 
Fuel source (mmol s~') (mmol) 
Muscle ATP 223) 
Creatine phosphate 73.3 446 
Conversion of muscle glycogen into lactate 39.1 6,700 
Conversion of muscle glycogen into CO, 16.7 84,000 
Conversion of liver glycogen into CO, 6.2 19,000 
Conversion of adipose-tissue fatty acids into CO, 6.7 4,000,000 


Note: Fuels stored are estimated for a 70-kg person having a muscle mass of 28 kg. 
Data from E. Hultman and R. C. Harris. In Principles of Exercise Biochemistry, edited by J. R. Poortmans 
(Karger, 2004), pp. 78-119. 


transfer its high-potential phosphoryl group to ADP to generate ATP. 
However, the amount of creatine phosphate, like that of ATP itself, is lim- 
ited. Creatine phosphate and ATP can power intense muscle contraction 
for 5 to 6s. Maximum speed in a sprint can thus be maintained for only 5 
to 6 s (Figure 15.7). Thus, the winner in a 100-meter sprint is the runner 
who both achieves the highest initial velocity and then slows down the least. 

During a ~10 second sprint, the ATP level in muscle drops from 5.2 to 
3.7 mM, and that of creatine phosphate decreases from 9.1 to 2.6 mM. 
Anaerobic glycolysis provides fuel to make up for the loss of ATP and cre- 
atine phosphate. A 100-meter sprint is powered by stored ATP, creatine phos- 
phate, and the anaerobic glycolysis of muscle glycogen. The conversion of 
muscle glycogen into lactate can generate a good deal more ATP, but the 
rate is slower than that of phosphoryl-group transfer from creatine phos- 
phate. Because of anaerobic glycolysis, the blood-lactate level is elevated 
from 1.6 to 8.3 mM. The release of H* from the intensely active muscle 
concomitantly lowers the blood pH from 7.42 to 7.24. Sprinting is powered 
by fast twitch muscle fiber (type IIb) specialized for anaerobic glycolysis. 
Recall that one of the effects of high-intensity training is to increase the 
amount of lactate transporters in the membranes of slow twitch fibers 
(type 1), which remove lactate from the blood and slow the fall in blood pH 
(Section 16.4). Nonetheless, the pace of a 100-meter sprint cannot be sus- 
tained in a 1000-meter run (~132 s) for two reasons. First, creatine pho- 
sphate is consumed within a few seconds. Second, the lactate produced 
would cause acidosis. Thus, alternative fuel sources are needed. 

Let us examine the fuel use of a middle-distance runner. The com- 
plete oxidation of muscle glycogen to CO, by aerobic respiration sub- 
stantially increases the energy available to power the 1000-meter run, but 
this aerobic process is a good deal slower than anaerobic glycolysis. 
Because ATP is produced more slowly by oxidative phosphorylation 

than by glycolysis (Table 27.3), the runner’s pace is 
necessarily slower than in a 100-meter sprint. The 
championship velocity for the 1000-meter run is about 
7.6ms_!, compared with approximately 10.4 ms! for 
the 100-meter event (Figure 27.11). 
The running of a marathon (26 miles 385 yards, or 
42,200 meters) requires a different selection of fuels and is 
Marathon run characterized by cooperation (depending on the capabili- 
Spence, ties of the runner) between muscle, liver, and adipose tis- 
sue. Liver glycogen complements muscle glycogen as an 
\ energy store that can be tapped. Elite marathoners can use 
the aerobic combustion of glucose to power the entire 


5 | | 
1 10 100 


Running time (s) 


10,000 race, provided they consume fuels during the race. 
However, for most of us, the total body glycogen stores 


(103 mol of ATP at best) are insufficient to provide the 150 mol of ATP 
needed for this grueling event. Much larger quantities of ATP can be 
obtained by the oxidation of fatty acids derived from the breakdown of fat 
in adipose tissue, but the maximal rate of ATP generation is slower yet than 
that of glycogen oxidation and is more than 10-fold slower than that with 
creatine phosphate. Thus, ATP is generated much move slowly from high- 
capacity stores than from limited ones, accounting for the different velocities 
of anaerobic and aerobic events. Fats are rapidly consumed in activities such 
as distance running, explaining why extended aerobic exercise is beneficial 
for people who are insulin resistant. 

Is it possible to determine the contribution of each fuel as a function of 
exercise intensity? The percentage contribution of each fuel can be measured 
with the use of a respirometer, which measures the respiratory quotient 
(RQ), the ratio of CO, produced to O, consumed. Consider the complete 
combustion of glucose: 


C6H1206 + 6 O2 —> 6 CO? + 6 H2O 
Glucose 


The RQ for glucose is 1. Now consider the oxidation of a typical fatty acid, 
palmitate: 


C1 6H320% + 23 Op —> 16 CO, + 16 H,0 
Palmitate 


The RQ for palmitate oxidation is 0.7. Thus, as aerobic exercise intensity 
increases, the RQ will rise from 0.7 (only fats are used as fuel) to 1.0 (only 
glucose is used as fuel). Between these values, a mixture of fuels is used 
(Figure 27.12). 

What is the optimal mix of fuels for use during a marathon? As suggested 
above, this is a complex question that varies with the athlete and level of train- 
ing. Studies have shown that, when muscle glycogen has been depleted, the 
power output of the muscle falls to approximately 50% of maximum. Power 
output decreases despite the fact that ample supplies of fat are available, sug- 
gesting that fats can supply only about 50% of maximal aerobic effort. Indeed, 
depletion of glycogen stores during a race is referred to as “bonking” or “hit- 
ting the wall,” with the result that the athlete must greatly reduce the pace. 

How is an optimal mix of these fuels achieved? A low blood-sugar level 
leads to a high glucagon/insulin ratio, which in turn mobilizes fatty acids from 
adipose tissue. Fatty acids readily enter muscle, where they 
are degraded by oxidation to acetyl CoA and then to CO,. ee 
The elevated acetyl CoA level decreases the activity of the 
pyruvate dehydrogenase complex to block the conversion of 
pyruvate into acetyl CoA. Hence, fatty acid oxidation 
decreases the funneling of glucose into the citric acid cycle 
and oxidative phosphorylation. Glucose is spared so that just 
enough remains available at the end of the marathon to 
increase the pace as the finish line draws near. The simulta- 
neous use of both fuels gives a higher mean velocity than 
would be attained if glycogen were totally consumed before 
the start of fatty acid oxidation. It is important to bear in 
mind that fuel utilization is only one of many factors that 
determine running ability. 

If carbohydrate-rich meals are consumed after glycogen 
depletion, glycogen stores are rapidly restored. In addition, 
glycogen synthesis continues during the consumption of 
carbohydrate-rich meals, increasing glycogen stores far 
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FIGURE 27.12 An idealized 
representation of fuels use as a function 
of aerobic exercise intensity. (A) With 
increased exercise intensity, the use of fats 
as fuels falls as the utilization of glucose 
increases. (B) The respiratory quotient (RQ) 
measures the alteration in fuel use. 
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27.5 Food Intake and Starvation Induce 
Metabolic Changes 


Thus far, we have been considering metabolism in the context of excess 
consumption of calories, as in obesity, or extreme caloric needs, as in exer- 
cise. We now look at the opposite physiological condition—lack of calories. 


The starved-fed cycle is the physiological response to a fast 


We begin with a physiological condition called the starved—fed cycle, which 
we all experience in the hours after an evening meal and through the night’s 
fast. This nightly starved—fed cycle has three stages: the well-fed state after 
a meal, the early fasting during the night, and the refed state after breakfast. 
A major goal of the many biochemical alterations in this period is to main- 
tain glucose homeostasis—that is, a constant blood-glucose concentration. 
Maintaining glucose homeostasis is crucial because glucose is normally the 
only fuel source for the brain. As discussed earlier, the major defect in dia- 
betes is the inability to perform this vital task. The two primary signals 
regulating the starved—fed cycle are insulin and glucagon. 


1. The well-fed, or postprandial, state. After we consume and digest an 
evening meal, glucose and amino acids are transported from the intestine to 
the blood. The dietary lipids are packaged into chylomicrons and trans- 
ported to the blood by the lymphatic system. This fed condition leads to the 
secretion of insulin, which signals the fed state. Insulin stimulates glycogen 
synthesis in both muscle and the liver, suppresses gluconeogenesis by the 
liver, and stimulates protein synthesis. Insulin also accelerates glycolysis in 
the liver, which in turn increases the synthesis of fatty acids. 

The liver helps to limit the amount of glucose in the blood during times 
of plenty by storing it as glycogen so as to be able to release glucose in times 
of scarcity. How is the excess blood glucose present after a meal removed? 
The liver is able to trap large quantities of glucose because it possesses an 
isozyme of hexokinase called glucokinase, which converts glucose into glu- 
cose 6-phosphate, which cannot be transported out of the cell. Recall that 
glucokinase has a high K,, value and is thus active only when blood-glucose 
concentration is high (Section 16.2). Furthermore, glucokinase is not inhib- 
ited by glucose 6-phosphate as hexokinase is. Consequently, the liver forms 
glucose 6-phosphate move rapidly as the blood-glucose concentration rises. The 
increase in glucose 6-phosphate, which activates glycogen synthase, coupled 
with insulin action leads to a buildup of glycogen stores. The hormonal effects 
on glycogen synthesis and storage are reinforced by a direct action of glu- 
cose itself. Phosphorylase a is a glucose sensor in addition to being the enzyme 
that cleaves glycogen. When the glucose level is high, the binding of glucose 
to phosphorylase a renders the enzyme susceptible to the action of a phos- 
phatase that converts it into phosphorylase b, which does not readily 
degrade glycogen (Figure 21.24). Thus, glucose allosterically shifts the glyco- 
gen system from a degradative to a synthetic mode. 

The high insulin level in the fed state also promotes the entry of glucose 
into muscle and adipose tissue. Insulin stimulates the synthesis of glycogen 
by muscle as well as by the liver. The entry of glucose into adipose tissue 
provides glycerol 3-phosphate for the synthesis of triacylglycerols. The 
action of insulin also extends to amino acid and protein metabolism. Insulin 
promotes the uptake of branched-chain amino acids (valine, leucine, and 
isoleucine) by muscle. Indeed, insulin has a general stimulating effect on 
protein synthesis, which favors a building up of muscle protein. In addition, 
it inhibits the intracellular degradation of proteins. 


2. The early fasting, or postabsorptive, state. The blood-glucose concentra- 
tion begins to drop several hours after a meal, leading to a decrease in insulin 
secretion and a rise in glucagon secretion by the a cells of the pancreas. The 
regulation of glucagon secretion is poorly understood, but when glucose is 
abundant, B cells inhibit glucagon secretion. When glucose concentration 
falls, the inhibition is relieved and glucagon is secreted by the a cells of the 
pancreas. Just as insulin signals the fed state, glucagon signals the starved 
state, serving to mobilize glycogen stores when there is no dietary intake of 
glucose. The main target organ of glucagon is the liver. Glucagon stimulates 
glycogen breakdown and inhibits glycogen synthesis by triggering the 
cyclic AMP cascade leading to the phosphorylation and activation of phos- 
phorylase and the inhibition of glycogen synthase (Section 21.5). Glucagon 
also inhibits fatty acid synthesis by diminishing the production of pyruvate 
and by maintaining the inactive phosphorylated state of acetyl CoA carbox- 
ylase. In addition, glucagon stimulates gluconeogenesis in the liver and 
blocks glycolysis by lowering the level of F-2, 6-BP (Figure 27.9). 

All known actions of glucagon are mediated by protein kinases that are 
activated by cyclic AMP. The activation of the cyclic AMP cascade results 
in a higher level of phosphorylase a activity and a lower level of glycogen 
synthase a activity. Glucagon’s effect on this cascade is reinforced by the 
low concentration of glucose in the blood. The diminished binding of glu- 
cose to phosphorylase a makes the enzyme less susceptible to the hydrolytic 
action of the phosphatase. Instead, the phosphatase remains bound to phos- 
phorylase a, and so the synthase stays in the inactive phosphorylated form. 
Consequently, there is a rapid mobilization of glycogen. 

The large amount of glucose formed by the hydrolysis of glucose 
6-phosphate derived from glycogen is then released from the liver into the 
blood. The entry of glucose into muscle and adipose tissue decreases in 
response to a low insulin level. The diminished utilization of glucose by 
muscle and adipose tissue also contributes to the maintenance of the blood- 
glucose concentration. The net result of these actions of glucagon is to mark- 
edly increase the release of glucose by the liver. Both muscle and the liver use 
fatty acids as fuel when the blood-glucose concentration drops, saving the 
glucose for use by the brain and red blood cells. Thus, the blood-glucose 
concentration is kept at or above 4.4 mM by three major factors: (1) the mobi- 
lization of glycogen and the release of glucose by the liver, (2) the release of 
fatty acids by adipose tissue, and (3) the shift in the fuel used from glucose to 
fatty acids by muscle and the liver. 

What is the result of the depletion of the liver’s glycogen stores? 
Gluconeogenesis from lactate and alanine continues, but this process mere- 
ly replaces glucose that had already been converted into lactate and alanine 
by tissues such as muscle and red blood cells. Moreover, the brain oxidizes 
glucose completely to CO, and H,O. Thus, for the net synthesis of glucose 
to take place, another source of carbon is required. Glycerol released from 
adipose tissue on lipolysis provides some of the carbon atoms, with the 
remaining carbon atoms coming from the hydrolysis of muscle proteins. 


3. The refed state. What are the biochemical responses to a hearty break- 
fast? Fat is processed exactly as it is processed in the normal fed state. 
However, that is not the case for glucose. The liver does not initially absorb 
glucose from the blood, but, instead, leaves it for the other tissues. 
Moreover, the liver remains in a gluconeogenic mode. Now, however, the 
newly synthesized glucose is used to replenish the liver’s glycogen stores. 
As the blood-glucose concentration continues to rise, the liver completes 
the replenishment of its glycogen stores and begins to process the remaining 
excess glucose for fatty acid synthesis. 
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FIGURE 27.13 Fuel choice during 
starvation. The plasma levels of fatty acids 
and ketone bodies increase in starvation, 
whereas that of glucose decreases. 


TABLE 27.4 Fuel reserves in a typical 70-kg man 


Available energy in kilojoules (kcal) 
Organ Glucose or glycogen Triacylglycerols Mobilizable proteins 
Blood 250 (60) 20 45) 0 (0) 
Liver 1700 (400) 2000 (450) 1700 (400) 
Brain 30. (8) 0 (0) 0 (0) 
Muscle 5000 (1200) 2000 (450) 100,000 (24,000) 
Adipose tissue 330 (80) 560,000 (135,000) 170 (40) 


Data from G. F. Cahill, Jr., Clin. Endocrinol. Metab. 5:398, 1976. 


Metabolic adaptations in prolonged starvation minimize 
protein degradation 


Earlier, we considered the metabolic results of overnutrition, a condition 
becoming all too common in prosperous nations. Let us now examine the 
opposite extreme. What are the adaptations if fasting is prolonged to the point 
of starvation, a circumstance affecting nearly a billion people worldwide? 
A typical well-nourished 70-kg man has fuel reserves totaling about 670,000 kJ 
(161,000 kcal; Table 27.4). The energy need for a 24-hour period ranges from 
about 6700 kJ (1600 kcal) to 25,000 kJ (6000 kcal), depending on the extent of 
activity. Thus, stored fuels suffice to meet caloric needs in starvation for 1 to 3 
months. However, the carbohydrate reserves are exhausted in only a day. 

Even under starvation conditions, the blood-glucose concentration must 
be maintained above 2.2 mM. The first priority of metabolism in starvation is 
to provide sufficient glucose to the brain and other tissues (such as red blood 
cells) that are absolutely dependent on this fuel. However, precursors of glu- 
cose are not abundant. Most energy is stored in the fatty acyl moieties of 
triacylglycerols. Moreover, recall that fatty acids cannot be converted into 
glucose, because acetyl CoA resulting from fatty acid breakdown cannot be 
transformed into pyruvate (Section 22.3). The glycerol moiety of triacyl- 
glycerol can be converted into glucose, but only a limited amount is avail- 
able. The only other potential source of glucose is the carbon skeletons of 
amino acids derived from the breakdown of proteins. However, proteins are 
not stored, and so any breakdown will necessitate a loss of function. Thus, 
the second priority of metabolism in starvation is to preserve protein, which is 
accomplished by shifting the fuel being used from glucose to fatty acids and 
ketone bodies (Figure 27.13). 

The metabolic changes on the first day of starvation are like those after 
an overnight fast. The low blood-sugar level leads to decreased secretion of 
insulin and increased secretion of glucagon. The dominant metabolic processes 
are the mobilization of triacylglycerols in adipose tissue and gluconeogenesis by 
the liver. The liver obtains energy for its own needs by oxidizing fatty acids 
released from adipose tissue. The concentrations of acetyl CoA and citrate 
consequently increase, which switches off glycolysis. The uptake of glucose 
by muscle is markedly diminished because of the low insulin level, whereas 
fatty acids enter freely. Consequently, muscle uses no glucose and relies exclu- 
sively on fatty acids for fuel. The B oxidation of fatty acids by muscle halts 
the conversion of pyruvate into acetyl CoA, because acetyl CoA stimulates 
the phosphorylation of the pyruvate dehydrogenase complex, which renders 
it inactive (Section 17.3). Hence, any available pyruvate, lactate, and alanine 
are exported to the liver for conversion into glucose. Glycerol derived from 
the cleavage of triacylglycerols is another raw material for the synthesis of 
glucose by the liver. 

Proteolysis also provides carbon skeletons for gluconeogenesis. During 
starvation, degraded proteins are not replenished and serve as carbon 


TABLE 27.5 Fuel metabolism in starvation 


Amount formed or consumed 
in 24 hours (grams) 


Fuel exchanges and consumption 3d day 40th day 
Fuel use by the brain 
Glucose 100 40 
Ketone bodies 50 100 
All other use of glucose 50 40 
Fuel mobilization 
Adipose-tissue lipolysis 180 180 
Muscle-protein degradation 75 20 
Fuel output of the liver 
Glucose 150 80 
Ketone bodies 150 150 


sources for glucose synthesis. Initial sources of protein are those that turn 
over rapidly, such as proteins of the intestinal epithelium and the secretions 
of the pancreas. Proteolysis of muscle protein provides some of the three- 
carbon precursors of glucose. However, survival for most animals depends 
on being able to move rapidly, which requires a large muscle mass, and so 
muscle loss must be minimized. 

How is the loss of muscle curtailed? After about 3 days of starvation, 
the liver forms large amounts of the ketone bodies, acetoacetate and D-3- 
hydroxybutyrate (Figure 27.14). Their synthesis from acetyl CoA increases 
markedly because the citric acid cycle is unable to oxidize all the acetyl units 
generated by the degradation of fatty acids. Gluconeogenesis depletes the 
supply of oxaloacetate, which is essential for the entry of acetyl CoA into the 
citric acid cycle. Consequently, the liver produces large quantities of ketone 
bodies, which are released into the blood. At this time, the brain begins to 
consume significant amounts of acetoacetate in place of glucose. After 3 days of 
starvation, about a quarter of the energy needs of the brain are met by ketone 
bodies (Table 27.5). The heart also uses ketone bodies as fuel. 

After several weeks of starvation, ketone bodies become the major fuel of the 
brain. Acetoacetate is activated by the transfer of CoA from succinyl CoA 
to give acetoacetyl CoA (Figure 27.15). Cleavage by thiolase then yields two 
molecules of acetyl CoA, which enter the citric acid cycle. In essence, ketone 
bodies are equivalents of fatty acids that are an accessible fuel source for the 
brain. Only 40 g of glucose is then needed per day for the brain, compared 
with about 120 g in the first day of starvation. The effective conversion of 
fatty acids into ketone bodies by the liver and their use by the brain markedly 
diminishes the need for glucose. Hence, less muscle is degraded than in the first 
days of starvation. The breakdown of 20 g of muscle daily compared with 
75 g early in starvation is most important for survival. A person’s survival 
time is mainly determined by the size of the triacylglycerol depot. 

What happens after depletion of the triacylglycerol stores? The only 
source of fuel that remains is protein. Protein degradation accelerates, and 
death inevitably results from a loss of heart, liver, or kidney function. 


e 27.6 Ethanol Alters Energy Metabolism 
in the Liver 


Ethanol has been a part of the human diet for centuries, partly because of its 
intoxicating effects and partly because alcoholic beverages provided a safe 
means of hydration when pure water was scarce. However, its consumption 
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in excess can result in a number of health problems, most notably liver dam- 
age. What is the biochemical basis of these health problems? 


Ethanol metabolism leads to an excess of NADH 


Ethanol cannot be excreted and must be metabolized, primarily by the liver. 
This metabolism is accomplished by two pathways. The first pathway 
comprises two steps. The first step, catalyzed by the enzyme alcohol dehy- 
drogenase, takes place in the cytoplasm: 


Alcohol 
CH,CH,OH + NAD+ “=> CH,CHO + NADH + H* 
Ethanol Acetaldehyde 


The second step, catalyzed by aldehyde dehydrogenase, takes place in 
mitochondria: 


Aldehyde 
CH,CHO + NAD+ + H,O “> CH,COO- + NADH + H* 
Acetaldehyde Acetate 


Note that ethanol consumption leads to an accumulation of NADH. This high 
concentration of NADH inhibits gluconeogenesis by preventing the oxida- 
tion of lactate to pyruvate. In fact, the high concentration of NADH will 
cause the reverse reaction to predominate, and lactate will accumulate. The 
consequences may be hypoglycemia and lactic acidosis. 

The overabundance of NADH also inhibits fatty acid oxidation. An 
important metabolic purpose of fatty acid oxidation is to generate NADH 
for ATP generation by oxidative phosphorylation, but an alcohol consum- 
ers NADH needs are met by ethanol metabolism. In fact, the excess 
NADH signals that conditions are right for fatty acid synthesis. Hence, 
triacylglycerols accumulate in the liver, leading to a condition known as 
“fatty liver” that is exacerbated in obese persons. The biochemical effects of 
ethanol consumption can be quite rapid. For instance, fat accumulates in the 
liver within a few days of moderate alcohol consumption. This accumula- 
tion is reversible with a decrease in alcohol intake. 

The second pathway for ethanol metabolism uses cytochrome P450 
enzymes. This means of ethanol metabolism is also called the microsomal 
ethanol-oxidizing system (MEOS). This cytochrome P450-dependent path- 
way (Section 26.4) generates acetaldehyde and subsequently acetate while 
oxidizing biosynthetic reducing power, NADPH, to NADP*. Because it 
uses oxygen, this pathway generates free radicals that damage tissues. 
Moreover, because the system consumes NADPH, the antioxidant glutathi- 
one cannot be regenerated (Section 20.5), exacerbating the oxidative stress. 

What are the effects of the other metabolites of ethanol? Liver mitochon- 
dria can convert acetate into acetyl CoA in a reaction requiring ATP. The 
enzyme is the thiokinase that normally activates short-chain fatty acids. 


Acetate + coenzyme A + ATP ——> acetyl CoA + AMP + PP; 
PP; — 2 P; 


However, further processing of the acetyl CoA by the citric acid cycle is 
blocked, because NADH inhibits two important citric acid cycle regulatory 
enzymes—isocitrate dehydrogenase and a-ketoglutarate dehydrogenase. 
The accumulation of acetyl CoA has several consequences. First, ketone 
bodies will form and be released into the blood, aggravating the acidic con- 
dition already resulting from the high lactate concentration. The processing 
of the acetate in the liver becomes inefficient, leading to a buildup of acetal- 
dehyde. This very reactive compound forms covalent bonds with many 


important functional groups in proteins, impairing protein function. If 
ethanol is consistently consumed at high levels, the acetaldehyde can sig- 
nificantly damage the liver, eventually leading to cell death. 

Liver damage from excessive ethanol consumption occurs in three stages. 
The first stage is the aforementioned development of fatty liver. In the second 
stage—alcoholic hepatitis—groups of cells die and inflammation results. This 
stage can itself be fatal. In stage three—cirrhosis—fibrous structure and scar 
tissue are produced around the dead cells. Cirrhosis impairs many of the liv- 
er’s biochemical functions. The cirrhotic liver is unable to convert ammonia 
into urea, and blood levels of ammonia rise. Ammonia is toxic to the nervous 
system and can cause coma and death. Cirrhosis of the liver arises in about 
25% of alcoholics, and about 75% of all cases of liver cirrhosis are the result of 
alcoholism. Viral hepatitis is a non-alcoholic cause of liver cirrhosis. 


Excess ethanol consumption disrupts vitamin metabolism 


The adverse effects of ethanol are not limited to the metabolism of ethanol 
itself. Vitamin A (retinol) is converted into retinoic acid, an important signal 
molecule for growth and development in vertebrates, by the same dehydrog- 
enases that metabolize ethanol. Consequently, this activation does not take 
place in the presence of ethanol, which acts as a competitive inhibitor. 
Moreover, the P450 enzymes induced by ethanol inactivate retinoic acid. 
These disruptions in the retinoic acid signaling pathway are believed to be 
responsible, at least in part, for fetal alcohol syndrome as well as the devel- 
opment of a variety of cancers. 

The disruption of vitamin A metabolism is a direct result of the biochemi- 
cal changes induced by excess ethanol consumption. Other disruptions 
in metabolism result from another common characteristic of alcoholics— 
malnutrition. Alcoholics will frequently drink instead of eating. A dramatic 
neurological disorder, referred to as Wernicke—Korsakoff syndrome, results 
from insufficient intake of the vitamin thiamine. Symptoms include mental 
confusion, unsteady gait, and lack of fine motor skills. The symptoms of 
Wernicke—Korsakoff syndrome are similar to those of beriberi (Section 17.4) 
because both conditions result from a lack of thiamine. Thiamine is converted 
into the coenzyme thiamine pyrophosphate, a key constituent of the pyruvate 
dehydrogenase complex. Recall that this complex links glycolysis with the 
citric acid cycle. Disruptions in the pyruvate dehydrogenase complex are 
most evident as neuromuscular disorders because the nervous system is nor- 
mally dependent on glucose for energy generation. 

Alcoholic scurvy is occasionally observed because of an insufficient 
ingestion of vitamin C. Vitamin C is required for the formation of stable col- 
lagen fibers. The symptoms of scurvy include skin lesions and blood-vessel 
fragility. Most notable are bleeding gums, the loss of teeth, and periodontal 
infections. Gums are especially sensitive to a lack of vitamin C because the 
collagen in gums turns over rapidly. What is the biochemical basis for 
scurvy? Vitamin C is required for the synthesis of 4-hydroxyproline, an 
amino acid necessary for collagen stability. To form this unusual amino acid, 
proline residues on the amino side of glycine residues in nascent collagen 
chains become hydroxylated. One oxygen atom from O, becomes attached 
to C-4 of proline while the other oxygen atom is taken up by a-ketoglutarate, 
which is converted into succinate (Figure 27.16). This reaction is catalyzed 
by prolyl hydroxylase, a dioxygenase, which requires an Fe?* ion to activate 
O,. The enzyme also converts a-ketoglutarate into succinate without 
hydroxylating proline. In this partial reaction, an oxidized iron complex is 
formed, which inactivates the enzyme. How is the active enzyme regener- 
ated? Ascorbate (vitamin C) comes to the rescue by reducing the ferric ion of 
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FIGURE 27.16 Formation of 4-hydroxyproline. Proline is hydroxylated at C-4 by the action 
of prolyl hydroxylase, an enzyme that activates molecular oxygen. 


the inactivated enzyme. In the recovery process, ascorbate is oxidized to 
dehydroascorbic acid (Figure 27.17). Thus, ascorbate serves here as a spe- 
cific antioxidant. Why does impaired hydroxylation have such devastating 
consequences? Collagen synthesized in the absence of ascorbate is less stable 
than the normal protein. Hydroxyproline stabilizes the collagen triple helix 
by forming interstrand hydrogen bonds. The abnormal fibers formed by 
insufficiently hydroxylated collagen account for the symptoms of scurvy. 


HQ 4 HQ 4 HQ 4 


Ascorbic acid Ascorbate Dehydroascorbic acid 


FIGURE 27.17 Forms of ascorbic acid (vitamin C). Ascorbate is the ionized form of 
vitamin C, and dehydroascorbic acid is the oxidized form of ascorbate. 


SUMMARY 


27.1 Caloric Homeostasis Is a Means of Regulating Body Weight 
Many people are able to maintain a near-constant body weight through- 
out adult life. This ability is a demonstration of caloric homeostasis, a 
physiological condition in which energy needs match energy intake. 
When energy intake is greater than energy needs, weight gain results. In 
the developed world, obesity is at epidemic proportions and is impli- 
cated as a contributing factor in a host of pathological conditions. 


27.2 The Brain Plays a Key Role in Caloric Homeostasis 
Various signal molecules act on the brain to control appetite. Short- 
term signals such as CCK and GLP-1 relay satiety signals to the brain 
while eating is in progress. Long-term signals include leptin and insu- 
lin. Leptin, secreted by adipose tissue in direct proportion to adipose- 
tissue mass, is an indication of fat stores. Leptin inhibits eating. Insulin 
also works in the brain, signaling carbohydrate availability. 

Leptin acts by binding to a receptor in brain neurons, which initiates 
signal-transduction pathways that reduce appetite. Obesity can develop 
in individuals with normal amounts of leptin and the leptin receptor, 
suggesting that such individuals are leptin resistant. Suppressors of 
cytokine signaling may inhibit leptin signaling, leading to leptin resis- 
tance and obesity. 


27.3 


27.4 


27.5 


Diabetes Is a Common Metabolic Disease Often Resulting from Obesity 
Diabetes is the most common metabolic disease in the world. Type 1 
diabetes results when insulin is absent due to autoimmune destruc- 
tion of the B cells of the pancreas. Type 2 diabetes is characterized by 
normal or higher levels of insulin, but the target tissues of insulin, 
notably muscle, do not respond to the hormone, a condition called 
insulin resistance. Obesity is a significant predisposing factor for 
type 2 diabetes. 

In muscle, excess fats accumulate in an obese individual. These fats 
are processed to second messengers that activate signal-transduction 
pathways that inhibit insulin signaling, leading to insulin resistance. 
Insulin resistance in target tissues ultimately leads to pancreatic B-cell 
failure. The pancreas tries to compensate for a lack of insulin action by 
synthesizing more insulin, resulting in ER stress and subsequent acti- 
vation of apoptotic pathways that lead to B-cell death. 

Type 1 diabetes is due to metabolic derangements resulting in an 
insufficiency of insulin and an excess of glucagon relative to a person’s 
needs. The result is an elevated blood-glucose level, the mobilization 
of triacylglycerols, and excessive ketone-body formation. Accelerated 
ketone-body formation can lead to acidosis, coma, and death in untreated 
insulin-dependent diabetics. 


Exercise Beneficially Alters the Biochemistry of Cells 

Exercise is a useful prescription for insulin resistance and type 2 
diabetes. Muscle activity stimulates mitochondrial biogenesis in a 
calcitum-dependent manner. The increase in the number of mitochon- 
dria facilitates fatty acid oxidation in the muscle, resulting in increased 
insulin sensitivity. 

Fuel choice in exercise is determined by the intensity and duration 
of the bout of exercise. Sprinting and marathon running are powered 
by different fuels to maximize power output. The 100-meter sprint is 
powered by stored ATP, creatine phosphate, and anaerobic glycolysis. 
In contrast, the oxidation of both muscle glycogen and fatty acids 
derived from adipose tissue is essential in the running of a marathon, 
a highly aerobic process. 


Food Intake and Starvation Induce Metabolic Changes 

Insulin signals the fed state; it stimulates the formation of glycogen 
and triacylglycerols and the synthesis of proteins. In contrast, gluca- 
gon signals a low blood-glucose level; it stimulates glycogen break- 
down and gluconeogenesis by the liver and triacylglycerol hydrolysis 
by adipose tissue. After a meal, the rise in the blood-glucose level leads 
to an increased secretion of insulin and a decreased secretion of gluca- 
gon. Consequently, glycogen is synthesized in muscle and the liver. 
When the blood-glucose level drops several hours later, glucose is 
then formed by the degradation of glycogen and by the gluconeogenic 
pathway, and fatty acids are released by the hydrolysis of triacylglyc- 
erols. The liver and muscle then increasingly use fatty acids instead of 
glucose to meet their own energy needs so that glucose is conserved for 
use by the brain and the red blood cells. 

The metabolic adaptations in starvation serve to minimize protein 
degradation. Large amounts of ketone bodies are formed by the liver 
from fatty acids and released into the blood within a few days after the 
onset of starvation. After several weeks of starvation, ketone bodies 
become the major fuel of the brain. The diminished need for glucose 
decreases the rate of muscle breakdown, and so the likelihood of sur- 
vival is enhanced. 
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KEY TERMS 


27.6 Ethanol Alters Energy Metabolism in the Liver 
The oxidation of ethanol results in an unregulated overproduction of 
NADH, which has several consequences. A rise in the blood levels of 
lactic acid and ketone bodies causes a fall in blood pH, or acidosis. The 
liver is damaged because the excess NADH causes excessive fat for- 
mation as well as the generation of acetaldehyde, a reactive molecule. 


Severe liver damage can result. 


caloric homeostasis (energy 
homeostasis) (p. 801) 


cholecystokinin (CCK) (p. 804) 


glucagon-like peptide 1 (GLP-1) (p. 805) 


leptin (p. 805) 


adiponectin (p. 806) 
leptin resistance (p. 806) 
type 1 diabetes (p. 808) 


insulin resistance (p. 808) 


type 2 diabetes (p. 808) 


endoplasmic reticulum (ER) 
stress (p. 811) 


unfolded protein response 


(UPR) (p. 811) 
starved—fed cycle (p. 816) 


insulin (p. 805) 


PROBLEMS 


metabolic syndrome (p. 809) 


glucose homeostasis (p.816) 


1. Weight control. As disturbing as the obesity epidemic is, 
an equally intriguing, almost amazing observation is that 
many people are able to maintain an approximately con- 
stant weight throughout adult life. A few simple calcula- 
tions of a simplified situation illustrates how remarkable 
this feat is. Consider a 120-pound woman whose weight 
does not change significantly between the ages of 25 and 
65. Let us say that the woman requires 8,400 kJ (2000 kcal) 
day !. For simplicity’s sake, let us assume that the wom- 
an’s diet consists predominantly of fatty acids derived from 
lipids. The energy density of fatty acids is 38 kJ (9 kcal) 
g_!. How much food has she consumed over the 40 years 
in question? 


2. Spare tire. Suppose that our test subject from problem 1 
gained 55 pounds between the ages of 25 and 65 (a sadly 
common occurrence), and that her weight at 65 years of age 
is 175 pounds. Calculate how many excess calories she con- 
sumed per day to gain the 55 pounds over 40 years. Assume 
that our test subject is 5 feet 6 inches tall. What is her BMI? 
Would she be considered obese at 175 lbs? 


3. Depot fat. Adipose tissue was once only considered a 
storage site for fat. Why is this view no longer considered 
correct? 


4. Balancing act. What is meant by caloric homeostasis? 


5. Dynamic duo. What are the key hormones responsible 
for maintaining caloric homeostasis? 


6. Dual roles. What two biochemical roles does CCK play? 
GLP-1? 


7. Failure to communicate. Leptin inhibits eating and is 
secreted in amounts in direct proportion to body fat. 
Moreover, obese people have normal amounts of leptin and 
leptin receptor. Why, then, do people become obese? 


8. Many signals. Match the characteristic (a1) with the 
appropriate hormone (1-6). 


(a) Secreted by adipose tissue 1. leptin 
2. adiponectin 
(b) Stimulates liver gluco- 3. GLP-1 
neogenesis 4, CCK 
(c) GPCR pathway 5. insulin 
(d) Satiety signal 6. glucagon 


(e) Enhances insulin secretion 


(f) Secreted by the pancreas 
during a fast 

(g) Secreted after a meal 

(h) Stimulates glycogen syn- 
thesis 

(i) Missing in type 1 diabetes 


9. A key chemical. What are the sources of glucose 
6-phosphate in liver cells? 


10. Neither option is good. Differentiate between type 1 and 
type 2 diabetes. 


11. Fighting diabetes. Leptin is considered an “anti- 
diabetogenic” hormone. Explain. 


12. Metabolic energy and power. The rate of energy expen- 
diture of a typical 70-kg person at rest is about 70 watts 
(W), like that of a light bulb. 


(a) Express this rate in kilojoules per second and in kilo- 
calories per second. 


(b) How many electrons flow through the mitochondrial 
electron-transport chain per second under these conditions? 


(c) Estimate the corresponding rate of ATP production. 


(d) The total ATP content of the body is about 50 g. Estimate 
how often an ATP molecule turns over in a person at rest. 


13. Respiratory quotient (RQ). This classic metabolic index 
is defined as the volume of CO, released divided by the 


volume of O, consumed. 


(a) Calculate the RQ values for the complete oxidation of 
glucose and of tripalmitoylglycerol. 


(b) What do RQ measurements reveal about the contribu- 
tions of different energy sources during intense exercise? 
(Assume that protein degradation is negligible.) 


14. Camel’s hump. Compare the H,O yield from the com- 
plete oxidation of 1 g of glucose with that of 1 g of tripalmit- 
oylglycerol. Relate these values to the evolutionary selection 
of the contents of a camel’s hump. 


15. Hungry—nourished. What is meant by the starved—fed 
cycle? 


16. Of course, too much is bad for you. What are the primary 
means of processing ethanol? 


17. Started out with burgundy, but soon hit the harder stuff. 
Describe the three stages of ethanol consumption that lead 
to liver damage and possibly death. 


18. The wages of sin. How long does a person have to jog to 
offset the calories obtained from eating 10 macadamia nuts 
(75 kJ, or 18 kcal, per nut)? (Assume an incremental power 
consumption of 400 W.) 


19. Sweet hazard. Ingesting large amounts of glucose 
before a marathon might seem to be a good way of increas- 
ing the fuel stores. However, experienced runners do not 
ingest glucose before a race. What is the biochemical reason 
for their avoidance of this potential fuel? (Hint: Consider 
the effect of glucose ingestion on the level of insulin.) 


20. Lipodystrophy. Lipodystrophy is a condition in which 
an individual lacks adipose tissue. The muscles and liver 
from such individuals are insulin resistant, and both tissues 
accumulate large amounts of triacylglycerols (hyperlipid- 
emia). The administration of leptin partly ameliorates this 
condition. What does it indicate about the relation of adi- 
pose tissue to insulin action? 


21. Therapeutic target. What would be the effect of a muta- 
tion in the gene for PTP1B (protein tyrosine phosphatase 
1B) that inactivated the enzyme in a person who has type 2 
diabetes? 


22. An effect of diabetes. Insulin-dependent diabetes is 
often accompanied by hypertriglyceridemia, which is an 
excess blood level of triacylglycerols in the form of very low 
density lipoproteins. Suggest a biochemical explanation. 


23. Sharing the wealth. The hormone glucagon signifies 
the starved state, yet it inhibits glycolysis in the liver. How 
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does this inhibition of an energy-production pathway ben- 
efit the organism? 


24. Compartmentation. Glycolysis takes place in the cyto- 
plasm, whereas fatty acid degradation takes place in mito- 
chondria. What metabolic pathways depend on the interplay 
of reactions that take place in both compartments? 


25. Kwashiorkor. The most common form of malnutrition 
in children in the world, kwashiorkor, is caused by a diet 
having ample calories but little protein. The high levels of 
carbohydrate result in high levels of insulin. What is the 
effect of high levels of insulin on 


(a) lipid utilization? 
(b) protein metabolism? 


(c) Children suffering from kwashiorkor often have large 
distended bellies caused by water from the blood leaking 
into extracellular spaces. Suggest a biochemical basis for 
this condition. 


26. One for all, all for one. How is the metabolism of the 
liver coordinated with that of skeletal muscle during strenu- 
ous exercise? 


27. A little help, please? What is the advantage of convert- 
ing pyruvate into lactate in skeletal muscle? 


28. Fuel choice. What is the major fuel for resting muscle? 
What is the major fuel for muscle under strenuous work 
conditions? 


29. Hefty reimbursement. Endurance athletes sometimes 
follow the exercise-and-diet plan described here: 7 days 
before an event, do exhaustive exercises so as to all 
but deplete glycogen stores. For the next 2 to 3 days, 
consume few carbohydrates and do moderate- to low- 
intensity exercises. Finally, 3 to 4 days before the event, 
consume a diet rich in carbohydrates. Explain the bene- 
fits of this regime. 


30. Oxygen deficit. After light exercise, the oxygen con- 
sumed in recovery is approximately equal to the oxygen 
deficit, which is the amount of additional oxygen that 
would have been consumed had oxygen consumption 
reached steady state immediately. How is the oxygen con- 
sumed in recovery used? 


31. Excess post-exercise oxygen consumption. The oxygen 
consumed after strenuous exercise stops is significantly 
greater than the oxygen deficit and is termed excess post- 
exercise oxygen consumption (EPOC). Why is so much more 
oxygen required after intense exercise? 


32. Psychotropic effects. Ethanol is unusual in that it is 
freely soluble in both water and lipids. Thus, it has access to 
all regions of the highly vascularized brain. Although the 
molecular basis of ethanol action in the brain is not clear, 
ethanol evidently influences a number of neurotransmitter 
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receptors and ion channels. Suggest a biochemical explana- 
tion for the diverse effects of ethanol. 


33. Fiber type. Skeletal muscle has several distinct fiber 
types. Type I is used primarily for aerobic activity, whereas 
type IIb is specialized for short, intense bursts of activity. 
How could you distinguish between these types of muscle 
fiber if you viewed them with an electron microscope? 


34. Tour de France. Cyclists in the Tour de France (more 
than 2000 miles in 3 weeks) require about 836,000 kJ 
(200,000 kcal) of energy, or 41,840 kJ (10,000 kcal) day~! 
(a resting male requires, 8368 kJ, or 2000 kcal, day~'). 


(a) With the assumptions that the energy yield of ATP is 
about 50.2 kJ (12 kcal) mol~! and that ATP has a molecular 
weight of 503 g mol~!, how much ATP would be expended 
by a Tour de France cyclist? 


(b) Pure ATP can be purchased at a cost of approximately 
$150 per gram. How much would it cost to power a cyclist 
through the Tour de France if the ATP had to be purchased? 


35. Responding to stress. Why does it make good physiolog- 
ical sense that regular bouts of prolonged exercise will result 
in mitochondrial biogenesis? 


36. Burning fats. How does the activation of AMP-activated 
protein kinase (AMPK) during aerobic exercise foster the 
switch to fatty acid oxidation for distance running? 


37. Saving protein. What metabolic and hormonal changes 
account for decreased gluconeogenesis during the first sev- 
eral weeks of starvation in humans? 


38. Did I do that!? Alcohol consumption on an empty stom- 
ach results in some interesting biochemical as well as embar- 
rassing behavioral alterations. We will ignore the latter. 
Gluconeogenesis falls; there are increases in intracellular 
ratios of lactate to pyruvate, of glycerol 3-phosphate to dihy- 
droxyacetone phosphate, of glutamate to a-ketoglutarate, 
and of D-3-hydroxybutyrate to acetoacetate. Hypoglycemia 
develops rapidly. Blood pH also falls. Alcohol consumption 
by a well-fed individual does not lead to hypoglycemia or 
alteration in blood pH. 


(a) Why does ethanol consumption result in the altered 
ratios? 

(b) Why does hypoglycemia and blood acidosis result in 
the hungry individual? 


(c) Why does the well-fed person not experience 
hypoglycemia? 


39. Skip the candy bar. Humans can survive longer by total 
fasting than on a diet consisting entirely of small amounts of 
carbohydrates. What is going on? 


40. Too much of a good thing. What is the relation between 
fatty acid oxidation and insulin resistance in the muscle? 


41. Aneurin? Really? Why are the symptoms of beriberi 
similar to those of Wernicke—Korsakoff syndrome? 


Data Interpretation Problem 


42. Lactate threshold. The graph shows the relation between 
blood-lactate levels, oxygen consumption, and heart rate 
during exercise of increasing intensity. The values for oxy- 
gen consumption and heart rate are indicators of the degree 
of exertion. 
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(a) Why is some lactate produced even when exercise is 
moderate? 


(b) Biochemically, what is taking place when the lactate 
concentration begins to rise rapidly, a point called the lac- 
tate threshold? 


(c) Endurance athletes will sometimes measure blood-lac- 
tate levels during training so that they know their lactate 
threshold. Then, during events, they will race just at or 
below their lactate threshold until the late stages of the race. 
Biochemically, why is this practice wise? 


(d) Training can increase the lactate threshold. Explain. 


Chapter Integration Problems 


43. Feeling the burn. Under certain circumstances, the respi- 
ratory quotient (RQ value) for an athlete exercising 
intensely can rise above one. How is this possible? 


44. So many channels. Like cable TV. Describe the role of ion 
channels in insulin secretion by the B cells of the pancreas. 
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DNA Replication, Repair, and 
Recombination 
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Faithful copying is essential to the storage of genetic 
information. With the precision of a diligent monk copying an 
illuminated manuscript, a DNA polymerase (above) copies 
DNA strands, preserving the precise sequence of bases with 
very few errors. [(Left) The Pierpont Morgan Library/Art 
Resource. ] 


OUTLINE 

erhaps the most exciting aspect of the structure of DNA deduced by 

Watson and Crick was, as expressed in their words, that the “specific 28.1 DNA Replication Proceeds 
pairing we have postulated immediately suggests a possible copying mech- by the Polymerization of 
anism for the genetic material.” A double helix separated into two single Deoxyribonucleoside 
strands can be replicated because each strand serves as a template on which Triphosphates Along a Template 
its complementary strand can be assembled (Figure 28.1). To preserve the 28.2 DNA Unwinding and Supercoiling 
information encoded in DNA through many cell divisions, copying of the Are Controlled by 
genetic information must be extremely faithful. To replicate the human Topoisomerases 


genome without mistakes, an error rate of less than 1 bp per 6 X 10° bp 


F : ; ; 28.3 DNA Replication Is Highl 
must be achieved. Such remarkable accuracy is achieved through a multi- 7 ee ne 


layered system of accurate DNA synthesis (which has an error rate of 1 per ese 

10°—10* bases inserted), proofreading during DNA synthesis (which 28.4 Many Types of DNA Damage Can 

reduces that error rate to approximately 1 per 10°—107 bp), and postrepli- Be Repaired 

cation mismatch repair (which reduces the error rate to approximately 28.5 DNA Recombination Plays 

1 per 1100" bp). Important Roles in Replication, 
Even after DNA has been initially replicated, the genome is still not safe. Repair, and Other Processes 


Although DNA is remarkably robust, ultraviolet light as well as a range of 
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Original parent molecule 


as 


First-generation daughter molecules 


FIGURE 28.1 DNA replication. Each 
strand of one double helix (shown in blue) 
acts as a template for the synthesis of a 
new complementary strand (shown in red). 


Starting genome 
Irradiated genome 
Reassembled genome 


(fragmented) 
Spontaneously 


C 0 1.5 3 


FIGURE 28.3 DNA repair in action. The 
bacterium Deinococcus radiodurans can 
reassemble its genome over the course of 
3 hours after irradiation with gamma rays to 
fragment the genome into many pieces. 
To aid analysis, the genomic DNA samples 
were digested with a restriction enzyme 
that cuts at only a few sites within the 
genome. [Adapted from K. Zahracka et al., 
Nature 445:569-578, 2006.] 
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chemical species can damage DNA, introducing changes in the DNA 
sequence (mutations) or lesions that can block further DNA replication 
(Figure 28.2). All organisms contain DNA-repair systems that detect DNA 
damage and act to preserve the original sequence. Mutations in genes that 
encode components of DNA-repair systems are key factors in the develop- 
ment of cancer. Among the most potentially devastating types of DNA 
damage are double-stranded breaks in DNA. With both strands of the 
double helix broken in a local region, neither strand is intact to act as a tem- 
plate for future DNA synthesis. A mechanism used to repair such lesions 
relies on DNA recombination—that is, the reassortment of DNA sequences 
present on two different double helices. In addition to its role in DNA 
repair, recombination is crucial for the generation of genetic diversity in 
meiosis. Recombination is also the key to generating a highly diverse reper- 
toire of genes for key molecules in the immune system (Chapter 34). 


Replication 
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FIGURE 28.2 DNA Replication, damage, and repair. Some errors (shown as a black dot) 
may arise in the replication processes. Additional defects (shown in yellow), including 
modified bases, cross-links, and single- and double-strand breaks, are introduced into 
DNA by subsequent DNA-damaging reactions. Many of the errors are detected and 
subsequently repaired. 


Se The bacterium Deinococcus radiodurans illustrates the extraordinary 

power of DNA repair systems. This bacterium was discovered in 1956 
when scientists were studying the use of high doses of gamma radiation to 
sterilize canned meat. In some cases, the meat still spoiled due to the growth 
of a bacterial species that withstood doses of gamma radiation more than 
1000 times larger than those that would kill a human being. Each D. radio- 
durans cell contains between 4 and 10 copies of its genome. Even when these 
bacterial chromosomes are broken into many fragments by the ionizing 
radiation, they can reassemble and recombine to regenerate the intact 
genome with essentially no loss of information (Figure 28.3). These cells can 
also survive extreme desiccation, that is, drying out, much better than other 
organisms. This ability is believed to be the selective advantage that favored 
the evolution of this and related species. 


28.1 DNA Replication Proceeds by the Polymerization of 
Deoxyribonucleoside Triphosphates Along a Template 


The base sequences of newly synthesized DNA must faithfully match the 
sequences of parent DNA. To achieve faithful replication, each strand 
within the parent double helix acts as a template for the synthesis of a new 
DNA strand with a complementary sequence. The building blocks for the 
synthesis of the new strands are deoxyribonucleoside triphosphates. They 
are added, one at a time, to the 3’ end of an existing strand of DNA. 
Although this reaction is in principle quite simple, it is significantly 
complicated by specific features of the DNA double helix. First, the two 
strands of the double helix run in opposite directions. Because DNA 


strand synthesis always proceeds in the 5’-to-3’ direction, the DNA rep- 
lication process must have special mechanisms to accommodate the 
oppositely directed strands. Second, the two strands of the double helix 
interact with one another in such a way that the bases, key templates for 
replication, are on the inside of the helix. Thus, the two strands must be 
separated from each other so as to generate appropriate templates. 
Finally, the two strands of the double helix wrap around each other. 
Thus, strand separation also entails the unwinding of the double helix. 
This unwinding creates supercoils that must themselves be resolved as 
replication continues. We begin with a consideration of the chemistry 
that underlies the formation of the phosphodiester backbone of newly 
synthesized DNA. 


DNA polymerases require a template and a primer 


DNA polymerases catalyze the formation of polynucleotide chains. Each 
incoming nucleoside triphosphate first forms an appropriate base pair with 
a base in the template. Only then does the DNA polymerase link the incom- 
ing base with the predecessor in the chain. Thus, DNA polymerases are 
template-directed enzymes. 

DNA polymerases add nucleotides to the 3’ end of a polynucleotide chain. 
The polymerase catalyzes the nucleophilic attack by the 3'-hydroxyl-group 
terminus of the polynucleotide chain on the a phosphoryl group of the nucleo- 
side triphosphate to be added (Figure 4.25). To initiate this reaction, DNA 
polymerases require a primer with a free 3’-hydroxyl group already base- 
paired to the template. They cannot start from scratch by adding nucleotides 
to a free single-stranded DNA template. RNA polymerase, in contrast, can 
initiate RNA synthesis without a primer, as we shall see in Chapter 29. 


All DNA polymerases have structural features in common 


The three-dimensional structures of anumber of DNA polymerase enzymes 
are known. The first such structure was elucidated by Tom Steitz and 
coworkers, who determined the structure of the so-called Klenow fragment 
of DNA polymerase I from E. coli (Figure 28.4). This fragment comprises 
two main parts of the full enzyme, including the polymerase unit. This unit 
approximates the shape of a right hand with domains that are referred to as 
the fingers, the thumb, and the palm. In addition to the polymerase, the 
Klenow fragment includes a domain with 3’ — 5’ exonuclease activity that 
participates in proofreading and correcting the polynucleotide product. 


bi al DNA polymerases are remarkably similar in overall shape, although 
they differ substantially in detail. At least five structural classes 
have been identified; some of them are clearly homologous, whereas others 
appear to be the products of convergent evolution. In all cases, the finger 
and thumb domains wrap around DNA and hold it across the enzyme’s 
active site, which comprises residues primarily from the palm domain. 
Furthermore, all DNA polymerases use similar strategies to catalyze the 
polymerase reaction, making use of a mechanism in which two metal ions 
take part. 


Two bound metal ions participate in the polymerase reaction 


Like all enzymes with nucleoside triphosphate substrates, DNA polymer- 
ases require metal ions for activity. Examination of the structures of DNA 
polymerases with bound substrates and substrate analogs reveals the pres- 
ence of two metal ions in the active site. One metal ion binds both the 
deoxynucleoside triphosphate (dNTP) and the 3’-hydroxyl group of the 
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Primer 
The initial segment of a polymer that is to be 
extended on which elongation depends. 


Template 
A sequence of DNA or RNA that directs the 
synthesis of a complementary sequence. 


Exonuclease 


k FIGURE 28.4 DNA polymerase 
structure. The first DNA polymerase 
structure determined was that of a 
fragment of E. coli DNA polymerase | called 
the Klenow fragment. Notice that, like other 
DNA polymerases, the polymerase unit 
resembles a right hand with fingers (blue), 
palm (yellow), and thumb (red). The Klenow 
fragment also includes an exonuclease 
domain that removes incorrect nucleotide 
bases. [Drawn from 1DPI.pdb.] 


FIGURE 28.5 DNA polymerase 
mechanism. Two metal ions (typically, 
Mg?*) participate in the DNA polymerase 
reaction. One metal ion coordinates the 
8'-hydroxyl group of the primer, whereas 
the other metal ion interacts only with the 
dNTP. The phosphoryl group of the 
nucleoside triphosphate bridges between 
the two metal ions. The hydroxyl group of 
the primer attacks the phosphoryl group to 
form a new O—P bond. 


FIGURE 28.7 Minor-groove interactions. 
DNA polymerases donate two hydrogen 
bonds to base pairs in the minor groove. 
Hydrogen-bond acceptors are present in 
these two positions for all Watson—Crick base 
pairs, including the A—T base pair shown. 
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primer, whereas the other interacts only with the dNTP (Figure 28.5). The 
two metal ions are bridged by the carboxylate groups of two aspartate resi- 
dues in the palm domain of the polymerase. These side chains hold the 
metal ions in the proper positions and orientations. The metal ion bound to 
the primer activates the 3’-hydroxyl group of the primer, facilitating its 
attack on the a phosphoryl group of the dNTP substrate in the active site. 
The two metal ions together help stabilize the negative charge that accumu- 
lates on the pentacoordinate transition state. The metal ion initially bound 
to dNTP stabilizes the negative charge on the pyrophosphate product. 


The specificity of replication is dictated by complementarity 
of shape between bases 


DNA must be replicated with high fidelity. Each base added to the growing 
chain should, with high probability, be the Watson—Crick complement of the 
base in the corresponding position in the template strand. The binding of the 
dNTP containing the proper base is favored by the formation of a base pair 
with its partner on the template strand. Although hydrogen bonding contrib- 
utes to the formation of this base pair, overall shape complementarity is cru- 
cial. Studies show that a nucleotide with a base that is very similar in shape to 
adenine but lacks the ability to form base-pairing hydrogen bonds can still 
direct the incorporation of thymidine, both in vitro and in vivo (Figure 28.6). 
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FIGURE 28.6 Shape complementarity. The base analog on the right has the same shape 
as adenosine, but groups that form hydrogen bonds between base pairs have been replaced 
by groups (shown in red) not capable of hydrogen bonding. Nonetheless, studies reveal that, 
when incorporated into the template strand, this analog directs the insertion of thymidine in 
DNA replication. 


An examination of the crystal structures of various DNA polymerases 
reveals why shape complementarity is so important. First, residues of the 
enzyme form hydrogen bonds with the minor-groove side of the base pair in 
the active site (Figure 28.7). In the minor groove, hydrogen-bond acceptors 
are present in the same positions for all Watson—Crick base pairs. These 
interactions act as a “ruler” that measures whether a properly spaced base 
pair has formed in the active site. 

Second, DNA polymerases close down around the incoming dNTP 
(Figure 28.8). The binding of a deoxyribonucleoside triphosphate into the 
active site of a DNA polymerase triggers a conformational change: the finger 
domain rotates to form a tight pocket into which only a properly shaped base 
pair will readily fit. Many of the residues lining this pocket are important to 
ensure the efficiency and fidelity of DNA synthesis. For example, mutation of 
a conserved tyrosine residue that forms part of the pocket results in a poly- 
merase that is approximately 40 times as error prone as the parent polymerase. 


dNTP 


dNTP inside 
tight pocket 


hk FIGURE 28.8 Shape selectivity. The binding of a deoxyribonucleoside triphosphate 


(dNTP) to DNA polymerase induces a conformational change, generating a tight pocket 
for the base pair consisting of the dNTP and its partner on the template strand. Such a 


conformational change is possible only when the dNTP corresponds to the Watson—Crick 


partner of the template base. [Drawn from 2BDP.pdb and 1T7P.pdb.] 


An RNA primer synthesized by primase enables DNA 
synthesis to begin 


DNA polymerases cannot initiate DNA synthesis without a primer, a 
section of nucleic acid having a free 3’ end that forms a double helix 
with the template. How is this primer formed? An important clue 
came from the observation that RNA synthesis is essential for the 
initiation of DNA synthesis. In fact, RNA primes the synthesis of 
DNA. An RNA polymerase called primase synthesizes a short stretch 
of RNA (about five nucleotides) that is complementary to one of the 
template DNA strands (Figure 28.9). Primase, like other RNA poly- 
merases, can initiate synthesis without a primer. After DNA synthesis 
has been initiated, the short stretch of RNA is removed by hydrolysis 
and replaced by DNA. 


One strand of DNA is made continuously, whereas the other 
strand is synthesized in fragments 


Both strands of parental DNA serve as templates for the synthesis 
of new DNA. The site of DNA synthesis is called the replication 
fork because the complex formed by the newly synthesized daugh- 
ter helices arising from the parental duplex resembles a two- 
pronged fork. Recall that the two strands are antiparallel; that is, 
they run in opposite directions. During DNA replication, both 
daughter strands appear on cursory examination to grow in the 
same direction. However, all known DNA polymerases synthesize 
DNA in the 5’ —> 3’ direction but not in the 3’ — 5’ direction. How 
then does one of the daughter DNA strands appear to grow in the 
3' + 5’ direction? 

This dilemma was resolved by Reiji Okazaki, who found that a 
significant proportion of newly synthesized DNA exists as small frag- 
ments. These units of about a thousand nucleotides (called Okazaki 
fragments) are present briefly in the vicinity of the replication fork 
(Figure 28.10). 
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FIGURE 28.9 Priming. DNA replication is primed by 
a short stretch of RNA that is synthesized by primase, 
an RNA polymerase. The RNA primer is removed at a 
later stage of replication. 
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FIGURE 28.10 Okazaki fragments. At a replication 
fork, both strands are synthesized in the 5’ —> 3’ 
direction. The leading strand is synthesized 
continuously, whereas the lagging strand is synthesized 
in short pieces termed Okazaki fragments. 
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CHAPTER 28 DNA Replication Repair 
and Recombination 


FIGURE 28.11 DNA ligase reaction. DNA 
ligase catalyzes the joining of one DNA 
strand with a free 3’-hydroxyl group to 
another with a free 5’-phosphoryl group. 

In eukaryotes and archaea, ATP is cleaved 
to AMP and PP; to drive this reaction. 

In bacteria, NAD* is cleaved to AMP and 
nicotinamide mononucleotide (NMN). 
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kK FIGURE 28.12 Helicase structure. The 
structure of the hexameric helicase from 


bacteriophage T7. One of the six subunits is shown in 
yellow with the P-loop NTPase shown in purple. The 
loops that participate in DNA binding are highlighted 
by a yellow oval. Notice that each subunit interacts 
closely with its neighbors and that the DNA-binding 


loops line the hole in the center of the 
structure. [Drawn from 1EOK.pdb.] 


As replication proceeds, these fragments become covalently joined 
through the action of the enzyme DNA ligase to form a continuous daugh- 
ter strand. The other new strand is synthesized continuously. The strand 
formed from Okazaki fragments is termed the lagging strand, whereas the 
one synthesized without interruption is the leading strand. The discontinu- 
ous assembly of the lagging strand enables 5’ — 3’ polymerization at the 
nucleotide level to give rise to overall growth in the 3’ — 5’ direction. 


DNA ligase joins ends of DNA in duplex regions 


The joining of Okazaki fragments requires an enzyme that catalyzes the 
joining of the ends of two DNA chains. The existence of circular DNA 
molecules also points to the existence of such an enzyme. In 1967, scientists 
in several laboratories simultaneously discovered DNA ligase. This enzyme 
catalyzes the formation of a phosphodiester linkage between the 3'-hydroxyl 
group at the end of one DNA chain and the 5'-phosphoryl group at the end of 
the other (Figure 28.11). An energy source is required to drive this thermo- 
dynamically uphill reaction. In eukaryotes and archaea, ATP is the energy 
source. In bacteria, NAD* typically plays this role. 


base (NAD*) + base = base 
A 
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DNA ligase cannot link two molecules of single-stranded DNA or cir- 
cularize single-stranded DNA. Rather, ligase seals breaks in double-stranded 
DNA molecules. The enzyme from E. coli ordinarily forms a phosphodiester 
bridge only if there are at least a few bases of single-stranded DNA on 
the end of a double-stranded fragment that can come together with 
those on another fragment to form base pairs. Ligase encoded by T4 
bacteriophage can link two blunt-ended double-helical fragments, a 
capability that is exploited in recombinant DNA technology. 


The separation of DNA strands requires specific helicases 
and ATP hydrolysis 


For a double-stranded DNA molecule to replicate, the two strands of 
the double helix must be separated from each other, at least locally. This 
separation allows each strand to act as a template on which a new poly- 
nucleotide chain can be assembled. Specific enzymes, termed helicases, 
utilize the energy of ATP hydrolysis to power strand separation. 
Helicases are a large and diverse family of enzymes taking part 
in many biological processes. The helicases in DNA replication are 
typically oligomers containing six subunits that form a ring structure. 
The structure of one such helicase, that from bacteriophage T’7, has 
been a source of considerable insight into the helicase mechanism 
(Figure 28.12). Each of the subunits within this hexameric structure 
has acore structure that includes a P-loop NT Pase domain (Figure 9.49). 
In addition to the P-loop, each subunit has two loops that extend 
toward the center of the ring structure and interact with DNA. Each 
subunit interacts closely with its two neighbors within the ring struc- 
ture. Closer examination of this structure reveals that the ring deviates 
significantly from six-fold symmetry. This deviation is even more 


apparent when helicase is crystallized in the presence of the nonhydrolyzable 
ATP analog AMP-PNP. 
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The AMP-PNP binds to only four of the six subunits within the ring 
(Figure 28.13). Furthermore, the four nucleotide-binding sites are not iden- 
tical but fall into two classes. One class appears to be well positioned to bind 
ATP but not catalyze its hydrolysis, whereas the other class is more well 
suited to catalyze the hydrolysis but not release the hydrolysis products. 
The classes are analogous to myosin’s two different conformations—one for 
binding ATP and one for hydrolyzing it (Section 9.4). Finally, the six sub- 
units fall into three classes with regard to their orientation with respect to 
the overall ring structure, with differences in rotation around an axis in the 
plane of the ring of approximately 30°. These differences in orientation 
affect the position of the two DNA-binding loops in each subunit. 

These observations are consistent with the following mechanism for the 
helicase (Figure 28.14). Only a single strand of DNA can fit through the 
center on the ring. This single strand binds to loops on two adjacent sub- 
units, one of which has bound ATP and the other of which has bound 
ADP + P.. The binding of ATP to the domains that initially had no bound 
nucleotides leads to a conformational change within the entire hexamer, 
leading to the release of ADP + P; from two subunits and the binding of the 
single-stranded DNA by one of the domains that just bound ATP. This 
conformational change pulls the DNA through the center of the hexamer. 
The protein acts as a wedge, forcing the two strands of the double helix 
apart. This cycle then repeats itself, moving two bases along the DNA 
strand with each cycle. 


28.2 DNA Unwinding and Supercoiling Are Controlled 
by Topoisomerases 


As a helicase moves along unwinding DNA, the DNA in front of the helli- 
case will become overwound in the absence of other changes. As discussed 
in Section 4.2, DNA double helices that are torsionally stressed tend to fold 
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hk FIGURE 28.13 Helicase asymmetry. 
The structure of the T7 helicase complexes 
with the ATP analog AMP-PNP is shown. 
The three classes of helicase subunits are 
shown in blue, red, and yellow. The rotation 
relative to the plane of the hexamer is 
shown for each subunit. Notice that only 
four of the subunits, those shown in blue 
and yellow, bind AMP-PNP. [Drawn from 
1EOK.pdb.] 


FIGURE 28.14 Helicase mechanism. One 
of the strands of the double helix passes 
through the hole in the center of the 
helicase, bound to the loops of two 
adjacent subunits. Two of the subunits do 
not contain bound nucleotides. On the 
binding of ATP to these two subunits and 
the release of ADP + P, from two other 
subunits, the helicase hexamer undergoes 
a conformational change, pulling the DNA 
through the helicase. The helicase acts as 
a wedge to force separation of the two 
strands of DNA. 
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up on themselves to form tertiary structures created by supercoiling. We will 
first consider the supercoiling of DNA in quantitative terms and then turn 
to topoisomerases, enzymes that can directly modulate DNA winding and 
supercoiling. Supercoiling is most readily understood by considering circu- 
lar DNA molecules, but it also applies to linear DNA molecules constrained 
to be in loops by other means. Most DNA molecules inside cells are subject 
to supercoiling. 

Consider a linear 260-bp DNA duplex in the B-DNA form 
(Figure 28.15A). Because the number of base pairs per turn in an unstressed 
DNA molecule averages 10.4, this linear DNA molecule has 25 (260/10.4) 
turns. The ends of this helix can be joined to produce a relaxed circular DNA 
(Figure 28.15B). A different circular DNA can be formed by unwinding 
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(D) Lk = 23, Tw ~ 23, Wr ~O 
Unwound circle 


Linear DNA unwound by two right-hand turns 
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(right-handed) 


FIGURE 28.15 Linking number. The relations between the linking number (Lk), twisting 
number (Tw), and writhing number (Wr) of a circular DNA molecule revealed schematically. 
[Information from W. Saenger, Principles of Nucleic Acid Structure (Springer Verlag, 1984), p. 452.] 


the linear duplex by two turns before joining its ends (Figure 28.15C). 
What is the structural consequence of unwinding before ligation? Two lim- 
iting conformations are possible. The DNA can fold into a structure con- 
taining 23 turns of B helix and an unwound loop (Figure 28.15D). 
Alternatively, the double helix can fold up to cross itself. Such crossings are 
called supercoils. In particular, a supercoiled structure with 25 turns of 
B helix and 2 turns of right-handed (termed negative) superhelix can be 
formed (Figure 28.15E). 

Supercoiling markedly alters the overall form of DNA. A supercoiled 
DNA molecule is more compact than a relaxed DNA molecule of the same 
length. Hence, supercoiled DNA moves faster than relaxed DNA when 
analyzed by centrifugation or electrophoresis. Unwinding will cause super- 
coiling in circular DNA molecules, whether covalently closed or con- 
strained in closed configurations by other means. 


The linking number of DNA, a topological property, determines 
the degree of supercoiling 


Our understanding of the conformation of DNA is enriched by concepts 
drawn from topology, a branch of mathematics dealing with structural prop- 
erties that are unchanged by deformations such as stretching and bending. 
A key topological property of a circular DNA molecule is its linking number 
(Lk), which is equal to the number of times that a strand of DNA winds in 
the right-handed direction around the helix axis when the axis lies in a 
plane, as in Figure 28.15A. For the relaxed DNA shown in Figure 28.15B, 
Lk = 25. For the partly unwound molecule shown in part D and the super- 
coiled one shown in part E, Lk = 23 because the linear duplex was unwound 
two complete turns before closure. Molecules differing only in linking num- 
ber are topological isomers, or topoisomers, of one another. Topoisomers of 
DNA can be interconverted only by cutting one or both DNA strands and then 
rejoining them. 

The unwound DNA and supercoiled DNA shown in Figure 28.15D 
and E are topologically identical but geometrically different. They have 
the same value of Lk but differ in twist (Tw) and writhe (Wr). Although 
the rigorous definitions of twist and writhe are complex, twist is a mea- 
sure of the helical winding of the DNA strands around each other, 
whereas writhe is a measure of the coiling of the axis of the double 
helix—that is, supercoiling. A right-handed coil is assigned a negative 
number (negative supercoiling) and a left-handed coil is assigned a posi- 
tive number (positive supercoiling). 

Is there a relation between Tw and Wr? Indeed, there is. Topology tells 
us that the sum of Tw and Wr is equal to Lk. 


Lk = Tw + Wr 


In Figure 28.15, the partly unwound circular DNA has Tw ~ 23, meaning the 
helix has 23 turns and Wr ~ 0, meaning the helix has not crossed itself to cre- 
ate a supercoil. The supercoiled DNA, however has Tw ~ 25 and Wr ~ —2. 
These forms can be interconverted without cleaving the DNA chain because 
they have the same value of Lk—namely, 23. The partitioning of Lk (which 
must be an integer) between Tw and Wr (which need not be integers) is deter- 
mined by energetics. The free energy is minimized when about 70% of the 
change in Lk is expressed in Wr and 30% is expressed in Tw. Hence, the most 
stable form would be one with Tw = 24.4 and Wr = —1.4. Thus, a lowering of 
Lk causes both right-handed (negative) supercoiling of the DNA axis and unwind- 
ing of the duplex. Topoisomers differing by just 1 in Lk, and consequently 
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k FIGURE 28.17 Structure of 
topoisomerase |. The structure of a 
complex between a fragment of human 
topoisomerase | and DNA is shown. Notice 
that DNA lies in a central cavity within the 
enzyme. [Drawn from 1EJ9.pdb.] 


FIGURE 28.16 Topoisomers. An electron micrograph showing negatively supercoiled and 
relaxed DNA. [Courtesy of Dr. Jack Griffith.] 


by 0.7 in Wr, can be readily separated by agarose gel electrophoresis because 
their hydrodynamic volumes are quite different; supercoiling condenses DNA 
(Figure 28.16). 


Topoisomerases prepare the double helix for unwinding 


Most naturally occurring DNA molecules are negatively supercoiled. What 
is the basis for this prevalence? As already stated, negative supercoiling 
arises from the unwinding or underwinding of the DNA. In essence, nega- 
tive supercoiling prepares DNA for processes requiring separation of the 
DNA strands, such as replication. Positive supercoiling condenses DNA as 
effectively, but it makes strand separation more difficult. 

The presence of supercoils in the immediate area of unwinding would, 
however, make unwinding difficult. Therefore, negative supercoils must be 
continuously removed, and the DNA relaxed, as the double helix unwinds. 
Specific enzymes called topoisomerases that introduce or eliminate super- 
coils were discovered by James Wang and Martin Gellert. Type I topoisom- 
erases catalyze the relaxation of supercoiled DNA, a thermodynamically 
favorable process. Type II topoisomerases utilize free energy from ATP 
hydrolysis to add negative supercoils to DNA. Both type I and type II 
topoisomerases play important roles in DNA replication as well as in tran- 
scription and recombination. 

These enzymes alter the linking number of DNA by catalyzing 
a three-step process: (1) the cleavage of one or both strands of DNA, 
(2) the passage of a segment of DNA through this break, and (3) the 
resealing of the DNA break. Type I topoisomerases cleave just one strand 
of DNA, whereas type II enzymes cleave both strands. The two types of 
enzymes have several common features, including the use of key tyro- 
sine residues to form covalent links to the polynucleotide backbone that 
is transiently broken. 


Type | topoisomerases relax supercoiled structures 


The three-dimensional structures of several type I topoisomerases have 
been determined (Figure 28.17). These structures reveal many features of 
the reaction mechanism. Human type I topoisomerase comprises four 
domains, which are arranged around a central cavity having a diameter of 
20 A, just the correct size to accommodate a double-stranded DNA mole- 
cule. This cavity also includes a tyrosine residue (Tyr 723), which acts as a 
nucleophile to cleave the DNA backbone in the course of catalysis. 
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FIGURE 28.18 Topoisomerase | mechanism. On binding to DNA, topoisomerase | cleaves 
one strand of the DNA by means of a tyrosine (Y) residue attacking a phosphoryl group. When 
the strand has been cleaved, it rotates in a controlled manner around the other strand. The 
reaction is completed by religation of the cleaved strand. This process results in partial or 
complete relaxation of a supercoiled plasmid. 


From analyses of these structures and the results of other studies, the 
relaxation of negatively supercoiled DNA molecules is known to proceed in 
the following manner (Figure 28.18). First, the DNA molecule binds inside 
the cavity of the topoisomerase. The hydroxyl group of tyrosine 723 attacks 
a phosphoryl group on one strand of the DNA backbone to form a phospho- 
diester linkage between the enzyme and the DNA, cleaving the DNA and 
releasing a free 5’-hydroxyl group. 
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With the backbone of one strand cleaved, the DNA can now rotate 
around the remaining strand, its movement driven by the release of 
energy stored because of the supercoiling. The rotation of the DNA 
unwinds the supercoils. The enzyme controls the rotation so that the 
unwinding is not rapid. The free hydroxyl group of the DNA attacks 
the phosphotyrosine residue to reseal the backbone and release tyrosine. 
The DNA is then free to dissociate from the enzyme. Thus, reversible 
cleavage of one strand of supercoiled DNA allows controlled rotation to 
partly relax the supercoils. 


Type Il topoisomerases can introduce negative supercoils through 
coupling to ATP hydrolysis 


Supercoiling requires an input of energy because a supercoiled molecule, in 
contrast with its relaxed counterpart, is torsionally stressed. The introduc- 
tion of an additional supercoil into a 3000-bp plasmid typically requires 
about 30 kJ mol~! (7 kcal mol~!). 
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DNA-segment- 
binding sites 


k FIGURE 28.19 Structure of topoisomerase II. The dimeric 
structure of a typical topoisomerase Il, that from the archaeon 
Sulfolobus shibatae. Notice that each half of the enzyme has one 
domain (shown in yellow) that contains a region for binding a DNA 
double helix and another domain (shown in green) that contains 
ATP binding sites. [Drawn from 2ZBK.pdb.] 


G segment 


Supercoiling can be catalyzed by type II topoisomer- 
ases. These elegant molecular machines couple the bind- 
ing and hydrolysis of ATP to the directed passage of one 
DNA double helix through another, temporarily cleaved 
DNA double helix. These enzymes have several mecha- 
nistic features in common with the type I topoisomerases. 

Topoisomerase II molecules are dimeric with a large 
internal cavity (Figure 28.19). The large cavity has gates at 
both the top and the bottom that are crucial to topoisom- 
erase action. The reaction begins with the binding of 
one double helix (hereafter referred to as the G, for gate, 
segment) to the enzyme (Figure 28.20). Each strand is 
positioned next to a tyrosine residue, one from each mono- 
mer, capable of forming a covalent linkage with the DNA 
backbone. This complex then loosely binds a second DNA 
double helix (hereafter referred to as the T, for transported, 
segment). Each monomer of the enzyme has a domain that 
binds ATP; this ATP binding leads to a conformational 
change that strongly favors the coming together of the two 
domains. As these domains come closer together, they trap 
the bound T segment. This conformational change also 
forces the separation and cleavage of the two strands of 
the G segment. Each strand is linked to the enzyme by a 
tyrosine—phosphodiester bond. Unlike the type I enzymes, 


T segment 


FIGURE 28.20 Topoisomerase || mechanism. Topoisomerase II first binds one DNA duplex 
termed the G (for gate) segment. The binding of ATP to the two N-terminal domains brings 
these two domains together. This conformational change leads to the cleavage of both 
strands of the G segment and the binding of an additional DNA duplex, the T segment. 

This T segment then moves through the break in the G segment and out the bottom of 

the enzyme. The hydrolysis of ATP resets the enzyme with the G segment still bound. 

The orientation of the entering T segment results in the introduction of negative supercoils. 
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the type II topoisomerases hold the DNA tightly so that it cannot rotate. 
The T segment then passes through the cleaved G segment and into the 
large central cavity. The ligation of the G segment leads to the release of the 
T segment through the gate at the bottom of the enzyme. The hydrolysis of 
ATP and the release of ADP and orthophosphate allow the ATP-binding 
domains to separate, preparing the enzyme to bind another T segment. The 
overall process leads to a decrease in the linking number by two. 


Ea The bacterial topoisomerase II (often called DNA gyrase) is the target 

of several antibiotics that inhibit the prokaryotic enzyme much more 
than the eukaryotic one. Novobiocin blocks the binding of ATP to gyrase. 
Nalidixic acid and ciprofloxacin, in contrast, interfere with the breakage and 
rejoining of DNA chains. These two gyrase inhibitors are widely used to treat 
urinary tract and other infections including those due to Bacillus anthracis 
(anthrax). Camptothecin, an antitumor agent, inhibits human topoisomerase I 
by stabilizing the form of the enzyme covalently linked to DNA. 


Nalidixic is A. 


28.3 DNA Replication Is Highly Coordinated 


DNA replication must be very rapid, given the sizes of the genomes and the 
rates of cell division. The E. coli genome contains 4.6 million base pairs and 
is copied in less than 40 minutes. Thus, 2000 bases are incorporated per 
second. Enzyme activities must be highly coordinated to replicate entire 
genomes precisely and rapidly. 

We begin our consideration of the coordination of DNA replication by 
looking at E. coli, which has been extensively studied. For this organism 
with a relatively small genome, replication begins at a single site and contin- 
ues around the circular chromosome. The coordination of eukaryotic DNA 
replication is much more complex because there are many initiation sites 
throughout the genome and an additional enzyme is needed to replicate the 
ends of linear chromosomes. 


DNA replication requires highly processive polymerases 


Replicative polymerases are characterized by their very high catalytic 
potency, fidelity, and processivity. Processivity refers to the ability of an 
enzyme to catalyze many consecutive reactions without releasing its sub- 
strate. These polymerases are assemblies of many subunits that have 
evolved to grasp their templates and not let go until many nucleotides have 
been added. The source of the processivity was revealed by the determina- 
tion of the three-dimensional structure of the 8, subunit of the E. coli rep- 
licative polymerase called DNA polymerase III (Figure 28.21). This unit 
keeps the polymerase associated with the DNA double helix. It has the form 
of a star-shaped ring. A 35-A-diameter hole in its center can readily accom- 
modate a duplex DNA molecule and leaves enough space between the 
DNA and the protein to allow rapid sliding during replication. To achieve a 
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® FIGURE 28.21 Structure of a sliding 
DNA clamp. The dimeric B subunit of DNA 
polymerase III forms a ring that surrounds 
the DNA duplex. Notice the central cavity 
through which the DNA template slides. 
Clasping the DNA molecule in the ring, 
the polymerase enzyme is able to move 
without falling off the DNA substrate. 
[Drawn from 2POL.pdb.] 
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Processive enzyme 

From the Latin procedere, “to go forward.” 
An enzyme that catalyzes multiple rounds 

of the elongation or digestion of a polymer 

while the polymer stays bound. A distributive 

enzyme, in contrast, releases its polymeric 

substrate between successive catalytic steps. 


FIGURE 28.22 Replication fork. A 
schematic view of the arrangement of DNA 
polymerase Ill and associated enzymes and 
proteins present in the replication of DNA. 
The helicase separates the two strands of 
the parent double helix, allowing DNA 
polymerases to use each strand as a 
template for DNA synthesis. Abbreviation: 
SSB, single-stranded-binding protein. 


catalytic rate of 1000 nucleotides polymerized per second requires that 
100 turns of duplex DNA (a length of 3400 A, or 0.34 mm) slide through 
the central hole of 8, per second. Thus, 8, plays a key role in replication by 
serving as a sliding DNA clamp. 

How does DNA become entrapped inside the sliding clamp? Replicative 
polymerases also include assemblies of subunits that function as clamp load- 
ers. These enzymes grasp the sliding clamp and, utilizing the energy of ATP 
binding, pull apart one of the interfaces between the two subunits of the 
sliding clamp. DNA can move through the gap, inserting itself through the 
central hole. ATP hydrolysis then releases the clamp, which closes around 
the DNA. 


The leading and lagging strands are synthesized 
in a coordinated fashion 


Replicative polymerases such as DNA polymerase III synthesize the leading 
and lagging strands simultaneously at the replication fork (Figure 28.22). 
DNA polymerase III begins the synthesis of the leading strand starting from 
the RNA primer formed by primase. The duplex DNA ahead of the poly- 
merase is unwound by a hexameric helicase called DnaB. Copies of single- 
stranded-binding protein (SSB) bind to the unwound strands, keeping the 
strands separated so that both strands can serve as templates. The leading 
strand is synthesized continuously by polymerase III. Topoisomerase II 
concurrently introduces right-handed (negative) supercoils to avert a topo- 
logical crisis. 

The mode of synthesis of the lagging strand is necessarily more complex. 
As mentioned earlier, the lagging strand is synthesized in fragments so that 
5’ —> 3’ polymerization leads to overall growth in the 3’ — 5’ direction. Yet 
the synthesis of the lagging strand is coordinated with the synthesis of the 
leading strand. How is this coordination accomplished? Examination of 
the subunit composition of the DNA polymerase ITI holoenzyme reveals an 
elegant solution (Figure 28.23). The holoenzyme includes two copies of the 
polymerase core enzyme, which consists of the DNA polymerase itself (the 
a subunit); the e subunit, a 3’-to-5’ proofreading exonuclease; another sub- 
unit called 8; and two copies of the dimeric B-subunit sliding clamp. The 
core enzymes are linked to a central structure having the subunit composition 
yT,58'x. The yr,58’ complex is the clamp loader, and the x and subunits 
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interact with the single-stranded- DNA-binding protein. The entire appara- 
tus interacts with the hexameric helicase DnaB. Eukaryotic replicative poly- 
merases have similar, albeit slightly more complicated, subunit compositions 
and structures. 

The lagging-strand template is looped out so that it passes through the 
polymerase site in one subunit of a dimeric polymerase III in the same 
direction as that of the leading-strand template in the other subunit, 5’ 3’. 
DNA polymerase III lets go of the lagging-strand template after adding 
about 1000 nucleotides by releasing the sliding clamp. A new loop is then 
formed, a sliding clamp is added, and primase again synthesizes a short 
stretch of RNA primer to initiate the formation of another Okazaki frag- 
ment. This mode of replication has been termed the trombone model because 
the size of the loop lengthens and shortens like the slide on a trombone 
(Figure 28.24). 
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FIGURE 28.23 DNA polymerase 
holoenzyme. Each holoenzyme consists of 
two copies of the polymerase core enzyme, 
which comprises the a, ¢, and @ subunits 
and two copies of the B subunit, linked to 
a central structure. The central structure 
includes the clamp-loader complex and 

the hexameric helicase DnaB. 


FIGURE 28.24 Trombone model. The 
replication of the leading and lagging 
strands is coordinated by the looping out of 
the lagging strand to form a structure that 
acts somewhat as a trombone slide, 
growing as the replication fork moves 
forward. When the polymerase on the 
lagging strand reaches a region that has 
been replicated, the sliding clamp is 
released and a new loop is formed. 
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DnaA 
hexamer 


FIGURE 28.26 Assembly of DnaA. 
Monomers of DnaA bind to their binding 
sites (Shown in yellow) in oriC and come 
together to form a complex structure, 
possibly the cyclic hexamer shown here. 
This structure marks the origin of replication 
and favors DNA strand separation in the 
AT-rich sites (green). 


FIGURE 28.27 Prepriming complex. The 
AT-rich regions are unwound and trapped 
by the single-stranded-binding protein 
(SSB). The hexameric DNA helicase DnaB 
is loaded on each strand. At this stage, the 
complex is ready for the synthesis of the 
RNA primers and assembly of the DNA 
polymerase III holoenzyme. 
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FIGURE 28.25 Origin of replication in E. coli. The oriC locus has a length of 245 bp. 
It contains a tandem array of three nearly identical 13-nucleotide sequences (green) and 
five binding sites (yellow) for the DnaA protein. 


The gaps between fragments of the nascent lagging strand are filled 
by DNA polymerase I. This essential enzyme also uses its 5’ — 3’ exo- 
nuclease activity to remove the RNA primer lying ahead of the poly- 
merase site. The primer cannot be erased by DNA polymerase III, 
because the enzyme lacks 5’ — 3’ editing capability. Finally, DNA ligase 
connects the fragments. 


DNA replication in Escherichia coli begins at a unique site 


In E. coli, DNA replication starts at a unique site within the entire 4.6 X 
10° bp genome. This origin of replication, called the oriC locus, is a 245-bp 
region that has several unusual features (Figure 28.25). The oriC locus con- 
tains five copies of a sequence that is the preferred binding site for the ori- 
gin-recognition protein DnaA. In addition, the locus contains a tandem 
array of 13-bp sequences that are rich in AT base pairs. Several steps are 
required to prepare for the start of replication: 


1. The binding of DnaA proteins to DNA is the first step in the preparation 
for replication. DnaA is a member of the P-loop NTPase family related to 
the hexameric helicases. Each DnaA monomer comprises an ATPase 
domain linked to a DNA-binding domain at its C-terminus. DnaA mole- 
cules are able to bind to each other through their ATPase domains; a group 
of bound DnaA molecules will break apart on the binding and hydrolysis of 
ATP. The binding of DnaA molecules to one another signals the start of the 
preparatory phase, and their breaking apart signals the end of that phase. 
The DnaA proteins bind to the five high-affinity sites in oriC and then 
come together with DnaA molecules bound to lower-affinity sites to form 
an oligomer, possibly a cyclic hexamer. The DNA is wrapped around the 
outside of the DnaA hexamer (Figure 28.26). 


2. Single DNA strands are exposed in the prepriming complex. With DNA 
wrapped around a DnaA hexamer, additional proteins are brought into 
play. The hexameric helicase DnaB is loaded around the DNA with the 
help of the helicase loader protein DnaC. Local regions of oriC, including 
the AT regions, are unwound and trapped by the single-stranded- DNA— 
binding protein. The result of this process is the generation of a structure 
called the prepriming complex, which makes single-stranded DNA accessi- 
ble to other proteins (Figure 28.27). Significantly, the primase, DnaG, is 
now able to insert the RNA primer. 


3. The polymerase holoenzyme assembles. The DNA polymerase III holoen- 
zyme assembles on the prepriming complex, initiated by interactions 
between DnaB and the sliding-clamp subunit of DNA polymerase III. 
These interactions also trigger ATP hydrolysis within the DnaA subunits, 
signaling the initiation of DNA replication. The breakup of the DnaA 
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FIGURE 28.28 Eukaryotic origins of replication. The image shows a single molecule of DNA 
containing two origins of replication. The origins were identified by labeling new replicated DNA 
in human cells first with one thymine analog (iodo-deoxyuridine, |-dU) and then another (chloro- 
deoxyuridine, Cl-dU). DNA molecules from these cells were then extended on a microscope slide 
and labeled with antibodies to I-dU (green) and Cl-dU (red) to visualize the DNA. This method 
allows the detection of replication origins as well as determination of the rate of DNA synthesis. 
[Courtesy Aaron Bensimon. Data from C. Conti, et al., Mol. Biol. Cell. 18:3059-3067, 2007.] 


assembly prevents additional rounds of replication from beginning at the 
replication origin. 


DNA synthesis in eukaryotes is initiated at multiple sites 


Replication in eukaryotes is mechanistically similar to replication in prokary- 
otes but is more challenging for a number of reasons. One of them is sheer 
size: E. coli must replicate 4.6 million base pairs, whereas a human diploid 
cell must replicate more than 6 billion base pairs. Second, the genetic infor- 
mation for E. coli is contained on 1 chromosome, whereas, in human beings, 
23 pairs of chromosomes must be replicated. Finally, whereas the E. coli 
chromosome is circular, human chromosomes are linear. Unless counter- 
measures are taken, linear chromosomes are subject to shortening with each 
round of replication. 

The first two challenges are met by the use of multiple origins of replica- 
tion. In human beings, replication requires about 30,000 origins of replication, 
with each chromosome containing several hundred. Each origin of replication 
is the starting site for a replication unit, or replicon. DNA replication can be 
monitored by single-molecule methods, revealing bidirectional synthesis from 
different sites (Figure 28.28). In contrast with E. coli, the origins of replication 
in human beings do not contain regions of sharply defined sequence. Instead, 
more broadly defined AT-rich sequences are the sites around which the origin 
of replication complexes (ORCs) are assembled. 


1. The assembly of the ORC is the first step in the preparation for replication. 
In human beings, the ORC is composed of six different proteins, each 
homologous to DnaA. These proteins come together to form a hexameric 
structure analogous to the assembly formed by DnaA. 


2. Licensing factors recruit a helicase that exposes single strands of DNA. 
After the ORC has been assembled, additional proteins are recruited, 
including Cdc6, a homolog of the ORC subunits, and Cdt1. These pro- 
teins, in turn, recruit a hexameric helicase with six distinct subunits called 
Mcm2-7. These proteins, including the helicase, are sometimes called 
licensing factors because they permit the formation of the initiation complex. 
After the initiation complex has formed, Mcm2-7 separates the parental 
DNA strands, and the single strands are stabilized by the binding of replica- 
tion protein A, a single-stranded- DNA-binding protein. 


3. Two distinct polymerases are needed to copy a eukaryotic replicon. An initia- 
tor polymerase called polymerase a begins replication but is soon replaced by 
amore processive enzyme. This process is called polymerase switching because 
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Mitosis 
begins 
FIGURE 28.29 Eukaryotic cell cycle. 
DNA replication and cell division must take 
place in a highly coordinated fashion in 
eukaryotes. Mitosis (M) takes place only 
after DNA synthesis (S). Two gaps (G, and 
G,) in time separate the two processes. 


TABLE 28.1 Some types of DNA polymerases 


Name Function 


Prokaryotic Polymerases 


DNA polymerase I Erases primer and fills in gaps on 
lagging strand 
DNA polymerase II (error-prone polymerase) DNA repair 
DNA polymerase III Primary enzyme of DNA synthesis 
Eukaryotic Polymerases 
DNA polymerase a Initiator polymerase 
Primase subunit Synthesizes the RNA primer 
DNA polymerase unit Adds stretch of about 20 nucleotides 


to the primer 
DNA polymerase B (error-prone polymerase) DNA repair 
DNA polymerase 8 Primary enzyme of DNA synthesis 


one polymerase has replaced another. This second enzyme, called DNA poly- 
merase 6, is the principal replicative polymerase in eukaryotes (Table 28.1). 

Replication begins with the binding of DNA polymerase a. This 
enzyme includes a primase subunit, used to synthesize the RNA primer, as 
well as an active DNA polymerase. After this polymerase has added a 
stretch of about 20 deoxynucleotides to the primer, another replication pro- 
tein, called replication factor C (RFC), displaces DNA polymerase a. 
Replication factor C attracts a sliding clamp called proliferating cell nuclear 
antigen (PCNA), which is homologous to the 8, subunit of E. coli poly- 
merase III. The binding of PCNA to DNA polymerase 6 renders the 
enzyme highly processive and suitable for long stretches of replication. 
Replication continues in both directions from the origin of replication until 
adjacent replicons meet and fuse. RNA primers are removed and the DNA 
fragments are ligated by DNA ligase. 

The use of multiple origins of replication requires mechanisms for ensur- 
ing that each sequence is replicated once and only once. The events of eukary- 
otic DNA replication are linked to the eukaryotic cell cycle (Figure 28.29). 
The processes of DNA synthesis and cell division are coordinated in the cell 
cycle so that the replication of all DNA sequences is complete before the 
cell progresses into the next phase of the cycle. This coordination requires 
several checkpoints that control the progression along the cycle. A family of 
small proteins termed cyclins are synthesized and degraded by proteasomal 
digestion in the course of the cell cycle. Cyclins act by binding to specific 
cyclin-dependent protein kinases and activating them. One such kinase, cyclin- 
dependent kinase 2 (cdk2) binds to assemblies at origins of replication and 
regulates replication through a number of interlocking mechanisms. 


Telomeres are unique structures at the ends of linear chromosomes 


Whereas the genomes of essentially all prokaryotes are circular, the chromo- 
somes of human beings and other eukaryotes are linear. The free ends of 
linear DNA molecules introduce several complications that must be resolved 
by special enzymes. In particular, complete replication of DNA ends is dif- 
ficult because polymerases act only in the 5’ —> 3’ direction. The lagging 
strand would have an incomplete 5’end after the removal of the RNA 
primer. Each round of replication would further shorten the chromosome. 
The first clue to how this problem is resolved came from sequence analy- 
ses of the ends of chromosomes, which are called telomeres (from the Greek 
telos, “‘an end”). Telomeric DNA contains hundreds of tandem repeats of a 
six-nucleotide sequence. One of the strands is G rich at the 3’ end, and it is 


G-rich strand 


$<“ 


slightly longer than the other strand. In human beings, the repeating G-rich 
sequence is AGGGTT. 

The structure adopted by telomeres has been extensively investigated. 
Evidence suggests that they may form large duplex loops (Figure 28.30). 
The single-stranded region at the very end of the structure has been pro- 
posed to loop back to form a DNA duplex with another part of the repeated 
sequence, displacing a part of the original telomeric duplex. This loop-like 
structure is formed and stabilized by specific telomere-binding proteins. 
Such structures would nicely mask and protect the end of the chromosome. 


Telomeres are replicated by telomerase, a specialized polymerase 
that carries its own RNA template 


How are the repeated sequences generated? An enzyme, termed telomerase, 
that executes this function has been purified and characterized. When a 
primer ending in GGTT is added to human telomerase in the presence of 
deoxynucleoside triphosphates, the sequences GGITTAGGGTT and 
GGTTAGGGTTAGGGTT, as well as longer products, are generated. 
Elizabeth Blackburn and Carol Greider discovered that the enzyme adding 
the repeats contains an RNA molecule that serves as the template for the 
elongation of the G-rich strand (Figure 28.31). Thus, telomerase carries the 
information necessary to generate the telomere sequences. The exact num- 
ber of repeated sequences is not crucial. 

Subsequently, a protein component of telomerases also was identified. 
This component is related to reverse transcriptases, enzymes first discovered 
in retroviruses that copy RNA into DNA (Section 5.2). Thus, telomerase is a 
specialized reverse transcriptase that carries its own template. Telomerase is 
generally expressed at high levels only in rapidly growing cells. Thus, telo- 
meres and telomerase can play important roles in cancer-cell biology and in 
cell aging. 


e Because cancer cells express high levels of telomerase, whereas most 
normal cells do not, telomerase is a potential target for anticancer 
therapy. A variety of approaches for blocking telomerase expression or block- 
ing its activity are under investigation for cancer treatment and prevention. 


28.4 Many Types of DNA Damage Can Be Repaired 


We have examined how even very large and complex genomes can, in prin- 
ciple, be replicated with considerable fidelity. However, DNA does become 
damaged, both in the course of replication and through other processes. 
Damage to DNA can be as simple as the misincorporation of a single base 
or it can take more-complex forms such as the chemical modification of 
bases, chemical cross-links between the two strands of the double helix, or 
breaks in one or both of the phosphodiester backbones. The results may be 
cell death or cell transformation, changes in the DNA sequence that can be 
inherited by future generations, or blockage of the DNA replication process 
itself. A variety of DNA-repair systems have evolved that can recognize 
these defects and, in many cases, restore the DNA molecule to its undam- 
aged form. We begin with some of the sources of DNA damage. 


FIGURE 28.30 Proposed model for 
telomeres. A single-stranded segment of 
the G-rich strand extends from the end 

of the telomere. In one model for telomeres, 
this single-stranded region invades the 
duplex to form a large duplex loop. 
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FIGURE 28.31 Telomere formation. 
Mechanism of synthesis of the G-rich strand 
of telomeric DNA. The RNA template of 
telomerase is shown in blue and the 
nucleotides added to the G-rich strand of 
the primer are shown in red. [Information from 
E. H. Blackburn, Nature 350:569-573, 1991.] 
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and Recombination 


Errors can arise in DNA replication 


Errors introduced in the replication process are the simplest source of 
damage in the double helix. With the addition of each base, there is the pos- 
sibility that an incorrect base might be incorporated, forming a non- Watson— 
Crick base pair. These non-Watson—Crick base pairs can locally distort the 
DNA double helix. Furthermore, such mismatches can be mutagenic; that 
is, they can result in permanent changes in the DNA sequence. When a 
double helix containing a non- Watson—Crick base pair is replicated, the two 
daughter double helices will have different sequences because the mis- 
matched base is very likely to pair with its Watson—Crick partner. Errors 
other than mismatches include insertions, deletions, and breaks in one or 
both strands. Furthermore, replicative polymerases can stall or even fall off 
a damaged template entirely. As a consequence, replication of the genome 
may halt before it is complete. 

A variety of mechanisms have evolved to deal with such interruptions, 
including specialized DNA polymerases that can replicate DNA across 
many lesions. A drawback is that such polymerases are substantially more 
error prone than are normal replicative polymerases. Nonetheless, these 
translesion or error-prone polymerases allow the completion of a draft 
sequence of the genome that can be at least partly repaired by DNA-repair 
processes. DNA recombination (Section 28.5) provides an additional mech- 
anism for salvaging interruptions in DNA replication. 


Bases can be damaged by oxidizing agents, alkylating agents, 
and light 


A variety of chemical agents can alter specific bases within DNA after 
replication is complete. Such mutagens include reactive oxygen species 
such as hydroxyl radical. For example, hydroxyl radical reacts with 
guanine to form 8-oxoguanine. 8-Oxoguanine is mutagenic because it 
often pairs with adenine rather than cytosine in DNA replication. Its 


8-Oxoguanine Adenine 


FIGURE 28.32 Oxoguanine-adenine base 

pair. When guanine is oxidized to 8-oxoguanine, the 
damaged base can form a base pair with adenine 
through an edge of the base that does not normally 
participate in base-pair formation. 


FIGURE 28.33 Adenine deamination. The 
base adenine can be deaminated to form 
hypoxanthine. Hypoxanthine forms base 
pairs with cytosine in a manner similar to 
that of guanine, and so the deamination 
reaction can result in mutation. 


choice of pairing partner differs from that of guanine because it uses a 
different edge of the base to form base pairs (Figure 28.32). Deamination 
is another potentially deleterious process. For example, adenine can be 
deaminated to form hypoxanthine (Figure 28.33). This process is 
mutagenic because hypoxanthine pairs with cytosine rather than thy- 
mine. Guanine and cytosine also can be deaminated to yield bases that 
pair differently from the parent base. 


Adenine Hypoxanthine 


In addition to oxidation and deamination, nucleotide bases are subject to 
alkylation. Electrophilic centers can be attacked by nucleophiles such as 
N-7 of guanine and adenine to form alkylated adducts. Some compounds 
are converted into highly active electrophiles through the action of enzymes 
that normally play a role in detoxification. A striking example is aflatoxin 
B,, acompound produced by molds that grow on peanuts and other foods. 
A cytochrome P450 enzyme (Section 26.4) converts this compound into a 


highly reactive epoxide (Figure 28.34). This agent reacts with the N-7 atom 
of guanosine to form a mutagenic adduct that frequently leads to a G-C-to- 
T—A transversion. 

The ultraviolet component of sunlight is a ubiquitous DNA-damaging 
agent. Its major effect is to covalently link adjacent pyrimidine residues 
along a DNA strand (Figure 28.35). Such a pyrimidine dimer cannot fit into 
a double helix, and so replication and gene expression are blocked until the 
lesion is removed. 
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Thymine dimer 


FIGURE 28.35 Cross-linked dimer of two thymine bases. Ultraviolet light induces cross- 
links between adjacent pyrimidines along one strand of DNA. 


A thymine dimer is an example of an intrastrand cross-link because both 
participating bases are in the same strand of the double helix. Cross-links 
between bases on opposite strands also can be introduced by various agents. 
Psoralens are compounds, produced by a number of plants including the 
common fig, that form such interstrand cross-links (Figure 28.36). Interstrand 
cross-links disrupt replication because they prevent strand separation. 

High-energy electromagnetic radiation such as x-rays can damage DNA 
by producing high concentrations of reactive species in solution. X-ray 
exposure can induce several types of DNA damage including single- and 
double-stranded breaks in DNA. This ability to induce such DNA damage 
led Hermann Muller to discover the mutagenic effects of x-rays in Drosophila 
in 1927. This discovery contributed to the development of Drosophila as one 
of the premier organisms for genetic studies. 


DNA damage can be detected and repaired by a variety of systems 


To protect the genetic message, a wide range of DNA-repair systems are 
present in most organisms. Many systems repair DNA by using sequence 
information from the uncompromised strand. Such single-strand replica- 
tion systems follow a similar mechanistic outline: 


1. Recognize the offending base(s). 
2. Remove the offending base(s). 
3. Repair the resulting gap with a DNA polymerase and DNA ligase. 


We will briefly consider examples of several repair pathways. Although 
many of these examples are taken from E. coli, corresponding repair sys- 
tems are present in most other organisms, including humans. 

The replicative DNA polymerases themselves are able to correct many 
DNA mismatches produced in the course of replication. For example, the ¢ 
subunit of E. coli DNA polymerase III functions as a 3'-to-5’ exonuclease. This 
domain removes mismatched nucleotides from the 3’ end of DNA by hydroly- 
sis. How does the enzyme sense whether a newly added base is correct? As a 
new strand of DNA is synthesized, it is proofread. If an incorrect base is 
inserted, then DNA synthesis slows down owing to the difficulty of threading 
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FIGURE 28.34 Aflatoxin activation. The 
compound, produced by molds that grow 
on peanuts, is activated by cytochrome 
P450 to form a highly reactive species that 
modifies bases such as guanine in DNA, 


leading to mutations. 


Psoralen 


FIGURE 28.36 A cross-linking 


agent. The compound psoralen and its 


derivatives can form interstrand cross- 


links through two reactive sites that can 


form adducts with nucleotide bases. 
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Template 
strand 


Exonuclease 
active site 


FIGURE 28.37 Proofreading. The growing 
polynucleotide chain occasionally leaves 
the polymerase site and migrates to the 
active site of the exonuclease. There, one 
or more nucleotides are excised from the 
newly synthesized chain, removing 
potentially incorrect bases. 


3! Mismatch ie 5! 


TITTlTTTTllIghttll ttt 


5’ a 


della llaaas banana 
[ow polymerase III 


denlelanaallelaaaalllaaadanlalaanlelelanaslelaal 
FIGURE 28.38 Mismatch repair. DNA 
mismatch repair in E. coli is initiated by 
the interplay of MutS, MutL, and MutH 
proteins. A G—T mismatch is recognized 
by MutS. MutH cleaves the backbone in 
the vicinity of the mismatch. A segment of 
the DNA strand containing the erroneous 
T is removed by exonuclease | and 
synthesized anew by DNA polymerase Ill. 
[Information from R. F. Service, Science 
263:1559-1560, 1994.] 
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a non-Watson—Crick base pair into the polymerase. In addition, the mis- 
matched base is weakly bound and therefore able to fluctuate in position. The 
delay from the slowdown allows time for these fluctuations to take the newly 
synthesized strand out of the polymerase active site and into the exonuclease 
active site (Figure 28.37). There, the DNA is degraded, one nucleotide at a time, 
until it moves back into the polymerase active site and synthesis continues. 

A second mechanism is present in essentially all cells to correct errors 
made in replication that are not corrected by proofreading (Figure 28.38). 
Mismatch-repair systems consist of at least two proteins, one for detecting the 
mismatch and the other for recruiting an endonuclease that cleaves the newly 
synthesized DNA strand close to the lesion to facilitate repair. In E. coli, these 
proteins are called MutS and MutL and the endonuclease is called MutH. 

Another mechanism of DNA repair is direct repair, one example of which 
is the photochemical cleavage of pyrimidine dimers. Nearly all cells contain a 
photoreactivating enzyme called DNA photolyase. The E. coli enzyme, a 35-kd 
protein that contains bound N°,N!°-methenyltetrahydrofolate and flavin 
adenine dinucleotide (FAD) cofactors, binds to the distorted region of DNA. 
The enzyme uses light energy—specifically, the absorption of a photon by 
the N°,N!°-methenyltetrahydrofolate coenzyme—to form an excited state 
that cleaves the dimer into its component bases. 

The excision of modified bases such as 3-methyladenine by the E. coli 
enzyme AIRA is an example of base-excision repair. The binding of this 
enzyme to damaged DNA flips the affected base out of the DNA double 
helix and into the active site of the enzyme (Figure 28.39). The enzyme then 


Backbone sugar 
in the apurinic 
site 


hk FIGURE 28.39 Structure of DNA-repair enzyme. A complex between the DNA-repair 
enzyme AlkA and an analog of a DNA molecule missing a purine base (an apurinic site) is 
shown. Notice that the backbone sugar in the apurinic site is flipped out of the double helix 
into the active site of the enzyme. [Drawn from 1BNK.pdb.] 


acts as a glycosylase, cleaving the glycosidic bond to release the damaged 
base. At this stage, the DNA backbone is intact, but a base is missing. 
This hole is called an AP site because it is apurinic (devoid of A or G) or 
apyrimidinic (devoid of C or T). An AP endonuclease recognizes this defect 
and nicks the backbone adjacent to the missing base. Deoxyribose phospho- 
diesterase excises the residual deoxyribose phosphate unit, and DNA poly- 
merase I inserts an undamaged nucleotide, as dictated by the base on the 
undamaged complementary strand. Finally, the repaired strand is sealed by 
DNA ligase. 

One of the best-understood examples of nucleotide-excision repair is uti- 
lized for the excision of a pyrimidine dimer. Three enzymatic activities are 
essential for this repair process in E. coli (Figure 28.40). First, an enzyme 
complex consisting of the proteins encoded by the uvrABC genes detects the 
distortion produced by the DNA damage. The UvrABC enzyme then cuts 
the damaged DNA strand at two sites, 8 nucleotides away from the dam- 
aged site on the 5’ side and 4 nucleotides away on the 3’ side. The 12-residue 
oligonucleotide excised by this highly specific excinuclease (from the Latin 
exci, “to cut out”) then diffuses away. DNA polymerase I enters the gap to 
carry out repair synthesis. The 3’ end of the nicked strand is the primer, and 
the intact complementary strand is the template. Finally, the 3’ end of the 
newly synthesized stretch of DNA and the original part of the DNA chain 
are joined by DNA ligase. 

DNA ligase is able to seal simple breaks in one strand of the DNA 
backbone. However, alternative mechanisms are required to repair 
breaks on both strands that are close enough together to separate the 
DNA into two double helices. Several distinct mechanisms are able to 
repair such damage. One mechanism, nonhomologous end joining (NHEJ), 
does not depend on other DNA molecules in the cell. In NHEJ, the free 
double-stranded ends are bound by a heterodimer of two proteins, Ku70 
and Ku80. These proteins stabilize the ends and mark them for subse- 
quent manipulations. Through mechanisms that are not yet well under- 
stood, the Ku70/80 heterodimers act as handles used by other proteins to 
draw the two double-stranded ends close together so that enzymes can 
seal the break. 

Alternative mechanisms of double-stranded-break repair can operate if 
an intact stretch of double-stranded DNA with an identical or very similar 
sequence is present in the cell. These repair processes use homologous 
recombination, presented in Section 28.5. 

As noted in Section 5.4, researchers can now take advantage of double- 
stranded-break repair mechanisms for executing targeted changes in 
eukaryotic genomes. Specific engineered nucleases are used to introduce 
double-strand breaks in particular locations within the genome. The DNA 
repair machinery in the targeted cells will then repair the break, through 
either NHEJ to disrupt the gene or homologous recombination-based repair 
with added double-stranded DNA fragments to introduce more elaborate 
modifications. For example, mutations associated with human diseases can 
be specifically introduced into the genomes of model organisms such as 
mice or zebrafish to examine the consequences. 


The presence of thymine instead of uracil in DNA permits 
the repair of deaminated cytosine 


Wa The presence in DNA of thymine rather than uracil, as in RNA, was 
an enigma for many years. Both bases pair with adenine. The only 
difference between them is a methyl group in thymine in place of the C-5 
hydrogen atom in uracil. Why is a methylated base employed in DNA and 
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FIGURE 28.40 Nucleotide-excision 
repair. Repair of a region of DNA 
containing a thymine dimer by the 
sequential action of a specific excinuclease, 
a DNA polymerase, and a DNA ligase. The 
thymine dimer is shown in blue and the 
new region of DNA is in red. [Information 
from P. C. Hanawalt, Endevour 31:83, 1982.] 
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FIGURE 28.41 Uracil repair. Uridine 
bases in DNA, formed by the deamination 
of cytidine, are excised and replaced 

by cytidine. 
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not in RNA? The existence of an active repair system to correct the deami- 
nation of cytosine provides a convincing solution to this puzzle. 

Cytosine in DNA spontaneously deaminates at a perceptible rate to 
form uracil. The deamination of cytosine is potentially mutagenic because 
uracil pairs with adenine, and so one of the daughter strands will contain 
a U-A base pair rather than the original C—G base pair. This mutation is 
prevented by a repair system that recognizes uracil to be foreign to DNA 
(Figure 28.41). The repair enzyme, uracil DNA glycosylase, is homologous 
to AlkA. The enzyme hydrolyzes the glycosidic bond between the uracil 
and deoxyribose moieties but does not attack thymine-containing nucleo- 
tides. The AP site generated is repaired to reinsert cytosine. Thus, the 
methyl group on thymine is a tag that distinguishes thymine from deaminated 
cytosine. If thymine were not used in DNA, uracil correctly in place would 
be indistinguishable from uracil formed by deamination. The defect 
would persist unnoticed, and so a C-G base pair would necessarily be 
mutated to U—A in one of the daughter DNA molecules. This mutation is 
prevented by a repair system that searches for uracil and leaves thymine 
alone. Thymine is used instead of uracil in DNA to enhance the fidelity of the 
genetic message. 


Some genetic diseases are caused by the expansion of repeats 
of three nucleotides 


e Some genetic diseases are caused by the presence of DNA sequences 

that are inherently prone to errors in the course of repair and replica- 
tion. A particularly important class of such diseases is characterized by the 
presence of long tandem arrays of repeats of three nucleotides. An example is 
Huntington disease, an autosomal dominant neurological disorder with a vari- 
able age of onset. The mutated gene in this disease expresses a protein in the 
brain called huntingtin, which contains a stretch of consecutive glutamine 
residues. These glutamine residues are encoded by a tandem array of CAG 
sequences within the gene. In unaffected persons, this array is between 6 and 
31 repeats, whereas, in those with the disease, the array is between 36 and 82 
repeats or longer. Moreover, the array tends to become longer from one gen- 
eration to the next. The consequence is a phenomenon called anticipation: the 
children of an affected parent tend to show symptoms of the disease at an 
earlier age than did the parent. 

The tendency of these trinucleotide repeats to expand is explained by the 
formation of alternative structures in the course of DNA repair. On cleav- 
age of the DNA backbone, part of the array can loop out without disrupting 
base-pairing outside this region. Then, in replication, DNA polymerase 
extends this strand through the remainder of the array, leading to an 
increase in the number of copies of the trinucleotide sequence. 

A number of other neurological diseases are characterized by expanding 
arrays of trinucleotide repeats. How do these long stretches of repeated amino 
acids cause disease? For huntingtin, it appears that the polyglutamine 
stretches become increasingly prone to aggregate as their length increases; the 
additional consequences of such aggregation are still under investigation. 


Many cancers are caused by the defective repair of DNA 


e As described in Chapter 14, cancers are caused by mutations in 

genes associated with growth control. Defects in DNA-repair sys- 
tems increase the overall frequency of mutations and, hence, the likeli- 
hood of cancer-causing mutations. Indeed, the synergy between studies 
of mutations that predispose people to cancer and studies of DNA repair 


TABLE 28.2 851 
28.4 Repair of DNA Damage 


Disease Selected Genes Repair Pathway 
Xeroderma pigmentosum (skin) XPA, XPB, XPC Nucleotide excision repair 
Lynch syndrome (colon cancer) MSH2, MLH1 Mismatch repair 

Breast and ovarian cancer BRCA1 and BRCA2 Double-strand break repair 
Renal and lung cancer OGG1 Base excision repair 


in model organisms has been tremendous in revealing the biochemistry 
of DNA-repair pathways. Genes for DNA-repair proteins are often 
tumor suppressor genes; that is, they suppress tumor development when 
at least one copy of the gene is free of a deleterious mutation. When both 
copies of a gene are mutated, however, tumors develop at rates greater 
than those for the population at large. People who inherit defects in a 
single tumor-suppressor allele do not necessarily develop cancer but are 
susceptible to developing the disease because only the one remaining 
normal copy of the gene must develop a new defect to further the devel- 
opment of cancer. Table 28.2 lists selected genes that are associated with 
diseases including cancer and that encode proteins involved with DNA 
repair processes. 

Consider, for example, xeroderma pigmentosum, a rare human skin dis- 
ease. The skin of an affected person is extremely sensitive to sunlight or 
ultraviolet light. In infancy, severe changes in the skin become evident and 
worsen with time. The skin becomes dry, and there is a marked atrophy of 
the dermis. Keratoses appear, the eyelids become scarred, and the cornea 
ulcerates. Skin cancer usually develops at several sites. Many patients die 
before age 30 from metastases of these malignant skin tumors. Studies of 
xeroderma pigmentosum patients have revealed that mutations occur in 
genes for a number of different proteins. These proteins are components of 
the human nucleotide-excision-repair pathway, including homologs of the 
UvrABC subunits. 

Defects in other repair systems can increase the frequency of other 
tumors. For example, hereditary nonpolyposis colorectal cancer (HNPCC, or 
Lynch syndrome) results from defective DNA mismatch repair. HNPCC is 
not rare—as many as 1 in 200 people will develop this form of cancer. 
Mutations in two genes, called hMSH2 and hMLH1, account for most 
cases of this hereditary predisposition to cancer. The striking finding is that 
these genes encode the human counterparts of MutS and MutL of E. coli. 
Mutations in hMSH2 and hMLH1 seem likely to allow mutations to accu- 
mulate throughout the genome. In time, genes important 1n controlling cell 


proliferation become altered, resulting in the onset of cancer. cl 
Not all tumor-suppressor genes are specific to particular types of cancer. 

The gene for a protein called p53 is mutated in more than half of all tumors. P 

The p53 protein helps control the fate of damaged cells. First, it plays a O—_p/__ 

central role in sensing DNA damage, especially double-stranded breaks. v4 

Then, after sensing damage, the protein either promotes a DNA-repair pl 

pathway or activates the apoptosis pathway, leading to cell death. Most cl 

mutations in the p53 gene are sporadic; that is, they occur in somatic cells cyclophosphamide 

rather than being inherited. People who inherit a deleterious mutation in 

one copy of the p53 gene suffer from Li-Fraumeni syndrome and have a high H3N cl 

probability of developing several types of cancer. 7 
Cancer cells often have two characteristics that make them especially a \ 

vulnerable to agents that damage DNA molecules. First, they divide fre- HN cl 


quently, and so their DNA replication pathways are more active than they Cisplatin 
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FIGURE 28.42 Ames test. (A) A petri plate 
containing about 109 Salmonella bacteria 
that cannot synthesize histidine and (B) a 
petri plate containing a filter-paper disc with 
a mutagen, which produces a large number 
of revertants that can synthesize histidine. 
After 2 days, the revertants appear as rings 
of colonies around the disc. The small 
number of visible colonies in plate A are 
spontaneous revertants. [Reprinted from 
Mutation Research/Environmental Mutagenesis 
and Related Subjects, 31:6, Ames, B.N., et al. 
Methods for detecting carcinogens and 
mutagens with the Salmonella/mammalian- 
microsome mutagenicity test, p. 347-363, 
Copyright 1975, with permission from Elsevier.] 


are in most cells. Second, as already noted, cancer cells often have defects in 
DNA-repair pathways. Several agents widely used in cancer chemotherapy, 
including cyclophosphamide and cisplatin, act by damaging DNA. Cancer 
cells are less able to avoid the effect of the induced damage than are normal 
cells, providing a therapeutic window for specifically killing cancer cells. 


Many potential carcinogens can be detected by their mutagenic 
action on bacteria 


Many human cancers are caused by exposure to chemicals that cause 
mutations. It is important to identify such compounds and to ascertain 
their potency so that human exposure to them can be minimized. Bruce 
Ames devised a simple and sensitive test for detecting chemical mutagens. 
In the Ames test, a thin layer of agar containing about 10° bacteria of a 
specially constructed tester strain of Salmonella is placed on a petri plate. 
These bacteria are unable to grow in the absence of histidine, because a 
mutation is present in one of the genes for the biosynthesis of this amino 
acid. The addition of a chemical mutagen to the center of the plate results 
in many new mutations. A small proportion of them reverse the original 
mutation, and histidine can be synthesized. These revertants multiply in 
the absence of an external source of histidine and appear as discrete colo- 
nies after the plate has been incubated at 37°C for 2 days (Figure 28.42). 
For example, 0.5 wg of 2-aminoanthracene gives 11,000 revertant colonies, 
compared with only 30 spontaneous revertants in its absence. A series of 
concentrations of a chemical can be readily tested to generate a dose— 
response curve. 

Some of the tester strains are responsive to base-pair substitutions, 
whereas others detect deletions or additions of base pairs (frameshifts). The 
sensitivity of these specially designed strains has been enhanced by the 
genetic deletion of their excision-repair systems. Potential mutagens enter 
the tester strains easily because the lipopolysaccharide barrier that normally 
coats the surface of Salmonella is incomplete in these strains. A key feature 
of this detection system is the inclusion of a mammalian liver homogenate. 
Recall that some potential carcinogens such as aflatoxin are converted into 
their active forms by enzyme systems in the liver or other mammalian tis- 
sues. Bacteria lack these enzymes, and so the test plate requires a few mil- 
ligrams of a liver homogenate to activate this group of mutagens. 

The Salmonella test is extensively used to help evaluate the mutagenic 
and carcinogenic risks of a large number of chemicals. This rapid and inex- 
pensive bacterial assay for mutagenicity complements epidemiological sur- 
veys and animal tests that are necessarily slower, more laborious, and far 
more expensive. The Salmonella test for mutagenicity is an outgrowth of 
studies of gene—protein relations in bacteria. It is a striking example of how 
fundamental research in molecular biology can lead directly to important 
advances in public health. 


28.5 DNA Recombination Plays Important Roles in 
Replication, Repair, and Other Processes 


Most processes associated with DNA replication function to copy the genetic 
message as faithfully as possible. However, several biochemical processes 
require the recombination of genetic material between two DNA molecules. 
In genetic recombination, two daughter molecules are formed by the 
exchange of genetic material between two parent molecules (Figure 28.43). 
Recombination is essential in the following processes. 
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FIGURE 28.43 Recombination. Two DNA molecules can recombine with each other to form 
new DNA molecules that have segments from both parent molecules. 


1. When replication stalls, recombination processes can reset the replica- 
tion machinery so that replication can continue. 


2. Some double-stranded breaks in DNA are repaired by recombination. 


3. In meiosis, the limited exchange of genetic material between paired 
chromosomes provides a simple mechanism for generating genetic diversity 
in a population. 


4. As we shall see in Chapter 34, recombination plays a crucial role in gen- 
erating molecular diversity for antibodies and some other molecules in the 
immune system. 


5. Some viruses employ recombination pathways to integrate their genetic 
material into the DNA of a host cell. 


6. Recombination is used to manipulate genes in, for example, the genera- 
tion of “gene knockout” mice and other targeted genome modifications 
(Section 5.4). 


Recombination is most efficient between DNA sequences that are similar in 
sequence. In homologous recombination, parent DNA duplexes align at 
regions of sequence similarity, and new DNA molecules are formed by the 
breakage and joining of homologous segments. 


RecA can initiate recombination by promoting strand invasion 


In many recombination pathways, a DNA molecule with a free end recom- 
bines with a DNA molecule having no free ends available for interaction. 
DNA molecules with free ends are the common result of double-stranded 
DNA breaks, but they may also be generated in DNA replication if the rep- 
lication complex stalls. This type of recombination has been studied exten- 
sively in E. coli, but it also takes place in other organisms through the action 
of proteins homologous to those of E. coli. Often dozens of proteins partici- 
pate in the complete recombination process. However, the key protein is 
RecA, the homolog of which is called RAD51 in human cells. To accomplish 
the exchange, the single-stranded DNA displaces one of the strands of the 
double helix (Figure 28.44). The resulting three-stranded structure is called 
a displacement loop or D-loop. This process is often referred to as strand inva- 
sion. Because a free 3’ end is now base-paired to a contiguous strand of 
DNA, the 3’ end can act as a primer to initiate new DNA synthesis. Strand 
invasion can initiate many processes, including the repair of double-stranded 
breaks and the reinitiation of replication after the replication apparatus has 
come off its template. In the repair of a break, the recombination partner is 
an intact DNA molecule with an overlapping sequence. 


5 Displacement loop 
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FIGURE 28.44 Strand invasion. This process, promoted by proteins such as RecA, can 
initiate recombination. 
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Recombination synapse 


FIGURE 28.45 Recombination 
mechanism. Recombination begins as two 
DNA molecules come together to form a 
recombination synapse. One strand from 
each duplex is cleaved by transesterification 
with the 3’ end of each of the cleaved 
strands linked to a tyrosine residue on the 
recombinase enzyme. New phosphodiester 
bonds are formed when a 5’ end of the 
other cleaved strand in the complex attacks 
these tyrosine-DNA adducts. After 
isomerization, these steps are repeated to 
form the recombined products. 
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Some recombination reactions proceed through Holliday-junction 
intermediates 


In recombination pathways for meiosis and some other processes, interme- 
diates form that are composed of four polynucleotide chains in a crosslike 
structure. Intermediates with these crosslike structures are often referred to 
as Holliday junctions, after Robin Holliday, who proposed their role in 
recombination in 1964. Such intermediates have been characterized by a 
wide range of techniques, including x-ray crystallography. 

Specific enzymes, termed recombinases, bind to these structures and 
resolve them into separated DNA duplexes. The Cre recombinase from 
bacteriophage P1 has been extensively studied. The mechanism begins with 
the recombinase binding to the DNA substrates (Figure 28.45). 

Four molecules of the enzyme and two DNA molecules come together to 
forma recombination synapse. The reaction begins with the cleavage of one strand 
from each duplex via transesterification reactions with the 3’-phosphoryl 
groups becoming linked to specific tyrosine residues in the recombinase while 
the 5’-hydroxyl group of each cleaved strand remains free. The free 5’ ends 
invade the other duplex and participate in transesterification reactions to form 
new phosphodiester bonds and free the tyrosine residues. These reactions 
result in the formation of a Holliday junction. This junction can then isomerize 
to form a structure in which the polynucleotide chains in the center of the struc- 
ture are reoriented. From this junction, the processes of strand cleavage and 
phosphodiester-bond formation repeat. The result is a synapse containing the 
two recombined duplexes. Dissociation of this complex generates the final 
recombined products. 

Cre catalyzes the formation of Holliday junctions as well as their resolution. In 
contrast, other proteins bind to Holliday junctions that have already been formed 
by other processes and resolve them into separate duplexes. In many cases, 
these proteins also promote the process of branch migration whereby a Holliday 
junction is moved along the two component double helices. Branch migration 
can affect which segments of DNA are exchanged in a recombination process. 


y 


Recombination can also occur during cell division between different, 
non-homologous chromosomes. These recombination events, termed 


translocations, can be harmless or can result in diseases, depending on the 
chromosomes involved and the genes that are modified or disrupted. The 
formation of the bcr-abl gene, which leads to chronic myelogenous leuke- 
mia, was discussed in section 14.5. 


SUMMARY 


28.1 


28.2 


28.3 


DNA Replication Proceeds by the Polymerization of 
Deoxyribonucleoside Triphosphates Along a Template 

DNA polymerases are template-directed enzymes that catalyze the 
formation of phosphodiester bonds by nucleophilic attack by the 
3’-hydroxyl group on the innermost phosphorus atom of a deoxyribo- 
nucleoside 5’-triphosphate. The complementarity of shape between 
correctly matched nucleotide bases is crucial to ensuring the fidelity of 
base incorporation. DNA polymerases cannot start chains de novo; a 
primer with a free 3’-hydroxyl group is required. Thus, DNA synthe- 
sis is initiated by the synthesis of an RNA primer, the task of a special - 
ized primase enzyme. After serving as a primer, the RNA is degraded 
and replaced by DNA. DNA polymerases always synthesize a DNA 
strand in the 5’-to-3’ direction. So that both strands of the double 
helix can be synthesized in the same direction simultaneously, one 
strand is synthesized continuously while the other is synthesized in 
fragments called Okazaki fragments. Gaps between the fragments are 
sealed by DNA ligases. ATP-driven helicases prepare the way for 
DNA replication by separating the strands of the double helix. 


DNA Unwinding and Supercoiling Are Controlled by Topoisomerases 

A key topological property of DNA is its linking number (Lk), which 
is defined as the number of times one strand of DNA winds around 
the other in the right-hand direction when the DNA axis is con- 
strained to lie in a plane. Molecules differing in linking number are 
topoisomers of one another and can be interconverted only by cutting 
one or both DNA strands; these reactions are catalyzed by topoisom- 
erases. Changes in linking number generally lead to changes in both 
the number of turns of double helix and the number of turns of super- 
helix. Topoisomerase II catalyzes the ATP-driven introduction of 
negative supercoils, which leads to the compaction of DNA and ren- 
ders it more susceptible to unwinding. Supercoiled DNA can be 
relaxed by topoisomerase I or topoisomerase II. Topoisomerase | acts 
by transiently cleaving one strand of DNA in a double helix, whereas 
topoisomerase II transiently cleaves both strands simultaneously. 


DNA Replication Is Highly Coordinated 
Replicative DNA polymerases are processive; that is, they catalyze the 
addition of many nucleotides without dissociating from the template. 
A major contributor to processivity is the DNA sliding clamp, such as 
the dimeric B subunit of the E. coli replicative polymerase. The sliding 
clamp has a ring structure that encircles the DNA double helix and 
keeps the enzyme and DNA associated. The DNA polymerase holo- 
enzyme is a large DNA-copying machine formed by two DNA poly- 
merase enzymes, one to act on each template strand, associated with 
other subunits including a sliding clamp and a clamp loader. 

The synthesis of the leading and lagging strands of a double- 
stranded DNA template is coordinated. As a replicative polymerase 


855 


Summary 


856 


CHAPTER 28 DNA Replication Repair 
and Recombination 


KEY TERMS 


28.4 


28.5 


moves along a DNA template, the leading strand is copied smoothly 
while the lagging strand forms loops that change length in the course 
of the synthesis of each Okazaki fragment. The mode of action is 
referred to as the trombone model. 

DNA replication is initiated at a single site within the E. coli 
genome. A set of specific proteins recognizes this origin of replication 
and assembles the enzymes needed for DNA synthesis, including a 
helicase that promotes strand separation. The initiation of replication 
in eukaryotes is more complex. DNA synthesis is initiated at thou- 
sands of sites throughout the genome. Complexes homologous to 
those in E. coli, but more complicated, are assembled at each eukary- 
otic origin of replication. A special polymerase called telomerase that 
relies on an RNA template synthesizes specialized structures called 
telomeres at the ends of linear chromosomes. 


Many Types of DNA Damage Can Be Repaired 

A wide variety of DNA damage can occur. For example, mismatched 
bases may be incorporated in the course of DNA replication or indi- 
vidual bases may be damaged by oxidation or alkylation after DNA 
replication. Other forms of damage are the formation of cross-links 
and the introduction of single- or double-stranded breaks in the DNA 
backbone. Several different repair systems detect and repair DNA 
damage. Repair begins with the process of proofreading in DNA rep- 
lication: mismatched bases that were incorporated in the course of 
synthesis are excised by exonuclease activity present in replicative 
polymerases. Some DNA lesions such as thymine dimers can be 
directly reversed through the action of specific enzymes. Other DNA- 
repair pathways act through the excision of single damaged bases 
(base-excision repair) or short segments of nucleotides (nucleotide- 
excision repair). Double-stranded breaks in DNA can be repaired by 
homologous or nonhomologous end-joining processes. Defects in 
DNA-repair components are associated with susceptibility to many 
different sorts of cancer. Such defects are a common target of cancer 
treatments. Many potential carcinogens can be detected by their 
mutagenic action on bacteria (the Ames test). 


DNA Recombination Plays Important Roles in Replication, Repair, 

and Other Processes 

Recombination is the exchange of segments between two DNA mol- 
ecules. Recombination is important in some types of DNA repair as 
well as other processes such as meiosis, the generation of antibody 
diversity, and the life cycles of some viruses. Some recombination 
pathways are initiated by strand invasion, in which a single strand at 
the end of a DNA double helix forms base pairs with one strand of 
DNA in another double helix and displaces the other strand. A com- 
mon intermediate formed in other recombination pathways is the 
Holliday junction, which consists of four strands of DNA that come 
together to form a crosslike structure. Recombinases promote recom- 
bination reactions through the introduction of specific DNA breaks 
and the formation and resolution of Holliday-junction intermediates. 
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(NHEJ) (p. 849) 
trinucleotide repeat (p. 850) 
tumor-suppressor gene (p. 851) 
Ames test (p. 852) 

RecA (p. 853) 

Holliday junction (p. 854) 
recombinase (p. 854) 
recombination synapse (p. 854) 


base-excision repair (p. 848) 


nucleotide-excision repair (p. 849) 


1. Activated intermediates. DNA polymerase I, DNA 
ligase, and topoisomerase I catalyze the formation of phos- 
phodiester bonds. What is the activated intermediate in the 
linkage reaction catalyzed by each of these enzymes? What 
is the leaving group? 


2. Life in a hot tub. An archaeon (Sulfolobus acidocaldarius) 
found in acidic hot springs contains a topoisomerase that 
catalyzes the ATP-driven introduction of positive super- 
coils into DNA. How might this enzyme be advantageous 
to this unusual organism? 


3. Which way? Provide a chemical explanation of why 
DNA synthesis proceeds in a 5’-to-3’ direction. 


4. Nucleotide requirement. DNA replication does not take 
place in the absence of the ribonucleotides ATP, CTP, 
GTP, and UTP. Propose an explanation. 


5. Close contact. Examination of the structure of DNA 
polymerases bound to nucleotide analogs reveals that con- 
served residues come within van der Waals contact of C-2’ 
of the bound nucleotide. What is the potential significance 
of this interaction? 


6. Molecular motors in replication. 


(a) How fast does template DNA spin (expressed in revo- 
lutions per second) at an E. coli replication fork? 


(b) What is the velocity of movement (in micrometers per 
second) of DNA polymerase II holoenzyme relative to the 
template? 


7. Wound tighter than a drum. Why would replication come 
to a halt in the absence of topoisomerase II? 


8. The missing link. One form of a plasmid shows a twist of 
Tw = 48 and a writhe of Wr = 3. 
(a) What is the linking number? 
(b) What would the value of writhe be for a form with twist 


Tw = 50 if the linking number is the same as that for the 
preceding form? 


9. Telomeres and cancer. Telomerase is not active in most 
human cells. Some cancer biologists have suggested that acti- 
vation of the telomerase gene would be a requirement for a 
cell to become cancerous. Explain why this might be the case. 


10. Backward? Bacteriophage T7 helicase moves along 
DNA in the 5’-to-3" direction. Other helicases have been 
reported to move in the 3’-to-5’ direction. Is there any fun- 
damental reason why you would expect helicases to move in 
one direction or the other? 


11. Nick translation. Suppose that you wish to make a 
sample of DNA duplex highly radioactive to use as a DNA 
probe. You have a DNA endonuclease that cleaves the DNA 
internally to generate 3’-OH and 5’-phosphoryl groups, 
intact DNA polymerase I, and radioactive dNTPs. Suggest 
a means for making the DNA radioactive. 


12. Revealing tracks. Suppose that replication is initiated in 
a medium containing moderately radioactive tritiated thy- 
mine. After a few minutes of incubation, the bacteria are 
transferred to a medium containing highly radioactive triti- 
ated thymidine. Sketch the autoradiographic pattern that 
would be seen for (a) undirectional replication and (b) bidi- 
rectional replication, each from a single origin. 


13. Mutagenic trail. Suppose that the single-stranded RNA 
from tobacco mosaic virus was treated with a chemical 
mutagen, that mutants were obtained having serine or leu- 
cine instead of proline at a specific position, and that further 
treatment of these mutants with the same mutagen yielded 
phenylalanine at this position. 


What are the plausible codon assignments for these four 
amino acids? 


14. Induced spectrum. DNA photolyases convert the energy 
of light in the near-ultraviolet or visible region (300-500 nm) 
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into chemical energy to break the cyclobutane ring of pyrim- 
idine dimers. In the absence of substrate, these photoreacti- 
vating enzymes do not absorb light of wavelengths longer 
than 300 nm. Why is the substrate-induced absorption band 
advantageous? 


15. Missing telomerase. Cells lacking telomerase can grow 
for several cell divisions without obvious defects. However, 
after more cell divisions, such cells tend to show chromo- 
somes that have fused together. Propose an explanation for 
the formation of the chromosomes. 


16. I need to unwind. With the assumption that the energy 
required to break an average base pair in DNA is 10 kJ 
mol! (2.4 kcal mol~'), estimate the maximum number of 
base pairs that could be broken per ATP hydrolyzed by a 
helicase operating under standard conditions. 


17. Triplet oxidation. The oxidation of guanine bases in the 
context of triplet repeats such as CAGCAGCAG can lead 
to the expansion of the repeat. Explain. 


Mechanism Problem 


18. A revealing analog. AMP-PNP, the B,y-imido analog 
of ATP (p. 833), is hydrolyzed very slowly by most 
ATPases. The addition of AMP-PNP to topoisomerase IT 
and circular DNA leads to the negative supercoiling of a 
single molecule of DNA per enzyme. DNA remains bound 
to the enzyme in the presence of this analog. What does this 
finding reveal about the catalytic mechanism? 


Data Interpretation and Chapter Integration Problems 


19. Like a ladder. Circular DNA from 5V40 virus was iso- 
lated and subjected to gel electrophoresis. The results are 
shown in lane A (the control) of the adjoining gel patterns. 


A B Cc 


[From W. Keller, PNAS 72:2553, 1975.] 


(a) Why does the DNA separate in agarose gel electropho- 
resis? How does the DNA in each band differ? 


The DNA was then incubated with topoisomerase I for 
5 minutes and again analyzed by gel electrophoresis with 
the results shown in lane B. 


(b) What types of DNA do the various bands represent? 


Another sample of DNA was incubated with topoisom- 
erase I for 30 minutes and again analyzed as shown in lane C. 


(c) What is the significance of the fact that more of the 
DNA is in slower-moving forms? 


20. Ames test. The illustration below shows four petri 
plates used for the Ames test. A piece of filter paper (white 
circle in the center of each plate) was soaked in one of four 
preparations and then placed on a petri plate. The four 
preparations contained (A) purified water (control), (B) a 
known mutagen, (C) a chemical whose mutagenicity is 
under investigation, and (D) the same chemical after treat- 
ment with liver homogenate. The number of revertants, 
visible as colonies on the petri plates, was determined in 
each case. 


(a) What was the purpose of the control plate, which was 
exposed only to water? 


(b) Why was it wise to use a known mutagen in the experi- 
mental system? 


(c) How would you interpret the results obtained with the 
experimental compound? 


(d) What liver components would you think are responsi- 
ble for the effects observed in preparation D? 


(C) + Experimental sample 


(D) + Experimental sample 
after treatment with 
liver homogenate 


CHAPTER 


RNA Synthesis and Processing 


Excised intron 
—_ 


mRNA precursor mRNA 


RNA synthesis is a key step in the expression of genetic 
information. For eukaryotic cells, the initial RNA transcript 
(the mRNA precursor) is often spliced, removing introns 
that do not encode protein sequences. Often, the same 
pre-mRNA is spliced differently in different cell types or at 
different developmental stages. In the image at the left, 


NA stores genetic information in a stable form that can be readily 

replicated. The expression of this genetic information requires its 
flow from DNA to RNA and, usually, to protein, as was introduced in 
Chapter 4. This chapter examines RNA synthesis, or transcription, which 
is the process of synthesizing an RNA transcript with the transfer of 
sequence information from a DNA template. We begin with a discussion of 
RNA polymerases, the large and complex enzymes that carry out the syn- 
thetic process. We then turn to transcription in bacteria and focus on the 
three stages of transcription: promoter binding and initiation, elongation 
of the nascent RNA transcript, and termination. We then examine tran- 
scription in eukaryotes, focusing on the distinctions between bacterial and 
eukaryotic transcription. 

RNA transcripts in eukaryotes are extensively modified, as exemplified 
by the capping of the 5’ end of an mRNA precursor and the addition of a 
long poly(A) tail to its 3’ end. One of the most striking examples of RNA 
modification is the splicing of mRNA precursors, which is catalyzed by 
spliceosomes, protein complexes consisting of small nuclear ribonucleopro- 
tein particles (snRNPs). Remarkably, some RNA molecules can splice 
themselves in the absence of protein. This landmark discovery by Thomas 
Cech and Sidney Altman revealed that RNA molecules can serve as cata- 
lysts, greatly influencing our view of molecular evolution. 


proteins associated with RNA splicing (stained with a 
fluorescent antibody) highlight regions of the newt 
genome that are being actively transcribed. [(Left) 
Courtesy of Dr. Mark B. Roth and Dr. Joseph G. Gall.] 
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29.1 RNA Polymerases Catalyze 
Transcription 


29.2 Transcription in Eukaryotes Is 
Highly Regulated 


29.3 The Transcription Products of 
Eukaryotic Polymerases Are 
Processed 


29.4 The Discovery of Catalytic RNA 
Was Revealing in Regard to Both 
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kK FIGURE 29.1 RNA polymerase 
structures. The three-dimensional 
structures of RNA polymerases from a 
prokaryote (Thermus aquaticus) and a 
eukaryote (Saccharomyces cerevisiae). The 
two largest subunits for each structure are 
shown in dark red and dark blue. Notice 
that both structures contain a central metal 
ion (green) in their active sites, near a large 
cleft on the right. The similarity of these 
structures reveals that these enzymes have 
the same evolutionary origin and have many 
mechanistic features in common. [Drawn 
from 116V.pdb and 1I6H.pdb.] 


RNA splicing is not merely a curiosity. Many genetic diseases have been 
associated with mutations that affect RNA splicing. Moreover, the same pre- 
mRNA can be spliced differently in various cell types, at different stages of 
development, or in response to other biological signals. In addition, individ- 
ual bases in some pre-mRNA molecules are changed ina process called RNA 
editing. One of the biggest surprises of the sequencing of the human genome 
was that only about 21,000 genes were identified compared with previous 
estimates of 100,000 or more. The ability of one gene to encode more than one 
distinct mRNA by alternative splicing and, hence, more than one protein 
may play a key role in expanding the repertoire of our genomes. 

Furthermore, the investigation of different classes of RNA molecules has 
been one of the most productive areas of biochemical research in recent years. 
In the next chapter we will explore the long-known ribosomal and transfer 
RNA molecules that are central to protein synthesis. We have also encountered 
microRNAs, molecules for which our understanding is still rapidly expanding. 
Many other classes of RNAs have been discovered more recently, including long 
non-coding RNAs, the functions of which are still under active investigation. 


RNA synthesis comprises three stages: Initiation, 
elongation, and termination 


RNA synthesis is catalyzed by large enzymes called RNA polymerases. The 
basic biochemistry of RNA synthesis is common to all organisms, com- 
monality that has been beautifully illustrated by the three-dimensional 
structures of representative RNA polymerases from prokaryotes and 
eukaryotes (Figure 29.1). Despite substantial differences in size and num- 
ber of polypeptide subunits, the overall structures of these enzymes are 
quite similar, revealing a common evolutionary origin. 

RNA synthesis, like all biological polymerization reactions, takes place 
in three stages: initiation, elongation, and termination. RNA polymerases 
perform multiple functions in this process: 


1. They search DNA for initiation sites, also called promoter sites or simply 
promoters. For instance, FE. coli DNA has about 2000 promoter sites in its 
4.8 X 10° bp genome. 


2. They unwind a short stretch of double-helical DNA to produce single- 
stranded DNA templates from which the sequence of bases can be easily 
read out. 


Prokaryotic RNA polymerase 


Eukaryotic RNA polymerase 


3. They select the correct ribonucleoside triphosphate and catalyze the 861 
formation of a phosphodiester bond. This process is repeated many times as 29.1 RNA Polymerase 
the enzyme moves along the DNA template. RNA polymerase is com- 

pletely processive—a transcript is synthesized from start to end by a single 

RNA polymerase molecule. 


4. They detect termination signals that specify where a transcript ends. 


5. They interact with activator and repressor proteins that modulate the 
rate of transcription initiation over a wide range. Gene expression is con- 
trolled substantially at the level of transcription, as will be discussed in 
detail in Chapters 31 and 32. 


The chemistry of RNA synthesis is identical for all forms of RNA, 
including messenger RNAs, transfer RNAs, ribosomal RNAs, and small 
regulatory RNAs. The basic steps just outlined apply to all forms. Their 
synthetic processes differ mainly in regulation, posttranscriptional process- 
ing, and the specific RNA polymerase that participates. 


29.1 RNA Polymerases Catalyze Transcription 


The fundamental reaction of RNA synthesis is the formation of a phospho- 
diester bond. The 3’'-hydroxyl group of the last nucleotide in the chain 
nucleophilically attacks the a phosphoryl group of the incoming nucleoside 
triphosphate with the concomitant release of a pyrophosphate. 
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This reaction is thermodynamically favorable, and the subsequent 
degradation of the pyrophosphate to orthophosphate locks the reac- 
tion in the direction of RNA synthesis. The catalytic sites of RNA 
polymerases include two metal ions, normally magnesium ions 
(Figure 29.2). One ion remains tightly bound to the enzyme, 
whereas the other ion comes in with the nucleoside triphosphate and 
leaves with the pyrophosphate. Three conserved aspartate residues Growing RNA chain 
participate in binding these metal ions. 

RNA polymerases are very large, complex enzymes. For example, 
the core of the RNA polymerase of E. coli consists of 5 kinds of sub- 
units with the composition a,BB'w (Table 29.1). A typical eukaryotic 
RNA polymerase is larger and more complex, having 12 subunits : ween 
and a total molecular mass of more than 0.5 million daltons. Despite 


Ribonucleoside 
triphosphate 


FIGURE 29.2 RNA polymerase active site. A model of the transition state for phosphodiester- 
bond formation in the active site of RNA polymerase. The 3’-hydroxyl group of the growing RNA 
chain attacks the a-phosphoryl group of the incoming nucleoside triphosphate, resulting in the 
release of pyrophosphate. This transition state is structurally similar to that in the active site of 
DNA polymerase (Figure 28.5). 


TABLE 29.1 Subunits of RNA polymerase from E. coli this complexity, the structures of RNA polymerases have 
been determined in detail by x-ray crystallography in work 


ee — eee pioneered by Roger Kornberg and Seth Darst. 
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region of approximately 17 base pairs. The edges of the 
bases that normally take part in Watson—Crick base pairs 
are exposed in the unwound region. We will begin with a 
detailed examination of the elongation process, including 
the role of the DNA template read by RNA polymerase 
and the reactions catalyzed by the polymerase, before 
returning to the more-complex processes of initiation and 
termination. 


RNA chains are formed de novo and grow in the 
5’-to-3’ direction 


Let us begin our examination of transcription by consider- 
ing the DNA template. The first nucleotide (the start site) 
of a DNA sequence to be transcribed is denoted as +1 and 
the second one as +2; the nucleotide preceding the start 
site is denoted as —1. These designations refer to the cod- 
ing strand of DNA. Recall that the sequence of the tem- 
plate strand of DNA is the complement of that of the RNA 
transcript (Figure 29.4). In contrast, the coding strand of 
DNA has the same sequence as that of the RNA transcript 
except for thymine (T) in place of uracil (U). The coding 
strand is also known as the sense (+) strand, and the tem- 
plate strand as the antisense (—) strand. 

In contrast with DNA synthesis, RNA synthesis can start 
de novo, without the requirement for a primer. Most newly 
synthesized RNA chains carry a highly distinctive tag on 


FIGURE 29.3 Transcription bubble. RNA polymerase separates the 5’ end: the first base at that end is either pppG or ppp. A. 
a region of the double helix to form a structure called the 


“transcription bubble.” The red (template) and blue (nontemplate) 
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The presence of the triphosphate moiety confirms that RNA synthesis 
starts at the 5’ end. 

The dinucleotide shown above is synthesized by RNA polymerase as 
part of the complex process of initiation, which will be discussed later in the 
chapter. After initiation takes place, RNA polymerase elongates the nucleic 


IAACGUAGGGUCACAUC... RNA transcript 
- GCATACAACACACCTTGCATCCCAGTGTAG... Template, or antisense (-), strand 
-CGTATGTTGTGTGGAACGTAGGGTCACATC... Coding, or sense (+), strand 
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FIGURE 29.4 Template and coding strands. The template, or antisense (—), strand is 
complementary in sequence to the RNA transcript. 
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acid chain in the following manner (Figure 29.5). A ribonucleoside triphos- 
phate binds in the active site of the RNA polymerase, directly adjacent to 
the growing RNA chain. The incoming ribonucleoside triphosphate forms 
a Watson—Crick base pair with the template strand. The 3’-hydroxyl group 
of the growing RNA chain, oriented and activated by the tightly bound 
metal ion, attacks the a-phosphoryl group to form a new phosphodiester 
bond, displacing pyrophosphate. 

To proceed to the next step, the RNA—DNA hybrid must move relative 
to the polymerase to bring the 3’ end of the newly added nucleotide into 
proper position for the next nucleotide to be added (Figure 29.6). This 
translocation step does not include breaking any bonds between base pairs 
and is reversible but, once it has taken place, the addition of the next nucleo- 
tide, favored by the triphosphate cleavage and pyrophosphate release and 
cleavage, drives the polymerization reaction forward. 

i= 


FIGURE 29.6 Translocation. After nucleotide addition, the RNA-DNA hybrid can translocate 
through the RNA polymerase, bringing a new DNA base into position to base-pair with an 
incoming nucleoside triphosphate. 
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The lengths of the RNA-DNA hybrid and of the unwound region of 
DNA stay rather constant as RNA polymerase moves along the DNA tem- 
plate. The length of the RNA—DNA hybrid is determined by a structure 
within the enzyme that forces the RNA-DNA hybrid to separate, allowing 
the RNA chain to exit from the enzyme and the DNA chain to rejoin its 
DNA partner (Figure 29.7). 


RNA polymerases backtrack and correct errors 


The RNA-DNA hybrid can also move in the direction opposite that of 
elongation (Figure 29.8). This backtracking is less favorable energetically 
than moving forward because it breaks the bonds between a base pair. 
However, backtracking is very important for proofreading. The incorpora- 
tion of an incorrect nucleotide introduces a non- Watson—Crick base pair. In 
this case, breaking the bonds between this base pair and backtracking is less 
costly energetically. After the polymerase has backtracked, the phosphodi- 
ester bond one base pair before the one that has just formed is adjacent to 
the metal ion in the active site. In this position, a hydrolysis reaction in 
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FIGURE 29.5 Elongation mechanism. A 
ribonucleoside triphosphate binds adjacent 
to the growing RNA chain and forms a 
Watson-Crick base pair with a base on the 
DNA template strand. The 3’-hydroxyl 
group at the end of the RNA chain attacks 
the newly bound nucleotide and forms a 
new phosphodiester bond, releasing 
pyrophosphate. 


FIGURE 29.7 RNA-DNA hybrid 
separation. A structure within RNA 
polymerase forces the separation of the 
RNA-DNA hybrid. Notice that the DNA strand 
exits in one direction and the RNA product 
exits in another. [Drawn from 1I6H.pdb.] 


FIGURE 29.8 Backtracking. The RNA- 
DNA hybrid can occasionally backtrack 
within the RNA polymerase. In the 
backtracked position, hydrolysis can take 
place, producing a configuration equivalent 
to that after translocation. Backtracking is 
more likely if a mismatched base is added, 
facilitating proofreading. 
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(A) GATATAATGAGCAAA 
(®): CGTATAATTTGACCS 
(© -GTTAACTAGTACGCA 
(D) GTGATACTGAGCACA 
() GTTTTCATGCCTCCA 

TATAAT 


FIGURE 29.9 Bacterial promoter 
sequences. A comparison of five 
sequences from prokaryotic promoters 
reveals a recurring sequence of TATAAT 
centered on position —10. The —10 
consensus sequence (in red) was deduced 
from a large number of promoter sequences. 
The sequences are from the (A) uvrD, 

(B) uncl, and (C) tro operons of E. coli, from 
(D) A phage, and from (E) 6X174 phage. 


which a water molecule attacks the phosphate can result in the cleavage of 
the phosphodiester bond and the release of a dinucleotide that includes the 
incorrect nucleotide. 

Studies of single molecules of RNA polymerase have confirmed that the 
enzymes hesitate and backtrack to correct errors. Furthermore, these proof- 
reading activities are often enhanced by accessory proteins. The final error 
rate of the order of one mistake per 10* or 10° nucleotides is higher than that 
for DNA replication, including all error-correcting mechanisms. The lower 
fidelity of RNA synthesis can be tolerated because mistakes are not trans- 
mitted to progeny. For most genes, many RNA transcripts are synthesized; 
a few defective transcripts are unlikely to be harmful. 


RNA polymerase binds to promoter sites on the DNA 
template to initiate transcription 


The elongation process is common to all organisms. In contrast, the pro- 
cesses of initiation and termination differ substantially in bacteria and 
eukaryotes. We begin with a discussion of these processes in bacteria, starting 
with initiation of transcription. The bacterial RNA polymerase discussed 
earlier with the composition a,8B’w is referred to as the core enzyme. The 
inclusion of an additional subunit produces the holoenzyme with composition 
a,BB’wo. The o subunit helps find sites on DNA where transcription 
begins, referred to as promoter sites or, simply, promoters. At these sites, the a 
subunit participates in the initiation of RNA synthesis and then dissociates 
from the rest of the enzyme. 

Sequences upstream of the promoter site are important in determining 
where transcription begins. A striking pattern was evident when the sequences 
of bacterial promoters were compared. Two common motifs are present on the 
upstream side of the transcription start site. They are known as the — 10 sequence 
and the — 35 sequence because they are centered at about 10 and 35 nucleotides 
upstream of the start site. The region containing these sequences is called the 
core promoter. The —10 and —35 sequences are each 6 bp long. Their consen- 
sus sequences, deduced from analyses of many promoters (Figure 29.9), are 


—35 -10 +1 
5~~T TGACA TATAAT Start 
site 


Promoters differ markedly in their efficacy. Some genes are transcribed 
frequently—as often as every 2 seconds in E. coli. The promoters for these 
genes are referred to as strong promoters. In contrast, other genes are tran- 
scribed much less frequently, about once in 10 minutes; the promoters for 
these genes are weak promoters. The —10 and —35 regions of most strong 
promoters have sequences that correspond closely to the consensus 
sequences, whereas weak promoters tend to have multiple substitutions at 
these sites. Indeed, mutation of a single base in either the —10 sequence or 
the —35 sequence can diminish promoter activity. The distance between 
these conserved sequences also is important; a separation of 17 nucleotides 
is optimal. Thus, the efficiency or strength of a promoter sequence serves to 
regulate transcription. Regulatory proteins that bind to specific sequences 
near promoter sites and interact with RNA polymerase (Chapter 31) also 
markedly influence the frequency of transcription of many genes. 

Outside the core promoter in a subset of highly expressed genes is the 
upstream element (also called the UP element for upstream element). This 
sequence is present from 40 to 60 nucleotides upstream of the transcription 
start site. The UP element is bound by the a subunit of RNA polymerase 
and serves to increase the efficiency of transcription by creating an additional 
interaction site for the polymerase. 


Sigma subunits of RNA polymerase recognize 
promoter sites 


To initiate transcription, the a,BB’w core of RNA 
polymerase must bind the promoter. However, it is the 
o subunit that makes this binding possible by enabling 
RNA polymerase to recognize promoter sites. In the pres- 
ence of the o subunit, the RNA polymerase binds 
weakly to the DNA and slides along the double helix 
until it dissociates or encounters a promoter. The o 
subunit recognizes the promoter through several inter- 
actions with the nucleotide bases of the promoter DNA. 
The structure of a bacterial RNA polymerase holoen- 
zyme bound to a promoter site shows the o subunit 
interacting with DNA at the —10 and —35 regions 
essential to promoter recognition (Figure 29.10). 
Therefore, the a subunit is responsible for the specific bind- 
ing of the RNA polymerase to a promoter site on the tem- 
plate DNA. The o subunit is generally released when the 
nascent RNA chain reaches 9 or 10 nucleotides in 
length. After its release, it can assist initiation by another 
core enzyme. Thus, the o subunit acts catalytically. 

E. coli has seven distinct o factors for recognizing 
several types of promoter sequences in E. coli DNA. ® FIGURE 29.10 RNA polymerase holoenzyme complex. Notice 
The type that recognizes the consensus sequences __ that the o subunit (blue) of the bacterial RNA polymerase holoenzyme 
described earlier is called «7° because it has a mass of makes sequence-specific contacts with the —10 (green) and —35 
70 kDa. A different o factor comes into play when the promoter elements (yellow). The active site of the polymerase is 
temperature is raised abruptly. E. coli responds by syn- revealed by bound metal ion (purple). [Drawn from 1L9Z.pdb]. 
thesizing 0°”, which recognizes the promoters of heat- 
shock genes. These promoters exhibit —10 sequences that are somewhat dif- 
ferent from the —10 sequence for standard promoters (Figure 29.11). The 
increased transcription of heat-shock genes leads to the coordinated synthesis 
of a series of protective proteins. Other o factors respond to environmental 
conditions, such as nitrogen starvation. These findings demonstrate that a 
plays the key role in determining where RNA polymerase initiates transcription. 


—10 element 


—35 element 


F Some other bacteria contain a much larger number of o factors. For 

example, the genome of the soil bacterium Streptomyces coelicolor 
encodes more than 60 o factors recognized on the basis of their amino acid 
sequences. This repertoire allows these cells to adjust their gene-expression 
programs to the wide range of conditions, in regard to nutrients and com- 
peting organisms, that they may experience. 


RNA polymerases must unwind the template double 
helix for transcription to take place 


Although RNA polymerases can search for promoter sites when bound to 
double-helical DNA, a segment of the DNA double helix must be unwound 
before synthesis can begin. The transition from the closed promoter complex 


—35 —10 
SsSTTGACA TATAAT+--3' Standard promoter 


5'i~~TNNCNCNCTTGAA CCCATNT 3' Heat-shock promoter 
5'eaC TCG GNAMeweeT TGC A 3’ Nitrogen-starvation promoter 
FIGURE 29.11 Alternative promoter sequences. A comparison of the consensus 


sequences of standard, heat-shock, and nitrogen-starvation promoters of E. coli. These 
promoters are recognized by o”°, 032, and o°, respectively. 
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(in which DNA is double helical) to the open promoter 
complex (in which a DNA segment is unwound) is an 
—_ _ = SS essential event in transcription (Figure 29.12). The free 
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es energy necessary to break the bonds between approxi- 


Closed complex Open complex mately 17 base pairs in the double helix is derived from 


FIGURE 29.12 DNA unwinding. The transition from the closed 
promoter complex to the open promoter complex requires the 


unwinding of approximately 17 base pairs of DNA. 


FIGURE 29.13 Transcription bubble. A 
schematic representation of a transcription 
bubble in the elongation of an RNA 
transcript. Duplex DNA is unwound at the 
forward end of RNA polymerase and 
rewound at its rear end. The RNA-DNA 
hybrid rotates during elongation. 
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additional interactions that are possible when the DNA 
distorts to wrap around the RNA polymerase and from 
interactions between the single-stranded DNA regions 
and other parts of the enzyme. These interactions stabilize 
the open promoter complex and help pull the template strand into the active 
site. The —35 element remains in a double-helical state, whereas the —10 
element is unwound. The stage is now set for the formation of the first phos- 
phodiester bond of the new RNA chain. 


Elongation takes place at transcription bubbles that move 
along the DNA template 


The elongation phase of RNA synthesis begins with the formation of the 
first phosphodiester bond. Repeated cycles of nucleotide addition can take 
place at this point. However, until about 10 nucleotides have been added, 
RNA polymerase sometimes releases the short RNA, which dissociates 
from the DNA. Once RNA polymerase passes this point, the enzyme stays 
bound to its template until a termination signal is reached. The region con- 
taining RNA polymerase, DNA, and nascent RNA corresponds to the 
transcription bubble (Figure 29.13). The newly synthesized RNA forms a 
hybrid helix with the template DNA strand. This RNA-DNA helix is 
about 8 bp long, which corresponds to nearly one turn of a double helix. The 
3'-hydroxyl group of the RNA in this hybrid helix is positioned so that it 
can attack the a-phosphorus atom of an incoming ribonucleoside triphos- 
phate. The core enzyme also contains a binding site for the coding strand of 
DNA. About 17 bp of DNA are unwound throughout the elongation phase, 
as in the initiation phase. The transcription bubble moves a distance of 
170 A (17 nm) in a second, which corresponds to a rate of elongation of 
about 50 nucleotides per second. Although rapid, it is much slower than the 
rate of DNA synthesis, which is 800 nucleotides per second. 


Sequences within the newly transcribed RNA signal termination 


In bacteria, the termination of transcription is as precisely controlled as its 
initiation. In the termination phase of transcription, the formation of phos- 
phodiester bonds ceases, the RNA-DNA hybrid dissociates, the unwound 
region of DNA rewinds, and RNA polymerase releases the DNA. What 
determines where transcription is terminated? The transcribed regions of DNA 
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templates contain stop signals. The simplest one is a palindromic GC-rich region 
followed by an AT-rich region. The RNA transcript of this DNA palindrome is 
self-complementary (Figure 29.14). Hence, its bases can pair to form a hairpin 
structure with a stem and loop, a structure favored by its high content of G and 
C residues. Guanine-cytosine base pairs are more stable than adenine—thy- 
mine pairs because of the extra hydrogen bond in the base pair. This stable 
hairpin is followed by a sequence of four or more uracil residues, which also 
are crucial for termination. The RNA transcript ends within or just after them. 
How does this combination hairpin—oligo(U) structure terminate tran- 
scription? First, RNA polymerase likely pauses immediately after it has 
synthesized a stretch of RNA that folds into a hairpin. Furthermore, the 
RNA-—DNA hybrid helix produced after the hairpin is unstable because its 
rU_-dA base pairs are the weakest of the four kinds. 
Hence, the pause in transcription caused by the hair- 
pin permits the weakly bound nascent RNA to dissoci- 
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ate from the DNA template and then from the enzyme. 5! 
The solitary DNA template strand rejoins its partner 
to re-form the DNA duplex, and the transcription 
bubble closes. 


Some messenger RNAs directly sense metabolite concentrations 


As we shall explore in Chapters 31 and 32, the expression of many genes is 
controlled in response to the concentrations of metabolites and signaling 
molecules within cells. One set of control mechanisms depends on the 
remarkable ability of some mRNA molecules to form special secondary 
structures, some of which are capable of directly binding small molecules. 
These structures are termed riboswitches. Consider a riboswitch that con- 
trols the synthesis of genes that participate in the biosynthesis of riboflavin 
in Bacillus subtilis (Figure 29.15). When flavin mononucleotide (FMN), a 
key intermediate in riboflavin biosynthesis, is present at high concentration, 
it binds to a conformation of the RNA transcript with an FMN-binding 
pocket that also includes a hairpin structure that favors premature termina- 
tion. By trapping the RNA transcript in this termination-favoring confor- 
mation, FMN prevents the production of functional mRNA. However, 
when FMN is present at low concentration, it does not readily bind to 
RNA, and an alternative conformation forms without the terminator, 
allowing the production of the full-length mRNA. The occurrence of ribo- 
switches serves as a vivid illustration of how RNAs are capable of forming 
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Low [FMN], terminator disrupted, 


High [FMN], terminator formed, 
full mRNA produced 


full mRNA not produced 
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FIGURE 29.14 Termination signal. A termination signal found at the 3’ 
end of an mRNA transcript consists of a series of bases that form a 
stable stem-loop structure and a series of U residues. 


hk FIGURE 29.15 Riboswitch. (A) The 
5'-end of an mRNA that encodes proteins 
engaged in the production of flavin 
mononucleotide (FMN) folds to form a 
structure that is stabilized by binding FMN. 
This structure includes a terminator that 
leads to premature termination of the 
mRNA. At lower concentrations of FMN, 
an alternative structure that lacks the 
terminator is formed, leading to the 
production of full-length mRNA. (B) The 
three-dimensional structure of a related 
FMN-binding riboswitch bound to FMN. 
The blue and yellow stretches correspond 
to regions highlighted in the same colors 
in part A. Notice how the yellow strand 
contacts the bound FMN, stabilizing the 
structure. [Drawn from 3F2Q.pdb]. 
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FIGURE 29.17 Mechanism for the termination of 
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FIGURE 29.16 Effect of p protein on the size of RNA transcripts. 


elaborate, functional structures, though we tend to depict them as simple 
lines in the absence of specific information. 


The rho protein helps to terminate the transcription of some genes 


RNA polymerase needs no help to terminate transcription at a hairpin 
followed by several U residues. At other sites, however, termination 
requires the participation of an additional factor. This discovery was 
prompted by the observation that some RNA molecules synthesized in 
vitro by RNA polymerase acting alone are longer than those made in vivo. 
The missing factor, a protein that caused the correct termination, was 
isolated and named rho (p). Additional information about the action of p 
was obtained by adding this termination factor to an incubation mixture 
at various times after the initiation of RNA synthesis (Figure 29.16). 
RNAs with sedimentation coefficients of 10S, 135, and 175 were obtained 
when p was added at initiation, a few seconds after initiation, and 2 min- 
utes after initiation, respectively. If no p was added, transcription yielded 
a 235 RNA product. It is evident that the template contains at least three 
termination sites that respond to p (yielding 105, 135, and 175 RNA) and 
one termination site that does not (yielding 235 RNA). Thus, specific 
termination at a site producing 235 RNA can take place in the absence of 
p. However, p detects additional termination signals that are not recog- 
nized by RNA polymerase alone. 

How does p provoke the termination of RNA synthe- 
sis? A key clue is the finding that p hydrolyzes ATP in the 
presence of single-stranded RNA but not in the presence of 
DNA or duplex RNA. Hexameric p is a helicase, homolo- 
gous to the helicases that we encountered in our consider- 
ation of DNA replication (Section 28.1). A stretch of 
nucleotides is bound in such a way that the RNA passes 
through the center of the structure (Figure 29.17). The p 
protein is brought into action by sequences located in the 
nascent RNA that are rich in cytosine and poor in guanine. 
The helicase activity of p enables the protein to pull the 
nascent RNA while pursuing RNA polymerase. When p 
catches RNA polymerase at the transcription bubble, it 
breaks the RNA-DNA hybrid helix by functioning as an 
RNA-DNA helicase. 

Proteins in addition to p may provoke termination. For 
example, the nusA protein enables RNA polymerase in E. coli 
to recognize a characteristic class of termination sites. A com- 


transcription by p protein. This protein is an ATP-dependent 
helicase that binds the nascent RNA chain and pulls it away 
from RNA polymerase and the DNA template. 


mon feature of protein-independent and protein-dependent ter- 
mination is that the functioning signals lie in newly synthesized 
RNA rather than in the DNA template. 


Some antibiotics inhibit transcription 


e Many antibiotics are highly specific inhibitors of biological processes 

in bacteria. Rifampicin and actinomycin are two antibiotics that 
inhibit bacterial transcription, although in quite different ways. Rifampicin 
is a semisynthetic derivative of rifamycins, which are derived from a strain 
of Amycolatopsis that is related to the bacterium that causes strep throat. 
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This antibiotic specifically inhibits the initiation of RNA synthesis. 
Rifampicin interferes with the formation of the first few phosphodiester 
bonds in the RNA chain. The structure of a complex between a prokaryotic 
RNA polymerase and rifampicin reveals that the antibiotic blocks the chan- 
nel into which the RNA-DNA hybrid generated by the enzyme must pass 
(Figure 29.18). The binding site is 12 A from the active site itself. Rifampicin 
can inhibit only the initiation of transcription, not elongation, because the 
RNA-DNA hybrid present in the enzyme during elongation prevents the 
antibiotic from binding. The pocket in which rifampicin binds is conserved 
among bacterial RNA polymerases, but not eukaryotic polymerases, and so 
rifampicin can be used as an antibiotic in antituberculosis therapy. 

Actinomycin D, a peptide-containing antibiotic from a different strain of 
Streptomyces, inhibits transcription by an entirely different mechanism. 
Actinomycin D binds tightly and specifically to double-helical DNA and thereby 
prevents it from being an effective template for RNA synthesis. The results of 
spectroscopic, hydrodynamic, and structural studies of complexes of actino- 
mycin D and DNA reveal that the phenoxazone ring of actinomycin slips in 


Rifampicin 


FIGURE 29.18 Antibiotic action. Rifampicin binds to a pocket in the channel that is normally 
occupied by the newly formed RNA-DNA hybrid. Thus, the antibiotic blocks elongation after 
only two or three nucleotides have been added. 
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FIGURE 29.19 Actinomycin-DNA 
complex structure. (A) The structure of a 
complex between a DNA duplex (shown 
as a space-filling model) and actinomycin 
B (shown as a ball-and-stick model). Two 
actinomycin B molecules are bound in the 
complex. (B) The structure of actinomycin B 
showing the phenoxazone ring. Notice how 
the phenoxazone (yellow) slides between 
base pairs of the DNA. Abbreviation: Me, 
methyl. [Drawn from 113W.pdb] 
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FIGURE 29.20 Primary transcript. 
Cleavage of this transcript produces 5S, 
16S, and 23S rRNA molecules and a 
tRNA molecule. Spacer regions are 
shown in yellow. 


870 


we Pro a 
p-Val CH; ia as 

cS N-MeVal 
WS O~ 


o” 


CH; 


Phenoxazone 


between base pairs in DNA (Figure 29.19). This mode of binding is called 
intercalation. At low concentrations, actinomycin D inhibits transcription 
without significantly affecting DNA replication or protein synthesis. Hence, 
actinomycin D is extensively used as a highly specific inhibitor of the formation 
of new RNA in both prokaryotic and eukaryotic cells. Its ability to inhibit the 
growth of rapidly dividing cells makes it an effective therapeutic agent in the 
treatment of some cancers. 


Precursors of transfer and ribosomal RNA are cleaved and chemically 
modified after transcription in prokaryotes 


In prokaryotes, messenger RNA molecules undergo little or no modifica- 
tion after synthesis by RNA polymerase. Indeed, many mRNA molecules 
are translated while they are being transcribed. In contrast, transfer RNA 
(tRNA) and ribosomal RNA (rRNA) molecules are generated by cleavage and 
other modifications of nascent RNA chains. For example, in E. coli, the three 
rRNAs and a tRNA are excised from a single primary RNA transcript that 
also contains spacer regions (Figure 29.20). Other transcripts contain arrays 
of several kinds of tRNA or of several copies of the same tRNA. The nucle- 
ases that cleave and trim these precursors of rRNA and tRNA are highly 
precise. Ribonuclease P (RNase P), for example, generates the correct 5’ 
terminus of all tRNA molecules in E. coli. Sidney Altman and his coworkers 
showed that this interesting enzyme contains a catalytically active RNA 
molecule. Ribonuclease III (RNase III) excises 55, 168, and 235 rRNA pre- 
cursors from the primary transcript by cleaving double-helical hairpin 
regions at specific sites. 

A second type of processing is the addition of nucleotides to the termini 
of some RNA chains. For example, CCA, a terminal sequence required for 
the function of all tRNAs, is added to the 3’ ends of tRNA molecules for 
which this terminal sequence is not encoded in the DNA. The enzyme 
that catalyzes the addition of CCA is atypical for an RNA polymerase in 
that it does not use a DNA template. A third type of processing is the 
modification of bases and ribose units of ribosomal RNAs. In prokaryotes, 
some bases of rRNA are methylated. Unusual bases are found in all tRNA 
molecules. They are formed by the enzymatic modification of a standard 
ribonucleotide in a tRNA precursor. For example, uridylate residues are 
modified after transcription to form ribothymidylate and pseudouridylate. 
These modifications generate diversity, allowing greater structural and 
functional versatility. 
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29.2 Transcription in Eukaryotes Is Highly Regulated 


We turn now to transcription in eukaryotes, a much more complex process 
than in bacteria. Eukaryotic cells have a remarkable ability to regulate pre- 
cisely the time at which each gene is transcribed and how much RNA is 
produced. This ability has allowed some eukaryotes to evolve into multicel- 
lular organisms, with distinct tissues. That is, multicellular eukaryotes use 
differential transcriptional regulation to create different cell types. Gene 
expression is influenced by three important characteristics unique to 
eukaryotes: the nuclear membrane, complex transcriptional regulation, and 
RNA processing. 


1. The Nuclear Membrane. In eukaryotes, transcription and translation take 
place in different cellular compartments: transcription takes place in the 
membrane-bounded nucleus, whereas translation takes place outside the 
nucleus in the cytoplasm. In bacteria, the two processes are closely coupled 
(Figure 29.21). Indeed, the translation of bacterial mRNA begins while 
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FIGURE 29.21 Transcription and 
translation. These two processes are 
closely coupled in prokaryotes whereas 
they are spatially and temporally separate 
in eukaryotes. (A) In prokaryotes, the 
primary transcript serves as MRNA and is 
used immediately as the template for 
protein synthesis. (B) In eukaryotes, MRNA 
precursors are processed and spliced in 
he nucleus before being transported to 
he cytoplasm for translation into protein. 
[Information from J. Darnell, H. Lodish, and 
D. Baltimore. Molecular Cell Biology, 2d ed. 
(Scientific American Books, 1990), p. 230.] 
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the transcript is still being synthesized. The spatial and temporal separation 
of transcription and translation enables eukaryotes to regulate gene expression 
in much move intricate ways, contributing to the richness of eukaryotic form 
and function. 


2. Complex Transcriptional Regulation. Like bacteria, eukaryotes rely on 
conserved sequences in DNA to regulate the initiation of transcription. 
But bacteria have only three promoter elements (the —10, —35, and UP 
elements), whereas eukaryotes use a variety of types of promoter elements, 
each identified by its own conserved sequence. Not all possible types will 
be present together in the same promoter. In eukaryotes, elements that regu- 
late transcription can be found at a variety of locations in DNA, upstream 
or downstream of the start site and sometimes at distances much farther 
from the start site than in prokaryotes. For example, enhancer elements 
located on DNA far from the start site increase the promoter activity of 
specific genes. 


3. RNA Processing. Although both bacteria and eukaryotes modify RNA, 
eukaryotes very extensively process nascent RNA destined to become mRNA. 
This processing includes modifications to both ends and, most signifi- 
cantly, splicing out segments of the primary transcript. RNA processing is 
described in Section 29.3. 


Three types of RNA polymerase synthesize RNA in eukaryotic cells 


In bacteria, RNA 1s synthesized by a single kind of polymerase. In con- 
trast, the nucleus of a typical eukaryotic cell contains three types of 
RNA polymerase differing in template specificity and location in the 
nucleus (Table 29.2). All these polymerases are large proteins, contain- 
ing from 8 to 14 subunits and having total molecular masses greater than 
500 kDa. RNA polymerase I is located in specialized structures within the 
nucleus called nucleoli, where it transcribes the tandem array of genes 
for 188, 5.88, and 285 rRNA. The other rRNA molecule (55 rRNA) and 
all the tRNA molecules are synthesized by RNA polymerase III, which 
is located in the nucleoplasm rather than in nucleoli. RNA polymerase IT, 
which also is located in the nucleoplasm, synthesizes the precursors 
of mRNA as well as several small RNA molecules, such as those of 
the splicing apparatus and many of the precursors to small regulatory 
RNAs. 

Although all eukaryotic RNA polymerases are homologous to one 
another and to prokaryotic RNA polymerases, RNA polymerase IT contains 
a unique carboxyl-terminal domain on the 220-kDa subunit called the CTD; 
this domain is unusual because it contains multiple repeats of a YSPTSPS 
consensus sequence. The activity of RNA polymerase II is regulated by 
phosphorylation mainly on the serine residues of the CTD. 

Another major distinction among the polymerases lies in their responses 
to the toxin a-amanitin, a cyclic octapeptide that contains several modified 
amino acids. 


TABLE 29.2 Eukaryotic RNA polymerases 


Type Location Cellular transcripts Effects of a-amanitin 
I Nucleolus 18S, 5.85, and 28S rRNA Insensitive 
II Nucleoplasm |= mRNA precursorsandsnRNA _ Strongly inhibited 


Il Nucleoplasm tRNA and 55 rRNA Inhibited by high concentrations 


a-Amanitin 
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Amanita phalloides, also called the death cap. 
[Jacana/Science Source.] 


a-Amanitin is produced by the poisonous mushroom 
Amanita phalloides, which is also called the death cap or the 
destroying angel. More than a hundred deaths result world- 
wide each year from the ingestion of poisonous mushrooms. 
a-Amanitin binds very tightly (Ky = 10 nM) to RNA 
polymerase IT and thereby blocks the elongation phase 


RNA polymerase | promoter 


= 


Upstream 
promoter 


element 
L 


Promoter 


of RNA synthesis. Higher concentrations of a-amanitin 
(1 pM) inhibit polymerase III, whereas polymerase I is 
insensitive to this toxin. This pattern of sensitivity is highly 
conserved throughout the animal and plant kingdoms. 
Eukaryotic polymerases also differ from each other in 
the promoters to which they bind. Eukaryotic genes, like 
prokaryotic genes, require promoters for transcription ini- 
tiation. Like prokaryotic promoters, eukaryotic promoters 
consist of conserved sequences that attract the polymerase 
to the start site. However, eukaryotic promoters differ dis- 
tinctly in sequence and position, depending on the type of 
RNA polymerase to which they bind (Figure 29.22). 


Enhancer 


Enhancer 


1. RNA Polymerase I. The ribosomal DNA (rDNA) 
transcribed by polymerase I is arranged in several hun- 
dred tandem repeats, each containing a copy of each of 
three rRNA genes. The promoter sequences are located in 
stretches of DNA separating the genes. At the transcrip- 
tional start site lies a TAT A-like sequence called the ribo- 
somal initiator element (rInr). Farther upstream, 150 to 200 bp from the start 
site, is the upstream promoter element (UPE). Both elements aid transcrip- 
tion by binding proteins that recruit RNA polymerase I. 


Type I: 5S rRNA 


2. RNA Polymerase II. Promoters for RNA polymerase IJ, like prokaryotic 
promoters, include a set of consensus sequences that define the start site and 
recruit the polymerase. However, the promoter can contain any combina- 
tion of a number of possible consensus sequences. Unique to eukaryotes, 
they also include enhancer elements that can be very distant (more than 1 
kb) from the start site. 


3. RNA Polymerase III. Promoters for RNA polymerase III are within the tran- 
scribed sequence, downstream of the start site. There are two types of intergenic 
promoters for RNA polymerase III. Type I promoters, found in the 55 rRNA 


RNA polymerase II promoter 


(eee fae 


Inr 


TATA box 
l | 
Promoter 


Promoter 


RNA polymerase III promoter 


Type Il: tRNA 


H B block 


FIGURE 29.22 Common eukaryotic 
promoter elements. Each eukaryotic RNA 
polymerase recognizes a set of promoter 
elements — sequences in DNA that promote 
transcription. The RNA polymerase | 
promoter consists of a ribosomal initiator 
(rinr) and an upstream promoter element 
(UPE). The RNA polymerase II promoter 
likewise includes an initiator element (Inr) 
and may also include either a TATA box or 
a downstream promoter element (DPE). 
Separate from the promoter region, 
enhancer elements bind specific 
transcription factors. RNA polymerase III 
promoters consist of conserved sequences 
that lie within the transcribed genes. 
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5' Tg2 Ag7 Tox Ags Ags Ass Aso 3’ 
TATA box 


FIGURE 29.23 TATA box. Comparisons of 
the sequences of more than 100 eukaryotic 
promoters led to the consensus sequence 
shown. The subscripts denote the 
frequency (%) of the base at that position. 


5’ GGNCAATCT3’ 
CAAT box 


5'GGGCGG3' 
GC box 


FIGURE 29.24 CAAT box and GC 

box. Consensus sequences for the CAAT 
and GC boxes of eukaryotic promoters for 
mRNA precursors. 


gene, contain two short conserved sequences known as the A block and the C 
block. Type II promoters, found intRNA genes, consist of two 11-bp sequenc- 
es, the A block and the B block, situated about 15 bp from either end of the gene. 


Three common elements can be found in the RNA polymerase II 
promoter region 


RNA polymerase II transcribes all of the protein-coding genes in eukaryotic 
cells. Promoters for RNA polymerase II, like those for bacterial polymer- 
ases, are generally located on the 5’ side of the start site for transcription. 
Because these sequences are on the same molecule of DNA as the genes 
being transcribed, they are called cis-acting elements. The most commonly 
recognized cis-acting element for genes transcribed by RNA polymerase II 
is called the TATA box on the basis of its consensus sequence (Figure 29.23). 
The TATA box is usually found between positions —30 and —100. Note 
that the eukaryotic TATA box closely resembles the prokaryotic —10 
sequence (TATAAT) but is farther from the start site. The mutation of a 
single base in the TATA box markedly impairs promoter activity. Thus, the 
precise sequence, not just a high content of AT pairs, is essential. 

The TATA box is often paired with an initiator element (Inr), a sequence 
found at the transcriptional start site, between positions —3 and +5. This 
sequence defines the start site because the other promoter elements are at vari- 
able distances from that site. Its presence increases transcriptional activity. 

A third element, the downstream core promoter element (DPE), is com- 
monly found in conjunction with the Inr in transcripts that lack the TATA 
box. In contrast with the TATA box, the DPE is found downstream of the 
start site, between positions +28 and +32. 

Additional regulatory sequences are located between —40 and —150. 
Many promoters contain a CAAT box, and some contain a GC box 
(Figure 29.24). Constitutive genes (genes that are continuously expressed 
rather than regulated) tend to have GC boxes in their promoters. The posi- 
tions of these upstream sequences vary from one promoter to another, in 
contrast with the quite constant location of the —35 region in prokaryotes. 
Another difference is that the CAAT box and the GC box can be effective 
when present on the template (antisense) strand, unlike the —35 region, 
which must be present on the coding (sense) strand. These differences 
between prokaryotes and eukaryotes correspond to fundamentally different 
mechanisms for the recognition of cis-acting elements. The —10 and —35 
sequences in prokaryotic promoters are binding sites for RNA polymerase 
and its associated o factor. In contrast, the TATA, CAAT, and GC boxes 
and other cis-acting elements in eukaryotic promoters are recognized by 
proteins other than by RNA polymerase itself. 


The TFIID protein complex initiates the assembly of the active 
transcription complex 


Cis-acting elements constitute only part of the puzzle of eukaryotic gene 
expression. Transcription factors that bind to these elements also are 
required. For example, RNA polymerase II is guided to the start site by a 
set of transcription factors known collectively as TFII (TF stands for tran- 
scription factor, and IJ refers to RNA polymerase II). Individual TFII fac- 
tors are called TFIIA, TFIIB, and so on. 

In TATA-box promoters, the key initial event is the recognition of the 
TATA box by the TATA-box-binding protein (TBP), a 30-kDa compo- 
nent of the 700-kDa TFIID complex (Figure 29.25). In TATA-less promot- 
ers, other proteins in the TFIID complex bind the core promoter elements 
but, because less is known about these interactions, we will consider only the 


FIGURE 29.25 Transcription initiation. Transcription factors TFIIA, B, D, E, F, and H are 
essential in initiating transcription by RNA polymerase II. The step-by-step assembly of these 
general transcription factors begins with the binding of TFIID (purple) to the TATA box. [The 
TATA-box-binding protein (TBP), a component of TFIID, recognizes the TATA box.] After 
assembly, TFIIH opens the DNA double helix and phosphorylates the carboxyl-terminal 
domain (CTD), allowing the polymerase to leave the promoter and begin transcription. 

The red arrow marks the transcription start site. 


TATA-box-TBP binding interaction. TBP binds 10° times as tightly to the 
TATA box as to nonconsensus sequences; the dissociation constant of the 
TBP-TATA-box complex is approximately 1 nM. TBP is a saddle-shaped 
protein consisting of two similar domains (Figure 29.26). The TATA box of 
DNA binds to the concave surface of TBP. This binding induces large confor- 
mational changes in the bound DNA. The double helix is substantially 
unwound to widen its minor groove, enabling it to make extensive contact with 
the antiparallel B strands on the concave side of TBP. Hydrophobic interac- 
tions are prominent at this interface. Four phenylalanine residues, for example, 
are intercalated between base pairs of the TATA box. The flexibility of AT-rich 
sequences is generally exploited here in bending the DNA. Immediately out- 
side the TATA box, classical B-DNA resumes. The TBP—TATA-box com- 
plex is distinctly asymmetric. The asymmetry is crucial for specifying a unique 
start site and ensuring that transcription proceeds unidirectionally. 

TBP bound to the TATA box is the heart of the initiation complex 
(Figure 29.25). The surface of the TBP saddle provides docking sites for the 
binding of other components. Additional transcription factors assemble on 
this nucleus in a defined sequence. TFIIA is recruited, followed by TFIIB; 
then TFIIF, RNA polymerase I, TFITE, and TFIIH join the other factors 
to form a complex called the basal transcription apparatus. During the for- 
mation of the basal transcription apparatus, the carboxyl-terminal domain 
(CTD) is unphosphorylated and plays a role in transcription regulation 
through its binding to an enhancer-associated complex called mediator 
(Section 32.2). The phosphorylated CTD stabilizes transcription elonga- 
tion by RNA polymerase II and recruits RNA-processing enzymes that act 
in the course of elongation. Phosphorylation of the CTD by TFIIH marks the 
transition from initiation to elongation. The importance of the carboxyl-ter- 
minal domain is highlighted by the finding that yeast containing mutant 
polymerase II with fewer than 10 repeats in the CTD are not viable. Most 
of the factors are released before the polymerase leaves the promoter and can 
then participate in another round of initiation. 


Multiple transcription factors interact with eukaryotic promoters 


The basal transcription complex described in the preceding section initiates 
transcription at a low frequency. Additional transcription factors that bind 
to other sites are required to achieve a high rate of mRNA synthesis. Their 
role is to selectively stimulate specific genes. Upstream stimulatory sites in 
eukaryotic genes are diverse in sequence and variable in position. Their 
variety suggests that they are recognized by many different specific pro- 
teins. Indeed, many transcription factors have been isolated, and their bind- 
ing sites have been identified by footprinting experiments. For example, 
heat-shock transcription factor (HSTF) is expressed in Drosophila after an 
abrupt increase in temperature. This 93-kDa DNA-binding protein binds 
to the following consensus sequence: 


5'-CNNGAANNTCCNNG~3’ 


Several copies of this sequence, known as the heat-shock response ele- 
ment, are present starting at a site 15 bp upstream of the TATA box. 


-50 -30 -10 +10 +30 


Inr \ 
DNA template 


RNA pol II 


DNA opened 
CTD phosphorylated 


k FIGURE 29.26 Complex formed by 
TATA-box-binding protein and DNA. 
The saddlelike structure of the protein sits 
atop a DNA fragment. Notice that the DNA 
is significantly unwound and bent. [Drawn 
from 1CDW.pdb.] 
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Heat-shock 
gene 


Heat-shock element 


HSTF differs from o*”, a heat-shock protein of E. coli (p. 865), in binding 
directly to response elements in heat-shock promoters rather than first 
becoming associated with RNA polymerase. 


Enhancer sequences can stimulate transcription at start 
sites thousands of bases away 


The activities of many promoters in higher eukaryotes are greatly increased 
by another type of cis-acting element called an enhancer. Enhancer sequences 
have no promoter activity of their own yet can exert their stimulatory actions 
over distances of several thousand base pairs. They can be upstream, down- 
stream, or even in the midst of a transcribed gene. Moreover, enhancers are 
effective when present on either the coding or non-coding DNA strand. 


ee A particular enhancer is effective only in certain cells. For example, the 
immunoglobulin enhancer functions in B lymphocytes but not else- 
where. Cancer can result if the relation between genes and enhancers is 
disrupted. In Burkitt lymphoma and B-cell leukemia, a chromosomal trans- 
location brings the proto-oncogene myc (a transcription factor itself) under the 
control of a powerful immunoglobin enhancer. The consequent dysregulation 
of the myc gene is believed to play a role in the progression of the cancer. 

Transcription factors and other proteins that bind to regulatory sites on DNA 
can be regarded as passwords that cooperatively open multiple locks, giving RNA 
polymerase access to specific genes. The discovery of promoters and enhancers 
has allowed us to gain a better understanding of how genes are selectively 
expressed in eukaryotic cells. The regulation of gene transcription, discussed 
in Chapter 32, is the fundamental means of controlling gene expression. 


a Although bacteria lack TBP, archaea utilize a TBP molecule that is 

structurally quite similar to the eukaryotic protein. In fact, transcrip- 
tional control processes in archaea are, in general, much more similar to 
those in eukaryotes than are the processes in bacteria. Many components of 
the eukaryotic transcriptional machinery evolved from those in an ancestor 
of archaea. 


29.3 The Transcription Products of Eukaryotic 
Polymerases Are Processed 


Virtually all the initial products of transcription are further processed in 
eukaryotes. For example, primary transcripts (pre-mRNA molecules), the 
products of RNA polymerase II action, acquire a cap at their 5’ ends and a 
poly(A) tail at their 3’ends. Most importantly, nearly all mRNA precursors in 
higher eukaryotes are spliced. Introns are precisely excised from primary tran- 
scripts, and exons are joined to form mature mRNAs with continuous mes- 
sages. Some mature mRNAsare only a tenth the size of their precursors, which 
can be as large as 30 kb or more. The pattern of splicing can be regulated in the 
course of development to generate variations on a theme, such as membrane- 
bound or secreted forms of antibody molecules. Alternative splicing enlarges 
the repertoire of proteins in eukaryotes and is one clear illustration of why the 
proteome is more complex than the genome. The particular processing steps 
and the factors taking part vary according to the type of RNA polymerase. 
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FIGURE 29.27 Processing of eukaryotic pre-rRNA. The mammalian pre-rRNA transcript 
contains the RNA sequences destined to become the 18S, 5.8S, and 28S rRNAs of the 
small and large ribosomal subunits. First, nucleotides are modified: small nucleolar 
ribonucleoproteins methylate specific nucleoside groups and convert selected uridines into 
pseudouridines (indicated by red lines). Next, the pre-rRNA is cleaved and packaged to form 
mature ribosomes, in a highly regulated process in which more than 200 proteins take part. 


RNA polymerase | produces three ribosomal RNAs 


Several RNA molecules are key components of ribosomes. RNA poly- 
merase I transcription produces a single precursor (455 in mammals) that 
encodes three RNA components of the ribosome: the 185 rRNA, the 285 
rRNA, and the 5.85 rRNA (Figure 29.27). The 185 rRNA is the RNA com- 
ponent of the small ribosomal subunit (40S), and the 28S and 5.85 rRNAs 
are two RNA components of the large ribosomal subunit (605). The other 
RNA component of the large ribosomal subunit, the 55 rRNA, is tran- 
scribed by RNA polymerase III as a separate transcript. 

The cleavage of the precursor into three separate rRNAs is actually the 
final step in its processing. First, the nucleotides of the pre-rRNA sequences 
destined for the ribosome undergo extensive modification, on both ribose 
and base components, directed by many small nucleolar ribonucleoproteins 
(snoRNPs), each of which consists of one snoRNA and several proteins. 
The pre-rRNA is assembled with ribosomal proteins, as guided by process- 
ing factors, in a large ribonucleoprotein. For instance, the small-subunit 
(SSU) processome is required for 185 rRNA synthesis and can be visualized 
in electron micrographs as a terminal knob at the 5’ ends of the nascent 
rRNAs (Fig. 29.28). Finally, rR NA cleavage (sometimes coupled with addi- 
tional processing steps) releases the mature rRNAs assembled with ribo- 
somal proteins as ribosomes. Like those of RNA polymerase I transcription 
itself, most of these processing steps take place in the cell nucleolus, a 
nuclear subcompartment. 


RNA polymerase III produces transfer RNA 


Eukaryotic tRNA transcripts are among the most processed of all RNA 
polymerase III transcripts. Like those of prokaryotic tRNAs, the 5’ 
leader is cleaved by RNase P, the 3’ trailer is removed, and CCA is added 
by the CCA-adding enzyme (Figure 29.29). Eukaryotic tRNAs are also 
heavily modified on base and ribose moieties; these modifications are 
important for function. In contrast with prokaryotic tRNAs, many 
eukaryotic pre-tRNAs are also spliced by an endonuclease and a ligase to 
remove an intron. 
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FIGURE 29.28 Visualization of rRNA 
transcription and processing in 
eukaryotes. Transcription of rRNA and its 
assembly into precursor ribosomes can be 
visualized by electron microscopy. The 
structures resemble Christmas trees: the 
trunk is the rDNA and each branch is a 
pre-rRNA transcript. Transcription starts 

at the top of the tree, where the shortest 
transcripts can be seen, and progresses 
down the rDNA to the end of the gene. 
The terminal knobs visible at the end of 
some pre-rRNA transcripts likely 
correspond to the SSU processome, 

a large ribonucleoprotein required for 
processing the pre-rRNA. [Reprinted by 
permission from Macmillan Publishers Ltd: 
Nature 417:967-970, F. Dragon et. al. 

A large nucleolar U3 ribonucleoprotein required 
for 18S ribosomal RNA biogenesis, © 2002.] 
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FIGURE 29.30 Capping the 5’ end. Caps 
at the 5’ end of eukaryotic mRNA include 
7-methylguanylate (red) attached by a 
triphosphate linkage to the ribose at the 


5’ end. N 


one of the riboses are methylated 


in cap O, one is methylated in cap 1, and 
both are methylated in cap 2. 


FIGURE 29.31 Polyadenylation of a 
primary transcript. A specific endonuclease 
cleaves the RNA downstream of AAUAAA. 
Poly(A) polymerase then adds about 250 
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FIGURE 29.29 Transfer RNA precursor processing. The conversion of a yeast tRNA 
precursor into a mature tRNA requires the removal of a 14-nucleotide intron (yellow), the 
cleavage of a 5’ leader (green), and the removal of UU and the attachment of CCA at the 
3’ end (red). In addition, several bases are modified. 


The product of RNA polymerase II, the pre-mRNA transcript, 
acquires a 5’ cap and a 3’ poly(A) tail 


Perhaps the most extensively studied transcription product is the product 
of RNA polymerase II: most of this RNA will be processed to mRNA. The 
immediate product of RNA polymerase II is sometimes referred to as precur- 
sor-to-messenger RNA, or pre-mRNA. Most pre-mRNA molecules are spliced 
to remove the introns. Moreover, both the 5’ and the 3’ ends are modified, and 

both modifications are retained as the pre-mRNA is converted into mRNA. 
As in prokaryotes, eukaryotic transcription usually begins with A or G. 
However, the 5’ triphosphate end of the nascent RNA chain is immediately 
modified. First, a phosphoryl group is released by hydrolysis. The diphos- 
phate 5’ end then attacks the a-phosphorus atom of GTP to form a very 
unusual 5’—5’ triphosphate linkage. This distinctive terminus is called a cap 
(Figure 29.30). The N-7 nitrogen of the terminal guanine is then methylated 
by S-adenosylmethionine to form cap 0. The adjacent riboses may be meth- 
ylated to form cap 1 or cap 2. Transfer RNA and ribosomal RNA molecules, 
in contrast with messenger RNAs and with small RNAs that participate in 
splicing, do not have caps. Caps contribute to the stability 


lat i : : ‘ 
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Template DNA 4 and nucleases. In addition, caps enhance the translation of 
soococescoec]escessesooool mRNA by eukaryotic protein-synthesizing systems. 
wes Ll As mentioned earlier, pre-mRNA is also modified at 
p AAUAAA ; 
\ : the 3’ end. Most eukaryotic mRNAs contain a polyadenyl- 
Nascent RNA Cleavage signal : ie 
ate, poly(A), tail at that end, added after transcription has 
sear e by specific ended. The DNA template does not encode this poly(A) 
peace tail. Indeed, the nucleotide preceding poly(A) is not the 
last nucleotide to be transcribed. Some primary tran- 
aides Pe meer ar " scripts contain hundreds of nucleotides beyond the 3’ end 
poly(A) polymerase of the mature mRNA. 
Pei How is the 3’ end of the pre-mRNA given its final form? 
5’ Cap AAUAAA—=AAAAA(A), —OH 3 ~—- Eukaryotic primary transcripts are cleaved by a specific endonu- 


Polyadenylated mRNA precursor 


clease that recognizes the sequence AAUAAA (Figure 29.31). 


Pri- Pre- 
microRNA microRNA 


Cleavage Cleavage 
al > ii) ——> 


Argonaute 


LTT > a >. 
Mature 
microRNA 


Cleavage does not take place if this sequence or a segment of some 20 nucleo- 
tides on its 3’ side is deleted. The presence of internal AAUAAA sequences in 
some mature mRNAs indicates that AAUAAA is only part of the cleavage 
signal; its context also is important. After cleavage of the pre- RNA by the endo- 
nuclease, a poly(A) polymerase adds about 250 adenylate residues to the 3’ end 
of the transcript; ATP is the donor in this reaction. 

The role of the poly(A) tail is still not firmly established despite much 
effort. However, evidence is accumulating that it enhances translation effi- 
ciency and the stability of mRNA. Blocking the synthesis of the poly(A) tail 
by exposure to 3’-deoxyadenosine (cordycepin) does not interfere with the 
synthesis of the primary transcript. Messenger RNA devoid of a poly(A) 
tail can be transported out of the nucleus. However, an mRNA molecule 
devoid of a poly(A) tail is usually a much less effective template for protein 
synthesis than is one with a poly(A) tail. Indeed, some mRNAs are stored 
in an unadenylated form and receive the poly(A) tail only when translation 
is imminent. The half-life ofan mRNA molecule may be determined in part 
by the rate of degradation of its poly(A) tail. 


Small regulatory RNAs are cleaved from larger precursors 


Cleavage plays a role in the processing of small single-stranded RNAs 
(approximately 20—23 nucleotides) called microRNAs. MicroRNAs play key 
roles in gene regulation in eukaryotes, as we shall see in Chapter 32. They are 
generated from initial transcripts produced by RNA polymerase II and, in 
some cases, RNA polymerase III. These transcripts fold into hairpin struc- 
tures that are cleaved by specific nucleases at various stages (Figure 29.32). 
The final single-stranded RNAs are bound by members of the Argonaute 
family of proteins where the RNAs help target the regulation of specific genes. 


RNA editing changes the proteins encoded by mRNA 


Remarkably, the amino acid sequence information encoded by some 
mRNAs 1s altered after transcription. RNA editing is the term for a change 
in the nucleotide sequence of RNA after transcription by processes other 
than RNA splicing. RNA editing is prominent in some systems already 
discussed. Apolipoprotein B (apo B) plays an important role in the transport 
of triacylglycerols and cholesterol by forming an amphipathic spherical 
shell around the lipids carried in lipoprotein particles (Section 26.3). Apo 
B exists in two forms, a 512-kDa apo B-100 and a 240-kDa apo B-48. The 
larger form, synthesized by the liver, participates in the transport of lipids 
synthesized in the cell. The smaller form, synthesized by the small intestine, 
carries dietary fat in the form of chylomicrons. Apo B-48 contains the 2152 
N-terminal residues of the 4536-residue apo B-100. This truncated mole- 
cule can form lipoprotein particles but cannot bind to the low-density- 
lipoprotein receptor on cell surfaces. What is the relationship between these 
two forms of apo B? Experiments revealed that a totally unexpected mecha- 
nism for generating diversity is at work: the changing of the nucleotide 
sequence of mRNA after its synthesis (Figure 29.33). A specific cytidine residue 
of mRNA is deaminated to uridine, which changes the codon at residue 2153 


FIGURE 29.32 Small regulatory RNA 
production. A pathway from a transcription 
product including a microRNA to the 
mature microRNA bound to an Argonaute 
protein. The initial transcription product, 

a pri-microRNA, is first cleaved to a small 
double-stranded RNA called a pre- 
microRNA. One of the strands of the pre- 
microRNA, the mature microRNA, is then 
bound by an Argonaute protein. 
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FIGURE 29.33 RNA editing. Enzyme- 
catalyzed deamination of a specific cytidine 
residue in the mRNA for apolipoprotein 
B-100 changes a codon for glutamine 
(CAA) to a stop codon (UAA). 
Apolipoprotein B-48, a truncated version 
of the protein lacking the LDL receptor- 
binding domain, is generated by this 
posttranscriptional change in the mRNA 
sequence. [Information from P. Hodges and 
J. Scott, Trends Biochem. Sci. 17:77, 1992.] 
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from CAA (Gln) to UAA (stop). The deaminase that catalyzes this reaction 
is present in the small intestine, but not in the liver, and is expressed only at 
certain developmental stages. 

RNA editing is not confined to apolipoprotein B. Glutamate opens cat- 
ion-specific channels in the vertebrate central nervous system by binding to 
receptors in postsynaptic membranes. RNA editing changes a single gluta- 
mine codon (CAG) in the mRNA for the glutamate receptor to the codon 
for arginine (read as CGG). The substitution of Arg for Gln in the receptor 
prevents Ca’, but not Na‘, from flowing through the channel. 

RNA editing is likely much more common than was formerly thought. 
The chemical reactivity of nucleotide bases, including the susceptibility to 
deamination that necessitates complex DNA-repair mechanisms, has been 
harnessed as an engine for generating molecular diversity at the RNA and, 
hence, protein levels. 

In trypanosomes (parasitic protozoans), a different kind of RNA editing 
markedly changes several mitochondrial mRNAs. Nearly half the uridine 
residues in these mRNAs are inserted by RNA editing. A guide RNA mol- 
ecule identifies the sequences to be modified, and a poly(U) tail on the guide 
donates uridine residues to the mRNAs undergoing editing. DNA sequences 
evidently do not always faithfully disclose the sequence of encoded pro- 
teins: functionally crucial changes to mRNA can take place. 


Sequences at the ends of introns specify splice sites 
in MRNA precursors 


Most genes in higher eukaryotes are composed of exons and introns (Section 
4.7). The introns must be excised and the exons must be linked to form the 
final mRNA in a process called RNA splicing. This splicing must be exqui- 
sitely sensitive: splicing just one nucleotide upstream or downstream of the 
intended site would create a one-nucleotide shift, which would alter the 
reading frame on the 3’ side of the splice to give an entirely different amino 
acid sequence, likely including a premature stop codon. Thus, the correct 
splice site must be clearly marked. Does a particular sequence denote the 
splice site? The sequences of thousands of intron—exon junctions within 
RNA transcripts are known. In eukaryotes from yeast to mammals, these 
sequences have a common structural motif: the intron begins with GU and 
ends with AG. The consensus sequence at the 5’ splice in vertebrates is 
AGGUAAGU, where the GU is invariant (Figure 29.34). At the 3’ end of 
an intron, the consensus sequence is a stretch of 10 pyrimidines (U or C; 
termed the polypyrimidine tract), followed by any base and then by C, and 
ending with the invariant AG. Introns also have an important internal site 
located between 20 and 50 nucleotides upstream of the 3’ splice site; it is 
called the branch site for reasons that will be evident shortly. In yeast, the 
branch-site sequence is nearly always VACUAAC, whereas, in mammals, 
a variety of sequences are found. 

The 5’ and 3’ splice sites and the branch site are essential for determining 
where splicing takes place. Mutations in each of these three critical regions 
lead to aberrant splicing. Introns vary in length from 50 to 10,000 nucleo- 
tides, and so the splicing machinery may have to find the 3’ site several 
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FIGURE 29.34 Splice sites. Consensus sequences for the 5’ splice site and the 3’ splice 
site are shown. Py stands for pyrimidine. 
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FIGURE 29.35 Splicing mechanism used for mRNA precursors. The upstream (5’) exon is 
shown in blue, the downstream (3’) exon in green, and the branch site in yellow. Y stands for 
a pyrimidine nucleotide, R for a purine nucleotide, and N for any nucleotide. The 5’ splice site 
is attacked by the 2’-OH group of the branch-site adenosine residue. The 3’ splice site is 
attacked by the newly formed 3’-OH group of the upstream exon. The exons are joined, and 
the intron is released in the form of a lariat. [Information from P. A. Sharp, Cell 42:397-408, 1985.] 


thousand nucleotides away. Specific sequences near the splice sites (in both 
the introns and the exons) play an important role in splicing regulation, par- 
ticularly in designating splice sites when there are many alternatives. 
Researchers are currently attempting to determine the factors that contrib- 
ute to splice-site selection for individual mRNAs. Despite our knowledge of 
splice-site sequences, predicting pre-mRNAs and their protein products 
from genomic DNA sequence information remains a challenge. 


Splicing consists of two sequential transesterification reactions 


The splicing of nascent mRNA molecules is a complicated process. It 
requires the cooperation of several small RNAs and proteins that form a 
large complex called a spliceosome. However, the chemistry of the splicing 
process is simple. Splicing begins with the cleavage of the phosphodiester 
bond between the upstream exon (exon 1) and the 5’ end of the intron 
(Figure 29.35). The attacking group in this reaction is the 2’-OH group of 
an adenylate residue in the branch site. A 2’—5’ phosphodiester bond is 
formed between this A residue and the 5’ terminal phosphate of the intron. 
This reaction is a transesterification. 
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Note that this adenylate residue is also joined to two other nucleotides by 
normal 3’—5' phosphodiester bonds (Figure 29.36). Hence a branch is gen- 
erated at this site, and a lariat intermediate is formed. 
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FIGURE 29.36 Splicing branch point. The 
structure of the branch point in the lariat 
intermediate in which the adenylate residue 
is joined to three nucleotides by 
phosphodiester bonds. The new 2'-to-5' 
linkage is shown in red, and the usual 
8’-to-5' linkages are shown in blue. 
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TABLE 29.3 Small nuclear ribonucleoprotein particles (SnRNPs) in the splicing 
of mRNA precursors 


Size of snRNA 
snRNP (nucleotides) Role 
U1 165 Binds the 5’ splice site 
U2 185 Binds the branch site and forms part of the catalytic center 
U5 116 Binds the 5’ splice site and then the 3’ splice site 
U4 145 Masks the catalytic activity of U6 
U6 106 Catalyzes splicing 


The 3’-OH terminus of exon 1 then attacks the phosphodiester bond 
between the intron and exon 2. Exons 1 and 2 become joined, and the intron 
is released in lariat form. Again, this reaction is a transesterification. 
Splicing is thus accomplished by two transestevification reactions rather than 
by hydrolysis followed by ligation. The first reaction generates a free 3’-OH 
group at the 3’ end of exon 1, and the second reaction links this group to the 
5'-phosphate of exon 2. The number of phosphodiester bonds stays the same 
during these steps, which is crucial because it allows the splicing reaction 
itself to proceed without an energy source such as ATP or GTP. 


Small nuclear RNAs in spliceosomes catalyze the 
splicing of MRNA precursors 


The nucleus contains many types of small RNA molecules with fewer than 
300 nucleotides, referred to as snRNAs (small nuclear RNAs). A few of 
them—designated U1, U2, U4, U5, and U6—are essential for splicing 
mRNA precursors. The secondary structures of these RNAs are highly 
conserved in organisms ranging from yeast to human beings. These RNA 
molecules are associated with specific proteins to form complexes termed 
snRNPs (small nuclear ribonucleoprotein particles); investigators often 
speak of them as “snurps.” Spliceosomes are large (60S) dynamic assemblies 
composed of snRNPs, hundreds of other proteins called splicing factors, and 
the mRNA precursors being processed (Table 29.3). 

In mammalian cells, splicing begins with the recognition of the 5’ splice 
site by the U1 snRNP (Figure 29.37). U1 snRNA contains a highly con- 
served six-nucleotide sequence, not covered by protein in the snRNP, that 
base-pairs to the 5’ splice site of the pre-mRNA. This binding initiates 
spliceosome assembly on the pre-mRNA molecule. 


5’ splice site mRNA precursor 
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U2 snRNP then binds the branch site in the intron by base-pairing 
between a highly conserved sequence in U2 snRNA and the pre-mRNA. 


FIGURE 29.37 Spliceosome assembly and action. U1 binds the 5’ splice site and U2 binds 
to the branch point. A preformed U4-U5-U6 complex then joins the assembly to form the 
complete spliceosome. The U6 snRNA re-folds and binds the 5’ splice site, displacing U1. 
Extensive interactions between U6 and U2 displace U4. Then, in the first transesterification 
step, the branch-site adenosine attacks the 5’ splice site, making a lariat intermediate. U5 
holds the two exons in close proximity, and the second transesterification takes place, with 
the 5’ splice-site hydroxyl group attacking the 3’ splice site. These reactions result in the 
mature spliced mRNA and a lariat form of the intron bound by U2, U5, and U6. [Information 
from T. Villa, J. A. Pleiss, and C. Guthrie, Cel! 109:149-152, 2002.] 
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U2 snRNP binding requires ATP hydrolysis. A preassembled U4-U5-U6 
tri-snRNP joins this complex of U1, U2, and the mRNA precursor to form 
the spliceosome. This association also requires ATP hydrolysis. 

A revealing view of the interplay of RNA molecules in this assembly 
came from examining the pattern of cross-links formed by psoralen, a reagent 
that joins neighboring pyrimidines in base-paired regions on treatment with 
light. These cross-links suggest that splicing takes place in the following way. 
First, U5 interacts with exon sequences in the 5’ splice site and subsequently 
with the 3’ exon. Next, U6 disengages from U4 and undergoes an intramo- 
lecular rearrangement that permits base-pairing with U2 as well as interac- 
tion with the 5’ end of the intron, displacing U1 from the spliceosome. The 
U2-U6 helix is indispensable for splicing, suggesting that U2 and U6 
snRNAs probably form the catalytic center of the spliceosome (Figure 29.38). 
U4 serves as an inhibitor that masks U6 until the specific splice sites are 
aligned. These rearrangements result in the first transesterification reaction, 
cleaving the 5’ exon and generating the lariat intermediate. 

Further rearrangements of RNA in the spliceosome facilitate the second 
transesterification. In these rearrangements, U5 aligns the free 5’ exon with 
the 3’ exon such that the 3'-hydroxyl group of the 5’ exon is positioned to 
nucleophilically attack the 3’ splice site to generate the spliced product. U2, 
U5, and U6 bound to the excised lariat intron are released to complete the 
splicing reaction. 

Many of the steps in the splicing process require ATP hydrolysis. How 
is the free energy associated with ATP hydrolysis used to power splicing? 
To achieve the well-ordered rearrangements necessary for splicing, ATP- 
powered RNA helicases must unwind RNA helices and allow alternative 
base-pairing arrangements to form. Thus, two features of the splicing pro- 
cess are noteworthy. First, RNA molecules play key roles in directing the 
alignment of splice sites and in carrying out catalysis. Second, ATP-powered 
helicases unwind RNA duplex intermediates that facilitate catalysis and induce 
the release of snRNPs from the mRNA. 


Transcription and processing of MRNA are coupled 


Although the transcription and processing of mRNAs have been described 
herein as separate events in gene expression, experimental evidence suggests 
that the two steps are coordinated by the carboxyl-terminal domain of RNA 
polymerase IT. We have seen that the CTD consists of a unique repeated seven- 
amino-acid sequence, YSPTSPS. Either 5, or 5; or both may be phosphory- 
lated in the various repeats. The phosphorylation state of the CTD is controlled 
by a number of kinases and phosphatases and leads the CTD to bind many of 


FIGURE 29.38 Splicing catalytic 
center. The catalytic center of the 


spliceosome is formed by U2 snRNA (red) 


and U6 snRNA (green), which are base 
paired. U2 is also base paired to the 
branch site of the mRNA 


precursor. [Information from H. D. Madhani 


and C. Guthrie, Cell 71:803-817, 1992.] 
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FIGURE 29.39 The CTD: Coupling 
transcription to pre-mRNA 
processing. The transcription factor TFIIH 
phosphorylates the carboxyl-terminal 
domain (CTD) of RNA polymerase Il, 
signaling the transition from transcription 
initiation to elongation. The phosphorylated 
CTD binds factors required for pre-mRNA 
capping, splicing, and polyadenylation. 
These proteins are brought in close 
proximity to their sites of action on the 
nascent pre-mRNA as it is transcribed 
during elongation. [Information from P. A. 
Sharp, TIBS 30:279-281, 2005.] 


FIGURE 29.40 A splicing mutation that 
causes thalassemia. An A-to-G mutation 
within the first intron of the gene for the 
human hemoglobin B chain creates a new 5’ 
splice site (GU). Both 5’ splice sites are 
recognized by the U1 snRNP; thus splicing 
may sometimes create a normal mature 
mRNA and an abnormal mature MRNA that 
contains intron sequences. The normal 
mature mRNA is translated into a hemoglobin 
8 chain. Because it includes intron 
sequences, the abnormal mature MRNA now 
has a premature stop codon and is degraded. 
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the proteins having roles in RNA transcription and processing. 
The CTD contributes to efficient transcription by recruiting 
these proteins to the pre-mRNA (Figure 29.39), including: 


1. capping enzymes, which methylate the 5’ guanine on the 
pre-mRNA immediately after transcription begins; 


2. components of the splicing machinery, which initiate the 
excision of each intron as it is synthesized; and 


3. anendonuclease that cleaves the transcript at the poly(A) 
addition site, creating a free 3’-OH group that is the target 
for 3’ adenylation. 


These events take place sequentially, directed by the 
phosphorylation state of the CTD. 


Mutations that affect pre-mRNA splicing cause disease 


e Mutations in either the pre-mRNA (cis-acting) or the 
splicing factors (trans-acting) can cause defective pre- 
mRNA splicing. Mutations in the pre-mRNA cause some 
forms of thalassemia, a group of hereditary anemias charac- 
terized by the defective synthesis of hemoglobin (Section 
7.4). Cis-acting mutations that cause aberrant splicing can 
occur at the 5’ or 3’ splice sites in either of the two introns of 
the hemoglobin B chain or in its exons. The mutations usu- 
ally result in an incorrectly spliced pre-mRNA that, because 
of a premature stop codon, cannot encode a full-length pro- 
tein. The defective mRNA is normally degraded rather than 
translated. Mutations in the 5’ splice site may alter that site 
such that the splicing machinery cannot recognize it, forcing 
the machinery to find another 5’ splice site in the intron and 
introducing the potential for a premature stop codon. Mutations in the 
intron itself may create a new 5’ splice site; in this case, either one of the two 
splice sites may be recognized (Figure 29.40). Consequently, some normal 
protein can be made, and so the disease is less severe. Mutations affecting 
splicing have been estimated to cause at least 15% of all genetic diseases. 
Disease-causing mutations may also appear in splicing factors. Retinitis 
pigmentosa is a disease of acquired blindness, first described in 1857, with an 
incidence of 1/3500. About 5% of the autosomal dominant form of retinitis 
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Most human pre-mRNAs can be spliced in alternative ways 
to yield different proteins 


Alternative splicing is a widespread mechanism for generating protein diver- 
sity. Different combinations of exons from the same gene may be spliced into 
a mature RNA, producing distinct forms of a protein for specific tissues, 
developmental stages, or signaling pathways. What controls which splicing 
sites are selected? The selection is determined by the binding of trans-acting 
splicing factors to cis-acting sequences in the pre-mRNA. Most alternative 
splicing leads to changes in the coding sequence, resulting in proteins with 
different functions. Alternative splicing provides a powerful mechanism for 
expanding the versatility of genomic sequences through combinatorial control. 
Consider a gene with five positions at which splicing can take place. With the 
assumption that these alternative splicing pathways can be regulated inde- 
pendently, a total of 2° = 32 different mRNAs can be generated. 
Sequencing of the human genome has revealed that most pre-mRNAs are 
alternatively spliced, leading to a much greater number of proteins than would 
be predicted from the number of genes. An example of alternative splicing lead- 
ing to the expression of two different proteins, each in a different tissue, is pro- 
vided by the gene encoding both calcitonin and calcitonin-gene-related peptide 
(CGRP; Figure 29.41). In the thyroid gland, the inclusion of exon 4 in one 
splicing pathway produces calcitonin, a peptide hormone that regulates calctum 
and phosphorus metabolism. In neuronal cells, the exclusion of exon 4 in 
another splicing pathway produces CGRP, a peptide hormone that acts as a 
vasodilator. A single pre-mRNA thus yields two very different peptide hor- 
mones, depending on cell type. In this case, only two proteins result from alter- 
native splicing; however, in other cases, many more can be produced. An 
extreme example is the Drosophila pre-mRNA that encodes DSCAM, a neuro- 
nal protein affecting axon connectivity. Alternative splicing of this pre-mRNA 
has the potential to produce 38,016 different combinations of exons, a greater 
number than the total number of genes in the Drosophila genome. However, 
only a fraction of these potential mRNAs appear to be produced, owing to regu- 
latory mechanisms that are not yet well understood. Several human diseases 
that can be attributed to defects in alternative splicing are listed in Table 29.4. 


TABLE 29.4 Selected human diseases attributed to defects in alternative splicing 


Disorder Gene or its product 

Acute intermittent porphyria Porphobilinogen deaminase 

Breast and ovarian cancer BRCA1 

Cystic fibrosis CFTR 

Frontotemporal dementia T protein 

Hemophilia A Factor VIII 

HGPRT deficiency (Lesch-Nyhan syndrome) | Hypoxanthine-guanine phosphoribosyltransferase 
Leigh encephalomyelopathy Pyruvate dehydrogenase Ela 

Severe combined immunodeficiency Adenosine deaminase 

Spinal muscle atrophy SMMN1 or SMN2 


In thyroid 
cells 
A 


Calcitonin 


FIGURE 29.41 An example of alternative 


splicing. In human beings, two very 


different hormones are produced from a 


single calcitonin/CGRP pre-mRNA. 


Alternative splicing produces the mature 


mRNA for either calcitonin or CGRP 


(calcitonin-gene-related protein), depending 


on the cell type in which the gene is 
expressed. Each alternative transcript 
incorporates one of two alternative 
polyadenylation signals (A) present in 
the pre-mRNA. 


885 


886 


CHAPTER 29 RNA Synthesis 


and Processing 


Upstream 


exon “SS 


Intron 


Downstream 
exon 


al 


Further understanding of alternative splicing and the mechanisms of splice-site 
selection will be crucial to understanding how the proteome represented by the 
human genome is expressed. 


29.4 The Discovery of Catalytic RNA Was Revealing in 
Regard to Both Mechanism and Evolution 


RNAs form a surprisingly versatile class of molecules. As we have seen, 
splicing is catalyzed largely by RNA molecules, with proteins playing a 
secondary role. Another enzyme that contains a key RNA component is 
ribonuclease P, which catalyzes the maturation of tRNA by endonucleolytic 
cleavage of nucleotides from the 5’ end of the precursor molecule. Finally, 
as we shall see in Chapter 30, the RNA component of ribosomes is the cata- 
lyst that carries out protein synthesis. 

The versatility of RNA first became clear from observations of the 
processing of ribosomal RNA in a single-cell eukaryote. In Tetrahymena 
(a ciliated protozoan), a 414-nucleotide intron is removed from a 6.4-kb 
precursor to yield the mature 26S rRNA molecule (Figure 29.42). In an 
elegant series of studies of this splicing reaction, Thomas Cech and his 
coworkers established that the RNA spliced itself to precisely excise the 
intron. These remarkable experiments demonstrated that an RNA molecule 
can splice itself in the absence of protein. Indeed, the RNA alone is catalytic 
and, under certain conditions, is thus a ribozyme. More than 1500 similar 
introns have since been found in species as widely dispersed as bacteria and 
eukaryotes, though not in vertebrates. Collectively, they are referred to as 
group I introns. 

The self-splicing reaction in the group I intron requires an added guano- 
sine nucleotide. Nucleotides were originally included in the reaction mix- 
ture because it was thought that ATP or GTP might be needed as an energy 
source. Instead, the nucleotides were found to be necessary as cofactors. 
The required cofactor proved to be a guanosine unit, in the form of guano- 
sine, GMP, GDP, or GTP. G (denoting any one of these species) serves not 
as an energy source but as an attacking group that becomes transiently 
incorporated into the RNA (Figure 29.42). G binds to the RNA and then 
attacks the 5’ splice site to form a phosphodiester bond with the 5’ end of 


dl 


FIGURE 29.42 Self-splicing. A ribosomal RNA precursor from 
Tetrahymena, representative of the group | introns, splices itself 
in the presence of a guanosine cofactor (G, shown in green). A 
414-nucleotide intron (red) is released in the first splicing 

Spliced reaction. This intron then splices itself twice again to produce a 

el linear RNA that has lost a total of 19 nucleotides. This L19 RNA 
is catalytically active. [Information from T. Cech. RNA as an enzyme. 
Copyright © 1986 by Scientific American, Inc. All rights reserved.] 
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hk FIGURE 29.43 Structure of a self- 
splicing intron. The structure of a large 
fragment of the self-splicing intron from 
Tetrahymena reveals a complex folding 
pattern of helices and loops. Bases are 
shown in green, A; yellow, C; purple, G; 
and orange, U. [Drawn from 1GRZ.pdb]. 


the intron. This transesterification reaction generates a 3’-OH group at the 
end of the upstream exon. This newly attached 3’-OH group then attacks 
the 3’ splice site. This second transesterification reaction joins the two 
exons and leads to the release of the 414-nucleotide intron. 

Self-splicing depends on the structural integrity of the RNA precursor. 
Much of the group I intron is needed for self-splicing. This molecule, like 
many RNAs, has a folded structure formed by many double-helical stems 
and loops (Figure 29.43), with a well-defined pocket for binding the guano- 
sine. Examination of the three-dimensional structure of a catalytically active 
group I intron determined by x-ray crystallography reveals the coordination 
of magnesium ions in the active site analogous to that observed in protein 
enzymes such as DNA polymerase. 

Analysis of the base sequence of the rRNA precursor suggested that the 
splice sites are aligned with the catalytic residues by base-pairing between 
the internal guide sequence (IGS) in the intron and the 5’ and 3’ exons 
(Figure 29.44). The IGS first brings together the guanosine cofactor and the 
5’ splice site so that the 3’-OH group of G can nucleophilically attack the 
phosphorus atom at this splice site. The IGS then holds the downstream 
exon in position for attack by the newly formed 3’-OH group of the 
upstream exon. A phosphodiester bond is formed between the two exons, 
and the intron is released as a linear molecule. Like catalysis by protein 
enzymes, self-catalysis of bond formation and breakage in this rRNA pre- 
cursor is highly specific. 

The finding of enzymatic activity in the self-splicing intron and in the 
RNA component of RNase P has opened new areas of inquiry and changed 
the way in which we think about molecular evolution. As mentioned in an 
earlier chapter, the discovery that RNA can be a catalyst as well as an infor- 
mation carrier suggests that an RNA world may have existed early in the 
evolution of life, before the appearance of DNA and protein. 

Messenger RNA precursors in the mitochondria of yeast and fungi also 
undergo self-splicing, as do some RNA precursors in the chloroplasts of 
unicellular organisms such as Chlamydomonas. Self-splicing reactions can 
be classified according to the nature of the unit that attacks the upstream 
splice site. Group I self-splicing is mediated by a guanosine cofactor, as in 
Tetrahymena. The attacking moiety in group II splicing is the 2’-OH group 
of a specific adenylate of the intron (Figure 29.45). 

Group I and group II self-splicing resembles spliceosome-catalyzed 
splicing in two respects. First, in the initial step, a ribose hydroxyl group 
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FIGURE 29.44 Self-splicing mechanism. 
The catalytic mechanism of the group | 
intron includes a series of transesterification 
reactions. [Information from T. Cech. RNA as 
an enzyme. Copyright © 1986 by Scientific 
American, Inc. All rights reserved.] 
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FIGURE 29.45 Comparison of splicing pathways. The exons being joined are shown in 
blue and yellow and the attacking unit is shown in green. The catalytic site is formed by the 
intron itself (red) in group | and group II splicing. In contrast, the splicing of nuclear mRNA 
precursors is catalyzed by snRNAs and their associated proteins in the spliceosome. 
[Information from P. A. Sharp, Science 235:766-771, 1987.] 


attacks the 5’ splice site. The newly formed 3’-OH terminus of the upstream 
exon then attacks the 3’ splice site to form a phosphodiester bond with the 
downstream exon. Second, both reactions are transesterifications in which 
the phosphate moieties at each splice site are retained in the products. The 
number of phosphodiester bonds stays constant. Group II splicing is like 
the spliceosome-catalyzed splicing of mRNA precursors in several addi- 
tional ways. The attack at the 5’ splice site is carried out by a part of the 
intron itself (the 2’-OH group of adenosine) rather than by an external 
cofactor (G). In both cases, the intron is released in the form of a lariat. 
Moreover, in some instances, the group II intron is transcribed in pieces 
that assemble through hydrogen bonding to the catalytic intron, in a man- 
ner analogous to the assembly of the snRNAs in the spliceosome. 


Wa These similarities have led to the suggestion that the spliceosome- 
catalyzed splicing of mRNA precursors evolved from RNA-catalyzed 
self-splicing. Group II splicing may well be an intermediate between group 
I splicing and the splicing in the nuclei of higher eukaryotes. A major step in 
this transition was the transfer of catalytic power from the intron itself to other 
molecules. The formation of spliceosomes gave genes a new freedom because 
introns were no longer constrained to provide the catalytic center for splic- 
ing. Another advantage of external catalysts for splicing is that they can be 
more readily regulated. However, it is important to note that similarities do 


not establish ancestry. The similarities between group II introns and mRNA 
splicing may be a result of convergent evolution. Perhaps there are only a 
limited number of ways to carry out efficient, specific intron excision. The 
determination of whether these similarities stem from ancestry or from 
chemistry will require expanding our understanding of RNA biochemistry. 


SUMMARY 


29.1 


20.2 


RNA Polymerases Catalyze Transcription 

All cellular RNA molecules are synthesized by RNA polymerases 
according to instructions given by DNA templates. The activated 
monomer substrates are ribonucleoside triphosphates. The direction 
of RNA synthesis is 5’ —> 3’, asin DNA synthesis. RNA polymerases, 
unlike DNA polymerases, do not need a primer. 

RNA polymerase in E. coli is a multisubunit enzyme. The subunit 
composition of the ~500-kDa holoenzyme is a8’ wo and that of the 
core enzyme is a,88’w. Transcription is initiated at promoter sites 
consisting of two sequences, one centered near —10 and the other near 
—35; that is, 10 and 35 nucleotides away from the start site in the 5’ 
(upstream) direction. The consensus sequence of the —10 region is 
TATAAT. The o subunit enables the holoenzyme to recognize pro- 
moter sites. When the growth temperature is raised, E. coli expresses 
a special o subunit that selectively binds the distinctive promoter of 
heat-shock genes. RNA polymerase must unwind the template double 
helix for transcription to take place. Unwinding exposes some 17 bases 
on the template strand and sets the stage for the formation of the first 
phosphodiester bond. Newly synthesized RNA chains usually start 
with pppG or pppA. The o subunit usually dissociates from the holo- 
enzyme after the initiation of the new chain. 

Elongation takes place at transcription bubbles that move along the 
DNA template at a rate of about 50 nucleotides per second. RNA 
polymerase occasionally backtracks, a process that can facilitate proof- 
reading of the RNA transcript. The nascent RNA chain contains stop 
signals that end transcription. One stop signal is an RNA hairpin, 
which is followed by several U residues. A different stop signal is read 
by the rho protein, an ATPase. Some genes are regulated by ribo- 
switches, structures that form in RNA transcripts and bind specific 
metabolites. In E. coli, precursors of transfer RNA and ribosomal 
RNA are cleaved and chemically modified after transcription, whereas 
messenger RNA is used unchanged as a template for protein synthesis. 


Transcription in Eukaryotes Is Highly Regulated 

RNA synthesis in eukaryotes takes place in the nucleus, whereas protein 
synthesis takes place in the cytoplasm. There are three types of RNA 
polymerase in the nucleus: RNA polymerase I makes ribosomal RNA 
precursors, II makes messenger RNA precursors, and III makes transfer 
RNA precursors. Eukaryotic promoters are complex, being composed 
of several different elements. Promoters for RNA polymerase II may be 
located on the 5’ side or the 3’ side of the start site for transcription. A 
common type of eukaryotic promoter consists of a TATA box centered 
between —30 and —100 and paired with an initiator element. Eukaryotic 
promoter elements are recognized by proteins called transcription fac- 
tors rather than by RNA polymerase II. The saddle-shaped TATA- 
box-binding protein unwinds and sharply bends DNA at TATA-box 
sequences and serves as a focal point for the assembly of transcription 
complexes. The TATA-box-binding protein initiates the assembly of 
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KEY TERMS 


29.3 


29.4 


the active transcription complex. The activity of many promoters is 
greatly increased by enhancer sequences that have no promoter activity 
of their own. Enhancer sequences can act over distances of several kilo- 
bases, and they can be located either upstream or downstream of a gene. 


The Transcription Products of Eukaryotic Polymerases Are Processed 
The 5' ends of mRNA precursors become capped and methylated in 
the course of transcription. A 3’ poly(A) tail is added to most mRNA 
precursors after the nascent chain has been cleaved by an endonucle- 
ase. RNA editing alters the nucleotide sequence of some mRNAs, 
such as the one for apolipoprotein B. 

The splicing of mRNA precursors is carried out by spliceosomes, 
which consist of small nuclear ribonucleoprotein particles. Splice sites in 
mRNA precursors are specified by sequences at ends of introns and by 
branch sites near their 3’ ends. The 2’-OH group of an adenosine residue 
in the branch site attacks the 5’ splice site to form a lariat intermediate. 
The newly generated 3’-OH terminus of the upstream exon then attacks 
the 3’ splice site to become joined to the downstream exon. Splicing thus 
consists of two transesterification reactions, with the number of phos- 
phodiester bonds remaining constant during reactions. Small nuclear 
RNAs in spliceosomes catalyze the splicing of mRNA precursors. In 
particular, U2 and U6 snRNAs form the active centers of spliceosomes. 

The events in posttranscriptional processing of mRNA are con- 
trolled by the phosphorylation state of the carboxy-terminal domain, 
part of RNA polymerase II. 


The Discovery of Catalytic RNA Was Revealing in Regard to Both 
Mechanism and Evolution 

Some RNA molecules, such as those containing the group I intron, 
undergo self-splicing in the absence of protein. A self-modified version 
of this rRNA intron displays true catalytic activity and is thus a ribo- 
zyme. Spliceosome-catalyzed splicing may have evolved from self- 
splicing. The discovery of catalytic RNA has opened new vistas in our 
exploration of early stages of molecular evolution and the origins of life. 
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1. Complements. The sequence of part of an mRNA is 
5 '-AUGGGGAACAGCAAGAGUGGGGCCCUGUCCAAGGAG-3’ 
What is the sequence of the DNA coding strand? Of the 


DNA template strand? 


2. Checking for errors. Why is RNA synthesis not as care- 
fully monitored for errors as is DNA synthesis? 


3. Speed is not of the essence. Why is it advantageous for 
DNA synthesis to be more rapid than RNA synthesis? 


4. Active sites. The overall structures of RNA polymerase 
and DNA polymerase are very different, yet their active 
sites show considerable similarities. What do the similari- 
ties suggest about the evolutionary relationship between 
these two important enzymes? 


5. Potent inhibitor. Heparin inhibits transcription by bind- 
ing to RNA polymerase. What properties of heparin allow 
it to bind so effectively to RNA polymerase? 


6. A loose cannon. Sigma protein by itself does not bind to 
promoter sites. Predict the effect of a mutation enabling o 
to bind to the —10 region in the absence of other subunits 
of RNA polymerase. 


7. Stuck sigma. What would be the likely effect of a muta- 
tion that prevents o from dissociating from the RNA poly- 
merase core? 


8. Transcription time. What is the minimum length of time 
required for the synthesis by E. coli polymerase of an 
mRNA encoding a 100-kDa protein? 


9. Rapid search. RNA polymerase finds promoter sites 
very rapidly. The observed rate constant for the binding of 
the RNA polymerase holoenzyme to promoter sequences is 
10'° M~'s~!. The rate constant for two macromolecules 
encountering each other is typically 10° M~'s~!. Propose 
an explanation for the 100-fold larger rate for a protein 
finding a particular site along a DNA molecule. 


10. Where to begin? Identify the likely transcription start 
site in the following DNA sequence: 


5'-GCCGTTGACACCGTTCGGCGATCGATCCGCTATAATGTGTGGATCCGCTT-3' 
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(111) strands 2 and 3, (iv) strands 1, 2, and 3, and (v) strands 
1, 2, and 3 and core RNA polymerase. 


(b) What is the likely effect of rifampicin on RNA synthe- 
sis in this system? 


(c) Heparin blocks elongation of the RNA primer if it is 
added to core RNA polymerase before the onset of tran- 
scription but not if added after transcription starts. Account 
for this difference. 


(d) Suppose that synthesis is carried out in the presence of 
ATP, CTP, and UTP. Compare the length of the longest 
product obtained with that expected when all four ribonu- 
cleoside triphosphates are present. 


13. Proofreading marks. The major products of proofread- 
ing by RNA polymerase are dinucleotides rather than 
mononucleotides. Why? 


14. Abortive cycling. Di- and trinucleotides are occasionally 
released from RNA polymerase at the very start of tran- 
scription, a process called abortive cycling. This process 
requires the restart of transcription. Suggest a plausible 
explanation for abortive cycling. 


15. Polymerase inhibition. Cordycepin inhibits poly(A) 
synthesis at low concentrations and RNA synthesis at 
higher concentrations. 


NH> 
ZN 
CTS 

' = 


HO 


3’-CGGCAACTGTGGCAAGCCGCTAGCTAGGCGATATTACACACCTAGGCGAA-5' 


11. Between bubbles. How far apart are transcription bubbles 
on E. coli genes that are being transcribed at a maximal rate? 


12. A revealing bubble. Consider the synthetic RNA-DNA 
transcription bubble illustrated here. Refer to the coding 
DNA strand, the template strand, and the RNA strand as 
strands 1, 2, and 3, respectively. 


(1) DNA Coding strand acACGGc, 
G A 


5'-GGATACT TACAGCCAT MPRETCCATT.3 
3'-CCTATGAATGTCGGTACCTGTGCCGCT TATGAGGTAA...5’ 


G 


(2) Template strand A ai cai 


U 

U 
seu 
(3) RNA strand 


(a) Suppose that strand 3 is labeled with *?P at its 5’ end 
and that polyacrylamide gel electrophoresis is carried out 
under nondenaturing conditions. Predict the autoradio- 
graphic pattern for (i) strand 3 alone, (ii) strands 1 and 3, 


HOH 


Cordycepin (3'-deoxyadenosine) 


(a) What is the basis of inhibition by cordycepin? 


(b) Why is poly(A) synthesis more sensitive than the syn- 
thesis of other RNAs to the presence of cordycepin? 


(c) Does cordycepin need to be modified to exert its effect? 


16. Alternative splicing. A gene contains eight sites where 
alternative splicing is possible. Assuming that the splicing 
pattern at each site is independent of that at all other sites, 
how many splicing products are possible? 


17. Supercoiling. Negative supercoiling of DNA favors 
the transcription of genes because it facilitates unwinding. 
However, not all promoter sites are stimulated by nega- 
tive supercoiling. The promoter site for topoisomerase II 
itself is a noteworthy exception. Negative supercoiling 
decreases the rate of transcription of this gene. Propose a 
possible mechanism for this effect and suggest a reason 
why it may occur. 
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18. An extra piece. In one type of mutation leading to a 
form of thalassemia, the mutation of a single base (G to A) 
generates a new 3’ splice site (blue in the illustration below) 
akin to the normal one (yellow) but farther upstream. 


Normal 3’ end 


yr of intron 


5’ CCTATTGGTCTATTTTCCACCCTTAGGCTGCTG3' 


5’ CCTATTAGTCTATTTTCCACCCTTAGGCTGCTG3' 


What is the amino acid sequence of the extra segment of 
protein synthesized in a thalassemic patient having a muta- 
tion leading to aberrant splicing? The reading frame after 
the splice site begins with TCT. 


19. A long-tailed messenger. Another thalassemic patient 
had a mutation leading to the production of an mRNA for 
the B chain of hemoglobin that was 900 nucleotides longer 
than the normal one. The poly(A) tail of this mutant 
mRNA was located a few nucleotides after the only 
AAUAAA sequence in the additional sequence. Propose a 
mutation that would lead to the production of this altered 
mRNA. 


Mechanism Problem 


20. RNA editing. Many uridine molecules are inserted into 
some mitochondrial mRNAs in trypanosomes. The uridine 
residues come from the poly(U) tail of a donor strand. 
Nucleoside triphosphates do not participate in this reaction. 


Propose a reaction mechanism that accounts for these find- 
ings. (Hint: Relate RNA editing to RNA splicing.) 


Chapter Integration Problems 


21. Proteome complexity. What processes considered in this 
chapter make the proteome more complex than the genome? 
What processes might further enhance this complexity? 


22. Separation technique. Suggest a means by which you 
could separate mRNA from the other types of RNA in a 
eukaryotic cell. 


Data Interpretation Problems 


23. Run-off experiment. Nuclei were isolated from brain, 
liver, and muscle. The nuclei were then incubated with a- 
[°27P]UTP under conditions that allow RNA synthesis, 
except that an inhibitor of RNA initiation was present. The 


radioactive RNA was isolated and annealed to various 
DNA sequences that had been attached to a gene chip. 
In the adjoining graphs, the intensity of the shading indi- 
cates roughly how much mRNA was attached to each DNA 
sequence. 


Liver Muscle Brain 


(a) Why does the intensity of hybridization differ between 
genes? 


(b) What is the significance of the fact that some of the 
RNA molecules display different hybridization patterns in 
different tissues? 


(c) Some genes are expressed in all three tissues. What 
would you guess is the nature of these genes? 


(d) Suggest a reason why an initiation inhibitor was 
included in the reaction mixture. 


24. Christmas trees. The adjoining autoradiograph 
depicts several bacterial genes undergoing transcription. 
Identify the DNA. What are the strands of increasing 
length? Where is the beginning of transcription? The 
end of transcription? On the page, what is the direction 
of RNA synthesis? What can you conclude about the 
number of enzymes participating in RNA synthesis on a 
given gene? 


[Thomas Broker/Phototake.] 
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The ribosome, shown at the right, is a factory for the manufacture of polypeptides. Amino 
acids are carried into the ribosome, one at a time, connected to transfer RNA molecules. 
Each amino acid is joined to the growing polypeptide chain, which detaches from the 
ribosome only after the polypeptide has been completed. This assembly-line approach 
allows even very long polypeptide chains to be assembled rapidly and with impressive 
accuracy. [(Left) Images of Birmingham Premium/Alamy.] 


he genes that encode proteins are key components of the genetic informa- 

tion because proteins play most of the functional roles in cells. In 
Chapters 28 and 29, we examined how DNA is replicated and transcribed 
into RNA. We now turn to the mechanism of protein synthesis, a process 
called translation because the four-letter alphabet of nucleic acids is translated 
into the entirely different twenty-letter alphabet of proteins. Translation is a 
conceptually more complex process than either replication or transcription, 
both of which take place within the framework of a common base-pairing lan- 
guage. As befits its position linking the nucleic acid and protein languages, the 
process of protein synthesis critically depends on both nucleic acid and protein 
factors. Protein synthesis takes place on ribosomes—enormous complexes con- 
taining three large RNA molecules and more than 50 proteins. Interestingly, 
the ribosome is a ribozyme; that is, the RNA components catalyze protein syn- 
thesis. This observation strongly supports the notion that life evolved through 
an RNA world, and the ribosome is a surviving inhabitant of that world. 

Transfer RNA molecules (tRNAs) and messenger RNA (mRNA) also 
are key participants in protein synthesis. The link between amino acids and 
nucleic acids is first made by enzymes called aminoacyl-tRNA synthetases. 
By specifically linking a particular amino acid to each tRNA, these enzymes 
translate the genetic code. 

Although RNA is paramount in the process of translation, protein factors 
also are required for the efficient synthesis of a protein. Protein factors par- 
ticipate in the initiation, elongation, and termination of protein synthesis. 


OUTLINE 


30.1 Protein Synthesis Requires the 
Translation of Nucleotide 
Sequences into Amino Acid 
Sequences 


30.2 Aminoacyl Transfer RNA 
Synthetases Read the Genetic 
Code 


30.3 The Ribosome Is the Site of 
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FIGURE 30.1 Polypeptide-chain growth. 
Proteins are synthesized by the successive 
addition of amino acids to the carboxyl 
terminus. 


TABLE 30.1 Accuracy of protein synthesis 


Probability of synthesizing 
an error-free protein 


This chapter focuses primarily on protein synthesis in bacteria because it 
illustrates many general principles and is well understood. Some distinctive 
features of protein synthesis in eukaryotes also are presented. 


30.1 Protein Synthesis Requires the Translation of 
Nucleotide Sequences into Amino Acid Sequences 


The basics of protein synthesis are the same across all kingdoms of life— 
evidence that the protein-synthesis system arose very early in evolution. An 
mRNA is decoded, or read, in the 5’-to-3’ direction, one codon at a time, 
and the corresponding protein is synthesized in the amino-to-carboxyl 
direction by the sequential addition of amino acids to the carboxyl end of 
the growing peptide chain (Figure 30.1). The amino acids arrive at the 
growing chain in activated form as aminoacyl-tRNAs, created by joining 
the carboxyl group of an amino acid to the 3’ end of atRNA molecule. The 
linking of an amino acid to its corresponding tRNA is catalyzed by an 
aminoacyl-tRNA synthetase. For each amino acid, there is usually one acti- 
vating enzyme and at least one kind of tRNA. 


c—O 
/ => 
f * a 
tRNA, tRNA2 tRNA; 


The synthesis of long proteins requires a low error frequency 


The process of transcription is analogous to copying, word for word, a page of 
a book. There is no change of alphabet or vocabulary; so the likelihood of a 
change in meaning is small. Translating the base sequence of an mRNA mole- 
cule into a sequence of amino acids is analogous to translating the page of a book 
into another language. Translation is a complex process, entailing many steps 
and dozens of molecules. The potential for error exists at each step. The com- 
plexity of translation creates a conflict between two require- 
ments: the process must be both accurate and fast enough to 
meet a cell’s needs. In E. coli, translation can take place at a 
rate of 20 amino acids per second, a truly impressive speed 
considering the complexity of the process. 


are Number of amino acid residues How accurate must protein synthesis be? Let us con- 
Frequency of inserting ‘a Th. babili ve ‘ : “th 
an incorrect'amino acid 100 300 1000 sider error rates. 1 he probability o orming a protein wit 
no errors depends on the number of amino acid residues 
10-2 0.366 0.049 0.000 : : : : 
4 and on the frequency () of insertion of a wrong amino acid. 
10 0.905 0.741 0.368 ae 
ie au6n ane aane As Table 30.1 shows, an error frequency of 10~“ is intoler- 
= ee 
exe en oan 0.999 able, even for quite small proteins. An é value of 107° usu 


ally leads to the error-free synthesis of a 300-residue protein 


Note: The probability p of forming a protein with no errors depends on n, the (~33 kDa) but not of a 1000-residue protein (~ 110 kDa) 


number of amino acids, and ¢, the frequency of insertion of a wrong amino 


acid: p = (1 — e)”. 


Thus, the error frequency must not exceed approximately 


10~* to produce the larger proteins effectively. Lower error frequencies are 
conceivable; however, except for the largest proteins, they will not dramati- 
cally increase the percentage of proteins with accurate sequences. In addition, 
such lower error rates are likely to be possible only by a reduction in the rate 
of protein synthesis because additional time for proofreading is required. In 
fact, the observed values of & are close to 10~*. An error frequency of about 
10~* per amino acid residue was selected in the course of evolution to accu- 
rately produce proteins consisting of as many as 1000 amino acids while 
maintaining a remarkably rapid rate for protein synthesis. 


Transfer RNA molecules have a common design 


The fidelity of protein synthesis requires accurate recognition of three-base 
codons on messenger RNA. Recall that the genetic code relates each amino 
acid to a three-letter codon (Section 4.6). An amino acid cannot itself recog- 
nize a codon. Consequently, an amino acid is attached to a specific tRNA 
molecule that recognizes the codon by Watson—Crick base-pairing. Transfer 
RNA serves as the adapter molecule that binds to a specific codon and brings 
with it an amino acid for incorporation into the polypeptide chain. 

Consider yeast alanyl-tRNA, so called because it will carry the amino 
acid alanine. This adapter molecule is a single chain of 76 ribonucleotides 
(Figure 30.2). The 5’ terminus is phosphorylated (pG), whereas the 3’ ter- 
minus has a free hydroxyl group. The amino acid-attachment site is the 
3'-hydroxyl group of the adenosine residue at the 3’ terminus of the mole- 
cule. The sequence 5’-I[GC-3’ in the middle of the molecule is the anticodon, 
where I is the purine base inosine. It is complementary to 5’-GCC-3’, one 
of the codons for alanine. 

Thousands of tRNA sequences are known. The striking finding is that 
all of them can be arranged in a cloverleaf pattern in which about half the 
residues are base-paired (Figure 30.3). Hence, tRNA molecules have many 
common structural features. This finding is not unexpected, because all 
tRNA molecules must be able to interact in nearly the same way with the 
ribosomes, mRNAs, and protein factors that participate in translation. 
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FIGURE 30.2 Alanyl-tRNA sequence. The base sequence of yeast alanyl- 
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tRNA and the deduced cloverleaf secondary structure are shown. Modified loop 

nucleosides are abbreviated as follows: methylinosine (ml), dihydrouridine 

(UH,), ribothymidine (1), pseudouridine (i), methylguanosine (mG), and FIGURE 30.3 General structure of tRNA molecules. 
(dimethylguanosine (m,G). Inosine (|), another modified nucleoside, is Comparison of the base sequences of many tRNAs reveals 


part of the anticodon. a number of conserved features. 
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1. Each is a single chain containing between 73 and 93 ribonucleotides 
(~25 kDa). 


2. The molecule is L-shaped (Figure 30.4). 


3. They contain many unusual bases, typically between 7 and 15 per molecule. 


NH Some of these bases are methylated or dimethylated derivatives of A, U, C, 
CH; and G formed by enzymatic modification of a precursor tRNA. Some meth- 
N~ ylations prevent the formation of certain base pairs, thereby rendering such 
os | bases accessible for interactions with other bases. In addition, methylation 
fe) N H imparts a hydrophobic character to some regions of tRNAs, which may be 
| important for their interaction with synthetases and ribosomal proteins. Other 

ribose ribose : ‘ a ‘ . 

5-Methylcytidine piivarourtding modifications alter codon recognition, as will be described shortly. 
(mc) (UH) 


4. About half the nucleotides in tRNAs are base-paired to form double 
helices. The four helical regions are arranged to form two apparently con- 
tinuous segments of double helix. These segments are like A-form DNA, as 
expected for an RNA helix (Section 4.2). One helix, containing the 5’ and 
3’ ends, runs horizontally in the model shown in Figure 30.5. The other 
helix, which contains the anticodon and runs vertically in Figure 30.5, 
forms the other arm of the L. 


Five groups of bases are not base-paired in this way: the 3’ CCA terminal 
region, which is part of a region called the acceptor stem; the TysC loop, 
which acquired its name from the sequence ribothymine-pseudouracil- 
cytosine; the “extra arm,” which contains a variable number of residues; the 
DHU loop, which contains several dihydrouracil residues; and the anticodon 
loop. Most of the bases in the nonhelical regions participate in hydrogen- 
bonding interactions, even if the interactions are not like those in Watson— 
Crick base pairs. The structural diversity generated by this combination of 
helices and loops containing modified bases ensures that the tRNAs can be 
uniquely distinguished, though they are structurally similar overall. 


5. The 5’ end of a tRNA is phosphorylated. The 5’ terminal residue is 
usually pG. 


6. An activated amino acid is attached to a hydroxyl group of the adenosine 
residue in the amino acid-attachment site, located at the end of the 3’ CCA 


TyC loop 


terminus 


Anticodon 
loop 
hk FIGURE 30.4 Transfer RNA structure. Notice the 
c L-shaped structure revealed by this skeletal model of yeast FIGURE 30.5 Helix stacking in tRNA. 
Anticodon _ Phenylalanyl-tRNA. The CCA region is at the end of one The four double-stranded regions of the 
oa loop arm, and the anticodon loop is at the end of the tRNA (Figure 30.3) stack to form an L-shaped 
other. [Drawn from 1EHZ.pdb.] structure. [Drawn from 1EHZ.pdb.] 


component of the acceptor stem (Figure 30.6). This single-stranded region 
can change conformation in the course of amino acid activation and protein 
synthesis. 


7. The anticodon loop, which is present in a loop near the center of the 
sequence, is at the other end of the L, making accessible the three bases that 
constitute the anticodon. 


Thus, the architecture of the tRNA molecule is well suited to its role as 
adaptor: the anticodon is available to interact with an appropriate codon on 
mRNA while the end that is linked to an activated amino acid is well posi- 
tioned to participate in peptide-bond formation. 


Some transfer RNA molecules recognize more than one codon 
because of wobble in base-pairing 


What are the rules that govern the recognition of a codon by the anticodon 
of atRNA? A simple hypothesis is that each of the bases of the codon forms 
a Watson—Crick type of base pair with a complementary base on the anti- 
codon of the tRNA. The codon and anticodon would then be lined up in an 
antiparallel fashion. In the diagram in the margin, the prime denotes the 
complementary base. Thus X and X' would be either A and U (or U and A) 
or G and C (or C and G). According to this model, a particular anticodon 
can recognize only one codon. 

The facts are otherwise. As found experimentally, some tRNA molecules 
can recognize more than one codon. For example, the yeast alanyl-tRNA binds 
to three codons: GCU, GCC, and GCA. The first two bases of these codons 
are the same, whereas the third is different. Could it be that recognition of the 
third base of a codon is sometimes less discriminating than recognition of the 
other two? The pattern of degeneracy of the genetic code indicates that it 
might be so. XYU and XYC always encode the same amino acid; XYA and 
XYG usually do. These data suggest that the steric criteria might be less strin- 
gent for pairing of the third base than for the other two. In other words, there 
is some steric freedom (“wobble”) in the pairing of the third base of the codon. 

The wobble hypothesis is now firmly established (‘Table 30.2). The anti- 
codons of tRNAs of known sequence bind to the codons predicted by this 
hypothesis. For example, the anticodon of yeast alanyl-tRNA is IGC. This 
tRNA recognizes the codons GCU, GCC, and GCA. Recall that, by con- 
vention, nucleotide sequences are written in the 5’ —> 3’ direction unless 
otherwise noted. Hence, I (the 5’ base of this anticodon) pairs with U, C, or 
A (the 3’ base of the codon), as predicted. 
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FIGURE 30.6 Aminoacyl-tRNA. Amino 
acids are coupled to tRNAs through ester 
linkages to either the 2’- or the 3’-hydroxyl 
group of the 3’-adenosine residue. A 
linkage to the 3’-hydroxyl group is shown. 


TABLE 30.2 Allowed pairings at the 


third base of the codon 
according to the wobble 


hypothesis 
First base Third base 
of anticodon of codon 
Cc G 
A U 
U AorG 
G UorC 
I U,C,orA 
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16S RNA A1493 


Anticodon A36 Codon U1 


FIGURE 30.7 16S rRNA monitors base- 
pairing between the codon and the 
anticodon. Adenine 1493, one of three 
universally conserved bases in 16S rRNA, 
forms hydrogen bonds with the bases in 
both the codon and the anticodon only if 
the codon and anticodon are correctly 
paired. [Information from J. M. Ogle and 

V. Ramakrishnan, Annu. Rev. Biochem. 74: 
129-177, 2005, Fig. 2a.] 


Two generalizations concerning the codon—anticodon interaction can 
be made: 


1. The first two bases of a codon pair in the standard way. Recognition is 
precise. Hence, codons that differ in either of their first two bases must be 
recognized by different tRNAs. For example, both UUA and CUA encode 
leucine but are read by different tRNAs. 


2. The first base of an anticodon determines whether a particular tRNA 
molecule reads one, two, or three kinds of codons: C or A (one codon), U or 
G (two codons), or I (three codons). Thus, part of the degeneracy of the 
genetic code arises from imprecision (wobble) in the pairing of the third base of 
the codon with the first base of the anticodon. We see here a strong reason for 
the frequent appearance of inosine, one of the unusual nucleosides, in anti- 
codons. Inosine maximizes the number of codons that can be read by a particu- 
lar tRNA molecule. The inosine bases in tRNA are formed by the deamina- 
tion of adenosine after the synthesis of the primary transcript. 


Why is wobble tolerated in the third position of the codon but not in the 
first two? This question is answered by considering the interaction of the 
tRNA with the ribosome. As we will see, ribosomes are huge RNA~—protein 
complexes consisting of two subunits, the 305 and 505 subunits. The 305 
subunit has an RNA molecule, the 165 rRNA, that has three universally 
conserved bases—adenine 1492, adenine 1493, and guanine 530—that form 
hydrogen bonds on the minor-groove side but only with correctly formed 
base pairs of the codon—anticodon duplex (Figure 30.7). These interactions 
check whether Watson—Crick base pairs are present in the first two posi- 
tions of the codon—anticodon duplex, but not the third position, so more- 
varied base pairs are tolerated. Thus, the ribosome plays an active role in 
decoding the codon—anticodon interactions. 


30.2 Aminoacyl Transfer RNA Synthetases Read 
the Genetic Code 


Before codon and anticodon meet, the amino acids required for protein syn- 
thesis must first be attached to specific tRNA molecules, linkages that are 
crucial for two reasons. First, the attachment of a given amino acid to a par- 
ticular tRNA establishes the genetic code. When an amino acid has been 
linked to a tRNA, it will be incorporated into a growing polypeptide chain 
at a position dictated by the anticodon of the tRNA. Second, because the 
formation of a peptide bond between free amino acids is not thermodynamically 
favorable, the amino acid must first be activated in order for protein synthesis 
to proceed. The activated intermediates in protein synthesis are amino acid 
esters, in which the carboxyl group of an amino acid is linked to either the 
2'- or the 3'-hydroxyl group of the ribose unit at the 3’ end of tRNA. An 
amino acid ester of tRNA is called an aminoacyl-tRNA or sometimes a 
charged tRNA (Figure 30.6). For a specific amino acid attached to its cog- 
nate tRNA—for instance, threonine—the charged tRNA is designated 
Thr-tRNA™, 


Amino acids are first activated by adenylation 


The activation reaction is catalyzed by specific aminoacyl-tRNA synthetases. 
The first step is the formation of an aminoacyl adenylate from an amino acid 
and ATP. 


Amino acid + ATP == aminoacyl-AMP + PP; 


This activated species is a mixed anhydride in which the carboxyl group of 
the amino acid is linked to the phosphoryl group of AMP; hence, it is also 
known as aminoacyl-AMP. 


7 <P 
*H3N ~ v adenine 
af ow No O 
R 


HO OH 
Aminoacyl adenylate 


The next step is the transfer of the aminoacyl group of aminoacyl-AMP to 
a particular tRNA molecule to form aminoacyl-tRNA. 


Aminoacyl-AMP + tRNA = — aminoacyl-tRNA + AMP 
The sum of these activation and transfer steps is 
Amino acid + ATP + tRNA = — aminoacyl-tRNA + AMP + PP; 


The AG’ of this reaction is close to 0, because the free energy of hydrol- 
ysis of the ester bond of aminoacyl-tRNA is similar to that for the hydrolysis 
of ATP to AMP and PP.. As we have seen many times, the reaction is driven 
by the hydrolysis of pyrophosphate. The sum of these three reactions is 
highly exergonic: 


Amino acid + ATP + tRNA + H,O ——> 
aminoacyl-tRNA + AMP + 2 P. 


Thus, the equivalent of two molecules of ATP is consumed in the synthesis of 
each aminoacyl-tRNA. One of them is consumed in forming the ester link- 
age of aminoacyl-tRNA, whereas the other is consumed in driving the reac- 
tion forward. 

The activation and transfer steps for a particular amino acid are cata- 
lyzed by the same aminoacyl-tRNA synthetase. Indeed, the aminoacyl- 
AMP intermediate does not dissociate from the synthetase. Rather, it is tightly 
bound to the active site of the enzyme by noncovalent interactions. 

We have already encountered an acyl adenylate intermediate in fatty acid 
activation (Section 22.2). The major difference between these reactions is 
that the acceptor of the acyl group is CoA in fatty acid activation and tRNA 
in amino acid activation. The energetics of these biosyntheses are very simi- 
lar: both are made irreversible by the hydrolysis of pyrophosphate. 


Aminoacyl-tRNA synthetases have highly discriminating 
amino acid activation sites 


Each aminoacyl-tRNA synthetase is highly specific for a given amino acid. 
Indeed, a synthetase will incorporate the incorrect amino acid only once in 
10* or 10° reactions. How is this level of specificity achieved? Each amino- 
acyl-tRNA synthetase takes advantage of the properties of its amino acid 
substrate. Let us consider the challenge faced by threonyl-tRNA synthe- 
tase. Threonine is especially similar to two other amino acids—namely, 
valine and serine. Valine has almost exactly the same shape as that of threo- 
nine, except that valine has a methyl group in place of a hydroxyl group. 
Serine has a hydroxyl group, as does threonine, but lacks the methyl group. 
How can the threonyl-tRNA synthetase avoid coupling these incorrect 
amino acids to threonyl-tRNA? 
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Activation 
site 


hk FIGURE 30.9 Editing site. 
Mutagenesis studies revealed the position of 
the editing site (shown in green) in threonyl- 
tRNA synthetase. The activation site is shown 
in yellow. Only one subunit of the dimeric 
enzyme is shown here and in subsequent 
illustrations. [Drawn from 1QF6.pdb.] 


Editing site 


Activation site 


FIGURE 30.10 Editing of aminoacyl-tRNA. 
The flexible CCA arm of an aminoacyl-tRNA 
can move the amino acid between the 
activation site and the editing site. If the 
amino acid fits well into the editing site, the 
amino acid is removed by hydrolysis. 
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FIGURE 30.8 Active site of threonyl- ~ 
tRNA synthetase. Notice that the amino His 

acid-binding site includes a zinc ion (green 
ball) that binds threonine through its amino 
and hydroxyl groups. O 


The structure of the amino acid-binding site of threonyl-tRNA synthetase 
reveals how valine is avoided (Figure 30.8). The synthetase contains a zinc 
ion, bound to the enzyme by two histidine residues and one cysteine residue. 
The remaining coordination sites are available for substrate binding. 
Threonine binds to the zinc ion through its amino group and its side-chain 
hydroxyl group. The side-chain hydroxy] group is further recognized by an 
aspartate residue that hydrogen bonds to it. The methyl group present in 
valine in place of this hydroxyl group cannot participate in these interactions; 
it is excluded from this active site and, hence, does not become adenylated and 
transferred to threonyl-tRNA (abbreviated tRNA!" ). The use of a zinc ion 
appears to be unique to threonyl-tRNA synthetase; other aminoacyl-tRNA 
synthetases have different strategies for recognizing their cognate amino 
acids. The carboxylate group of the correctly positioned threonine is available 
to attack the a phosphoryl group of ATP to form the aminoacy] adenylate. 

The zinc site is less able to discriminate against serine because this 
amino acid does have a hydroxyl group that can bind to the zinc ion. Indeed, 
with only this mechanism available, threonyl-tRNA synthetase does mis- 
takenly couple serine to threonyl-tRNA at a rate 10-7 to 103 times that for 
threonine. As noted on page 894, this error rate is likely to lead to many 
translation errors. How is a higher level of specificity achieved? 


Proofreading by aminoacyl-tRNA synthetases increases 
the fidelity of protein synthesis 


Threonyl-tRNA synthetase can be incubated with tRNA™™ that has been 
covalently linked with serine (Ser-tRNA!"); the tRNA has been “mis- 
charged.” The reaction is immediate: a rapid hydrolysis of the aminoacyl- 
tRNA forms serine and free tRNA. In contrast, incubation with correctly 
charged Thr-tRNA!"* results in no reaction. Thus, threonyl-tRNA syn- 
thetase contains an additional functional site that hydrolyzes Ser-tRNA 
but not Thr-tRNA™". This editing site provides an opportunity for the 
synthetase to correct its mistakes and improve its fidelity to less than one 
mistake in 10+. The results of structural and mutagenesis studies revealed 
that the editing site is more than 20 A from the activation site (Figure 30.9). 
This editing site readily accepts and cleaves Ser-tRNA!*" but does not 
cleave Thr-tRNA™". The discrimination of serine from threonine is easy 
because threonine contains an extra methyl group; a site that conforms to 
the structure of serine will sterically exclude threonine. The structure of the 
complex between threonyl-tRNA synthetase and its substrate reveals that 
the aminoacylated CCA can swing out of the activation site and into the 
editing site (Figure 30.10). Thus, the aminoacyl-tRNA can be edited 


without dissociating from the synthetase. This proofreading 
depends on the conformational flexibility of a short stretch of 
polynucleotide sequence. 

Most aminoacyl-tRNA synthetases contain editing sites in 
addition to activation sites. These complementary pairs of sites 
function as a double sieve to ensure very high fidelity. In general, the 
acylation site rejects amino acids that are larger than the correct one 
because there is insufficient room for them, whereas the hydrolytic 
site cleaves activated species that are smaller than the correct one. 

A few synthetases achieve high accuracy without editing. For 
example, tyrosyl-tRNA synthetase has no difficulty discrimi- 
nating between tyrosine and phenylalanine; the hydroxyl group 
on the tyrosine ring enables tyrosine to bind to the enzyme 10+ 
times as strongly as phenylalanine. Proofreading has been selected 
in evolution only when fidelity must be enhanced beyond what can 
be obtained through an initial binding interaction. 


Synthetases recognize various features of transfer 
RNA molecules 


How do synthetases choose their tRNA partners? This enor- 
mously important step is the point at which “translation” takes 
place—at which the correlation between the amino acid and the 
nucleic acid worlds is made. In a sense, aminoacyl-tRNA synthe- 
tases are the only molecules in biology that “know” the genetic 
code. Their precise recognition of tRNAs is as important for 
high-fidelity protein synthesis as is the accurate selection of 
amino acids. In general, tRNA recognition by the synthetase is 
different for each synthetase and tRNA pair. Consequently, gen- 
eralities are difficult to make. 

Some synthetases recognize their tRNA partners primarily on the 
basis of their anticodons, although they may also recognize other 
aspects of tRNA structure that vary among different tRNAs. The 
most direct evidence comes from crystallographic studies of com- 
plexes formed between synthetases and their cognate tRNAs. 
Consider, for example, the structure of the complex between 
threonyl-tRNA synthetase and tRNA™ (Figure 30.11). As 
expected, the CCA arm extends into the zinc-containing activa- 
tion site, where it is well positioned to accept threonine from 
threonyl adenylate. The enzyme interacts extensively not only 
with the acceptor stem of the tRNA, but also with the anticodon 
loop. The interactions with the anticodon loop are particularly 
revealing. Each base within the sequence 5’-CGU-3’ of the anti- 
codon participates in hydrogen bonds with the enzyme; those 
with the second two bases (G and U) appear to be more important 
because the synthetase interacts just as efficiently with the antico- 
dons GGU and UGU. Although interactions between the enzyme 
and the anticodon are often crucial for correct recognition, Figure 
30.12 shows that many aspects of tRNA molecules are recognized 
by synthetases. Note that many of the recognition sites are loops 
rich in unusual bases that can provide structural identifiers. 


Aminoacyl-tRNA synthetases can be divided 
into two classes 


a At least one aminoacyl-tRNA synthetase exists for each 
amino acid. The diverse sizes, subunit composition, and 
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Anticodo 
loop 


& FIGURE 30.11 Threonyl-tRNA synthetase 
complex. The structure shows the complex between 
threonyl-tRNA synthetase (blue) and tRNA (red). Notice 
that the synthetase binds to both the acceptor stem and 
the anticodon loop. [Drawn from 1QF6.pdb.] 
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FIGURE 30.12 Recognition sites on tRNA. Circles 
represent nucleotides, and the sizes of the circles are 
proportional to the frequency with which they are used as 
recognition sites by aminoacyl-tRNA synthetases. The 
numbers indicate the positions of the nucleotides in the 
base sequence, beginning from the 5’ end of the tRNA 
molecule. [Information from M. lbba, and D. Sdll, Annu. Rev. 
Biochem. 69:617-650, 1981, p. 636.] 
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k FIGURE 30.13 Classes of 
aminoacyl-tRNA synthetases. Notice that 
class | and class II synthetases recognize 
different faces of the tRNA molecule. The 
CCA arm of tRNA adopts different 
conformations in complexes with the two 
classes of synthetase. Note that the CCA 
arm of the tRNA is turned toward the 
viewer (Figures 30.4 and 30.5). [Drawn from 
1EUY.pdb and 1QF6.pdb.] 


TABLE 30.3 Classification and 
subunit structure of 
aminoacyl-tRNA 
synthetases in E. coli 


Class I Class II 


Arg (a) Ala (a4) 
Cys (a) Asn (2) 
Gln (a) Asp (a2) 
Glu (a) Gly (@282) 
Ile (a) His (a2) 
Leu (a) Lys (a2) 
Met (a) Phe (#282) 
Trp (a2) Ser (a2) 
Tyr (a2) Pro (a2) 
Val (a) Thr (a2) 
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CLASS II 


tRNA Complex tRNA 


Complex 


sequences of these enzymes were bewildering for many years. Could it be 
that essentially all synthetases evolved independently? The determination 
of the three-dimensional structures of several synthetases followed by 
more-refined sequence comparisons revealed that different synthetases 
are, in fact, related. Specifically, synthetases fall into two classes, termed 
class I and class II, each of which includes enzymes specific for 10 of the 
20 amino acids (Table 30.3). Intriguingly, synthetases from the two classes 
bind to different faces of the tRNA molecule (Figure 30.13). The CCA arm 
of tRNA adopts different conformations to accommodate these interac- 
tions; the arm is in the helical conformation observed for free tRNA 
(Figures 30.4 and 30.5) for class I] enzymes and in a hairpin conformation 
for class I enzymes. These two classes also differ in other ways. 


1. Class I enzymes acylate the 2'-hydroxyl group of the terminal adenosine 
of tRNA, whereas class II enzymes (except the enzyme for Phe-tRNA) 
acylate the 3’-hydroxyl group. 


2. The two classes bind ATP in different conformations. 


3. Most class I enzymes are monomeric, whereas most class II enzymes 
are dimeric. 


Why did two distinct classes of aminoacyl-tRNA synthetases evolve? 
The observation that the two classes bind to distinct faces of tRNA suggests 
a possibility. Recognition sites on both faces of tRNA may have been 
required to allow the recognition of 20 different tRNAs. 


30.3 The Ribosome Is the Site of Protein Synthesis 


We turn now to ribosomes, the molecular machines that coordinate the 
interplay of aminoacyl-tRNAs, mRNA, and proteins that leads to protein 
synthesis. An E. coli ribosome is a ribonucleoprotein assembly with a mass 
of about 2500 kDa, a diameter of approximately 250 A, and a sedimentation 
coefficient (Section 3.1) of 70S. The 20,000 ribosomes in a bacterial cell 
constitute nearly a fourth of its mass. 

A ribosome can be dissociated into a large subunit (50S) and a small sub- 
unit (30S). These subunits can be further split into their constituent proteins 
and RNAs. The 305 subunit contains 21 different proteins (referred to as S1 
through $21) anda 16S RNA molecule. The 50S subunit contains 34 differ- 
ent proteins (L1 through L34) and two RNA molecules, a 23S and a 55 


50S subunit 70S ribosome 


species. A ribosome contains one copy of each RNA molecule, two copies 
each of the L7 and L12 proteins, and one copy of each of the other proteins. 
The L7 protein is identical with L12 except that its amino terminus is acety- 
lated (Section 10.3). Both the 305 and the 50S subunits can be reconstituted 
in vitro from their constituent proteins and RNA. This reconstitution is an 
outstanding example of the principle that supramolecular complexes can form 
spontaneously from their macromolecular constituents. 

The structures of both the 305 and the 505 subunits as well as the com- 
plete 705 ribosome have been determined (Figure 30.14). The features of 
these structures are in remarkable agreement with interpretations of less- 
direct experimental probes. These structures provide an invaluable frame- 
work for examining the mechanism of protein synthesis. 


Ribosomal RNAs (5S, 16S, and 23S rRNA) play a central 
role in protein synthesis 


The prefix ribo in the name ribosome is apt because RNA constitutes nearly 
two-thirds of the mass of these large molecular assemblies. The three RNAs 
present—55, 16S, and 235—are critical for ribosomal architecture and func- 
tion. They are formed by the cleavage of primary 30S transcripts and further 
processing. These molecules fold into structures that allow them to form 
internal base pairs. Their base-pairing patterns were deduced by comparing 
the nucleotide sequences of many species to detect conserved sequences as 
well as conserved base pairings. For instance, the 16S RNA of one species 
may have a G-C base pair, whereas another may have an A—-U base pair, but 
the location of the base pair is the same in both molecules. Chemical modi- 
fication and digestion experiments supported the structures deduced from 
sequence comparisons (Figure 30.15). The striking finding is that across all 
species, ribosomal RNAs (rRNAs) are folded into defined structures that have 
many short duplex regions. 


— For many years, ribosomal proteins were presumed to orchestrate pro- 
tein synthesis and ribosomal RNAs were presumed to serve primarily 
as structural scaffolding. The current view is almost the reverse. The discovery 
of catalytic RNA (Section 29.4) made biochemists receptive to the possibility 
that RNA plays a much more active role in ribosomal function. The detailed 
structures make it clear that the key sites in the ribosome, such as those that 
catalyze the formation of the peptide bond and interact with mRNA and 
tRNA, are composed almost entirely of RNA. Contributions from the pro- 
teins are minor. The almost inescapable conclusion is that the ribosome ini- 
tially consisted only of RNA and that the proteins were added later to fine-tune 


30S subunit 


hk FIGURE 30.14 The ribosome at high 
resolution. Detailed models of the 
ribosome based on the results of x-ray 
crystallographic studies of the 70S 
ribosome and the 30S and 50S subunits: 
(left) view of the part of the 50S subunit 
that interacts with the 30S subunit; (center) 
side view of the 70S ribosome; (right) view 
of the part of the 30S subunit that interacts 
with the 50S subunit. 28S RNA is shown in 
yellow, 5S RNA in orange, 16S RNA in 
green, proteins of the 50S subunit in red, 
and proteins of the 30S subunit in blue. 
Notice that the interface between the 50S 
and the 30S subunits consists entirely of 
RNA. [Drawn from 1GIX. pdb and 1GIY.pdb.] 
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hk FIGURE 30.15 Ribosomal RNA folding pattern. (A) The secondary structure of 16S 
ribosomal RNA deduced from sequence comparison and the results of chemical studies. 
(B) The tertiary structure of 16S RNA determined by x-ray crystallography. [(A) Courtesy of 
Dr. Bryn Weiser and Dr. Harry Noller; (B) drawn from 1FJG.pdb.] 
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its functional properties. This conclusion has the pleasing consequence of 
dodging a “chicken and egg” question: How can complex proteins be synthe- 
sized if complex proteins are required for protein synthesis? 


Ribosomes have three tRNA-binding sites that bridge 
the 30S and 50S subunits 


Three tRNA-binding sites in ribosomes are arranged to allow the formation 
of peptide bonds between amino acids encoded by the codons on mRNA 
(Figure 30.16). The mRNA fragment being translated at a given moment is 
bound within the 30S subunit. Each of the tRNA molecules is in contact 
with both the 305 subunit and the 50S subunit. At the 305 end, two of the 
three tRNA molecules are bound to the mRNA through anticodon—codon 
base pairs. These binding sites are called the A site (for aminoacyl) and the 
P site (for peptidyl). The third tRNA molecule is bound to an adjacent site 
called the E site (for exit). 

The other end of each tRNA molecule, the end without the anticodon, 
interacts with the 50S subunit. The acceptor stems of the tRNA molecules 
occupying the A site and the P site converge at a site where a peptide bond 
is formed. A tunnel connects this site to the back of the ribosome, through 
which the polypeptide chain passes during synthesis (Figure 30.17). 


The start signal is usually AUG preceded by several 
bases that pair with 16S rRNA 


How does protein synthesis start? The simplest possibility would be for the 
first 3 nucleotides of each mRNA to serve as the first codon; no special start 
signal would then be needed. However, experiments show that translation 
in bacteria does not begin immediately at the 5’ terminus of mRNA. 
Indeed, the first translated codon is nearly always more than 25 nucleotides 
away from the 5’ end. Furthermore, in bacteria, many mRNA molecules are 
polycistronic—that is, they encode two or more polypeptide chains. For 
example, a single mRNA molecule about 7000-nucleotides long specifies 
five enzymes in the biosynthetic pathway for tryptophan in E.coli. Each of 
these five proteins has its own start and stop signals on the mRNA. In fact, 
all known mRNA molecules contain signals that define the beginning and end 
of each encoded polypeptide chain. 

A clue to the mechanism of initiation was the finding that nearly half the 
amino-terminal residues of proteins in E. coli are methionine. In fact, the 


mRNA 
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k FIGURE 30.16 Transfer RNA-binding 
sites. (A) Three tRNA-binding sites are 
present on the 70S ribosome. They are 
called the A (for aminoacyl), P (for peptidyl), 
and E (for exit) sites. Each tRNA molecule 
contacts both the 30S and the 50S 
subunit. (B) The tRNA molecules in sites 
A and P are base-paired with mRNA. 

[(B) Drawn from 1JGP. pdb.] 
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FIGURE 30.17 An active ribosome. This 
schematic representation shows the 
relations among the key components of 
the translation machinery. 
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FIGURE 30.18 Initiation sites. 

Sequences of mRNA initiation sites for 
protein synthesis in some bacterial and viral 
mRNA molecules. Comparison of these 
sequences reveals some recurring features. 
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FIGURE 30.19 Formylation of methionyl- 
tRNA. Initiator tRNA (tRNA, is first charged 
with methionine, and then a formyl group is 
transferred to the methionyl-tRNA, from 
N'°-formyltetrahydrofolate. 
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AGCACGAGGEGAAAUCUGAUGGAACGCUAC E. coli trpA 
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initiating codon in mRNA is AUG (methionine) or, less frequently, GUG 
(valine) or, rarely UUG (leucine). What additional signals are necessary to 
specify a translation start site? The first step toward answering this question 
was the isolation of initiator regions from a number of mRNAs. This isolation 
was accomplished by using ribonuclease to digest mRNA-ribosome complexes 
(formed under conditions in which protein synthesis could begin but elonga- 
tion could not take place). As expected, each initiator region usually displays an 
AUG codon (Figure 30.18). In addition, each initiator region contains a purine- 
rich sequence centered about 10 nucleotides on the 5’ side of the initiator codon. 

The role of this purine-rich region, called the Shine—Dalgarno sequence, 
(named after John Shine and Lynn Dalgarno, who first described the 
sequence), became evident when the sequence of 165 rRNA was elucidated. 
The 3’ end of this rRNA component of the 305 subunit contains a sequence 
of several bases that is complementary to the purine-rich region in the ini- 
tiator sites of mRNA. Mutagenesis of the CCUCC sequence near the 3’ end 
of 165 rRNA to ACACA markedly interferes with the recognition of start 
sites in mRNA. This result and other evidence show that the initiator region 
of mRNA binds very near the 3’ end of the 165 rRNA. The number of base 
pairs linking mRNA and 16S rRNA ranges from three to nine. Thus, two 
kinds of interactions determine where protein synthesis starts: (1) the pairing of 
mRNA bases with the 3' end of 16S rRNA and (2) the pairing of the initiator 
codon on mRNA with the anticodon of an initiator tRNA molecule. 


Bacterial protein synthesis is initiated by formylmethionyl transfer RNA 


As stated earlier, methionine is the first amino acid in many E. coli proteins. 
However, the methionine residue found at the amino-terminal end of E. coli 
proteins is usually modified. In fact, protein synthesis in bacteria starts with 
the modified amino acid N-formylmethionine (f{Met). A special tRNA brings 
formylmethionine to the ribosome to initiate protein synthesis. This initia- 
tor tRNA (abbreviated as tRNA,) differs from the tRNA that inserts 
methionine in internal positions (abbreviated as tRNA, ). The subscript “f” 
indicates that methionine attached to the initiator tRNA can be formylated, 
whereas it cannot be formylated when attached to tRNA,,. Although virtu- 
ally all proteins synthesized in E. coli begin with formylmethionine, in 
approximately one-half of the proteins, N-formylmethionine is removed 
when the nascent chain is 10 amino acids long. 

Methionine is linked to these two kinds of tRNAs by the same aminoacy]- 
tRNA synthetase. A specific enzyme then formylates the amino group of the 
methionine molecule that is attached to tRNA, (Figure 30.19). The activated 
formyl donor in this reaction is N!°-formyltetrahydrofolate, a folate deriva- 
tive that carries activated one-carbon units (Section 24.2). Free methionine 
and methionyl-tRNA,,, are not substrates for this transformylase. 


Formylmethionyl-tRNA, is placed in the P site of the ribosome 30S ribosomal subunit 
in the formation of the 70S initiation complex 
if Initiation factors 


Messenger RNA and formylmethionyl-tRNA; must be brought to the ribo- 
some for protein synthesis to begin. How is this task accomplished? Three 
protein initiation factors (IF1, IF2, and IF3) are essential. The 305 ribosomal 


subunit first forms a complex with IF1 and IF3 (Figure 30.20). The binding of 30S -1F1-1F3 
these factors to the 305 subunit prevents it from prematurely joining the 505 ren 
subunit to form a dead-end 70S complex, devoid of mRNA and fMet-tRNA,. i pale evened 


IF'1 binds near the A site and directs the f{Met-tRNA, to the P site. Initiation 
factor 2, a member of the G-protein family, binds GTP, and the concomitant 
conformational change enables IF2 to associate with f{Met-tRNA,. The IF2— 
GTP-initiator-tRNA complex binds with mRNA (correctly positioned by 
the interaction of the Shine-Dalgarno sequence with the 165 rRNA) and the 
30S subunit to form the 30S initiation complex. Structural changes then lead to 
the ejection of IF1 and IF3. IF2 stimulates the association of the 505 subunit 
to the complex. The GTP bound to IF2 is hydrolyzed upon arrival of the 50S 
subunit, releasing IF2. The result is a 70S initiation complex. The formation of 
the 70S initiation complex is the rate-limiting step in protein synthesis. 


30S initiation complex 


When the 705 initiation complex has been formed, the ribosome is ready \ 
for the elongation phase of protein synthesis. The fMet-tRNA, molecule eles 
occupies the P site on the ribosome, positioned so that its anticodon pairs 50S subunit + H,O 
with the initiating codon on mRNA. The other two sites for tRNA molecules, 
the A site and the E site, are empty. This interaction establishes the reading IF2, CDP +P; 
frame for the translation of the entire mRNA. After the initiator codon has 


been located, groups of three nonoverlapping nucleotides are defined. 


Elongation factors deliver aminoacyl-tRNA to the ribosome 


At this point, f{Met-tRNA, occupies the P site, and the A site is vacant. The 
particular species inserted into the empty A site depends on the mRNA codon 
in the A site. However, the appropriate aminoacyl-tRNA does not simply leave 
the synthetase and diffuse to the A site. Rather, it is delivered to the A site in 
association with a 43-kDa protein called elongation factor Tu (EF-Tu), another 70S initiation complex 
member of the G-protein family. EF-Tu, the most abundant bacterial protein, mietine une Transietun initiation wi 
binds aminoacyl-tRNA only in its GTP form (Figure 30.21). The binding of bacteria. Initiation factors aid the assembly 
EF-Tu to aminoacyl-tRNA serves two functions. First, EF-Tu protects the first of the 30S initiation complex and then 
delicate ester linkage in aminoacyl-tRNA from hydrolysis. Second, EF-Tu of the 70S initiation complex. 
contributes to the accuracy of protein synthesis because GTP hydrolysis and 
expulsion of the EF-Tu-GDP complex from the ribosome occurs only if the 
pairing between the anticodon and the codon is correct. EF-Tu interacts with 
the 165 RNA, which monitors the accuracy of the base pairing of position 1 
and 2 of the codon (p. 898). Correct codon recognition induces structural 
changes in the 30S subunit that activate the GTPase activity of EF-Tu, releas- 
ing EF-Tu-GDP from the ribosome. These same structural changes also rotate 
the aminoacyl-tRNA in the A site so that the amino acid is brought into prox- 
imity with the aminoacyl-tRNA in the P site, a process called accommodation. 
Accommodation aligns the amino acids for peptide bond formation. 

Released EF-Tu is then reset to its GTP form by a second elongation 
factor, elongation factor Ts. EF-Ts induces the dissociation of GDP. GTP 
binds to EF-Tu, and EF-T's concomitantly departs. It is noteworthy that 


EF-Tu does not interact with fMet-tRNA,. Hence, this initiator tRNA is not 


Aminoacyl- 
tRNA 
FIGURE 30.21 Structure of elongation factor Tu. The structure of a complex between 
elongation factor Tu (EF-Tu) and an aminoacyl-tRNA. Notice the P-loop NTPase domain 
(purple shading) at the amino-terminal end of EF-Tu. This NTPase domain is similar to those 
in other G proteins. [Drawn from 1B23. pdb.] 
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FIGURE 30.22 Peptide-bond formation. 
(A) The amino group of the aminoacyl-tRNA 
attacks the carbonyl group of the ester 
linkage of the peptidyl-tRNA. (B) An eight- 
membered transition state is formed. 

Note: Not all atoms are shown and some 
bond lengths are exaggerated for clarity. 

(C) This transition state collapses to form 
the peptide bond and release the 
deacylated tRNA. 
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delivered to the A site. In contrast, Met-tRNA,,, like all other aminoacy]l- 
tRNAs, does bind to EF-Tu. These findings account for the fact that internal 
AUG codons are not read by the initiator tRNA. Conversely, IF2 recognizes 
fMet-tRNA, but no other tRNA. The cycle of elongation continues until a 
termination codon is met. 


a This GTP—GDP cycle of EF-Tu is reminiscent of those of the het- 
erotrimeric G proteins in signal transduction (Section 14.1) and the 
Ras proteins in growth control (Section 14.3). This similarity is due to their 
shared evolutionary heritage, seen in the homology of the amino-terminal 
domain of EF-Tu to the P-loop NTPase domains in the other G proteins. 
In all these related enzymes, the change in conformation between the GTP 
and the GDP forms leads to a change in interaction partners. A further 
similarity is the requirement that an additional protein catalyzes the 
exchange of GTP for GDP; EF-Ts catalyzes the exchange for EF-Tu, just 
as an activated receptor does for a heterotrimeric G protein. 


Peptidyl transferase catalyzes peptide-bond synthesis 


With both the P site and the A site occupied by aminoacyl-tRNA, the stage 
is set for the formation of a peptide bond: the formylmethionine molecule 
linked to the initiator tRNA will be transferred to the amino group of the 
amino acid in the A site. The formation of the peptide bond, one of the most 
important reactions in life, is a thermodynamically spontaneous reaction 
catalyzed by a site on the 235 rRNA of the 505 subunit called the peptidyl 
transferase center. This catalytic center is located deep in the 50S subunit 
near the tunnel that allows the nascent peptide to leave the ribosome. 

The ribosome, which enhances the rate of peptide bond synthesis by a 
factor of 10’ over the uncatalyzed reaction (~10~+ M7! s~!), derives much 
of its catalytic power from catalysis by proximity and orientation. The ribo- 
some positions and orients the two substrates so that they are situated to take 
advantage of the inherent reactivity of an amine group (on the aminoacyl- 
tRNA in the A site) with an ester (on the initiator tRNA in the P site). The 
amino group of the aminoacyl-tRNA in the A site, in its unprotonated state, 
makes a nucleophilic attack on the ester linkage between the initiator tRNA 
and the formylmethionine molecule in the P site (Figure 30.22A). The nature 
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of the transition state that follows the attack is not established and several 
models are plausible. One model proposes roles for the 2’OH of the adeno- 
sine of the tRNA in the P site and a molecule of water at the peptidyl trans- 
ferase center (Figure 30.22B). The nucleophilic attack of the a-amino group 
generates an eight-membered transition state in which three protons are 
shuttled about in a concerted manner. The proton of the attacking amino 
group hydrogen bonds to the 2’ oxygen of ribose of the tRNA. The hydrogen 
of 2'OH in turn interacts with the oxygen of the water molecule at the center, 
which then donates a proton to the carbonyl oxygen. Collapse of the transi- 
tion state with the formation of the peptide bond allows protonation of the 
3'OH of the now empty tRNA in the P site (Figure 30.22C). The stage is 
now set for translocation and formation of the next peptide bond. 


The formation of a peptide bond is followed by the GTP-driven 
translocation of tRNAs and mRNA 


With the formation of the peptide bond, the peptide chain is now attached 
to the tRNA whose anticodon is in the A site on the 305 subunit. The two 
subunits rotate with respect to one another, and this structural change 
places the CCA end of the same tRNA and its peptide in the P site of the 
large subunit (Figure 30.23). Another aminoacyl-tRNA arrives and binds 
at the A site (1). Again, peptide bond synthesis occurs (2). However, protein 
synthesis cannot continue without the translocation of the mRNA and the 
tRNAs within the ribosome. Elongation factor G (EF-G, also called translo- 
case) catalyzes the movement of mRNA, at the expense of GTP hydrolysis, 
by a distance of three nucleotides. Now, the next codon is positioned in the 
A site for interaction with the incoming aminoacyl-tRNA (3). The pepti- 
dyl-tRNA moves out of the A site into the P site on the 305 subunit and at 
the same time, the deacylated tRNA moves out of the P site into the E site 
and is subsequently released from the ribosome (4). The movement of the 
peptidyl-tRNA into the P site shifts the mRNA by one codon, exposing the 
next codon to be translated in the A site. 


Aminoacyl-tRNA 
binding 


Oo) 


FIGURE 30.23 Mechanism of protein 
synthesis. The cycle begins with peptidyl- 
tRNA in the P site. (1) An aminoacyl-tRNA 
binds in the A site. (2) With both sites 
occupied, a new peptide bond is formed. 
(8) The tRNAs and the mRNA are 
translocated through the action of 


deacylated tRNA to the E site. (4) Once 
there, the tRNA is free to dissociate to 


elongation factor G, which moves the oe 3 ae 5 


complete the cycle. @ 
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FIGURE 30.25 Polysomes. Transcription 
of a segment of DNA from E. coli generates 
mRNA molecules that are immediately 
translated by multiple ribosomes. [From 

O. L. Miller, Jr, B. A. Hamkalo, and C. A. 
Thomas, Jr. Science 169 (1970):392. 
Reprinted with permission from AAAS.] 
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FIGURE 30.24 Translocation mechanism. In the GTP form, EF-G binds to the A site on 
the 50S subunit. This binding stimulates GTP hydrolysis, inducing a conformational change 
in EF-G that forces the tRNAs and mRNA to move through the ribosome by a distance 
corresponding to one codon. 


The three-dimensional structure of the ribosome undergoes significant 
change during translocation, and evidence suggests that translocation may 
result from properties of the ribosome itself. However, EF-G accelerates the 
process. A possible mechanism for accelerating the translocation process is 
shown in Figure 30.24. First, EF-G in the GTP form binds to the ribosome 
near the A site, interacting with the 235 rRNA of the 50S subunit. The 
binding of EF-G to the ribosome stimulates the GTPase activity of EF-G. 
On GTP hydrolysis, EF-G undergoes a conformational change that dis- 
places the peptidyl-tRNA in the A site to the P site, which carries the 
mRNA and the deacylated tRNA with it. The dissociation of EF-G leaves 
the ribosome ready to accept the next aminoacyl-tRNA into the A site. 

Note that the peptide chain remains in the P site on the 50S subunit 
throughout this cycle, growing into the exit tunnel. This cycle is repeated, 
with mRNA translation taking place in the 5’ — 3’ direction, as new ami- 
noacyl-tRNAs move into the A site, allowing the polypeptide to be elon- 
gated until a stop signal is found. 

The direction of translation has important consequences. Recall that 
transcription also is in the 5’ — 3’ direction (Section 29.1). If the direction 
of translation were opposite that of transcription, only fully synthesized 
mRNA could be translated. In contrast, because the directions are the 
same, mRNA can be translated while it is being synthesized. In bacteria, 
almost no time is lost between transcription and translation. The 5’ end of 
mRNA interacts with ribosomes very soon after it is made, well before the 
3’ end of the mRNA molecule is finished. An important feature of bacterial 
gene expression is that translation and transcription are closely coupled in space 
and time. 

Many ribosomes can be translating an mRNA molecule simultaneously. 
The group of ribosomes bound to an mRNA molecule is called a polyribo- 
some or a polysome (Figure 30.25). Recent work shows that the ribosomes are 
arranged so as to protect the mRNA and to facilitate easy exchange of the 
substrates and products with the cytoplasm. The ribosomes in the polysome 
are in a helical array around the mRNA with the tRNA binding sites and 
peptide exit tunnel exposed to the cytoplasm. 


Protein synthesis is terminated by release factors that 
read stop codons 


The final phase of translation is termination. How does the synthesis of a 
polypeptide chain come to an end when a stop codon is encountered? No 
tRNAs with anticodons complementary to the stop codons—UAA, UGA, 
or UAG—exist in normal cells. Instead, these stop codons are recognized by 
proteins called release factors (RFs). One of these release factors, RF 1, rec- 
ognizes UAA or UAG. A second factor, RF2, recognizes UAA or UGA. 


from tRNA 
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FIGURE 30.26 Termination of protein synthesis. A release factor recognizes a stop codon 
in the A site and stimulates the release of the completed protein from the tRNA in the P site. 


A third factor, RF3, another GTPase, catalyzes the removal of RF1 or RF2 tRNA 
from the ribosome upon release of the newly synthesized protein. ‘ 

RF 1 and RF2 are compact proteins that, in eukaryotes, resemble atRNA oA 
molecule. When bound to the ribosome, the proteins unfold to bridge the gap O 
between the stop codon on the mRNA and the peptidy] transferase center on 
the 505 subunit (Figure 30.26). The RF interacts with the peptidyl transferase 
center using a loop containing a highly conserved glycine-glycine-glutamine 
(GGQ) sequence, with the glutamine methylated on the amide nitrogen atom 
of the R group. This modified glutamine (assisted by the peptidyl transferase) 
is crucial in promoting a water molecule’s attack on the ester linkage between 
the tRNA and the polypeptide chain, freeing the polypeptide chain. The 
detached polypeptide leaves the ribosome. Transfer RNA and messenger 
RNA remain briefly attached to the 705 ribosome until the entire complex is 
dissociated through the hydrolysis of GTP in response to the binding of 
EF-G and another factor, called the ribosome release factor (RRF). 
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30.4 Eukaryotic Protein Synthesis Differs from Bacterial oA 
Protein Synthesis Primarily in Translation Initiation oe O 


The basic plan of protein synthesis in eukaryotes and archaea is similar to 
that in bacteria. The major structural and mechanistic themes recur in all HO OH 
domains of life. However, eukaryotic protein synthesis entails more protein + 
components than does bacterial protein synthesis, and some steps are more 
intricate. Some noteworthy similarities and differences are as follows: 


1. Ribosomes. Eukaryotic ribosomes are larger. They consist of a 60S large 
subunit and a 40S small subunit, which come together to form an 805 particle 
having a mass of 4200 kDa, compared with 2700 kDa for the bacterial 70S adlyseniide” 
ribosome. The 405 subunit contains an 185 RNA that is homologous to the 

bacterial 165 RNA. The 605 subunit contains three RNAs: the 55 RNA, 

which is homologous to the bacterial 55 rRNA; the 285 RNA, which is 

homologous to the bacterial 235 molecules; and the 5.85 RNA, which is 

homologous to the 5’ end of the 235 RNA of bacteria. 


2. Initiator tRNA. In eukaryotes, the initiating amino acid is methionine 
rather than N-formylmethionine. However, as in bacteria, a special tRNA 
participates in initiation. This aminoacyl-tRNA is called Met-tRNA, (the 
subscript “1” stands for initiation). 


3. Initiation. The initiating codon in eukaryotes is always AUG. 
Eukaryotes, in contrast with bacteria, do not have a specific purine-rich 
sequence on the 5’ side to distinguish initiator AUGs from internal ones. 
911 
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FIGURE 30.27 Eukaryotic translation 
initiation. In eukaryotes, translation 
initiation starts with the assembly of a 
complex on the 5’ cap that includes the 
40S subunit and Met-tRNA,. Driven by ATP 
hydrolysis, this complex scans the mRNA 
until the first AUG is reached. The 60S 
subunit is then added to form the 80S 
initiation complex. 
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Instead, the AUG nearest the 5’ end of mRNA is usually selected as the 
start site. Eukaryotes utilize many more initiation factors than do bacteria, 
and their interplay is much more complex. The prefix eI F denotes a eukary- 
otic initiation factor. 

Initiation begins with the formation of a ternary complex consisting of 
the 40S ribosome and Met-tRNA, in association with e[F-2. The complex is 
called the 435 preinitiation complex (PIC). The PIC binds to the 5’ end of 
mRNA and begins searching for an AUG codon by moving step-by-step in 
the 3’ direction. Initiation factor elF-4E binds to the 5’-cap of the mRNA 
(Section 29.3) and facilitates binding of PIC to the mRNA (Figure 30.27). 
This scanning process is catalyzed by helicases that move along the mRNA 
powered by ATP hydrolysis. Pairing of the anticodon of Met-tRNA, with 
the AUG codon of mRNA signals that the target has been found. In 
almost all cases, eukaryotic mRNA has only one start site and hence is the 
template for only a single protein. In contrast, a bacterial mRNA can have 
multiple Shine-Dalgarno sequences and, hence, start sites, and it can serve 
as a template for the synthesis of several proteins. 

The difference in initiation mechanism between bacteria and eukaryotes 
is, 1n part, a consequence of the difference in RNA processing. The 5’ end 
of mRNA is readily available to ribosomes immediately after transcription 
in bacteria. In contrast, in eukaryotes pre-mRNA must be processed and 
transported to the cytoplasm before translation is initiated. The 5’ cap pro- 
vides an easily recognizable starting point. In addition, the complexity of 
eukaryotic translation initiation provides another mechanism for regulation 
of gene expression that we shall explore further in Chapter 31. 

Although most eukaryotic mRNA molecules rely on the 5’ cap to initi- 
ate protein synthesis, recent work has established that some mRNA mole- 
cules can recruit ribosomes for initiation without the use of a 5’-cap and 
cap-binding proteins. In these mRNAs, highly structured RNA sequences 
called internal ribosome entry sites (IRES) facilitate 405 ribosome binding 
to the mRNA. IRES were first discovered in the genomes of RNA viruses 
and have since been found in other viruses, as well as in a subset of cellular 
mRNA that appears to take part in development and cell stress. The molec- 
ular mechanism by which IRES function to initiate protein synthesis 
remains to be determined. 


4. The Structure of mRNA. Soon after the PIC binds the mRNA, eIF-4G 
links eIF-4E to a protein associated with the poly(A) tail, the poly(A)-binding 
protein I (PABPI; Figure 30.28). Cap and tail are thus brought together to 
form a circle of mRNA. The benefits of circularization of mRNA remain to 
be determined, but a requirement for circularization may prevent translation 
of mRNA molecules that have lost their poly(A) tails (Section 29.3). 


FIGURE 30.28 Protein interactions circularize eukaryotic MRNA. [Information from H. 
Lodish et al., Molecular Cell Biology, 5th ed. (W. H. Freeman and Company, 2004), Fig. 4.31.] 
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FIGURE 30.29 The effects of vanishing white matter disease. (A) In the normal brain, 
magnetic resonance imaging (MRI) visualizes the white matter as dark gray. (B) In the 
diseased brain, this MRI reveals that white matter is replaced by cerebrospinal fluid, seen 

as white. [(A) Macmillan Publishers Ltd: NATURE Ainsworth, Claire. “Molecular Medicine: Lost in 
Translation.” Nature 435 (2005): 5556-558, © 2005. (B) Reprinted from Lancet Neurology, 5, M. C. 
van der Knaap, M. S., Pronk, J. C., Scheper, G. C., “Vanishing white matter disease” Fig. B, p. 414 
© 2006, with permission from Elsevier.] 


5. Elongation and Termination. Eukaryotic elongation factors EF1a and 
EF1By are the counterparts of bacterial EF-Tu and EF-Ts. The GTP form of 
EF 1a delivers aminoacyl-tRNA to the A site of the ribosome, and EF1 By cata- 
lyzes the exchange of GTP for bound GDP. Eukaryotic EF2 mediates GTP- 
driven translocation in much the same way as does bacterial EF-G. Termination 
in eukaryotes is carried out by a single release factor, eRF1, compared with two 
in bacteria. Release factor elF-3 accelerates the activity of eRF-1. 


6. Organization. The components of the translation machinery in higher 
eukaryotes are organized into large complexes associated with the cytoskele- 
ton. This association is believed to facilitate the efficiency of protein synthesis. 
Recall that organization of elaborate biochemical processes into physical com- 
plexes is a reoccurring theme in biochemistry (Section 18.5 and Section 25.2). 


Mutations in initiation factor 2 cause a curious pathological condition 


e Mutations in eukaryotic initiation factor 2 result in a mysterious dis- 

ease, called vanishing white matter (VWM) disease, in which nerve cells 
in the brain disappear and are replaced by cerebrospinal fluid (Figure 30.29). 
The white matter of the brain consists predominantly of nerve axons that 
connect the gray matter of the brain to the rest of the body. Death, resulting 
from fever or extended coma, can occur anywhere from a few years to decades 
after the onset of the disease. Symptoms usually appear in young children but 
can develop shortly after birth or even in adulthood. An especially puzzling 
aspect of the disease is its tissue specificity. A mutation in a biochemical pro- 
cess as fundamental to life as protein-synthesis initiation would be predicted 
to be lethal or at least to affect all tissues of the body. Diseases such as VWM 
graphically show that, although much progress has been made in biochemis- 
try, much more research will be required to understand the complexities of 
health and disease. 


30.5 A Variety of Antibiotics and Toxins Can Inhibit 
Protein Synthesis 


Many chemicals that inhibit various aspects of protein synthesis have been 
identified. These chemicals are powerful experimental tools and clinically 
useful drugs. 
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TABLE 30.4 Antibiotic inhibitors of protein synthesis 


Antibiotic 


Action 


Streptomycin and other aminoglycosides Inhibit initiation and cause the misreading of mRNA (bacteria) 
Tetracycline Binds to the 308 subunit and inhibits the binding of aminoacyl-tRNAs (bacteria) 
Chloramphenicol Inhibits the peptidyl transferase activity of the 50S ribosomal subunit (bacteria) 


Cycloheximide Inhibits translocation (eukaryotes) 


Erythromycin Binds to the 508 subunit and inhibits translocation (bacteria) 


Puromycin Causes premature chain termination by acting as an analog of aminoacyl-tRNA (bacteria and eukaryotes) 


Streptomycin 


FIGURE 30.30 Antibiotic action of RO 
puromycin. Puromycin resembles the 
aminoacyl terminus of an aminoacyl-tRNA. 

Its amino group joins the carboxyl group of 

the growing polypeptide chain to form 
peptidyl-puromycin that dissociates from 

the ribosome. Peptidyl-puromycin is stable 
because puromycin has an amide (shown 

in red) rather than an ester linkage. 
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Some antibiotics inhibit protein synthesis 


e The differences between eukaryotic and bacterial ribosomes can be 

exploited for the development of antibiotics (Table 30.4). For exam- 
ple, the antibiotic streptomycin, a highly basic trisaccharide, interferes with 
the binding of fMet-tRNA to ribosomes in bacteria and thereby prevents the 
correct initiation of protein synthesis. Other aminoglycoside antibiotics such 
as neomycin, kanamycin, and gentamycin interfere with the interaction 
between tRNA and the 165 rRNA of the 305 subunit (p. 906) of bacterial 
ribosomes. Chloramphenicol acts by inhibiting peptidyl transferase activity. 
Erythromycin binds to the 50S subunit and blocks translocation. 

The antibiotic puromycin inhibits protein synthesis in both bacteria and 
eukaryotes by causing nascent polypeptide chains to be released before their 
synthesis is completed. Puromycin is an analog of the terminal part of ami- 
noacyl-tRNA (Figure 30.30). It binds to the A site on the ribosome and 
blocks the entry of aminoacyl-tRNA. Furthermore, puromycin contains an 
a-amino group. This amino group, like the one on aminoacyl-tRNA, forms 
a peptide bond with the carboxyl group of the growing peptide chain. The 
product, a peptide having a covalently attached puromycin residue at its 
carboxyl end, dissociates from the ribosome. No longer used medicinally, 
puromycin remains an experimental tool for the investigation of protein 
synthesis. Cycloheximide, another antibiotic, blocks translocation in eukary- 
otic ribosomes, making a useful laboratory tool for blocking protein synthe- 
sis in eukaryotic cells. 


Diphtheria toxin blocks protein synthesis in eukaryotes 
by inhibiting translocation 


e Many antibiotics, harvested from bacteria for medicinal purposes, 

are inhibitors of bacterial protein synthesis. However, some bacteria 
produce protein-synthesis inhibitors that inhibit eukaryotic protein synthe- 
sis, leading to diseases such as diphtheria, which was a major cause of death 
in childhood before the advent of effective immunization. Symptoms 
include painful sore throat, hoarseness, fever, and difficulty breathing. The 
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lethal effects of this disease are due mainly to a protein toxin pro- 
duced by a phage infecting Corynebacterium diphtheriae, a bacte- 
rium that grows in the upper respiratory tract of an infected 
person. A few micrograms of diphtheria toxin is usually lethal in 
an unimmunized person because it inhibits protein synthesis. The 
toxin consists of a single polypeptide chain that is cleaved into a 
21-kDa A fragment and a 40-kDa B fragment shortly after enter- 
ing the cell. The role of the B-fragment in the intact protein is to 
bind to the cell, enabling the toxin to enter the cytoplasm of its 


target cell. The A fragment catalyzes the covalent modification of Sok 
EF2, the elongation factor catalyzing translocation in eukaryotic O 


protein synthesis, resulting in protein synthesis inhibition and cell 
death. 

A single A fragment of the toxin in the cytoplasm can kill a 
cell. Why is it so lethal? EF2 contains diphthamide, an unusual 
amino acid residue that enhances the fidelity of codon shifting 
during translocation. Diphthamide is formed by a highly con- 
served complicated pathway that posttranslationally modifies 
histidine. The A fragment of the diphtheria toxin catalyzes the 
transfer of the ADP ribose unit of NAD* to the diphthamide ring 
(Figure 30.31). This ADP ribosylation of a single side chain of EF2 
blocks EF2’s capacity to carry out the translocation of the growing 
polypeptide chain. Protein synthesis ceases, accounting for the 
remarkable toxicity of diphtheria toxin. 


Ricin fatally modifies 28S ribosomal RNA 


e Ricin is a small protein (65 kDa) found in the seeds of the castor oil 
plant, Ricinus communis (Figure 30.32). It is indeed a deadly molecule 
because as little as 500 yg is lethal for an adult human being, and a single 
molecule can inhibit all protein synthesis in a cell, resulting in cell death. 
Ricin is a heterodimeric protein composed of a catalytic A chain joined 
by asingle disulfide bond to a B chain. The B chain allows the toxin to bind 
to the target cell, and this binding leads to an endocytotic uptake of the 
dimer and the eventual release of the A chain into the cytoplasm. The A 
chain, an N-glycoside hydrolyase, cleaves adenine from a particular adeno- 
sine nucleotide on the 285 rRNA that is found in all eukaryotic ribosomes. 
Removal of the adenine base completely inactivates the ribosome by pre- 
venting the binding of elongation factors. Thus, ricin and diphtheria toxin 
both act by inhibiting protein-synthesis elongation; ricin does so by cova- 
lently modifying rRNA, and diphtheria toxin does so by covalently modify- 
ing the elongation factor. 


30.6 Ribosomes Bound to the Endoplasmic Reticulum 
Manufacture Secretory and Membrane Proteins 


Not all newly synthesized proteins are destined to function in the cytoplasm. 
Eukaryotic cells can direct proteins to internal sites such as mitochondria, 
the nucleus and the endoplasmic reticulum, a process called protein targeting 
or protein sorting. How is sorting accomplished? There are two general 
mechanisms by which sorting takes place. In one mechanism, the protein is 
synthesized in the cytoplasm, and then the completed protein is delivered to 
its intracellular location posttranslationally. Proteins destined for the nucleus, 
chloroplasts, mitochondria, and peroxisomes are delivered by this general 
process. The other mechanism, termed the secretory pathway, directs pro- 
teins into the endoplasmic reticulum (ER), the extensive membrane system 
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FIGURE 30.31 Blocking of translocation by diphtheria 
toxin. Diphtheria toxin blocks protein synthesis in 
eukaryotes by catalyzing the transfer of an ADP-ribose 
unit from NAD* to diphthamide, a modified amino acid 
residue in EF2 (translocase). Diphthamide is formed by the 
posttranslational modification (blue) of a histidine residue. 


On September 7, 1978, an agent of the 
Soviet Union State Security Department 
(KGB), using a weapon built into an umbrella, 
embedded a small pellet containing ricin into 
the thigh of Bulgarian dissident Georgi 
Markov as he crossed Waterloo Bridge in 
London. Mr. Markov felt a sharp sting, as if 
from a bug bite. He died four days later. 


FIGURE 30.32 Castor beans. The seeds 
of castor beans from Ricinus communis are 
a rich source of oils with a wide variety of 
uses, including the production of biodiesel 
fuels. The seeds are also rich in the toxin 
ricin. [Ted Kinsman/Science Source.] 
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FIGURE 30.33 Ribosomes are bound 
to the endoplasmic reticulum. In this 
electron micrograph, ribosomes appear 
as small black dots binding to the 
cytoplasmic side of the endoplasmic 
reticulum to give a rough appearance. 

In contrast, the smooth endoplasmic 
reticulum is devoid of ribosomes. [Don W. 
Fawcett/Science Source] 


FIGURE 30.34 Amino-terminal signal 
sequences of some eukaryotic secretory 
and plasma-membrane proteins. The 
hydrophobic core (yellow) is preceded by 
basic residues (blue) and followed by a 
cleavage site (red) for signal peptidase. 
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that comprises about half the total membrane of a cell, cotranslationally— 
that is, while the protein is being synthesized. Approximately 30% of all 
proteins are sorted by the secretory pathway, including secreted proteins, 
residents of the ER, the Golgi complex, lysosomes, and integral membrane 
proteins of these organelles as well as integral plasma-membrane proteins. 
We will focus our attention on the secretory pathway only. 

In eukaryotic cells, a ribosome remains free in the cytoplasm unless it is 
directed to the ER. The region that binds ribosomes is called the rough ER 
because of its studded appearance in contrast with the smooth ER, which is 
devoid of ribosomes (Figure 30.33). 


Protein Synthesis Begins on Ribosomes That Are Free in the Cytoplasm 


The synthesis of proteins sorted by the secretory pathway begins on free 
ribosomes that soon associate with the ER. Free ribosomes that are synthe- 
sizing proteins for use in the cell are identical with those attached to the ER. 
What is the process that directs the ribosome synthesizing a protein des- 
tined to enter the ER to bind to the ER? 


Signal sequences mark proteins for translocation across 
the endoplasmic reticulum membrane 


The synthesis of proteins destined to leave the cell or become embedded in 
the plasma membrane begins on a free ribosome but, shortly after synthesis 
begins, it is halted until the ribosome is directed to the cytoplasmic side of 
the ER. When the ribosome docks with the ER membrane, protein synthe- 
sis begins again. As the newly forming peptide chain exits the ribosome, it 
is transported, cotranslationally, through the membrane into the lumen of 
the ER. The translocation consists of four components. 


1. The Signal Sequence. The signal sequence is a sequence of 9 to 12 hydro- 
phobic amino acid residues, sometimes containing positively charged amino 
acids (Figure 30.34). This sequence, which adopts an a-helical structure, is 
usually near the amino terminus of the nascent polypeptide chain. The 
presence of the signal sequence identifies the nascent peptide as one that 
must cross the ER membrane. Some signal sequences are maintained in the 
mature protein, whereas others are cleaved by a signal peptidase on the 
lumenal side of the ER membrane. 


2. The Signal-Recognition Particle (SRP). The signal-recognition particle 
recognizes the signal sequence and binds the sequence and the ribosome 
as soon as the signal sequence exits the ribosome. SRP then shepherds 


Cleavage 
site 


TSLLLAFGLLCLPWLQEGSA | FPT 
MALWMRLLPLLALLALWGPDPAAA | FVN 
MKWVTFISLLLFSSAYS | RGV 
MKVLSLLYLLTAIPHIMS | DVQ 

Chicken lysozyme MRSLLILVLCFLPKLAALG | KVF 
MKFLVNVALVFMVVYISYIYA| APE 
MK LLVVAVIACMLIGFADPASG | CKD 
MAAKIFCLIMLLGLSASAATA | SIF 
MLLOAFLFLLAGFAAKISA | SMT 
MKAKLLVLLYAFVAG | DQ 
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Zea maize protein 19 


Yeast invertase 


Human influenza virus A 


the ribosome and its nascent polypeptide chain to the ER membrane. SRP Binds ER 

is a ribonucleoprotein consisting of a 75 RNA and six different proteins ye Fenel sequence 
(Figure 30.35). One protein, SRP54, is a GTPase that is crucial for SRP 
function. SRP samples all ribosomes until it locates one exhibiting a signal 
sequence. After SRP is bound to the signal sequence, interactions between 
the ribosome and the SRP occlude the elongation-factor-binding site, 
thereby halting protein synthesis. 


3. The SRP Receptor (SR). The SRP-ribosome complex diffuses to the 
ER, where SRP binds the SRP receptor, an integral membrane protein con- 
sisting of two subunits, 5Ra and SRB. 5Ra is, like SRP54, a GTPase. 


4. The Translocon. The SRP-SR complex delivers the ribosome to the 
translocation machinery, called the translocon, a multisubunit assembly of 
integral and peripheral membrane proteins. The translocon is a protein- 
conducting channel. This channel opens when the translocon and ribosome 
bind to each other. Protein synthesis resumes with the growing polypeptide FIGURE 30.35 The signal-recognition 


chain passing through the translocon channel into the lumen of the ER. particle. The signal-recognition particle 
(SRP) consists of six proteins and one 


The interactions of the components of the translocation machinery are eo) nugeende AINA mbleculss ie AN 
has a complex structure with many double- 


shown in Figure 30.36. Both the SRP54 and the SRa subunits of SR must helical stretches punctuated by single- 
bind GTP to facilitate the formation of the SRP—SR complex. For the stranded regions, shown as circles. 
SRP-SR complex to then deliver the ribosome to the translocon, the two [Information from H. Lodish et al., Molecular 


GTP molecules—one in SRP and the other in SR—are aligned in what is Cell Biology, Sth ed. (W. H. Freeman and 
Company, 2004). Data from K. Strub et al., 
Mol. Cell Biol. 11:3949-8959, 1991, and 
S. High and B. Dobberstein, J. Cell Biol. 
113:229-233, 1991] 


SRP19 
> SRP68/SRP72 


FIGURE 30.36 The SRP targeting cycle. (1) Protein synthesis begins on free ribosomes. 

(2) After the signal sequence has exited the ribosome, it is bound by the SRP, and protein 
synthesis halts. (3) The SRP-ribosome complex docks with the SRP receptor in the ER 
membrane. (4) The SRP and the SRP receptor simultaneously hydrolyze bound GTPs. Protein 
synthesis resumes and the SRP is free to bind another signal sequence. (6) The signal peptidase 
may remove the signal sequence as it enters the lumen of the ER. (6) Protein synthesis 
continues as the protein is synthesized directly into the ER. (7) On completion of protein 
synthesis, the ribosome is released. (8) The protein tunnel in the translocon closes. [Information 
from H. Lodish et al., Molecular Cell Biology, 5th ed. (W. H. Freeman and Company, 2004), Fig. 16.6.] 
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essentially an active site shared by the two proteins. The formation of the 
alignment is catalyzed by the 75 RNA of the SRP. After the ribosome has 
been passed along to the translocon, the GTPs are hydrolyzed, SRP and SR 
dissociate, and SRP is free to search for another signal sequence to begin the 
cycle anew. Thus, SRP acts catalytically. The signal peptidase, which is 
associated with the translocon in the lumen of the ER, removes the signal 
sequence from most proteins. 


Transport vesicles carry cargo proteins to their final destination 


As the proteins are synthesized, they fold to form their three-dimensional 
structures in the lumen of the ER. Some proteins are modified by the 
attachment of N-linked carbohydrates (Section 11.3). Finally, the proteins 
must be sorted and transported to their final destinations. Regardless of the 
destination, the principles of transport are the same. Transport is mediated 
by transport vesicles that bud off the ER (Figure 30.37). Transport vesicles 
from the ER carry their cargo (the proteins) to the Golgi complex, where the 
vesicles fuse and deposit the cargo inside the complex. There the cargo pro- 
teins are further modified—for instance, by the attachment of O-linked 
carbohydrates. From the Golgi complex, transport vesicles carry the cargo 
proteins to their final destinations, as shown in Figure 30.37. 


Exterior 


Cytoplasm ty 


ye "protein inserted 


in plasma membrane 


Secretory 
granule 


» ) Cis-Golgi 
PS € a 
E) ranspon vesicle 


Rough ER 


FIGURE 30.37 Protein-sorting pathways. Newly synthesized proteins in the lumen of the ER 
are collected into membrane buds. These buds pinch off to form transport vesicles. The 
transport vesicles carry the cargo proteins to the Golgi complex, where the cargo proteins are 
modified. Transport vesicles then carry the cargo to the final destination as directed by the 
V-SNARE and t-SNARE proteins. 


How does a protein end up at the correct destination? A newly synthesized 
protein will float inside the ER lumen until it binds to an integral membrane 
protein called a cargo receptor. This binding sequesters the cargo protein into a 
small region of the membrane that can subsequently form a membrane bud. 
The bud will carry the protein to a specific destination—plasma membrane, 
lysosome, or cell exterior. The key to ensuring that the protein reaches the 
proper destination is that the protein must bind to a receptor in the ER region 
associated with the protein’s destination. To ensure the proper match of protein 
with ER region, cargo receptors recognize various characteristics of the cargo 
protein, such as a particular amino acid sequence or an added carbohydrate. 

The formation of buds is facilitated by the binding of coat proteins (COPs) 
to the cytoplasmic side of the bud. The coat proteins associate with one another 
to pinch off the vesicle. After the transport vesicle has formed and is released, 
the coat proteins are shed to reveal another integral protein called v-SNARE 
(“v” for vesicle). v-SNARE will bind to a particular t-SNARE (‘‘t” for target) 
in the target membrane. This binding leads to the fusion of the transport vesi- 
cle to the target membrane, and the cargo is delivered. Thus, the assignment of 
identical v-SNARE proteins to the same region of the ER membrane causes an 
ER region to be associated with a particular destination. 


SUMMARY 


30.1 Protein Synthesis Requires the Translation of Nucleotide Sequences 

into Amino Acid Sequences 

Protein synthesis is called translation because information present as a 
nucleic acid sequence is translated into a different language, the 
sequence of amino acids in a protein. This complex process is mediated 
by the coordinated interplay of more than a hundred macromolecules, 
including mRNA, rRNAs, tRNAs, aminoacyl-tRNA synthetases, 
and protein factors. Given that a protein typically comprises from 100 
to 1000 amino acids, the frequency at which an incorrect amino acid is 
incorporated in the course of protein synthesis must be less than 10~*. 
Transfer RNAs are the adaptors that make the link between a nucleic 
acid and an amino acid. These molecules, single chains of about 
80 nucleotides, have an L-shaped structure. 


30.2 Aminoacyl Transfer RNA Synthetases Read the Genetic Code 

Each amino acid is activated and linked toa specific transfer RNA by an 
enzyme called an aminoacyl-tRNA synthetase. The synthetase links the 
carboxyl group of an amino acid to the 2’- or 3'-hydroxyl group of the 
adenosine unit of a CCA sequence at the 3’ end of the tRNA by an ester 
linkage. There is at least one specific aminoacyl-tRNA synthetase and 
at least one specific tRNA for each amino acid. A synthetase utilizes 
both the functional groups and the shape of its cognate amino acid to 
prevent the attachment of an incorrect amino acid to a tRNA. Some 
synthetases have a separate active site at which incorrectly linked amino 
acids are removed by hydrolysis. A synthetase recognizes the anticodon, 
the acceptor stem, and sometimes other parts of its tRNA substrate. By 
specifically recognizing both amino acids and tRNAs, aminoacyl- 
tRNA synthetases implement the instructions of the genetic code. 

The codons of messenger RNA recognize the anticodons of transfer 
RNAs rather than the amino acids attached to the tRNAs. A codon on 
mRNA forms base pairs with the anticodon of the tRNA. Some 
tRNAs are recognized by more than one codon because pairing of 
the third base of a codon is less crucial than that of the other two (the 
wobble mechanism). There are two evolutionary distinct classes of 
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30.3 


30.4 


30.5 


30.6 


synthetases, each recognizing 10 amino acids. The two classes recog- 
nize opposite faces of tRNA molecules. 


The Ribosome Is the Site of Protein Synthesis 

Protein synthesis takes place on ribosomes—ribonucleoprotein particles 
(about two-thirds RNA and one-third protein) consisting of large and 
small subunits. In E. coli, the 70S ribosome (2500 kDa) is made up of 305 
and 505 subunits. The 305 subunit consists of 165 ribosomal RNA and 
21 different proteins; the 505 subunit consists of 238 and 55 rRNA and 
34 different proteins. The ribosome includes three sites for tRNA bind- 
ing called the A (aminoacy]) site, the P (peptidyl) site, and the E (exit) site. 

Protein synthesis takes place in three phases: initiation, elongation, 
and termination. In bacteria, mRNA, formylmethionyl-tRNA, (the 
special initiator tRNA that recognizes AUG), and a 305 ribosomal 
subunit come together with the assistance of initiation factors to form 
a 305 initiation complex. A 50S ribosomal subunit then joins this com- 
plex to form a 70S initiation complex, in which fMet-tRNA, occupies 
the P site of the ribosome. 

Elongation factor Tu delivers the appropriate aminoacyl-tRNA to the 
ribosome’s A site as an EF-Tu—aminoacyl-tRNA-GTP ternary com- 
plex. EF-Tu serves both to protect the aminoacyl-tRNA from premature 
cleavage and to increase the fidelity of protein synthesis by ensuring that 
the correct anticodon—codon pairing has taken place before hydrolyzing 
GTP and releasing aminoacyl-tRNA into the A site. A peptide bond is 
formed when the amino group of the aminoacyl-tRNA nucleophilically 
attacks the ester linkage of the peptidyl-tRNA. On peptide-bond forma- 
tion, the tRNAs and mRNA are translocated for the next cycle to begin. 
The deacylated tRNA moves to the E site and then leaves the ribosome, 
and the peptidyl-tRNA moves from the A site into the P site. Elongation 
factor G uses the free energy of GTP hydrolysis to drive translocation. 
Protein synthesis is terminated by release factors, which recognize the 
termination codons UAA, UGA, and UAG and cause the hydrolysis of 
the ester bond between the polypeptide and tRNA. 


Eukaryotic Protein Synthesis Differs from Bacterial Protein 

Synthesis Primarily in Translation Initiation 

The basic plan of protein synthesis in eukaryotes is similar to that in bac- 
teria, but there are some significant differences between them. Eukaryotic 
ribosomes (805) consist of a 405 small subunit and a 605 large subunit. 
The initiating amino acid is again methionine, but it is not formylated. 
The initiation of protein synthesis is more complex in eukaryotes than in 
bacteria. In eukaryotes, the AUG closest to the 5’ end of mRNA is nearly 
always the start site. The 40S ribosome finds this site by binding to the 5’ 
cap and then scanning the RNA until AUG is reached. The regulation of 
translation in eukaryotes provides a means for regulating gene expression. 


A Variety of Antibiotics and Toxins Can Inhibit Protein Synthesis 

Many clinically important antibiotics function by inhibiting protein 
synthesis. All steps of protein synthesis are susceptible to inhibition by 
one antibiotic or another. Diphtheria toxin inhibits protein synthesis 
by covalently modifying an elongation factor, thereby preventing elon- 
gation. Ricin, a toxin from castor beans, inhibits elongation by remov- 
ing acrucial adenine residue from rRNA. 


Ribosomes Bound to the Endoplasmic Reticulum Manufacture 
Secretory and Membrane Proteins 

Proteins contain signals that determine their ultimate destination. The 
synthesis of all proteins begins on free ribosomes in the cytoplasm. In 


eukaryotes, protein synthesis continues in the cytoplasm unless the 921 


nascent chain contains a signal sequence that directs the ribosome to 


Problems 


the endoplasmic reticulum. Amino-terminal signal sequences consist 
of a hydrophobic segment of 9 to 12 residues preceded by a positively 
charged amino acid. The signal-recognition particle, a ribonucleopro- 
tein assembly, recognizes signal sequences and brings ribosomes bear- 
ing them to the ER. A GTP—GDP cycle releases the signal sequence 
from SRP and then detaches SRP from its receptor. The nascent chain 
is then translocated across the ER membrane. Proteins are transported 


throughout the cell in transport vesicles. 


KEY TERMS 


translation (p. 893) 

ribosome (p. 893) 

aminoacyl-tRNA synthetase (p. 894) 
codon (p. 895) 

transfer RNA (tRNA) (p. 895) 
anticodon (p. 895) 

wobble hypothesis (p. 897) 

30S subunit (p. 902) 

505 subunit (p. 902) 
Shine-Dalgarno sequence (p. 906) 


(p. 909) 
polysome (p. 910) 


| PROBLEMS 


initiation factor (p. 907) 

elongation factor Tu (EF-Tu) (p. 907) 
accommodation (p. 907) 

elongation factor Ts (EF-Ts) (p. 907) 
peptidy]l transferase center (p. 908) 
elongation factor G (EF-G) (translocase) 


signal peptidase (p. 916) 
signal-recognition particle (SRP) (p. 916) 
SRP receptor (SR) (p. 917) 

translocon (p. 917) 

transport vesicle (p. 918) 

coat proteins (p. 919) 

v-SNARE (p. 919) 

t-SNARE (p. 919) 


release factor (RF) (p. 910) 
signal sequence (p. 916) 


1. Babel fish. Why is protein synthesis also called translation? 


2. Careful, but not too careful. Why is it crucial that protein 
synthesis has an error frequency of 1074? 


3. Commonalities. What features are common to all tRNA 
molecules? 


4. The ol’ two step. What two reaction steps are required for 
the formation of an aminoacyl-tRNA? 


5. The same but different. Why must tRNA molecules have 
both unique structural features and common structural 
features? 


6. Charge it. In the context of protein synthesis, what is 
meant by an activated amino acid? 


7. Synthetase mechanism. The formation of isoleucyl-tRNA 
proceeds through the reversible formation of an enzyme- 
bound Ile- AMP intermediate. Predict whether **P-labeled 
ATP is formed from 72PP , when each of the following sets of 
components is incubated with the specific activating enzyme: 
(a) ATP and 2PP. 

(b) tRNA, ATP, and **PP. 

(c) Isoleucine, ATP, and al 


8. 1 = 2, for sufficiently large values of 1. The energetic 
equivalent of two molecules of ATP is used to activate 
an amino acid, yet only one molecule of ATP is used. Explain. 


9. Sieves. Using threonyl-tRNA synthetase as an example, 
account for the specificity of threonyl-tRNA formation. 


10. Use all available information. Suggest a reason why 
there are two classes of aminoacyl-tRNA synthetases, with 
each class recognizing a different face of the tRNA. 


11. Going wobbly. Explain how it is possible that some 
tRNA molecules recognize more than one codon. 


12. Light and heavy ribosomes. Ribosomes were isolated 
from bacteria grown in a “heavy” medium (!3C and !5N) 
and from bacteria grown in a “light” medium (!2C and 
14N]). These 60S ribosomes were added to an in vitro system 
engaged in protein synthesis. An aliquot removed several 
hours later was analyzed by density-gradient centrifuga- 
tion. How many bands of 705 ribosomes would you expect 
to see in the density gradient? 


13. The price of protein synthesis. What is the smallest num- 
ber of molecules of ATP and GTP consumed in the synthe- 
sis of a 200-residue protein, starting from amino acids? 
Assume that the hydrolysis of PP, is equivalent to the 
hydrolysis of ATP for this calculation. 


14. Correct phasing. What is meant by the phrase reading 
frame? 


15. Suppressing frameshifts. The insertion of a base in a cod- 
ing sequence leads to a shift in the reading frame, which in 
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most cases produces a nonfunctional protein. Propose a 
mutation in atRNA that might suppress frameshifting. 


16. Tagging a ribosomal site. Design an affinity-labeling 
reagent for one of the tRNA-binding sites in E. coli ribosomes. 


17. Viral mutation. An mRNA transcript of a T7 phage 
gene contains the base sequence 


y 
5’ -AACUGCACGAGGUAACACAAGAUGGCU-3' 


Predict the effect of a mutation that changes the red G to A. 


18. A new translation. A transfer RNA with a UGU antico- 
don is enzymatically conjugated to !'*C-labeled cysteine. The 
cysteine unit is then chemically modified to alanine. The 
altered aminoacyl-tRNA is added to a protein-synthesizing 
system containing normal components except for thistRNA. 
The mRNA added to this mixture contains the following 
sequence: 


5’ -UUUUGCCAUGUUUGUGCU-3’ 


What is the sequence of the corresponding radiolabeled 
peptide? 


19. Two synthetic modes. Compare and contrast protein 
synthesis by ribosomes with protein synthesis by the solid- 
phase method (Section 3.4). 


20. Triggered GTP hydrolysis. Ribosomes markedly acceler- 
ate the hydrolysis of GTP bound to the complex of EF-Tu 
and aminoacyl-tRNA. What is the biological significance of 
this enhancement of GTPase activity by ribosomes? 


21. Blocking translation. Devise an experimental strategy for 
switching off the expression of a specific mRNA without 
changing the gene encoding the protein or the gene’s control 
elements. 


22. Directional problem. Suppose that you have a protein- 
synthesis system that is synthesizing a protein designated 
A. Furthermore, you know that protein A has four trypsin- 
sensitive sites, equally spaced in the protein, that, on diges- 
tion with trypsin, yield the peptides A,, Pigy Pigs Ags and A.. 
Peptide A, is the amino-terminal peptide, and A, is the 
carboxyl-terminal peptide. Finally, you know that your 
system requires 4 minutes to synthesize a complete pro- 
tein A. At t = 0, you add all 20 amino acids, each carrying 
a !4C label. 


(a) At t = 1 minute, you isolate intact protein A from the 
system, cleave it with trypsin, and isolate the five peptides. 
Which peptide is most heavily labeled? 


(b) At t = 3 minutes, what will be the order of the labeling 
of peptides from greatest to least? 


(c) What does this experiment tell you about the direction 
of protein synthesis? 


23. Translator. Aminoacyl-tRNA synthetases are the only 
component of gene expression that decodes the genetic 
code. Explain. 


24. A timing device. EF-Tu, a member of the G-protein 
family, plays a crucial role in the elongation process of 
translation. Suppose that a slowly hydrolyzable analog of 
GTP were added to an elongating system. What would be 
the effect on the rate of protein synthesis? 


25. Not just RNA. What are the roles of the protein factors 
required for protein synthesis? 


26. Membrane transport. What four components are 
required for the translocation of proteins across the endo- 
plasmic reticulum membrane? 


27. Push. Don’t pull. What is the energy source that powers 
the cotranslational movement of proteins across the endo- 
plasmic reticulum? 


28. You have to know where to look. Bacterial messenger 
RNAs usually contain many AUG codons. How does the 
ribosome identify the AUG specifying initiation? 


29. Fundamentally the same, yet ... List the differences 
between bacterial and eukaryotic protein synthesis. 


30. Like a border collie. What is the role of the signal- 
recognition particle in protein translocation? 


31. An assembly line. Why is the fact that protein synthesis 
takes place on polysomes advantageous? 


32. Match ‘em 
(a) Initiation . GTP 

. AUG 

. {Met 

. RRF 

IF2 

. Shine-Dalgarno 

. EF-Tu 

. Peptidyl transferase 
. UGA 


. Transformylase 


33. Wasted effort? Transfer RNA molecules are quite 
large, given that the anticodon consists of only three 
nucleotides. What is the purpose of the rest of the tRNA 
molecule? 


(b) Elongation 


(c) Termination 


SOD ON DM FwWN 
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Mechanism Problems 


34. Evolutionary amino acid choice. Ornithine is struc- 
turally similar to lysine except ornithine’s side chain is 
one methylene group shorter than that of lysine. Attempts 


to chemically synthesize and isolate ornithinyl-tRNA 
proved unsuccessful. Propose a mechanistic explanation. 
(Hint: Six-membered rings are more stable than seven- 
membered rings.) 


Chapter Integration Problems 


35. Contrasting modes of elongation. The two basic mecha- 
nisms for the elongation of biomolecules are represented in 
the adjoining illustration. In type 1, the activating group 
(X) is released from the growing chain. In type 2, the acti- 
vating group is released from the incoming unit as it is 
added to the growing chain. Indicate whether each of the 
following biosyntheses is by means of a type 1 or a type 2 
mechanism: 


oe + F® ®+ F® 
HI® + @ HH 1® + @ 
Type 1 Type 2 


a) Glycogen synthesis 
b) Fatty acid synthesis 


d) DNA synthesis 
e) RNA synthesis 
(f) Protein synthesis 


( 
( 
(c) C, > C,, > C,; in cholesterol synthesis 
( 
( 


36. Enhancing fidelity. Compare the accuracy of DNA rep- 
lication, RNA synthesis, and protein synthesis. Which 
mechanisms are used to ensure the fidelity of each of these 
processes? 


37. Déja vu. Which protein in G-protein cascades plays a 
role similar to that of elongation factor Ts? 


38. Family resemblance. Eukaryotic elongation factor 2 is 
inhibited by ADP ribosylation catalyzed by diphtheria 
toxin. What other G proteins are sensitive to this mode of 
inhibition? 

39. The exceptional E. coli. In contrast with E. coli, most 
bacteria do not have a full complement of aminoacyl-tRNA 
synthetases. For instance, Helicobacter pylori, the cause of 
stomach ulcers, has tRNAG!", but no Gln-tRNA synthe- 
tase. However, glutamine is a common amino acid in H. 
pylori proteins. Suggest a means by which glutamine can be 
incorporated into proteins in H. pylori. (Hint: Glu-tRNA 
synthetase can misacylate tRNA®". ) 


40. The final step. What aspect of primary structure allows 
the transfer of linear nucleic acid information into the func- 
tional three-dimensional structure of proteins? 
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Data Interpretation Problems 


41. Helicase helper. The initiation factor eIF-4 displays 
ATP-dependent RNA helicase activity. Another initiation 
factor, e[F-4H, has been proposed to assist the action of 
elF-4. Graph A shows some of the experimental results 
from an assay that can measure the activity of eIF'-4 helicase 
in the presence of elF-4H. 


30 elF-4A + elF-4H 
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(a) What are the effects on eIF-4 helicase activity in the 
presence of elF-4H? 


(b) Why did measuring the helicase activity of elF-4H 
alone serve as an important control? 


a 


Initial rate of helix 
unwinding (fmole minute-') 


ES SS ee ee 
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(B) elF-4H added (1M) 


(c) The initial rate of helicase activity of 0.2 uM of elF-4 
was then measured with varying amounts of eIF-4H 
(graph B). What ratio of eIF-4H to eIF-4 yielded optimal 
activity? 


2.0 — 


elF-4A + elF-4H 


elF-4A only 


In (initial rate of 
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[Data from N. J. Richter, G. W. Rodgers, Jr., J. O. Hensold, and 
W. C. Merrick. Further biochemical and kinetic characterization of human 
eukaryotic initiation factor 4H., J. Biol. Chem. 274:35415-35424, 1999.] 


(d) Next, the effect of RNA-RNA helix stability on the 
initial rate of unwinding in the presence and absence of 
elF-4H was tested (graph C). How does the effect of eIF- 
4H vary with helix stability? 


(e) How might eIF-4H affect the helicase activity of 
elF-4A? 


924 
CHAPTER 30 Protein Synthesis 


42. Size separation. The protein-synthesizing machinery was 
isolated from eukaryotic cells and briefly treated with a low 
concentration of RNase. The sample was then subjected to 
sucrose gradient centrifugation. The gradient was fractionated 
and the absorbance, or optical density (OD), at 254 nm was 
recorded for each fraction. The following plot was obtained. 
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(a) What do the three peaks of absorbance in graph A 
represent? 


The experiment was repeated except that, this time, the 
RNase treatment was omitted. 
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(b) Why is the centrifugation pattern in graph B more com- 
plex? What do the series of peaks near the bottom of the 
centrifuge tube represent? 


Before the isolation of the protein-synthesizing machinery, 
the cells were grown in low concentrations of oxygen 
(hypoxic conditions). Again the experiment was repeated 
without RNase treatment (graph C). 
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[Data from M. Koritzinsky et al., EMBO J. 25:1114-1125, 2006.] 


(c) What is the effect of growing cells under hypoxic 
conditions? 


The Control of Gene 
Expression in Prokaryotes 


CHAPTER 


Bacteria respond to changes in their environments. A micrograph of the light organ of a 
newly hatched squid (Euprymna scolopes) is shown on the left. The light spots are due to 
colonies of the bacteria Vibrio fischeri that live symbiotically within these organs. These 
bacteria become luminescent when they reach an appropriately high density. The density 
is sensed by the circuit shown on the right in which each bacterium releases a small 
molecule into the environment. The molecule is subsequently taken up by other bacterial 
cells, which start a signaling cascade that stimulates the expression of specific genes. 
[Spencer V. Nyholm, Eric V. Stabb, Edward G. Ruby, and Margaret J. McFall-Ngai, 
“Establishment of an animal-bacterial association: Recruiting symbiotic vibrios from the 
environment.” PNAS 97(2000): 10231-10235. Copyright 2000 National Academy of Sciences.] 


ven simple prokaryotic cells must respond to changes in their metabo- 

lism or in their environments. Much of this response takes place through 
changes in gene expression. A gene is expressed when it is transcribed into 
RNA and, for most genes, translated into proteins. Genomes comprise thou- 
sands of genes. Some of these genes are expressed all the time. These genes 
are subject to constitutive expression. Many other genes are expressed only 
under some circumstances—that is, under a particular set of physiological 
conditions. These genes are subject to regulated expression. For example, the 
level of expression of some genes in bacteria may vary more than a 1000-fold 
in response to the supply of nutrients or to environmental challenges. 

In this chapter, we will examine gene-regulation mechanisms in pro- 
karyotes, particularly FE. coli, because many of these processes were first 
discovered in this organism. In Chapter 32, we will turn to gene-regulation 
mechanisms in eukaryotes. We shall see both substantial similarities and 
fundamental differences in comparing gene-regulatory mechanisms of the 
two types of organisms. 

How is gene expression controlled? Gene activity is controlled first and fore- 
most at the level of transcription. Whether a gene is transcribed is determined 
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926 largely by the interplay between specific DNA sequences and certain proteins 

CHAPTER 31 The Control of Gene that bind to these sequences. Most often, these proteins repress the expression 

Expression in Prokaryotes of specific genes by blocking the access of RNA polymerase to their promot- 
ers. In some cases, however, the proteins can activate the expression of specific 
genes. We shall learn about several different strategies that allow the coordi- 
nated regulation of sets of genes. Some genes are also controlled at stages 
beyond the level of transcription and we shall examine several mechanisms at 
these stages. Finally, we will examine several important examples of how gene 
expression is regulated in response to changes in the concentrations of specific 
molecules in the environment of prokaryotic cells. 


31.1 Many DNA-Binding Proteins Recognize Specific 
DNA Sequences 


How do regulatory systems distinguish the genes that need to be activated 
or repressed from genes that are constitutive? After all, the DNA sequences 
of genes themselves do not have any distinguishing features that would 
allow regulatory systems to recognize them. Instead, gene regulation 
depends on other sequences in the genome. In prokaryotes, these regula- 
tory sites are close to the region of the DNA that is transcribed. Regulatory 
sites are usually binding sites for specific DNA-binding proteins, which 
can stimulate or repress gene expression. These regulatory sites were first 
identified in FE. coli in studies of changes in gene expression. In the pres- 
ence of the sugar lactose, the bacterium starts to express a gene encoding 
B-galactosidase, an enzyme that can process lactose for use as a carbon and 
energy source. The sequence of the regulatory site for this gene is shown in 
Figure 31.1. The nucleotide sequence of this site shows a nearly perfect 
inverted repeat, indicating that the DNA in this region has an approximate 
twofold axis of symmetry. Recall that cleavage sites for restriction enzymes 
such as EcoRV have similar symmetry properties (Section 9.3). Symmetry 
in such regulatory sites usually corresponds to symmetry in the protein 
FIGURE 31.1 Sequence of the lac that binds the site. Symmetry matching 1s a recurring theme in protein-DNA 


regulatory site. The nucleotide sequence interactions. 
of this regulatory site shows a nearly 
perfect inverted repeat, corresponding to 


twofold rotational symmetry in the DNA. ; 
Parts of the sequences that are related by 5’-... TGTGTGGAAT TGTGAGCGGATAACAATTTCACACA...3 


this symmetry are shown in the same color. 3'-...ACACACCT T AACACT CGCCT AAT GTT AAAGT GT GT ...5’ 


To understand these protein-DNA interactions in detail, scientists 
examined the structure of the complex between an oligonucleotide that 
includes this site and the DNA-binding unit that recognizes it (Figure 31.2). 


Operator DNA 


hk FIGURE 31.2 The /ac repressor- 
DNA complex. The DNA-binding 
domain from a gene-regulatory protein, 
the /ac repressor, binds to a DNA 
fragment containing its preferred 
binding site (referred to as operator 
DNA) by inserting an « helix into the 
major groove of operator DNA. Notice 
that a specific contact forms between 
an arginine residue of the repressor 


age ian lac repressor 
and a G-C base pair in the binding ot helix 


site. [Drawn from 1EFA.pdb.] 


lac repressor CAP 


trp repressor 


The DNA-binding unit comes from a protein called the lac repressor, which 
represses the expression of the lactose-processing gene. As expected, this 
DNA-binding unit binds as a dimer, and the twofold axis of symmetry of 
the dimer matches the symmetry of the DNA. An a helix from each mono- 
mer of the protein is inserted into the major groove of the DNA, where 
amino acid side chains make specific contacts with exposed edges of the 
base pairs. For example, the side chain of an arginine residue of the protein 
forms a pair of hydrogen bonds with a guanine residue of the DNA, which 
would not be possible with any other base. This interaction and similar ones 
allow the lac repressor to bind more tightly to this site than to the wide range 
of other sites present in the E. coli genome. 


The helix-turn-helix motif is common to many prokaryotic 
DNA-binding proteins 


Are similar strategies utilized by other prokaryotic DNA-binding proteins? 
The structures of many such proteins have now been determined, and 
amino acid sequences are known for many more. Strikingly, the DNA- 
binding surfaces of many, but not all, of these proteins consist of a pair of a 
helices separated by a tight turn (Figure 31.3). In complexes with DNA, the 
second of these two helices (often called the recognition helix) lies in the 
major groove, where amino acid side chains make contact with the edges of 
base pairs. In contrast, residues of the first helix participate primarily in 
contacts with the DNA backbone. Helix-turn-helix motifs are present on 
many proteins that bind DNA as dimers, and thus two of the units will be 
present, one on each monomer. 

Although the helix-turn-helix motif is the most commonly observed 
DNA-binding unit in prokaryotes, not all regulatory proteins bind DNA 
through such units. A striking example is provided by the E. coli methio- 
nine repressor (Figure 31.4). This protein binds DNA through the insertion 
of a pair of B strands into the major groove. 


31.2 Prokaryotic DNA-Binding Proteins Bind Specifically 
to Regulatory Sites in Operons 


A historically important example reveals many common principles of gene 
regulation by DNA-binding proteins. Bacteria such as E. coli usually rely 
on glucose as their source of carbon and energy, even when other sugars are 
available. However, when glucose is scarce, E. coli can use lactose as their 
carbon source, even though this disaccharide does not lie on any major 
metabolic pathways. An essential enzyme in the metabolism of lactose is 
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FIGURE 31.3 Helix-turn-helix motif. 
These structures show three sequence- 
specific DNA-binding proteins that interact 
with DNA through a helix-turn-helix motif 
(highlighted in yellow). Notice that, in each 
case, the helix-turn-helix units within a 
protein dimer are approximately 34 A apart, 
corresponding to one full turn of DNA. 
[Drawn from 1EFA, 1RUN, and 1TRO.pdb.] 


FIGURE 31.4 DNA recognition 
through 8 strands. A methionine repressor 
is shown bound to DNA. Notice that 
residues in 8 strands, rather than in a 
helices, participate in the crucial 
interactions between the protein and 
the DNA. [Drawn from 1CMA.pdb.] 
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FIGURE 31.6 B-Galactosidase induction. 
The addition of lactose to an E. coli culture 
causes the production of B-galactosidase 
to increase from very low amounts to much 
larger amounts. The increase in the amount 
of enzyme parallels the increase in the 
number of cells in the growing culture. 
G-Galactosidase constitutes 6.6% of the 
total protein synthesized in the presence 

of lactose. 


928 


H 
N 
> 
\ Spontaneous 
Br dimerization 
and oxidation 
HO 


cl 


5,5'-Dibromo-4,4’-dichloro-indigo 


FIGURE 31.5 Monitoring the B-galactosidase reaction. The galactoside substrate X-Gal 
produces a colored product on cleavage by B-galactosidase. The appearance of this colored 
product provides a convenient means for monitoring the amount of the enzyme both in vitro 
and in vivo. 


B-galactosidase, which hydrolyzes lactose into galactose and glucose. These 
products are then metabolized by pathways discussed in Chapter 16. 
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This reaction can be conveniently followed in the laboratory through the 
use of alternative galactoside substrates that form colored products such as 
X-Gal (Figure 31.5). An E. coli cell growing on a carbon source such as 
glucose or glycerol contains fewer than 10 molecules of B-galactosidase. In 
contrast, the same cell will contain several thousand molecules of the 
enzyme when grown on lactose (Figure 31.6). The presence of lactose in the 
culture medium induces a large increase in the amount of B-galactosidase 
by eliciting the synthesis of new enzyme molecules rather than by activating 
a preexisting but inactive precursor. 

A crucial clue to the mechanism of gene regulation was the observation 
that two other proteins are synthesized in concert with B-galactosidase— 
namely, galactoside permease and thiogalactoside transacetylase. The permease 
is required for the transport of lactose across the bacterial cell membrane 
(Section 13.3). The transacetylase is not essential for lactose metabolism but 
appears to play a role in the detoxification of compounds that also may be 
transported by the permease. Thus, the expression levels of a set of enzymes 
that all contribute to the adaptation to a given change in the environment change 
together. Such a coordinated unit of gene expression is called an operon. 


An operon consists of regulatory elements and 
protein-encoding genes 


The parallel regulation of B-galactosidase, the permease, and the trans- 
acetylase suggested that the expression of genes encoding these enzymes is 
controlled by a common mechanism. Francois Jacob and Jacques Monod 
proposed the operon model to account for this parallel regulation as well as 
the results of other genetic experiments. The genetic elements of the model 
are a regulator gene that encodes a regulatory protein, a regulatory DNA 
sequence called an operator site, and a set of structural genes (Figure 31.7). 
The regulator gene encodes a repressor protein that binds to the operator 
site. The binding of the repressor to the operator prevents transcription of the 
structural genes. The operator and its associated structural genes constitute 
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FIGURE 31.7 Operons. (A) The general structure of an operon as conceived by Jacob and 
Monod. (B) The structure of the lactose operon. In addition to the promoter, p, in the operon, a 
second promoter is present in front of the regulator gene, i, to drive the synthesis of the regulator. 


the operon. For the lactose (lac) operon, the i gene encodes the repressor, o is 
the operator site, and the z, y, and a genes are the structural genes for 
B-galactosidase, the permease, and the transacetylase, respectively. The 
operon also contains a promoter site (denoted by p), which directs the RNA 
polymerase to the correct transcription initiation site. The z, y, and a genes 
are transcribed to give a single mRNA molecule that encodes all three pro- 
teins. An mRNA molecule encoding more than one protein is known as a 
polygenic or polycistronic transcript. 


The Jac repressor protein in the absence of lactose binds to the 
operator and blocks transcription 


In the absence of lactose, the lactose operon is repressed. How does the lac 
repressor mediate this repression? The lac repressor exists as a tetramer of 
37-kDa subunits with two pairs of subunits coming together to form the 
DNA-binding unit previously discussed. In the absence of lactose, the 
repressor binds very tightly and rapidly to the operator. When the lac 
repressor is bound to DNA, the repressor prevents RNA polymerase from 
transcribing the protein-coding genes inasmuch as the operator site is 
directly adjacent to and downstream of the promoter site where the repres- 
sor would block the progress of RNA polymerase. 

How does the lac repressor locate the operator site in the E. coli chromo- 
some? The lac repressor binds 4 X 10° times as strongly to operator DNA 
as it does to random sites in the genome. This high degree of selectivity 
allows the repressor to find the operator efficiently even with a large excess 
(4.6 X 10°) of other sites within the E. coli genome. The dissociation con- 
stant for the repressor—operator complex is approximately 0.1 pM (107!3M). 
The rate constant for association (~ 10!° M~! s~!) is strikingly high, indi- 
cating that the repressor finds the operator primarily by diffusing along a 
DNA molecule (a one-dimensional search) rather than encountering it from 
the aqueous medium (a three-dimensional search). This diffusion has been 
confirmed by studies that monitored the behavior of fluorescently labeled 
single molecules of lac repressor inside living E. coli cells. 

Inspection of the complete E. coli genome sequence reveals two sites 
within 500 bp of the primary operator site that approximate the sequence of 
the operator. When one dimeric DNA-binding unit binds to the operator 
site, the other DNA-binding unit of the lac repressor tetramer can bind to 
one of these sites with similar sequences. The DNA between the two bound 
sites forms a loop. No other sites that closely match the sequence of the lac 
operator site are present in the rest of the E. coli genome sequence. Thus, the 
DNA-binding specificity of the lac repressor is sufficient to specify a nearly 
unique site within the E. coli genome. 

The three-dimensional structure of the lac repressor has been deter- 
mined in various forms. Each monomer consists of a small amino-terminal 
domain that binds DNA and a larger domain that mediates the formation of 
the dimeric DNA-binding unit and the tetramer (Figure 31.8). A pair of the 
amino-terminal domains come together to form the functional DNA- 
binding unit. Each monomer has a helix-turn-helix unit that interacts with 
the major groove of the bound DNA. 


dom 


k& FIGURE 31.8 Structure of the 
lac repressor. A lac repressor dimer is 
shown bound to DNA. Notice that the 
amino-terminal domain binds to DNA, 
whereas the carboxyl-terminal domain 
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forms a separate structure. A part of the 
structure that mediates the formation of lac 
repressor tetramers is not shown. [Drawn 


from 1EFA.pdb.] 
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FIGURE 31.9 Effects of IPTG on Jac repressor 
structure. The structure of the /ac repressor 
bound to the inducer isopropylthiogalactoside 
(IPTG), shown in orange, is superimposed on the 
structure of the /ac repressor bound to DNA, 
shown in purple. Notice that the binding of IPTG 
induces structural changes that alter the relation 
between the two DNA-binding domains so that 
they cannot interact effectively with DNA. The 
DNA-binding domains of the fac repressor bound 
to IPTG are not shown, because these regions are 
not well ordered in the crystals studied. 
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Ligand binding can induce structural changes 
in regulatory proteins 


In the situation just described, glucose is present and lactose is absent, and 
the lac operon is repressed. How does the presence of lactose trigger the 
relief of this repression and, hence, the expression of the lac operon? 
Interestingly, lactose itself does not have this effect; rather, allolactose, a 
combination of galactose and glucose with an a-1,6 rather than an a-1,4 
linkage, does. Allolactose is thus referred to as the inducer of the lac operon. 
Allolactose is a side product of the B-galactosidase reaction and is pro- 
duced at low levels by the few molecules of B-galactosidase that are present 
before induction. Some other B-galactosides such as isopropylthiogalacto- 
side (IPTG) are potent inducers of B-galactosidase expression, although 
they are not substrates of the enzyme. IPTG is useful in the laboratory as 
a tool for inducing gene expression in engineered bacterial strains. 

The inducer triggers gene expression by preventing the lac repressor 
from binding the operator. The inducer binds to the lac repressor and 
thereby greatly reduces the repressor’s affinity for operator DNA. An 
inducer molecule binds in the center of the large domain within each 
monomer. This binding leads to conformational changes that modify 
the relation between the two small DNA-binding domains (Figure 
31.9). These domains can no longer easily contact DNA simultane- 
ously, leading to a dramatic reduction in DNA-binding affinity. 

Let us recapitulate the processes that regulate gene expression in the 
lactose operon (Figure 31.10). In the absence of inducer, the lac repres- 
sor is bound to DNA in a manner that blocks RNA polymerase from 
transcribing the z, y, and a genes. Thus, very little B-galactosidase, 
permease, or transacetylase are produced. The addition of lactose to the 
environment leads to the formation of allolactose. This inducer binds to 
the lac repressor, leading to conformational changes and the release of 
DNA by the lac repressor. With the operator site unoccupied, RNA 
polymerase can then transcribe the other lac genes and the bacterium 
will produce the proteins necessary for the efficient use of lactose. 


Bg The structure of the large domain of the lac repressor is similar to 

those of a large class of proteins that are present in EF. coli and 
other bacteria. This family of homologous proteins binds ligands such 
as sugars and amino acids at their centers. Remarkably, domains of this 
family are utilized by eukaryotes in taste proteins and in neurotransmit- 
ter receptors, as will be discussed in Chapter 33. 


The operon is a common regulatory unit in prokaryotes 


Many other gene-regulatory networks function in ways analogous to 
those of the lac operon. For example, genes taking part in purine and, 
to a lesser degree, pyrimidine biosynthesis are repressed by the 
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pur repressor. This dimeric protein is 31% identical in sequence with the 
lac repressor and has a similar three-dimensional structure. However, the 
behavior of the pur repressor is opposite that of the lac repressor: whereas 
the lac repressor is released from DNA by binding to a small molecule, the 
pur repressor binds DNA specifically, blocking transcription, only when bound 
to a small molecule. Such a small molecule is called a corepressor. For the pur 
repressor, the corepressor can be either guanine or hypoxanthine. The 
dimeric pur repressor binds to inverted-repeat DNA sites of the form 
5'’-ANGCAANCGNTTNCNT-3’, in which the bases shown in bold- 
face type are particularly important. Examination of the E. coli genome 
sequence reveals the presence of more than 20 such sites, regulating 
19 operons and including more than 25 genes (Figure 31.11). 


a Because the DNA-binding sites for these regulatory proteins are 
short, it is likely that they evolved independently and are not related 
by divergence from an ancestral regulatory site. Once a ligand-regulated 
DNA-binding protein is present in a cell, binding sites for the protein may 
arise by mutation adjacent to additional genes. Binding sites for the pur 
repressor have evolved in the regulatory regions of a wide range of genes 
taking part in nucleotide biosynthesis. All such genes can then be regulated 
in a concerted manner. 

The organization of prokaryotic genes into operons is useful for the 
analysis of completed genome sequences. Sometimes a gene of unknown 
function is discovered to be part of an operon containing well-characterized 
genes. Such associations can provide powerful clues to the biochemical and 
physiological functions of the uncharacterized gene. 


Transcription can be stimulated by proteins that contact 
RNA polymerase 


All the DNA-binding proteins discussed thus far function by inhibiting 
transcription until some environmental condition, such as the presence of 
lactose, is met. There are also DNA-binding proteins that stimulate tran- 
scription. One particularly well studied example is a protein in E. coli that 
stimulates the expression of catabolic enzymes. 

E. coli grown on glucose, a preferred energy source, have very low levels 
of catabolic enzymes for metabolizing other sugars. Clearly, the synthesis of 
these enzymes when glucose is abundant would be wasteful. Glucose has an 
inhibitory effect on the genes encoding these enzymes, an effect called 
catabolite repression. It is due to the fact that glucose lowers the concentra- 
tion of cyclic AMP in E. coli. When the cAMP concentration is high, it 
stimulates the concerted transcription of many catabolic enzymes by acting 
through a protein called the catabolite activator protein (CAP), which is also 
known as the cAMP receptor protein (CRP). 

When bound to cAMP, CAP stimulates the transcription of lactose- 
and arabinose-catabolizing genes. CAP is a sequence-specific DNA- 
binding protein. Within the lac operon, CAP binds to an inverted repeat 
that is centered near position — 61 relative to the start site for transcription 
(Figure 31.12). This site is approximately 70 base pairs from the operator 
site. As expected from the symmetry of the binding site, CAP functions as 
a dimer of identical subunits. 

The CAP-cAMP complex stimulates the initiation of transcription by 
approximately a factor of 50. Energetically favorable contacts between CAP 
and RNA polymerase increase the likelihood that transcription will be initi- 
ated at sites to which the CAP-cAMP complex is bound (Figure 31.13). 
Thus, in regard to the lac operon, gene expression is maximal when the 
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pur operator lac operator 


FIGURE 31.11 Binding-site distributions. 
The E. coli genome contains only a single 
region that closely matches the sequence of 
the lac operator (shown in blue). In contrast, 
20 sites match the sequence of the pur 
operator (shown in red). Thus, the pur 
repressor regulates the expression of many 
more genes than does the /ac repressor. 


polymerase 


Start site 
for transcription 


FIGURE 31.12 Binding site for catabolite 
activator protein (CAP). This protein binds 
as a dimer to an inverted repeat that is at 
the position —61 relative to the start site of 
transcription. The CAP-binding site on DNA 
is adjacent to the position at which RNA 
polymerase binds. 
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kK FIGURE 31.13 Structure of a dimer 
of CAP bound to DNA. The residues 
shown in yellow in each CAP monomer have 
been implicated in direct interactions with 
RNA polymerase. [Drawn from 1RUN.pdb.] 


hk FIGURE 31.14 Structure of the A 
repressor bound to DNA. The X repressor 
binds to DNA as a dimer. The amino- 
terminal domain of one subunit is shown in 
red and the carboxyl-terminal domain is 
shown in blue. In the other subunit, both 
domains are shown in yellow. Notice how a 
helices on the amino-terminal domains fit 
into the major groove of the DNA. [Drawn 
from 3DBN.pdb.] 


binding of allolactose relieves the inhibition by the lac repressor and the 
CAP-cAMP complex stimulates the binding of RNA polymerase. 

The E. coli genome contains many CAP-binding sites in positions 
appropriate for interactions with RNA polymerase. Thus, an increase in 
the cAMP level inside an E. coli bacterium results in the formation of 
CAP-cAMP complexes that bind to many promoters and stimulate the 
transcription of genes encoding a variety of catabolic enzymes. 


31.3 Regulatory Circuits Can Result in Switching 
Between Patterns of Gene Expression 


The study of viruses that infect bacteria has led to significant advances in 
our understanding of the processes that control gene expression. Again, 
sequence-specific DNA-binding proteins play key roles in these processes. 
Investigations of bacteriophage \ have been particularly revealing. We 
examined the alternative infection modes of \ phage in Chapter 5. In the 
lytic pathway, most of the genes in the viral genome are transcribed, initiat- 
ing the production of many virus particles and leading to the eventual lysis 
of the bacterial cell with the concomitant release of approximately 100 virus 
particles. In the lysogenic pathway, the viral genome is incorporated into the 
bacterial DNA where most of the viral genes remain unexpressed, allowing 
the viral genome to be carried along as the bacteria replicate. Two key pro- 
teins and a set of regulatory sequences in the viral genome are responsible 
for the switch that determines which of these two pathways is followed. 


The X repressor regulates its own expression 


The first protein that we shall consider is the A repressor, sometimes known as 
the d cl protein. This protein is key because it blocks, either directly or indi- 
rectly, the transcription of almost all genes encoded by the virus. The one 
exception is the gene that encodes the A repressor itself. The \ repressor con- 
sists of an amino-terminal DNA-binding domain and a carboxyl-terminal 
domain that participates in protein oligomerization (Figure 31.14). This pro- 
tein binds to a number of key sites in the \ phage genome. The sites of greatest 
interest for our present discussion are in the so-called right operator 
(Figure 31.15). This region includes three binding sites for the \ repressor 
dimer, as well as two promoters within a region of approximately 80 base 
pairs. One promoter drives the expression of the gene for the \ repressor 
itself, whereas the other drives the expression of a number of other viral genes. 
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FIGURE 31.15 Sequence of the A right operator. The three operator sites (O_1, Op2, and 
O,3) are shaded yellow with their centers indicated. The start sites for the \ repressor MRNA 
and the Cro mRNA are indicated, as are their —-10 and —35 sequences. 


The X repressor does not have the same affinity for the three sites; it 
binds the site Op1 with the highest affinity. In addition, the binding to 
adjacent sites is cooperative so that, after a \ repressor dimer has bound at 
Opi, the likelihood that a protein will bind to the adjacent site Op2 
increases by approximately 25-fold. Thus, when \ repressor is present in 
the cell at moderate concentrations, the most likely configuration has \ 
repressor bound at Op! and Op2, but not at Op3. In this configuration, the 
\ repressor dimer bound at Op1 blocks access to the promoter on the right 
side of the operator sites, repressing transcription of the adjacent gene, 
which encodes a protein termed Cro (controller of repressor and others), 
while the repressor dimer at Op2 can be in contact with RNA polymerase 
and stimulate transcription of the promoter that controls the transcription 
of the gene that encodes the J repressor itself. 

Thus, the d repressor stimulates its own production. As the concentration 
of the \ repressor increases further, an additional repressor dimer can bind to 
the Op3 site, blocking the other promoter and repressing the production of 
additional repressor. Thus, the right operator serves to maintain the \ repres- 
sor in a narrow, stable concentration range (Figure 31.16). The A repressor 
also blocks other promoters in the \ phage genome so that the repressor is the 
only phage protein produced, which corresponds to the lysogenic state. 


A circuit based on the \ repressor and Cro forms a genetic switch 


What stimulates the switch to the lytic pathway? Changes such as DNA 
damage initiate the cleavage of the \ repressor at a specific bond between 
the DNA-binding and oligomerization domains. This process is mediated 
by the E. coli RecA protein (Section 28.5). After this cleavage has taken 
place, the affinity of the \ repressor for DNA is reduced. After the A repres- 
sor is no longer bound to the Op! site, the Cro gene can be transcribed. Cro 
is a small protein that binds to the same sites as the \ repressor does, but 
with a different order of affinity for the three sites in the right operator. In 
particular, Cro has the highest affinity for Op3. Cro bound in this site 
blocks the production of new repressor. The absence of repressor leads to 
the production of other phage genes, leading to the production of virus par- 
ticles and the eventual lysis of the host cells. Thus, this genetic circuit acts 
as a switch with two stable states: (1) repressor high, Cro low, corresponding 
to the lysogenic state, and (2) Cro high, repressor low, corresponding to the 
lytic state (Figure 31.17). Regulatory circuits with different DNA-binding 
proteins controlling the expression of each other’s genes constitute a com- 
mon motif for controlling gene expression. 


Many prokaryotic cells release chemical signals that regulate 
gene expression in other cells 


Prokaryotic cells have been traditionally viewed as solitary single cells. 
However, it is becoming increasingly clear that, in many circumstances, 
prokaryotic cells live in complex communities, interacting with other cells 
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FIGURE 31.16 The A repressor controls 
its own synthesis. (A) When \ repressor 
levels are relatively low, the repressor binds 
0 sites Op1 and Op2 and stimulates the 
ranscription of the gene that encodes the 
d repressor itself. (B) When \ repressor 
levels are higher, the repressor also binds 
0 site O,3, blocking access to its 
promoter and repressing transcription 

rom this gene. 
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FIGURE 31.17 The A repressor and Cro 
form a genetic circuit. The \ repressor 
blocks the production of Cro by binding 
most favorably to site O,1 whereas Cro 
blocks the production of the \ repressor by 
binding most favorably to site O,3. This 
circuit forms a switch that determines 
whether the lysogenic or the lytic pathway 
is followed. 
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Acyl-homoserine lactone (AHL) 


FIGURE 31.18 Autoinducer structure. 
The structure of the acyl-homoserine 
lactone N-3-oxo-hexanoyl homoserine 
lactone, the autoinducer from V. fischeri. 
The autoinducers from other bacterial 
species can have different acyl groups 
(shown in red). 


FIGURE 31.19 Quorum-sensing gene regulator. The structure of a 
homolog of LuxR (TraR from the bacterium Agrobacterium tumefaciens) is 


of their own and different species. These social interactions change the pat- 
terns of gene expression within the cells. 

An important type of interaction is called quorum sensing. This phenom- 
enon was discovered in the bacteria Vibrio fischeri, a species of bacterium 
that can live inside a specialized light organ in the bobtail squid. In this sym- 
biotic relation, the bacteria produce luciferase and bioluminesce, providing 
protection for the squid (by preventing being backlit by moonlight) in 
exchange for a safer place to live and reproduce. When these bacteria are 
grown in culture at low density, they are not bioluminescent. However, when 
the cell density reaches a critical level, the gene for luciferase is expressed and 
the cells bioluminesce. A key observation was that, when V. fischeri cells 
were transferred to a sterile medium in which other V. fischeri cells had been 
grown to high density, the cells became bioluminescent even at low cell den- 
sity. This experiment revealed that a chemical, subsequently shown to be 
N-3-oxo-hexanoyl homoserine lactone (hereafter AHL for acyl-homoserine 
lactone), had been released into the medium where it triggered the develop- 
ment of the bioluminescence (Figure 31.18). This compound and other 
compounds that play similar roles are termed autoinducers. 

Cells of V. fischeri release the autoinducer into their environment and 
other V. fischeri cells take up the chemical. V. fischeri cells express a DNA- 
binding protein LuxR that serves as the receptor for the autoinducer. LuxR 
comprises two domains, one of which binds AHL and the other of which 
binds DNA through a helix-turn-helix motif (Figure 31.19). After the AHL 
concentration inside the cell has increased to an appropriate level, a substan- 
tial fraction of the LuxR molecules bind AHL. When bound to AHL, LuxR 
dimers bind to specific sites on DNA and increase the rate of transcription 
initiation at specific genes. The target genes include an operon that contains 
LuxA and LuxB, which together encode the luciferase enzyme, and LuxI, 
which produces an enzyme that catalyzes the formation of more AHL. 

Because each cell produces only a small amount of the autoinducer, this 
regulatory system allows each V. fischeri cell to determine the density of the 
V. fischeri population in its environment—hence the term quorum sensing for 
this process. Studies of other prokaryotic cells are revealing an elaborate 
chemical language of different autoinducers (as well as autorepressors that 
repress specific genes). The “words” in this language include other acyl- 
homoserine lactone molecules with different acyl 
chain lengths and functionalities, as well as other 
distinct classes of molecules. 


Biofilms are complex communities of prokaryotes 


Many species of prokaryotes can be found in special- 
ized structures termed biofilms that can form on 
surfaces. Biofilms are of considerable medical impor- 
tance because organisms within them are often quite 
resistant to the immune response of the host as well 
as to antibiotics. Quorum sensing appears to play a 
major role in the formation of biofilms, in that cells 
are able to sense other cells in their environments 
and to promote the formation of communities with 
particular compositions. Some genes controlled by 
quorum-sensing mechanisms promote the forma- 
tion of specific molecules that serve as scaffolds for 
the biofilm. An intriguing recent discovery is that 


Autoinducer 


shown. Notice that the dimeric protein binds to DNA through an a-helical many. of the organisms that are present in biofilms on 
domain, whereas the autoinducer binds to a separate domain. or in our bodies (perhaps 95% or more) have not been 


grown in culture. Through DNA-sequencing methods, we are developing a 935 
better census of our microbiome and are working toward understanding the 31.4 Posttranscriptional Gene Regulation 
gene-regulatory mechanisms that support these complex communities. 


31.4 Gene Expression Can Be Controlled at 
Posttranscriptional Levels 


The modulation of the rate of transcription initiation is the most common 
mechanism of gene regulation. However, other stages of transcription also 
can be targets for regulation. In addition, the process of translation provides 
other points of intervention for regulating the level of a protein produced in 
a cell. In Chapter 29, we considered riboswitches that control transcription 
termination (Section 29.1). Other riboswitches control gene expression by 
other mechanisms such as the formation of structures that inhibit transla- 
tion. Additional mechanisms for posttranscriptional gene regulation have 
been discovered, one of which will be described here. 


Attenuation is a prokaryotic mechanism for regulating transcription 
through the modulation of nascent RNA secondary structure 


A means for regulating transcription in bacteria was discovered by Charles 
Yanofsky and his colleagues as a result of their studies of the tryptophan 
operon. This operon encodes five enzymes that convert chorismate into tryp- 
tophan. Analysis of the 5’ end of trp mRNA revealed the presence of a leader 
sequence of 162 nucleotides before the initiation codon of the first enzyme. The 
next striking observation was that bacteria produced a transcript consisting of 
only the first 130 nucleotides when the tryptophan level was high, but they 
produced a 7000-nucleotide trp mRNA, including the entire leader sequence, 


when tryptophan was scarce. Thus, when tryptophan is plentiful and the bio- RSE Sty beech een er ue 


mRNA. (A) The nucleotide sequence of the 


synthetic enzymes are not needed, transcription is abruptly broken off before 5? end of tro mRNA includes a short open 
any mRNA coding for the enzymes is produced. The site of termination is reading frame that encodes a peptide 
called the attenuator, and this mode of regulation is called attenuation. comprising 14 amino acids; the leader 
Attenuation depends on features at the 5’ end of the mRNA product encodes two tryptophan residues and has 
(Figure 31.20). The first part of the leader sequence encodes a 14-amino-acid Sel heb balesienenal uals! aoa e. 


‘ ; ‘ ; ‘ includes a region that can form a terminator 
leader peptide. Following the open reading frame for the peptide is the structure (tad and blue). (8 and’C) The 
attenuator region can adopt either of two 
distinct stem-loop structures. 
(A) 
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FIGURE 31.21 Attenuation. (A) In the presence of adequate concentrations of tryptophan 
(and, hence, Trp-tRNA), translation proceeds rapidly and an RNA structure forms that 
terminates transcription. (B) At low concentrations of tryptophan, translation stalls while 
awaiting Trp-tRNA, giving time for an alternative RNA structure to form that prevents the 
formation of the terminator and transcription can proceed. 


attenuator, a region of RNA that is capable of forming several alternative 
structures. Recall that transcription and translation are tightly coupled 
in bacteria. Thus, the translation of the trp mRNA begins soon after the 
ribosome-binding site has been synthesized. 

How does the level of tryptophan alter transcription of the trp operon? An 
important clue was the finding that the 14-amino-acid leader peptide 
includes two adjacent tryptophan residues. A ribosome is able to translate the 
leader region of the mRNA product only in the presence of adequate concen- 
trations of tryptophan. When enough tryptophan is present, a stem-loop 
structure forms in the attenuator region, which leads to the release of RNA 
polymerase from the DNA (Figure 31.21). However, when tryptophan is 
scarce, transcription is terminated less frequently. Little tryptophanyl-tRNA 
is present, and so the ribosome stalls at the tandem UGG codons encoding 
tryptophan. This delay leaves the adjacent region of the mRNA exposed as 
transcription continues. An alternative RNA structure that does not function 
as a terminator is formed, and transcription continues into and through the 
coding regions for the enzymes. Thus, attenuation provides an elegant means 
of sensing the supply of tryptophan required for protein synthesis. 

Several other operons for the biosynthesis of amino acids in E. coli also are 
regulated by attenuator sites. The leader peptide of each contains an abun- 
dance of the amino acid residues of the type synthesized by the operon 
(Figure 31.22). For example, the leader peptide for the phenylalanine 
operon includes 7 phenylalanine residues among 15 residues. The threonine 
operon encodes enzymes required for the synthesis of both threonine and 
isoleucine; the leader peptide contains 8 threonine and 4 isoleucine residues 
ina 16-residue sequence. The leader peptide for the histidine operon includes 


Met - Lys - Arg - lle - Ser - Thr - Thr - lle - Thr - Thr - Thr - lle - Thr - Ile - Thr - Thr - 
AUG AAA CGC AUU AGC ACC ACC AUU ACC ACC ACC AUC ACC AUU ACC ACA 3’ 


Met - Lys - His - lle - Pro - Phe - Phe - Phe - Ala - Phe - Phe - Phe - Thr - Phe - Pro - Stop 
AUG AAA CAC AUA CCG UUU UUC UUC GCA UUC UUU UUU ACC UUC CCC UGA 3’ 


Met - Thr - Arg - Val - Gln - Phe - Lys - His - His - His - His - His - His - His - Pro - Asp- 
AUG ACA CGC GUU CAA UUU AAA CAC CAC CAU CAU CAC CAU CAU CCU GAC 3’ 


FIGURE 31.22 Leader peptide sequences. Amino acid sequences and the corresponding 
mRNA nucleotide sequences of the (A) threonine operon, (B) phenylalanine operon, and 

(C) histidine operon. In each case, an abundance of one amino acid in the leader peptide 
sequence leads to attenuation. 


7 histidine residues in a row. In each case, low levels of the corresponding 
charged tRNA cause the ribosome to stall, trapping the nascent mRNA ina 
state that can form a structure that allows RNA polymerase to read through 
the attenuator site. Evolution has apparently converged on this strategy 
repeatedly as a mechanism for controlling amino acid biosynthesis. 


a The examples of prokaryotic gene regulation that we have been 

describing come from bacteria, as opposed to archaea. The transcrip- 
tional apparatus present in archaea shares many features with that found in 
eukaryotes. This commonality is frequently interpreted to suggest that 
eukaryotes evolved after a cell fusion event in which a bacterial cell was 
engulfed by an archaeal cell. Nonetheless, the key principles of gene regula- 
tion such as the occurrence of operons and the roles of DNA-binding pro- 
teins in directly blocking or stimulating RNA polymerase are the same in 
bacteria and archaea, perhaps because of similar genome sizes. As we shall 
see in the next chapter, eukaryotic cells with larger genomes and, often, 
many distinct cell types, utilize quite different strategies. 


SUMMARY 


31.1 Many DNA-Binding Proteins Recognize Specific DNA Sequences 

The regulation of gene expression depends on the interplay between 
specific sequences within the genome and proteins that bind specifi- 
cally to these sites. Specific DNA-binding proteins recognize regula- 
tory sites that usually lie adjacent to the genes whose transcription is 
regulated by these proteins. The proteins of the largest family contain 
a helix-turn-helix motif. The first helix of this motif inserts into the 
major groove of DNA and makes specific hydrogen-bonding and 
other contacts with the edges of the base pairs. 


31.2 Prokaryotic DNA-Binding Proteins Bind Specifically to 

Regulatory Sites in Operons 

In prokaryotes, many genes are clustered into operons, which are units 
of coordinated genetic expression. An operon consists of control sites 
(an operator and a promoter) and a set of structural genes. In addition, 
regulator genes encode proteins that interact with the operator and pro- 
moter sites to stimulate or inhibit transcription. The treatment of E. coli 
with lactose induces an increase in the production of B-galactosidase 
and two additional proteins that are encoded in the lactose operon. 
In the absence of lactose or a similar galactoside inducer, the lac repres- 
sor protein binds to an operator site on the DNA and blocks trans- 
cription. The binding of allolactose, a derivative of lactose, to the lac 
repressor induces a conformational change that leads to dissociation 
from DNA. RNA polymerase can then move through the operator to 
transcribe the lac operon. 

Some proteins activate transcription by directly contacting RNA 
polymerase. For example, cyclic AMP stimulates the transcription of 
many catabolic operons by binding to the catabolite activator protein. 
The binding of the cAMP-CAP complex to a specific site in the pro- 
moter region of an inducible catabolic operon enhances the binding of 
RNA polymerase and the initiation of transcription. 


31.3 Regulatory Circuits Can Result in Switching Between 
Patterns of Gene Expression 
The study of bacterial viruses, particularly bacteriophage A, has 
revealed key aspects of gene-regulatory networks. Bacteriophage \ can 


937 


Summary 


938 


CHAPTER 31 The Control of Gene 
Expression in Prokaryotes 


KEY TERMS 


31.4 


develop by either a lytic or a lysogenic pathway. A key regulatory 
protein, the \ repressor, regulates its own expression, promoting 
transcription of the gene that encodes the repressor when repressor 
levels are low and blocking transcription of the gene when levels are 
high. This behavior depends on the } right operator, which includes 
three sites to which J repressor dimers can bind. Cooperative binding 
of the X repressor to two of the sites stabilizes the state in which two X 
repressor dimers are bound. The Cro protein binds to the same sites as 
does the \ repressor, but with reversed affinities. When Cro is present 
at sufficient concentrations, it blocks the transcription of the gene for 
the \ repressor while allowing the transcription of its own gene. Thus, 
these two proteins and the operator form a genetic switch that can exist 
in either of two states. 

Some prokaryotic species participate in quorum sensing. This pro- 
cess includes the release of chemicals called autoinducers into the 
medium surrounding the cells. These autoinducers are often, but not 
always, acyl homoserine lactones. Autoinducers are taken up by sur- 
rounding cells. When the autoinducer concentration reaches an appro- 
priate level, it is bound by receptor proteins that activate the expres- 
sion of genes, including those that promote the synthesis of more 
autoinducer. These chemically mediated social interactions allow 
these prokaryotes to change their gene-expression patterns in response 
to the number of other cells in their environments. Biofilms are 
complex communities of prokaryotes that are promoted by quorum- 
sensing mechanisms. 


Gene Expression Can Be Controlled at Posttranscriptional Levels 

Gene expression can also be regulated at the level of translation. In 
prokaryotes, many operons important in amino acid biosynthesis are 
regulated by attenuation, a process that depends on the formation of 
alternative structures in mRNA, one of which favors the termination 
of transcription. Attenuation 1s mediated by the translation of a leader 
region of mRNA. A ribosome stalled by the absence of an aminoacyl- 
tRNA needed to translate the leader mRNA alters the structure of 
mRNA, allowing RNA polymerase to transcribe the operon beyond 
the attenuator site. 


helix-turn-helix motif (p. 927) 
B-galactosidase (p. 928) 
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repressor (p. 928) 
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lac operator (p. 929) 
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PROBLEMS 
1. Missing genes. Predict the effects of deleting the follow- 2. Minimal concentration. Calculate the concentration 
ing regions of DNA: of lac repressor, assuming that one molecule is present 


(a) The gene encoding lac repressor 
(b) The lac operator 
(c) The gene encoding CAP 


per cell. Assume that each EF. coli cell has a volume of 
10~!2 cm?. Would you expect the single molecule to be free 
or bound to DNA? 


3. Counting sites. Calculate the expected number of times 
that a given 8-base-pair DNA site should be present in 
the E. coli genome. Assume that all four bases are equally 
probable. Repeat for a 10-base-pair site and a 12-base- 
pair site. 


4. The same but not the same. The lac repressor and the pur 
repressor are homologous proteins with very similar three- 
dimensional structures, yet they have different effects on 
gene expression. Describe two important ways in which the 
gene-regulatory properties of these proteins differ. 


5. The opposite direction. Some compounds called anti- 
inducers bind to repressors such as the lac repressor and 
inhibit the action of inducers; that is, transcription is 
repressed and higher concentrations of inducer are required 
to induce transcription. Propose a mechanism of action for 
anti-inducers. 


6. Inverted repeats. Suppose that a nearly perfect 20-base- 
pair inverted repeat is observed in a DNA sequence. 
Provide two possible explanations. 


7. Broken operators. Consider a hypothetical mutation in 
Ox2 that blocks both \ repressor and Cro binding. How 
would this mutation affect the likelihood of bacteriophage 
entering the lytic phase? 


8. Promoters. Compare the —10 and —35 sequences for the 
\ repressor and Cro genes in the right operator. How many 
differences are there between these sequences? 


9. Positive and negative feedback. What is the effect of an 
increased Cro concentration on the expression of the gene 
for the \ repressor? Of an increased concentration of 
repressor on the expression of the Cro gene? Of an increased 
concentration of \ repressor on the expression of the d 
repressor gene? 


10. Leaderless. The mRNA for the \ repressor begins with 
5'-AUG-3', which encodes the methionine residue that 
begins the protein. What is unusual about this beginning? 
Would it cause the mRNA to translate efficiently or not? 


11. Quorum count. Suppose you have a series of compounds 
that you wish to test for the autoinducer activity in Vibrio 
fischeri. Propose a simple assay, assuming that you can grow 
V. fischeri cultures at low cell densities. 


12. Codon utilization. There are four codons that encode 
threonine. Consider the leader sequence in Figure 31.22A. 
What codons are used and with what frequency? 


Mechanism Problem 


13. Follow the stereochemistry. The hydrolysis of lactose is 
catalyzed by B-galactosidase. Does the overall reaction pro- 
ceed with retention or inversion of configuration? Given 
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Problems 


that each step likely proceeds with inversion of configura- 
tion, what does the overall change in stereochemistry sug- 
gest about the mechanism? A key residue in the reaction has 
been identified to be Glu 537. Propose an overall mecha- 
nism for the hydrolysis of lactose. 


HO 
OH 
OH 
O 
HO or? OH + H,0 ——> 
OH 
HO 
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OH 
oO 
HO oH + HOHO OH 
O 
OH 
HO 
Galactose Glucose 


Data Interpretation Problem 


14. Leaving tracks. A powerful method for examining 
protein-DNA interactions is called DNA footprinting. In 
this method, a DNA fragment containing a potential bind- 
ing site is radiolabeled on one end. The labeled DNA is 
then treated with a DNA-cleaving agent such as DNase I 
such that each DNA molecule within the population is cut 
only once. The same cleavage process is carried out in the 
presence of the DNA-binding protein. The bound protein 
protects some sites within the DNA from cleavage. The 
patterns of DNA fragments in the cleaved pool of DNA 
molecules are then examined by electrophoresis followed 
by autoradiography (a). 
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A repressor concentration 


This method is applied to a DNA fragment containing a 
single binding site for the \ repressor in the presence of dif- 
ferent concentrations of the \ repressor. The results are 


shown below (b): 
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Estimate the dissociation constant for the \ repressor— 
DNA complex and the standard free energy of binding. 


The Control of Gene 
Expression in Eukaryotes 


Complex biological processes often require coordinated control of the expression 
of many genes. The maturation of a tadpole into a frog is largely controlled by 
thyroid hormone. This hormone regulates gene expression by binding to a protein, 
the thyroid-hormone receptor, as shown at the right. In response to the hormone’s 
binding, this protein binds to specific DNA sites in the genome and modulates the 
expression of nearby genes. [(Left) Gary Meszaros/Getty Images.] 


M any of the most important and intriguing features in modern biology 
and medicine, such as the pathways crucial for the development of 
multicellular organisms, the changes that distinguish normal cells and 
cancer cells, and the evolutionary changes leading to new species, entail 
networks of gene-regulatory pathways. Gene regulation in eukaryotes is 
significantly more complex than in prokaryotes in several ways. First, the 
genomes being regulated are significantly larger. The FE. coli genome con- 
sists of a single, circular chromosome containing 4.6 megabases (Mb). 
This genome encodes approximately 2000 proteins. In comparison, one of 
the simplest eukaryotes, Saccharomyces cerevisiae (baker’s yeast), contains 
16 chromosomes ranging in size from 0.2 to 2.2 Mb (Figure 32.1). The 
yeast genome totals 12 Mb and encodes approximately 6000 proteins. The 
genome within a human cell contains 23 pairs of chromosomes ranging in 
size from 50 to 250 Mb. Approximately 21,000 genes are present within 
the 3000 Mb of human DNA. 

Second, whereas prokaryotic genomic DNA is relatively accessible, 
eukaryotic DNA is packaged into chromatin, a complex between the DNA 
and a special set of proteins (Figure 32.2). Although the principles for the 
construction of chromatin are relatively simple, the chromatin structure for 
a complete genome is quite complex. Importantly, in a given eukaryotic cell, 
some genes and their associated regulatory regions are relatively accessible 
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(if double stranded) or 10° bases (if single 
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FIGURE 32.1 Yeast chromosomes. 
Pulsed-field electrophoresis allows the 
separation of 16 yeast chromosomes. [From 
G. Chu, D. Vollrath, and R. W. Davis, Science 
234:1582-1585, 1986.] 


FIGURE 32.2 Chromatin structure. An 
electron micrograph of chromatin showing 
its “beads on a string” character. The 
beads correspond to DNA complexes with 
specific proteins. [Don W. Fawcett/Science 
Source.] 


for transcription and regulation, whereas other genes are tightly packaged 
and are thus rendered inactive. Eukaryotic gene regulation frequently 
requires the manipulation of chromatin structure. 

A manifestation of this complexity is the presence of many different cell 
types in most eukaryotes. A liver cell, a pancreatic cell, and an embryonic 
stem cell contain the same DNA sequences, yet the subset of genes highly 
expressed in cells from the pancreas, which secretes digestive enzymes, dif- 
fers markedly from the subset highly expressed in the liver, the site of lipid 
transport and energy transduction. Embryonic stem cells do not express any 
subset of genes at high levels; the most highly expressed genes are “house- 
keeping” genes involved in the cytoskeleton and processes such as transla- 
tion (Table 32.1). The existence of stable cell types is due to differences in 
the epigenome, differences in chromatin structure, and covalent modifica- 
tions of the DNA, not in the DNA sequence itself. Thus, different cell types 
share the same genome (DNA sequence) but differ in their epigenomes, the 
packaging and modification of this genome. 

In addition, eukaryotic genes are not generally organized into operons. 
Instead, genes that encode proteins for steps within a given pathway are 
often spread widely across the genome. This characteristic requires that 
other mechanisms function to regulate genes in a coordinated way. 

Despite these differences, some aspects of gene regulation in eukary- 
otes are quite similar to those in prokaryotes. In particular, activator and 
repressor proteins that recognize specific DNA sequences are central to 
many gene-regulatory processes. In this chapter, we will focus first on 
chromatin structure. We will then turn to transcription factors—DNA- 
binding proteins similar in many ways to the prokaryotic proteins that we 
encountered in the preceding chapter. Eukaryotic transcription factors 
can act directly by interacting with the transcriptional machinery or indi- 
rectly by influencing chromatin structure. Finally, we examine selected 


TABLE 32.1 Highly expressed protein-encoding genes of the pancreas, liver, and 
embryonic stem cells (as percentage of total mRNA pool) 


Proteins expressed Proteins expressed Proteins expressed 
Rank in pancreas % in liver % in stem cells % 
1 Procarboxypeptidase 7.6 Albumin 3.5 Glyceraldehyde-3- 0.7 
Al phosphate 
dehydrogenase 
2 Pancreatic 5.5 Apolipoprotein 2.8 Translation 0.6 
trypsinogen 2 A-I elongation 
factor 1 a1 
3 Chymotrypsinogen 4.4 Apolipoprotein 2.5 Tubulina 0.5 
C-I 
4 Pancreatic trypsin 1 3.7 Apolipoprotein 2.1. Translationally 0.5 
C-I1 controlled tumor 
protein 
5 Elastase IIIB 2.4 ATPase 6/8 1.5. Cyclophilin A 0.4 
6 Protease E 1.9 Cytochrome 1.1. Cofilin 0.4 
oxidase 3 
7 Pancreatic lipase 1.9 Cytochrome 1.1. Nucleophosmin 0.3 
oxidase 2 
8 Procarboxypeptidase 1.7 a,-Antitrypsin 1.0 Connexin 43 0.3 
B 
9 Pancreatic amylase 1.7. Cytochrome 0.9  Phosphoglycerate 0.2 
oxidase 1 mutase 


10 Bile-salt-stimulated 1.4 Apolipoprotein E 0.9 ‘Translation elongation 0.2 
lipase factor 1 B2 


Sources: Data for pancreas from V. E. Velculescu, L. Zhang, B. Vogelstein, and K. W. Kinzler, Science 
270:484—487, 1995. Data for liver from T. Yamashita, S. Hashimoto, S. Kaneko, S. Nagai, N. Toyoda, 

T. Suzuki, K. Kobayashi, and K. Matsushima, Biochem. Biophys. Res. Commun. 269:110-116, 2000. Data for 
stem cells from M. Richards, S. P. Tan, J. H. Tan, W. K. Chan, and A. Bongso, Stem Cells 22:51-64, 2004. 


posttranscriptional gene-regulatory mechanisms, including those based 
on microRNAs, an important class of gene-regulatory molecules discov- 
ered in recent years. 


32.1 Eukaryotic DNA Is Organized into Chromatin 


Eukaryotic DNA is tightly bound to a group of small basic proteins called 
histones. In fact, histones constitute half the mass of a eukaryotic chromo- 
some. The entire complex of a cell’s DNA and associated protein is called 
chromatin. Chromatin compacts and organizes eukaryotic DNA and its 
presence has dramatic consequences for gene regulation. 


Nucleosomes are complexes of DNA and histones 


Chromatin is made up of repeating units, each containing 200 bp of DNA and 
two copies each of four histone proteins H2A, H2B, H3, and H4. Histones 
have strikingly basic properties because a quarter of the residues in each his- 
tone are either arginine or lysine, positively charged amino acids that strongly 
interact with the negatively charged DNA. The protein complex is called the 
histone octamer. The repeating units of the histone octamer and the associated 
DNA are known as nucleosomes. Chromatin viewed with the electron micro- 
scope has the appearance of beads on a string (Figure 32.2); each bead has a 
diameter of approximately 100 A. Partial digestion of chromatin with DNase 
yields the isolated beads. These particles consist of fragments of DNA about 
200 bp in length bound to the histone octamer. More-extensive digestion 
yields a shorter DNA fragment of 145 bp bound to the octamer. The smaller 
complex formed by the histone octamer and the 145-bp DNA fragment is the 
nucleosome core particle. The DNA connecting core particles in undigested 
chromatin is called linker DNA. Histone H1 binds, in part, to the linker DNA. 


DNA wraps around histone octamers to form nucleosomes 


The overall structure of the nucleosome was revealed through electron 
microscopic and x-ray crystallographic studies pioneered by Aaron Klug 
and his colleagues. More recently, the three-dimensional structure of a 
reconstituted nucleosome core particle (Figure 32.3) was determined to 


Amino-terminal 


(A) (B) 


& FIGURE 32.3 Nucleosome core particle. The structure consists of a core of eight 
histone proteins surrounded by DNA. (A) A view showing the DNA wrapping around the 
histone core. (B) A 90-degree rotation of the view in part (A). Notice that the DNA forms a left- 
handed superhelix as it wraps around the core. (C) A schematic view. [Drawn from 1AOl.pdb.] 
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(36 nm) 


FIGURE 32.5 Higher-order chromatin 
structure. A proposed model for chromatin 
arranged in a helical array consisting of six 
nucleosomes per turn of helix. The DNA 
double helix (shown in red) is wound 
around each histone octamer (shown in 
blue). The actual structure in cells is likely 
more heterogeneous and dynamic than 
suggested by this model. [Information from 
J. T. Finch and A. Klug, Proc. Natl. Acad. Sci. 
U. S. A. 73:1897-1901, 1976.] 
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H2B H3 H4 


h FIGURE 32.4 Homologous histones. Histones H2A, H2B, H3, and H4 adopt a similar 
three-dimensional structure as a consequence of common ancestry. Some parts of the tails at 
the termini of the proteins are not shown. [Drawn from 1AOl.pdb.] 


higher resolution by x-ray diffraction methods. The four types of histone 
that make up the protein core are homologous and similar in structure 
(Figure 32.4). The eight histones in the core are arranged into a (H3),(H4), 
tetramer and a pair of H2A—H2B dimers. The tetramer and dimers come 
together to form a left-handed superhelical ramp around which the DNA 
wraps. In addition, each histone has an amino-terminal tail that extends out 
from the core structure. These tails are flexible and contain a number of 
lysine and arginine residues. As we shall see, covalent modifications of these 
tails play an essential role in regulating gene expression. 

The DNA forms a left-handed superhelix as it wraps around the outside 
of the histone octamer. The protein core forms contacts with the inner sur- 
face of the DNA superhelix at many points, particularly along the phospho- 
diester backbone and the minor groove. Nucleosomes will form on almost 
all DNA sites, although some sequences are preferred because the dinucleo- 
tide steps are properly spaced to favor bending around the histone core. 
A histone with a different structure from that of the others, called histone 
H1, seals off the nucleosome at the location at which the linker DNA enters 
and leaves. The amino acid sequences of histones, including their amino- 
terminal tails, are remarkably conserved from yeast to human beings. 

The winding of DNA around the nucleosome core contributes to the 
packing of DNA by decreasing its linear extent. An extended 200-bp 
stretch of DNA would have a length of about 680 A. Wrapping this DNA 
around the histone octamer reduces the length to approximately 100 A 
along the long dimension of the nucleosome. Thus the DNA is compacted 
by a factor of 7. However, human chromosomes in metaphase, which are 
highly condensed, are compacted by a factor of 104. Clearly, the nucleo- 
some is just the first step in DNA compaction. What is the next step? 
The nucleosomes themselves are arranged in a helical array approximately 
360 A across, forming a series of stacked layers approximately 110 A apart 
(Figure 32.5). The folding of these fibers of nucleosomes into loops fur- 
ther compacts DNA. 

The wrapping of DNA around the histone octamer as a left-handed 
helix also stores negative supercoils; if the DNA in a nucleosome is 
straightened out, the DNA will be underwound. This underwinding is 
exactly what is needed to separate the two DNA strands during replication 
and transcription. 


32.2 Transcription Factors Bind DNA and Regulate 
Transcription Initiation 


DNA-binding transcription factors are key to gene regulation in eukaryotes, 
just as they are in prokaryotes. However, the roles of eukaryotic transcription 
factors are different in several ways. First, whereas the DNA-binding sites 
crucial for the control of gene expression in prokaryotes are usually quite 
close to promoters, those in eukaryotes can be farther away from promoters 
and can exert their action at a distance. Second, most prokaryotic genes are 
regulated by single transcription factors, and multiple genes in a pathway are 
expressed in a coordinated fashion because such genes are often transcribed 
as part of a polycistronic mRNA. In eukaryotes, the expression of each gene 
is typically controlled by multiple transcription factors, 

and the coordinated expression of different genes depends 


on having similar transcription-factor-binding sites in each > 
7 


gene in the set. Third, in prokaryotes, transcription factors 
usually interact directly with RNA polymerase. In eukary- 
otes, while some transcription factors interact directly with 
RNA polymerase, many others act more indirectly, inter- 
acting with other proteins associated with RNA poly- 
merase or modifying chromatin structure. Let us now 
examine eukaryotic transcription factors in more detail. 
Eukaryotic transcription factors usually consist of several 
domains. The DNA-binding domain binds to regulatory 
sequences that can either be adjacent to the promoter or at 
some distance from it. Most commonly, transcription factors 
include additional domains that help activate transcription. 
When atranscription factor is bound to the DNA, its activa- 
tion domain promotes transcription by interacting with 


Homeodomain 
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FIGURE 32.6 Homeodomain structure. The structure of a 
heterodimer formed from two different DNA-binding domains, each 
based on a homeodomain. Notice that each homeodomain has a 


RNA polymerase I, by interacting with other associated helix-turn-helix motif with one helix inserted into the major groove 
proteins, or by modifying the local structure of chromatin. of DNA. [Drawn from 1AKH.pdb.] 


A range of DNA-binding structures are employed by 
eukaryotic DNA-binding proteins 


The structures of many eukaryotic DNA-binding proteins have been deter- 
mined and a range of structural motifs have been observed, but we will 
focus on three that reveal the common features and the diversity of these 
motifs. The first class of eukaryotic DNA-binding unit that we will con- 
sider is the homeodomain (Figure 32.6). The structure of the homeodomain 
and its mode of recognition of DNA are very similar to those of the prokary- 
otic helix-turn-helix proteins. In eukaryotes, homeodomain proteins often 
form heterodimeric structures, sometimes with other homeodomain pro- 
teins, that recognize asymmetric DNA sequences. 

The second class of eukaryotic DNA-binding unit comprises the basic- 
leucine zipper (bZip) proteins (Figure 32.7). This DNA-binding unit consists of 
a pair of long a helices. The first part of each a helix is a basic region that lies 
in the major groove of the DNA and makes contacts responsible for DNA-site 
recognition. The second part of each a helix forms a coiled-coil structure with 
its partner. Because these units are often stabilized by appropriately spaced 
leucine residues, these structures are often referred to as leucine zippers. 

The final class of eukaryotic DNA-binding units that we will consider here 
are the Cys,His, zinc-finger domains (Figure 32.8). A DNA-binding unit of 
this class comprises tandem sets of small domains, each of which binds a zinc 
ion through conserved sets of two cysteine and two histidine residues. These 
structures, often called zinc-finger domains, form a string that follows the major 


Leucine 
zipper 
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hk FIGURE 32.7 Basic-leucine zipper. 
This heterodimer comprises two basic- 
leucine zipper proteins. Notice that the 
basic region lies in the major groove of 
DNA. The leucine-zipper part stabilizes the 
protein dimer. [Drawn from 1FOS.pdb.] 


Kk FIGURE 32.8 Zinc-finger domains. 
A DNA-binding domain comprising three 
Cys,His, zinc-finger domains (shown in 
yellow, blue, and red) is shown in a complex 
with DNA. Each zinc-finger domain is 
stabilized by a bound zinc ion (shown in 
green) through interactions with two cysteine 
residues and two histidine residues. Notice 
how the protein wraps around the DNA in 
the major groove. [Drawn from 1AAY.pdb.] 


Mediator 


Transcription 
factor 


RNA polymerase II 


FIGURE 32.9 Mediator. Mediator, a large 
complex of protein subunits, acts as a 
bridge between transcription factors 
bearing activation domains and RNA 
polymerase II. These interactions help 
recruit and stabilize RNA polymerase II near 
specific genes that are then transcribed. 
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groove of DNA. An a helix from each domain makes 
specific contact with the edges of base pairs within the 
groove. Some proteins contain arrays of 10 or more 
zinc-finger domains, potentially enabling them to con- 
tact long stretches of DNA. The human genome 
encodes several hundred proteins that contain zinc- 
finger domains of this class. We will encounter another 
class of zinc-based DNA-binding domain when we 
consider nuclear hormone receptors in Section 32.3. 


Activation domains interact with other proteins 


The activation domains of transcription factors gen- 
erally recruit other proteins that promote transcrip- 
tion. In some cases, these activation domains interact 
directly with RNA polymerase II or closely associ- 
ated proteins. The activation domains act through 
intermediary proteins that bridge between the transcription factors and the 
polymerase. An important target of activators is mediator, a complex of 25 
to 30 subunits conserved from yeast to human beings, that acts as a bridge 
between transcription factors and promoter-bound RNA polymerase II 
(Figure 32.9). 

Activation domains are less conserved than DNA-binding domains. In 
fact, very little sequence similarity has been found. For example, they may be 
acidic, hydrophobic, glutamine rich, or proline rich. However, they have cer- 
tain features in common. First, they are often redundant; that is, a part of the 
activation domain can be deleted without loss of function. Second, they are 
modular and can activate transcription when paired with a variety of DNA- 
binding domains. Third, they can act synergistically: two activation domains 
acting together create a stronger effect than either acting separately. 

We have been considering the case in which gene control increases the 
expression level of a gene. In many cases, the expression of a gene must be 
decreased by blocking transcription. The agents in such cases are transcrip- 
tional repressors. Like activators, transcriptional repressors act in many 
cases by altering chromatin structure. 


Multiple transcription factors interact with eukaryotic regulatory regions 


The basal transcription complex described in Chapter 29 initiates transcrip- 
tion at a low frequency. Recall that several general transcription factors join 
with RNA polymerase II to form the basal transcription complex. Additional 
transcription factors must bind to other sites that can be near the promoter 
or quite distant for a gene to achieve a higher rate of mRNA synthesis. In 
contrast with the regulators of prokaryotic transcription, few eukaryotic 
transcription factors have any effect on their own. Instead, each factor 
recruits other proteins to build up large complexes that interact with the 
transcriptional machinery to activate transcription. 

A major advantage of this mode of regulation is that a given regulatory 
protein can have different effects, depending on what other proteins are 
present in the same cell. This phenomenon, called combinatorial control, is 
crucial to multicellular organisms that have many different cell types. Even 
in unicellular eukaryotes such as yeast, combinatorial control allows the 
generation of distinct cell types. 


Enhancers can stimulate transcription in specific cell types 


Transcription factors can often act even if their binding sites lie at a con- 
siderable distance from the promoter. These distant regulatory sites are 


called enhancers (Chapter 29). Enhancers function by serving as binding 
sites for specific transcription factors. An enhancer is effective only in 
the specific cell types in which appropriate regulatory proteins are 
expressed. In many cases, these DNA-binding proteins influence tran- 
scription initiation by perturbing the local chromatin structure to expose 
a gene or its regulatory sites rather than by direct interactions with RNA 
polymerase. This mechanism accounts for the ability of enhancers to act 
at a distance. 

The properties of enhancers are illustrated by studies of the enhancer 
controlling the muscle isoform of creatine kinase (Figure 32.10). The 
results of mutagenesis and other studies revealed the presence of an 
enhancer located between 1350 and 1050 base pairs upstream of the start 
site of the gene for this enzyme. Experimentally inserting this enhancer 
near a gene not normally expressed in muscle cells is sufficient to cause the 
gene to be expressed at high levels in muscle cells but not in other cells 
(Figure 32.11). 


FIGURE 32.11 An experimental 
demonstration of enhancer function. 

A promoter for muscle creatine kinase 
artificially drives the transcription of 
8-galactosidase in a zebrafish embryo. 
Only specific sets of muscle cells produce 
8-galactosidase, as visualized by the 
formation of the blue product on treatment 
of the embryo with X-Gal. [From F. Muller, 
D. W. Williamson, J. Kobolak, L. Gauvry, 

G. Goldspink, L. Orban, and N. MacLean, 
Mol. Reprod. Dev. 47:404-412, 1997.] 


Induced pluripotent stem cells can be generated by introducing 
four transcription factors into differentiated cells 


2 An important application illustrating the power of transcription 
factors is the development of induced pluripotent stem (iPS) cells. 
Pluripotent stem cells have the ability to differentiate into many different 
cell types on appropriate treatment. Previously isolated cells derived from 
embryos show a very high degree of pluripotency. Over time, researchers 
identified dozens of genes in embryonic stem cells that contributed to this 
pluripotency when expressed. In a remarkable experiment reported for 
mouse cells in 2006 and human cells in 2007, Shinya Yamanaka demon- 
strated that just four genes out of this entire set could induce pluripotency 
in already-differentiated skin cells. Yamanaka introduced genes encoding 
four transcription factors into skin cells called fibroblasts. The fibroblasts 
de-differentiated into cells that appeared to have characteristics very nearly 
identical with those of embryonic stem cells (Figure 32.12). 

These iPS cells represent powerful research tools and, potentially, a new 
class of therapeutic agents. The proposed concept is that a sample of a 
patient’s fibroblasts could be readily isolated and converted into iP5 cells. 
These iPS cells could be treated to differentiate into a desired cell type that 
could then be transplanted into the patient. For example, such an approach 
might be used to restore a particular class of nerve cells that had been 
depleted by a neurodegenerative disease. Although the field of iPS cell 
research is still evolving, it holds great promise as a possible approach to 
treat any common and difficult-to-treat diseases. 
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FIGURE 32.10 Enhancer binding sites. 

A schematic structure for the region 1 ko 
upstream of the start site for the muscle 
creatine kinase gene. One binding site of 
the form 5'-CAGCTG-3’ is present near the 
TATA box. The enhancer region farther 
upstream contains two binding sites for the 
same protein and two additional binding 
sites for other proteins. 


FIGURE 32.12 Induced pluripotent stem 
cells. A micrograph of human induced 
pluripotent stem cells stained green for a 
transcription factor that is characteristic of 
pluripotent cells. [From K. Takahashi et al., 
Cell 131:861-872, 2007; with permission 
from Elsevier, courtesy of Shinya Yamanaka, 
Kyoto University.] 
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hk FIGURE 32.13 GAL4 binding sites. 
The yeast transcription factor GAL4 
binds to DNA sequences of the form 
5’'-CGG(N), ;CCG-3". Two zinc-based 
domains are present in the DNA-binding 
region of this protein. Notice that these 
domains contact the 5'-CGG-3' 
sequences, leaving the center of the site 
uncontacted. [Drawn from 1D66.pdb.] 
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FIGURE 32.14 Chromatin 
immunoprecipitation. Cells or isolated 
nuclei are treated with formaldehyde to 
cross-link proteins to DNA. The cells are 
then lysed and the DNA is fragmented by 
sonication. DNA fragments bound to a 
particular protein are isolated through the 


use of an antibody specific for that protein. 


The cross-links are then reversed and the 
DNA fragments are characterized. 


32.3 The Control of Gene Expression Can Require 
Chromatin Remodeling 


Early observations suggested that chromatin structure plays a major role in 
controlling eukaryotic gene expression. DNA that is densely packaged into 
chromatin is less susceptible to cleavage by the nonspecific DNA-cleaving 
enzyme DNase I. Regions adjacent to genes that are being transcribed are 
more sensitive to cleavage by DNase I than are other sites in the genome, 
suggesting that the DNA in these regions is less compacted than it is else- 
where and more accessible to proteins. In addition, some sites, usually 
within 1 kb of the start site of an active gene, are exquisitely sensitive to 
DNase I and other nucleases. These hypersensitive sites correspond to 
regions that have few nucleosomes or contain nucleosomes in an altered 
conformation Hypersensitive sites are cell-type specific and developmentally 
regulated. For example, globin genes in the precursors of erythroid cells 
from 20-hour-old chicken embryos are insensitive to DNase I. However, 
when hemoglobin synthesis begins at 35 hours, regions adjacent to these 
genes become highly susceptible to digestion. In tissues such as the brain 
that produce no hemoglobin, the globin genes remain resistant to DNase I 
throughout development and into adulthood. These studies suggest that a 
prerequisite for gene expression is a relaxing of the chromatin structure. 
Recent experiments even more clearly revealed the role of chromatin struc- 
ture in regulating access to DNA-binding sites. Genes required for galactose 
utilization in yeast are activated by a transcription factor called GAL4, which 
recognizes DNA-binding sites with two 5’-CGG-3’ sequences on comple- 
mentary strands separated by 11 base pairs (Figure 32.13). Approximately 
4000 potential GAL4 binding sites of the form 5'-CGG(N),,CCG-3’ are 
present in the yeast genome, but only 10 of them regulate genes necessary for 
galactose metabolism. How is GAL4 targeted to only a small fraction of the 
potential binding sites? This question is addressed through the use of a tech- 
nique called chromatin immunopyrecipitation (ChIP; Figure 32.14). GAL4 is 
first cross-linked to its DNA-binding sites in chromatin. The DNA is then 
fragmented into small pieces, and antibodies to GAL4 are used to isolate the 
chromatin fragments containing GAL4. The cross-linking is reversed, and the 
DNA is isolated and characterized. The results of these studies reveal that only 
approximately 10 of the 4000 potential GAL4 sites are occupied by GAL4 
when the cells are growing on galactose; more than 99% of the sites appear 
to be blocked, presumably by the local chromatin structure. Thus, whereas 
in prokaryotes all sites appear to be equally accessible, chromatin structure 
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shields a large number of the potential binding sites in eukaryotic cells. GAL4 
is thereby prevented from binding to sites that are unimportant in galactose 
metabolism. These lines of evidence and others reveal that chromatin struc- 
ture is altered in active genes compared with inactive ones. 


The methylation of DNA can alter patterns of gene expression 


The degree of methylation of DNA provides another mechanism, in addi- 
tion to packaging with histones, for inhibiting gene expression inappropriate 
to a specific cell type. Carbon 5 of cytosine can be methylated by specific 
methyltransferases. About 70% of the 5'-CpG-3’ sequences (where “‘p” 
represents the phosphate residue in the DNA backbone) in mammalian 
genomes are methylated. However, the distribution of these methylated 
cytosines varies, depending on the cell type. Consider the B-globin gene. In 
cells that are actively expressing hemoglobin, the region from approximately 
1 kb upstream of the start site to approximately 100 bp downstream of the 
start site is less methylated than the corresponding region in cells that do not 
express this gene. The relative absence of 5-methylcytosines near the start 
site is referred to as hypomethylation. The methyl group of 5-methylcytosine 
protrudes into the major groove where it could easily interfere with the 
binding of proteins that stimulate transcription. 


a The distribution of CpG sequences in mammalian genomes is not 
uniform. Many CpG sequences have been converted into TpG 
through mutation by the deamination of 5-methylcytosine to thymine. 
However, sites near the 5’ ends of genes have been maintained because of 
their role in gene expression. Thus, most genes are found in CpG islands, 
regions of the genome that contain approximately four times as many CpG 
sequences as does the remainder of the genome. 


Steroids and related hydrophobic molecules pass through 
membranes and bind to DNA-binding receptors 


We next look at an example that illustrates how transcription factors can 
stimulate changes in chromatin structure that affect transcription. We will 
consider in some detail the system that detects and responds to estrogens. 
Synthesized and released by the ovaries, estrogens, such as estradiol, are 
cholesterol-derived steroid hormones (Section 26.4). They are required for 
the development of female secondary sex characteristics and, along with 
progesterone, participate in the ovarian cycle. 

Because they are hydrophobic molecules, estrogens easily diffuse across 
cell membranes. When inside a cell, estrogens bind to highly specific, solu- 
ble receptor proteins. Estrogen receptors are members of a large family of 
proteins that act as receptors for a wide range of hydrophobic molecules, 
including other steroid hormones, thyroid hormones, and retinoids. 
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All-trans-retinoic acid Thyroxine 
(a retinoid) (L-3,5,3’,5’-Tetraiodothyronine) 
(a thyroid hormone) 
The human genome encodes approximately 50 members of this fam- 
ily, often referred to as nuclear hormone receptors. The genomes of other 
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hk FIGURE 32.15 Structure of two 
nuclear hormone-receptor domains. 
Nuclear hormone receptors contain two 
crucial conserved domains: (1) a DNA- 
binding domain toward the center of the 
sequence and (2) a ligand-binding domain 
toward the carboxyl terminus. The structure 
of a dimer of the DNA-binding domain 
bound to DNA is shown, as is one 
monomer of the normally dimeric ligand- 
binding domain. [Drawn from 1HCQ and 
1LBD.pdb.] 


DNA-binding 
domain 


Ligand-binding 
domain 


multicellular eukaryotes encode similar numbers of nuclear hormone 
receptors, although they are absent in yeast. 

All these receptors have a similar mode of action. On binding of the 
signal molecule (called, generically, a ligand), the ligand-receptor complex 
modifies the expression of specific genes by binding to control elements in 
the DNA. Estrogen receptors bind to specific DNA sites (referred to as 
estrogen response elements or EREs) that contain the consensus sequence 
5'’-AGGTCANNNTGACCT-3’. As expected from the symmetry of 
this sequence, an estrogen receptor binds to such sites as a dimer. 

A comparison of the amino acid sequences of members of this family 
reveals two highly conserved domains: a DNA-binding domain and a 
ligand-binding domain (Figure 32.15). The DNA-binding domain lies 
toward the center of the molecule and consists of a set of zinc-based domains 
different from the Cys,His, zinc-finger proteins introduced in Section 32.2. 
These zinc-based domains bind to specific DNA sequences by virtue of an 
a helix that lies in the major groove in the specific DNA complexes formed 
by estrogen receptors. 


Nuclear hormone receptors regulate transcription by recruiting 
coactivators to the transcription complex 


The second highly conserved domain of the nuclear receptor proteins lies 
near the carboxyl terminus and is the ligand-binding site. This domain folds 
into a structure that consists almost entirely of a helices, arranged in three 
layers. The ligand binds in a hydrophobic pocket that lies in the center of 
this array of helices (Figure 32.16). This domain changes conformation 
when it binds its ligand, estrogen. How does ligand binding lead to changes 
in gene expression? The simplest model would have the binding of ligand 
alter the DNA-binding properties of the receptor, analogously to the lac 
repressor in prokaryotes. However, experiments with purified nuclear hor- 
mone receptors revealed that ligand binding does not significantly alter 
DNA-binding affinity and specificity. Another mechanism 1s operative. 
Investigators sought to determine whether specific proteins might bind 
to the nuclear hormone receptors only in the presence of ligand. Such 
searches led to the identification of several related proteins called coactiva- 
tors, such as SRC-1 (steroid receptor coactivator-1), GRIP-1 (glucocorticoid 
receptor interacting protein-1), and NcoA-1 (nuclear hormone receptor 
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Sg — Helix 12 


k FIGURE 32.16 Ligand binding to nuclear hormone receptor. The ligand lies 
completely surrounded within a pocket in the ligand-binding domain. Notice that the last 

a helix, helix 12 (shown in purple), folds into a groove on the side of the structure on ligand 
binding. [Drawn from 1LDB and 1ERE.pdb.] 


coactivator-1). These coactivators are referred to as the p160 family because 
of their size. The binding of ligand to the receptor induces a conformational 
change that allows the recruitment of a coactivator (Figure 32.17). In many 
cases, these coactivators are enzymes that catalyze reactions that lead to the 
modification of chromatin structure. 


Steroid-hormone receptors are targets for drugs 


= Molecules such as estradiol that bind to a receptor and trigger signaling 
pathways are called agonists. Athletes sometimes take natural and syn- 
thetic agonists of the androgen receptor, a member of the family of nuclear 
hormone receptors, because their binding to the androgen receptor stimulates 
the expression of genes that enhance the development of lean muscle mass. 


CH; O 
CH; 
0 
Androstendione Dianabol 
(a natural androgen) (methandrostenolone) 


(a synthetic androgen) 


Referred to as anabolic steroids, such compounds used in excess are not 
without side effects. In men, excessive use leads to a decrease in the secretion 
of testosterone, to testicular atrophy, and sometimes to breast enlargement 


FIGURE 32.17 Coactivator 


Estrogen : a ft 
(ligand) recruitment. The binding 
oe of ligand to a nuclear 


hormone receptor induces 
a conformational change in 
the ligand-binding domain. 
This change in conformation 
generates favorable sites for 
the binding of a coactivator. 


\ 


a helix 


Tamoxifen 


Ly FIGURE 32.18 Estrogen receptor— 
tamoxifen complex. Tamoxifen binds in 
the pocket normally occupied by estrogen. 
However, notice that part of the tamoxifen 
structure extends from this pocket, and so 
helix 12 cannot pack in its usual position. 
Instead, this helix blocks the coactivator- 
binding site. [Drawn from 3ERT.pdb.] 


Histone 
H3 tail 


hk FIGURE 32.19 Structure of histone 
acetyltransferase. The amino-terminal tail 
of histone H3 extends into a pocket in 
which a lysine side chain can accept an 
acetyl group from acetyl Coenzyme A 
bound in an adjacent site. [Drawn from 
1QSN.pdb.] 


Lysine in histone tail 
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(gynecomastia) if some of the excess androgen is converted into estrogen. In 
women, excess testosterone causes a decrease in ovulation and estrogen 
secretion; it also causes breast regression and growth of facial hair. 

Other molecules bind to nuclear hormone receptors but do not effec- 
tively trigger signaling pathways. Such compounds are called antagonists 
and are, in many ways, like competitive inhibitors of enzymes. Some impor- 
tant drugs are antagonists that target the estrogen receptor. For example, 
tamoxifen and raloxifene are used in the treatment and prevention of breast 
cancer, because some breast tumors rely on estrogen-mediated pathways for 
growth. Because some of these compounds have distinct effects on different 
forms of the estrogen receptor, they are referred to as selective estrogen recep- 


tor modulators (SERMs). 
SH 
es ( ¢ Cy 
o* — 
r m \ [ po 
oO _ J 
Tamoxifen Raloxifene 


The determination of the structures of complexes between the estrogen 
receptor and these drugs revealed the basis for their antagonist effect 
(Figure 32.18). Tamoxifen binds to the same site as estradiol does. However, 
tamoxifen has a group that extends out of the normal ligand-binding 
pocket, as do other antagonists. These groups block the normal conforma- 
tional changes induced by estrogen. Tamoxifen blocks the binding of coacti- 
vators and thus inhibits the activation of gene expression. 


Chromatin structure is modulated through covalent 
modifications of histone tails 


We have seen that nuclear receptors respond to signal molecules by recruit- 
ing coactivators. Now we can ask, How do coactivators modulate transcrip- 
tional activity? These proteins act to loosen the histone complex from the DNA, 
exposing additional DNA regions to the transcription machinery. 

Much of the effectiveness of coactivators appears to result from their 
ability to covalently modify the amino-terminal tails of histones as well as 
regions on other proteins. Some of the p160 coactivators and the proteins 
that they recruit catalyze the transfer of acetyl groups from acetyl CoA to 
specific lysine residues in these amino-terminal tails. 


+ CoA—SH + Ht 


Acetyl CoA 


Enzymes that catalyze such reactions are called histone acetyltransferases 
(HATs). The histone tails are readily extended; so they can fit into the HAT 
active site and become acetylated (Figure 32.19). 

What are the consequences of histone acetylation? Lysine bears a posi- 
tively charged ammonium group at neutral pH. The addition of an acetyl 
group generates an uncharged amide group. This change dramatically 
reduces the affinity of the tail for DNA and modestly decreases the affinity 


of the entire histone complex for DNA, loosening the histone complex from 
the DNA. 

In addition, the acetylated lysine residues interact with a specific acetylly- 
sine-binding domain that is present in many proteins that regulate eukaryotic 
transcription. This domain, termed a bromodomain, comprises approximately 
110 amino acids that form a four-helix bundle containing a peptide-binding 
site at one end (Figure 32.20). 

Bromodomain-containing proteins are components of two large complexes 
essential for transcription. One is a complex of more than 10 polypeptides that 
binds to the TATA-box-binding protein. Recall that the TATA-box-binding 
protein is an essential transcription factor for many genes (Section 29.2). 
Proteins that bind to the TATA-box-binding protein are called TAFs (for 
TATA-box-binding protein associated factors). In particular, TAF 1 contains 
a pair of bromodomains near its carboxyl terminus. The two domains are ori- 
ented such that each can bind one of two acetyllysine residues at positions 5 
and 12 in the histone H4 tail. Thus, acetylation of the histone tails provides a 
mechanism for recruiting other components of the transcriptional machinery. 

Bromodomains are also present in some components of large complexes 
known as chromatin-remodeling complexes or chromatin-remodeling engines. 
These complexes, which also contain domains homologous to those of heli- 
cases, utilize the free energy of ATP hydrolysis to shift the positions of 
nucleosomes along the DNA and to induce other conformational changes in 
chromatin (Figure 32.21). Histone acetylation can lead to a reorganization 
of the chromatin structure, potentially exposing binding sites for other fac- 
tors. Thus, histone acetylation can activate transcription through a combina- 
tion of three mechanisms: by reducing the affinity of the histones for DNA, by 
recruiting other components of the transcriptional machinery, and by initiating 
the remodeling of the chromatin structure. 

Nuclear hormone receptors also include regions that interact with com- 
ponents of the mediator complex. Thus, two mechanisms of gene regulation 
can work in concert. Modification of histones and chromatin remodeling 
can open up regions of chromatin into which the transcription complex can 
be recruited through protein-protein interactions. 


Histone deacetylases contribute to transcriptional repression 


Just as in prokaryotes, some changes in a cell’s environment lead to the 
repression of genes that had been active. The modification of histone tails 


aOR 


Transcription 
factor 


Coactivator 
Acetylated 
lysine residues 


Remodeling 
engine 


Exposed 
site 


Histone H4 tail Acetyllysine 


hk FIGURE 32.20 Structure of a 
bromodomain. This four-helix-bundle 
domain binds peptides containing 
acetyllysine. Notice that an acetylated 
peptide from histone H4 is bound in the 
structure. [Drawn from 1EGI.pdb.] 


FIGURE 32.21 Chromatin remodeling. 
Eukaryotic gene regulation begins with an 
activated transcription factor bound to a 
specific site on DNA. One scheme for the 
initiation of transcription by RNA polymerase 
ll requires five steps: (1) recruitment of a 
coactivator, (2) acetylation of lysine 
residues in the histone tails, (8) binding 

of a remodeling-engine complex to 

the acetylated lysine residues, (4) ATP- 
dependent remodeling of the chromatin 
structure to expose a binding site for 

RNA polymerase II or for other factors, and 
(5) recruitment of RNA polymerase II. Only 
two subunits are shown for each complex, 
although the actual complexes are much 
larger. Other schemes are possible. 


RNA 
polymerase II 
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TABLE 32.2 Selected histone modifications 


Modification Associated with 
H4 K8 acetylation Activation 

H3 K14 acetylation Activation 

H3 K27 monomethylation Activation 

H3 K27 trimethylation Repression 

H3 R17 methylation Activation 

H2B 814 phosphorylation DNA repair 

H2B K120 ubiquitination Activation 


k FIGURE 32.22 Structure of ferritin. 
(A) Twenty-four ferritin polypeptides form 
a nearly spherical shell. (B) A cutaway 
view reveals the core that stores iron as 
an iron oxide-hydroxide complex [Drawn 
from 1IES.pdb.] 
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again plays an important role. However, in repression, a key reaction 
appears to be the deacetylation of acetylated lysine, catalyzed by spe- 
cific histone deacetylase enzymes. 

In many ways, the acetylation and deacetylation of lysine residues in 
histone tails (and, likely, in other proteins) is analogous to the phos- 
phorylation and dephosphorylation of serine, threonine, and tyrosine 
residues in other stages of signaling processes. Like the addition of 
phosphory] groups, the addition of acetyl groups can induce conforma- 
tional changes and generate novel binding sites. Without a means of 
removal of these groups, however, these signaling switches will become 

stuck in one position and lose their effectiveness. Like phosphatases, 
deacetylases help reset the switches. 

Acetylation is not the only modification of histones and other proteins in 
gene-regulation processes. The methylation of specific lysine and arginine 
residues also can be important. Some of the more common modifications 
are shown in Table 32.2. 

The elucidation of the roles of these modifications is a very active area of 
research at present. The relation between various histone modifications and 
their roles in controlling gene expression is sometimes referred to as “the his- 
tone code.” Although important generalizations have been discovered, this 
code clearly is subtle and complicated rather than being a set of hard and fast 
rules. Nonetheless, the development and application of methods that allow 
probing of histone modifications on a genome-wide scale in a range of cell types 
has provided large data sets that can be examined to test and refine models. 


32.4 Eukaryotic Gene Expression Can Be Controlled at 
Posttranscriptional Levels 


Just as in prokaryotes, gene expression in eukaryotes can be regulated subse- 
quent to transcription. We shall consider two examples. The first is the regula- 
tion of genes taking part in iron metabolism through key features in RNA 
secondary structure, similar in many ways to prokaryotic posttranscriptional 
regulation (Section 31.4). The second entails an entirely new mechanism, first 
glimpsed with the discovery of RNA interference (p. 162). Certain small regu- 
latory RNA molecules allow the regulation of gene expression through interac- 
tion with arangeof mRNA molecules. Remarkably, this mechanism, discovered 
only recently, affects the expression of approximately 60% of all human genes. 


Genes associated with iron metabolism are translationally 
regulated in animals 


RNA secondary structure plays a role in the regulation of iron metabolism in 
eukaryotes. Iron is an essential nutrient, required for the synthesis of hemo- 
globin, cytochromes, and many other proteins. However, excess iron can be 
quite harmful because, untamed by a suitable protein environment, iron can 
initiate a range of free-radical reactions that damage proteins, lipids, and 
nucleic acids. Animals have evolved sophisticated systems for the accumula- 
tion of iron in times of scarcity and for the safe storage of excess iron for later 
use. Key proteins include transferrin, a transport protein that carries iron in 
the serum, transferrin receptor, a membrane protein that binds iron-loaded 
transferrin and initiates its entry into cells, and ferritin, an impressively effi- 
cient iron-storage protein found primarily in the liver and kidneys. Twenty- 
four ferritin polypeptides form a nearly spherical shell that encloses as many 
as 2400 iron atoms, a ratio of one iron atom per amino acid (Figure 32.22). 
Ferritin and transferrin-receptor expression levels are reciprocally related 
in their responses to changes in iron levels. When iron is scarce, the amount of 
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FIGURE 32.23 lron-response element. Ferritin mRNA includes a stem-loop structure, 
termed an iron-response element (IRE), in its 5’ untranslated region. The IRE binds a specific 
protein that blocks the translation of this mRNA under low iron conditions. 


transferrin receptor increases and little or no new ferritin is synthesized. 
Interestingly, the extent of mRNA synthesis for these proteins does not change 
correspondingly. Instead, regulation takes place at the level of translation. 

Consider ferritin first. Ferritin mRNA includes a stem-loop structure 
termed an iron-response element (IRE) in its 5’ untranslated region 
(Figure 32.23). This stem-loop binds a 90-kDa protein, called an IRE- 
binding protein (IRP), that blocks the initiation of translation as the IRE 
binds on the 5’ side of the coding region. When the iron level increases, the 
IRP binds iron as a 4Fe-45 cluster. The IRP bound to iron cannot bind 
RNA, because the binding sites for iron and RNA substantially overlap. 
Thus, in the presence of iron, ferritin mRNA is released from the IRP and 
translated to produce ferritin, which sequesters the excess iron. 

An examination of the nucleotide sequence of transferrin-receptor mRNA 
reveals the presence of several IRE-like regions. However, these regions are 
located in the 3’ untranslated region rather than in the 5’ untranslated region 
(Figure 32.24). Under low-iron conditions, IRP binds to these IREs. 
However, given the location of these binding sites, the transferrin-receptor 
mRNA can still be translated. What happens when the iron level increases 
and the IRP no longer binds transferrin-receptor mRNA? Freed from the 
IRP, transferrin-receptor mRNA is rapidly degraded. Thus, an increase in 
the cellular iron level leads to the destruction of transferrin-receptor mRNA 
and, hence, a reduction in the production of transferrin-receptor protein. 


Iron-response elements 


Coding region 


ny The purification of the IRP and the cloning of its cDNA were sources of 
truly remarkable insight into evolution. The IRP was found to be 
approximately 30% identical in amino acid sequence with the citric acid cycle 
enzyme aconitase from mitochondria. Further analysis revealed that the IRP is, 
in fact, an active aconitase enzyme; it is a cytoplasmic aconitase that had been 
known for a long time, but its function was not well understood (Figure 32.25). 
The iron-sulfur center at the active site of the IRP is rather unstable, and loss 
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32.4 Eukaryotic Posttranscriptional 
Gene Regulation 


FIGURE 32.24 Transferrin-receptor 
mRNA. This mRNA has a set of iron- 
response elements (IREs) in its 3’ untranslated 
region. The binding of the IRE-binding protein 
to these elements stabilizes the mRNA but 
does not interfere with translation. 


(B) of the iron triggers significant changes in protein conformation. 
Thus, this protein can serve as an iron-sensing factor. 

Other mRNAs, including those taking part in heme synthe- 
sis, have been found to contain IREs. Thus, genes encoding pro- 
teins required for iron metabolism acquired sequences that, when 
transcribed, provided binding sites for the iron-sensing protein. 

n environmental signal—the concentration of iron—controls 
the translation of proteins required for the metabolism of this 
metal. Thus, mutations in the untranslated region of mRNAs 
have been selected for beneficial regulation by iron levels. 


High-iron 
conditions 


( 


Low-iron 
conditions 


Small RNAs regulate the expression of many 
eukaryotic genes 


FIGURE 32.25 The IRP is an aconitase. (A) Aconitase 
contains an unstable 4Fe-4S cluster at its center. (B) Under 
conditions of low iron, the 4Fe-4S cluster dissociates and 
appropriate RNA molecules can bind in its place. [Drawn 


from 1C96.pdb.] 


RNA 
~~» substrate 


hk FIGURE 32.26 MicroRNA-Argonaute 
complex. The miRNA (shown in red) is 
bound by the Argonaute protein. Notice 
that the miRNA serves as a guide that 
binds the RNA substrate (shown in gray) 
through the formation of a double helix. 
Two magnesium ions are shown in 
green. [Drawn from 3HK2.pdb.] 
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Genetic studies of development in C. elegans revealed that a gene 
called lin-4 encodes an RNA molecule 61 nucleotides long that can 
regulate the expression of certain other genes. The 61-nucleotide 
lin-4¢ RNA does not encode a protein, but rather is cleaved into a 
22-nucleotide RNA that possesses the regulatory activity. This 
discovery was the first view of a large class of regulatory RNA 
molecules that are now called microRNAs or miRNAs. The key to the activity of 
microRNAs and their specificity for particular genes is their ability to form 
Watson—Crick base-pair-stabilized complexes with the mRNAs of those genes. 

These miRNAs do not function on their own. Instead, the miRNAs bind 
to members of a class of proteins called the Argonaute family (Figure 32.26). 
These Argonaute-miRNA complexes can then bind mRNAs that have 
sequences that are substantially complementary to the miRNAs. Once 
bound, such an mRNA can be cleaved by the Argonaute-miRNA complex 
through the action of a magnesium-based active site. Thus, the miRNAs serve 
as guide RNAs that determine the specificity of the Argonaute complex 
(Figure 32.27). Cleavage of mRNA by the Argonaute-miRNA complex is 
similar to the mechanism of RNA interference (p. 162). The small RNA mol- 
ecules participating in RNAi, however, come from a different source. In 
RNAi, double-stranded RNAs are cleaved into 21-nucleotide fragments, 
single-stranded components of which are bound by members of the Argonaute 
family to form an RNA-induced silencing complex (RISC) that cleaves com- 
plementary mRNAs. For miRNAs, the single-stranded RNAs are generated 
from larger genetically encoded precursors, as described in Chapter 29. 

The occurrence of gene regulation by miRNAs was originally thought to 
be limited to a relatively small number of species. However, subsequent 
studies have revealed that this mode of gene regulation is nearly ubiquitous 
in eukaryotes. Indeed, more than 700 miRNAs encoded by the human 
genome have been identified. Each miRNA can regulate many different 
genes because many different target sequences are present in each mRNA. 
An estimated 60% of all human genes are regulated by one or more 
miRNAs. As one example, consider the human miRNA called miR-206. 
This miRNA down-regulates the expression of one isoform of the estrogen 
receptor. In addition, it appears to down-regulate the expression of several 
different coactivators that interact with the estrogen receptor. Thus, this 
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FIGURE 32.27 MicroRNA action. MicroRNAs bind to members of the Argonaute family 
where they target specific mRNA molecules for cleavage. 


miRNA can mute the influence of estrogen by blocking the estrogen- 
initiated signaling pathway at several different steps. 


a The microRNA pathway has tremendous implications for the evolu- 

tion of gene-regulatory pathways. Most of the target sites for miRNAs 
are present in the 3’-untranslated regions of mRNAs. These sequences are 
quite free to mutate because they do not encode proteins and are not required 
to fold into any specific structures. Thus, in the context of a set of expressed 
miRNAs, mutations in this region of any particular gene could, in principle, 
increase or decrease the affinity for one or more miRNAs and, hence, alter 
the regulation of the gene. 


SUMMARY 


32.1 Eukaryotic DNA Is Organized into Chromatin 

Eukaryotic DNA is tightly bound to basic proteins called histones; the 
combination is called chromatin. DNA wraps twice around an octamer 
of core histones to form a nucleosome. The four core histones are 
homologous and fold into similar structures. Each core histone has an 
amino-terminal tail rich in lysine and arginine residues. Nucleosomes 
are the first stage of compaction of eukaryotic DNA. Chromatin 
blocks access to many potential DNA-binding sites. Changes in chro- 
matin structure play a major role in regulating gene expression. 


32.2 Transcription Factors Bind DNA and Regulate Transcription Initiation 
Most eukaryotic genes are not expressed unless they are activated by 
the binding of specific proteins, called transcription factors, to sites on 
the DNA. These specific DNA-binding proteins interact directly or 
indirectly with RNA polymerases or their associated proteins. 
Eukaryotic transcription factors are modular: they consist of separate 
DNA-binding and activation domains. Important classes of DNA- 
binding proteins include the homeodomains, the basic-leucine zipper 
proteins, and Cys,His, zinc-finger proteins. Each of these classes of 
proteins uses an a helix to make specific contacts with DNA. 
Activation domains interact with RNA polymerases or their associ- 
ated factors or with other protein complexes such as mediator. 
Enhancers are DNA elements that can modulate gene expression from 
more than 1000 bp away from the start site of transcription. Enhancers 
are often specific for certain cell types, depending on which DNA- 
binding proteins are present. The introduction of genes for a specific 
set of four transcription factors into fibroblasts can cause these cells to 
dedifferentiate into induced pluripotent stem cells. 


32.3 The Control of Gene Expression Can Require Chromatin Remodeling 
Chromatin structure is crucial to the control of gene expression; it is 
more open near the transcription start sites of actively transcribed genes. 
Steroids such as estrogens bind to eukaryotic transcription factors called 
nuclear hormone receptors. These proteins are capable of binding DNA 
whether or not ligands are bound. The binding of ligands induces a con- 
formational change that allows the recruitment of additional proteins 
called coactivators. Among the most important functions of coactivators 
is to catalyze the addition of acetyl groups to lysine residues in the tails 
of histone proteins. Histone acetylation decreases the affinity of the his- 
tones for DNA, making additional genes accessible for transcription. In 
addition, acetylated histones are targets for proteins containing specific 
binding units called bromodomains. Bromodomains are components of 
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CHAPTER 32 The Control of Gene 
Expression in Eukaryotes 


32.4 


two classes of large complexes: (1) chromatin-remodeling engines and 
(2) factors associated with RNA polymerase II. These complexes open 
up sites on chromatin and initiate transcription. 


Eukaryotic Gene Expression Can Be Controlled at 

Posttranscriptional Levels 

Genes encoding proteins that transport and store iron are regulated at 
the translational level. Iron-response elements, structures that are 
present in certain mRNAs, are bound by an IRE-binding protein 
when this protein is not binding iron. Whether the expression of a gene 
is stimulated or inhibited in response to changes in the iron status of a 
cell depends on the location of the IRE within the mRNA. The IRE- 
binding protein is a cytoplasmic aconitase that loses its iron-sulfur 
center under low-iron conditions. MicroRNAs are specialized RNA 
molecules encoded as parts of larger RNA precursors. They bind to 
proteins of the Argonaute family; the bound miRNAs function as 
guides that help bind specific mRNA molecules that are then cleaved. 


KEY TERMS 


cell type (p. 942) 
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nucleosome core particle (p. 943) 
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combinatorial control (p. 946) antagonist (p. 952) 
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PROBLEMS 


hypersensitive site (p. 948) 


chromatin immunoprecipitation 


hypomethylation (p. 949) 
CpG island (p. 949) 
nuclear hormone receptor (p. 949) 


estrogen response element 


anabolic steroid (p. 951) 


selective estrogen receptor modulator 


acetyllysine-binding domain (p. 953) 
bromodomain (p. 953) 


TATA-box-binding protein associated 
factor (TAF) (p. 953) 


chromatin-remodeling complex (p. 953) 
histone deacetylase (p. 954) 

transferrin (p. 954) 

transferrin receptor (p. 954) 

ferritin (p. 954) 

iron-response element (IRE) (p. 955) 
IRE-binding protein (IRP) (p. 955) 
microRNA (miRNA) (p. 956) 
Argonaute family proteins (p. 956) 


histone acetyltransferase (HAT) (p. 952) 


1. Charge neutralization. Given the histone amino acid 
sequences illustrated below, estimate the charge of a histone 
octamer at pH 7. Assume that histidine residues are 
uncharged at this pH. How does this charge compare with 
the charge on 150 base pairs of DNA? 


Histone H2A 
MSGRGKQGGKARAKAKTRSSRAGLQFPVGRVHRLLRKGNYSERVGAGAPVYLAAVLEYLTAEILELAGNA 


ARDNKKTRIIPRHLQLAIRNDEELNKLLGRVTIAQGGVLPNIQAVLLPKKTESHHKAKGK 


Histone H2B 
MPEPAKSAPAPKKGSKKAVTKAQKKDGKKRKRSRKESYSVYVYKVLKQVHPDTGISSKAMGIMNSFVNDI 


FERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTSSK 


Histone H3 
MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRPGTVALREIRRYQKSTELLIRKLPFQR 


LVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA 


Histone H4 
MSGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDA 


VTYTEHAKRKTVTAMDWVYALKRQGRTLYGFGG 


2. Chromatin immunoprecipitation. You have used the tech- 
nique of chromatin immunoprecipitation to isolate DNA 
fragments containing a DNA-binding protein of interest. 
Suppose that you wish to know whether a particular known 
DNA fragment is present in the isolated mixture. How 
might you detect its presence? 


3. Around we go. Assuming that 145 base pairs of DNA 
wrap around the histone octamer 1.75 times, estimate the 
radius of the histone octamer. Assume 3.4 A per base pair 
and simplify the calculation by assuming that the wrapping 
is in two rather than three dimensions and neglecting the 


thickness of the DNA. 


4, Nitrogen substitution. Growth of mammalian cells in the 
presence of 5-azacytidine results in the activation of some 
normally inactive genes. Propose an explanation. 


NH> 


ee 

O = 
deoxyribose 
5-Azacytidine 


5. A new domain. A protein domain that recognizes 
5-methylcytosine in the context of double-stranded DNA 
has been characterized. What role might proteins contain- 
ing such a domain play in regulating gene expression? 
Where on a double-stranded DNA molecule would you 
expect such a domain to bind? 


6. Hybrid receptor. Through recombinant DNA methods, a 
modified steroid hormone receptor was prepared that consists 
of an estrogen receptor with its ligand-binding domain 
replaced by the ligand-binding domain from the progesterone 
receptor. Predict the expected responsiveness of gene expres- 
sion for cells treated with estrogen or with progesterone. 


7. Different modifications. What is the effect of acetylation 
of a lysine residue on the charge of a histone protein? Of 
lysine methylation? 


8. Transformer. The following amino acid sequence of one 
of the four transcription factors is used to generate iPS cells: 


HTCDYAGCGKTYTKSSHLKAHLRTHTGEKPYHCDWDGCGWKFARSDELTRHY RKHTGHRPFQCQKCD 
RAFSRSDHLALHMKRHF 


This transcription factor belongs to one of the three struc- 
tural classes discussed in Section 32.2. Identify the class. 


9. Coverage. What percentage of the DNA sites in yeast are 
accessible, assuming that the fraction of sites observed for 
GAL4 is typical? To how many base pairs of the 12-Mb 
yeast genome does this percentage correspond? 


10. Count the methyl groups. Examination of the histone 
modifications of a gene reveals an abundance of histone H3 
with lysine 27 modified with a single methyl group. Does 
this suggest that this gene is activated or repressed? How 
would your answer change if many lysine 27 residues were 
modified with three methyl groups each? 
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11. Iron regulation. What effect would you expect from the 
addition of an IRE to the 5’ end of a gene that is not nor- 
mally regulated by iron levels? To the 3’ end? 


12. Predicting microRNA regulation. Suppose that you 
have identified an miRNA that has the sequence 
5'-GCCUAGCCUUAGCAUUGAUUGG-3’. Propose a 
strategy for identifying mRNA that might be regulated by 
this miRNA, given the sequences of all mRNAs encoded 
by the human genome. 


Mechanism Problem 


13. Acetyltransferases. Propose a mechanism for the trans- 
fer of an acetyl group from acetyl CoA to the amino group 
of lysine. 


Data Interpretation Problem 


14. Limited restriction. The restriction enzyme Hpall is a 
powerful tool for analyzing DNA methylation. This enzyme 
cleaves sites of the form 5’-CCGG-3’ but will not cleave such 
sites if the DNA is methylated on any of the cytosine resi- 
dues. Genomic DNA from different organisms is treated 
with HpalI and the results are analyzed by gel electrophoresis 
(see the adjoining patterns). Provide an explanation for the 
observed patterns. 
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Sensory Systems 


Color perception requires specific photoreceptors. The photoreceptor rhodopsin (right), which absorbs 
light in the process of vision, consists of the protein opsin and a bound vitamin A derivative, retinal. The 
amino acids (shown in red) that surround the retinal determine the color of light that is most efficiently 
absorbed. A person lacking a light-absorbing photoreceptor for the color green will see a bowl of fruits of 
many colors (left) as similar shades of yellow-green (middle). [From Africa Studio/Shutterstock (left), modified 


by Jeremy Berg using Coblis (www.color-blindness.com/coblis-color-blindness-simulator/) (middle).] 


(a= senses provide us with means for detecting a diverse set of external 
signals, often with incredible sensitivity and specificity. For example, 
when fully adapted to a darkened room, our eyes allow us to sense very low 
levels of light, down to a limit of less than 10 photons. With more light, we 
are able to distinguish millions of colors. Through our senses of smell and 
taste, we are able to detect thousands of chemicals in our environment and 
sort them into categories: pleasant or unpleasant? healthful or toxic? Finally, 
we can perceive mechanical stimuli in the air and around us through our 
senses of hearing and touch. 

How do our sensory systems work? How are the initial stimuli detected? 
How are these initial biochemical events transformed into perceptions and 
experiences? We have already encountered systems that sense and respond 
to chemical signals—namely, receptors that bind to growth factors and 
hormones. Our knowledge of these receptors and their associated signal- 
transduction pathways provides us with concepts and tools for unraveling 
some of the workings of sensory systems. For example, 7I'M receptors 
(seven-transmembrane receptors; Section 14.1) play key roles in olfaction, 
taste, and vision. Ion channels that are sensitive to mechanical stress are 
essential for hearing and touch. 

In this chapter, we focus on the five major sensory systems found in 
human beings and other mammals: olfaction (the sense of smell—i.e., the 
detection of small molecules in the air), taste, or gustation (the detection of 
selected organic compounds and ions by the tongue), vision (the detection 
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FIGURE 33.1 Sensory connections to the brain. eee : 
Sensory nerves connect sensory organs to the brain compound 3-methylbutane-1 -thiol is a major component of the odor 


and spinal cord. 
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of light), hearing (the detection of sound, or pressure waves in the air), 
and touch (the detection of changes in pressure, temperature, and 
other factors by the skin). Each of these primary sensory systems con- 
tains specialized sensory neurons that transmit nerve impulses to the 
central nervous system (Figure 33.1). In the central nervous system, 
these signals are processed and combined with other information to 
yield a perception that may trigger a change in behavior. By these 
means, our senses allow us to detect changes in our environments and 
to adjust our behavior appropriately. 


33.1 A Wide Variety of Organic Compounds 
Are Detected by Olfaction 


Human beings can detect and distinguish thousands of different 
compounds by smell, often with considerable sensitivity and speci- 
ficity. Most odorants are small organic compounds with sufficient 
volatility that they can be carried as vapors into the nose. For exam- 
ple, a major component responsible for the odor of almonds is the 
simple aromatic compound benzaldehyde, whereas the sulfhydryl 


of skunks. 


3-Methylbutane-1-thiol Geraniol Zingiberene 
(Skunk) (Rose) (Ginger) 


What properties of these molecules are responsible for their odors? First, 
the shape of the molecule rather than its other physical properties is crucial. We 
can most clearly see the importance of shape by comparing molecules such 
as those responsible for the odors of spearmint and caraway. 

These compounds are identical in essentially all physical properties such 
as hydrophobicity because they are exact mirror images of one another. 
Thus, the odor produced by an odorant depends not on a physical property 
but on the compound’s interaction with a specific binding surface, most 
likely a protein receptor. Second, some human beings (and other animals) 
suffer from specific anosmias; that is, they are incapable of smelling specific 
compounds even though their olfactory systems are otherwise normal. Such 
anosmias are often inherited. These observations suggest that mutations in 
individual receptor genes lead to the loss of the ability to detect a small sub- 
set of compounds. 


Olfaction is mediated by an enormous family of seven- 
transmembrane-helix receptors 


Odorants are detected in a specific region of the nose, called the main olfac- 
tory epithelium, that lies at the top of the nasal cavity (Figure 33.2). 
Approximately 1 million sensory neurons line the surface of this region. 
Cilia containing the odorant-binding protein receptors project from these 
neurons into the mucous lining of the nasal cavity. 
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Biochemical studies in the late 1980s examined isolated cilia from rat olfac- 
tory epithelia that had been treated with odorants. Exposure to the odorants 
increased the cellular level of cyclic AMP, and this increase was observed 
only in the presence of GTP. On the basis of what was known about signal- 
transduction systems, the participation of cAMP and GTP strongly suggested 
the involvement of a G protein and, hence, 7TM receptors. Indeed, Randall 
Reed purified and cloned a G-protein a subunit, termed G of Which is 
uniquely expressed in olfactory cilia. The involvement of 7TM receptors sug- 
gested a strategy for identifying the olfactory receptors themselves. 
Complementary DNAs were sought that (1) were expressed primarily in the 
sensory neurons lining the nasal epithelium, (2) encoded members of the 
7TM-receptor family, and (3) were present as a large and diverse family to 
account for the range of odorants. Through the use of these criteria, cDNAs 
for odorant receptors from rats were identified in 1991 by Richard Axel and 
Linda Buck. 


a The odorant receptor (hereafter, OR) family is even 
larger than expected: more than 1000 OR genes are 
present in the mouse and the rat, whereas the human genome 
encodes approximately 350 ORs. In addition, the human 
genome includes approximately 500 OR pseudogenes 
containing mutations that prevent the generation of a 
full-length, proper odorant receptor. The OR family is 
thus one of the largest gene families in human beings. 
Further analysis of primate OR genes reveals that the 
fraction of pseudogenes is greater in species more closely 
related to human beings (Figure 33.3). Thus, we may 
have a glimpse at the evolutionary loss of acuity in the 
sense of smell as higher mammals presumably became 
less dependent on this sense for survival. For rodents that 
are highly dependent on their sense of smell, essentially 
all OR genes encode functional proteins. 

The OR proteins are typically 20% identical in 
sequence with the B-adrenergic receptor (Section 14.1) 
and from 30% to 60% identical with one another. Several 
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FIGURE 33.2 The main nasal epithelium. 
This region of the nose, which lies at the top 
of the nasal cavity, contains approximately 

1 million sensory neurons. Nerve impulses 
generated by odorant molecules binding to 
receptors on the cilia travel from the sensory 
neurons to the olfactory bulb. 


FIGURE 33.3 Evolution of odorant receptors. Odorant receptors 
appear to have lost function through conversion into pseudogenes 


specific sequence features are present in most ca all in the course of primate evolution. The percentage of OR genes that 
OR family members (Figure 33.4). The central region, appear to be functional for each species is given in parentheses. 


particularly transmembrane helices 4 and 5, is highly 
variable, suggesting that this region is the site of odorant 
binding. That site must be different in odorant receptors 
that bind distinct odorant molecules. 

What is the relation between OR gene expression and 
the individual neuron? Interestingly, each olfactory neuron 
expresses only a single OR gene, among hundreds available. 
Apparently, the precise OR gene expressed is determined 
largely at random. After one OR gene is expressed and a 
functional OR protein is produced, the expression of all 
other OR genes is suppressed by a feedback mechanism 
that remains to be fully elucidated. 

The binding of an odorant to an OR on the neuronal sur- 
face initiates a signal-transduction cascade that results in an 


FIGURE 33.4 Conserved and variant regions in odorant action potential (Figure 33.5). The ligand-bound OR acti- 
receptors. Odorant receptors are members of the 7TM-receptor vates G olf)? the specific G protein mentioned earlier. Grit) 1s 
family. The green cylinders represent the seven presumed initially in its GDP-bound form. When activated, it releases 


transmembrane helices. Strongly conserved residues characteristic GDP, binds GTP, and releases its associated Bry subunits. 


of this protein family are shown in blue, whereas highly variable 


residues are shown in red. 
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The a subunit then activates a specific adenylate cyclase, 
increasing the intracellular concentration of cAMP. The 
rise in the intracellular concentration of cAMP activates a 
nonspecific cation channel that allows calcium and other 
cations into the cell. The flow of cations through the channel depolarizes the 
neuronal membrane and initiates an action potential. This action potential, 
combined with those from other olfactory neurons, leads to the perception of 
a specific odor. 


Odorant cAMP- gated 


0 ei Adenylate Nat ion channel 


FIGURE 33.5 The olfactory signal-transduction cascade. The binding of odorant to the 
olfactory receptor activates a signaling pathway similar to those initiated in response to the 
binding of some hormones to their receptors. The final result is the opening of cCAMP-gated 
ion channels and the initiation of an action potential. 


Odorants are decoded by a combinatorial mechanism 


An obvious challenge presented to an investigator by the large size of the 
OR family is to match each OR with the one or more odorant molecules to 
which it binds. Exciting progress has been made in this regard. Initially, an 
OR was matched with odorants by overexpressing a single, specific OR 
gene in rats. This OR responded to straight-chain aldehydes, most favor- 
ably to n-octanal and less strongly to n-heptanal and n-hexanal. More- 
dramatic progress was made by taking advantage of our knowledge of the 
OR signal-transduction pathway and the power of the polymerase chain 
reaction (PCR). A section of nasal epithelium from a mouse was loaded with 
the calctum-sensitive dye Fura-2 (Section 14.1). The tissue was then treated 
with different odorants, one at a time, at a specific concentration. If the 
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FIGURE 33.6 Four series of odorants tested for olfactory-receptor 
activation. 


odorant had bound to an OR and activated it, that neu- 
ron could be detected under a microscope by the change 
in fluorescence caused by the influx of calcium that takes 
place as part of the signal-transduction process. To 
determine which OR was responsible for the response, 
cDNA was generated from mRNA that had been iso- 
lated from single identified neurons. The cDNA was 
then subjected to PCR with the use of primers that are 
effective in amplifying most or all OR genes. The 
sequence of the PCR product from each neuron was then 
determined and analyzed. 

Using this approach, investigators analyzed the 
responses of neurons to a series of compounds having 
varying chain lengths and terminal functional groups 
(Figure 33.6). The results of these experiments appear 
surprising at first glance (Figure 33.7). Importantly, 
there is not a simple 1:1 correspondence between odor- 
ants and receptors. Almost every odorant activates a num- 
ber of receptors (usually to different extents) and almost 
every receptor is activated by move than one odorant. Note, 
however, that each odorant activates a unique combina- 
tion of receptors. In principle, this combinatorial mecha- 
nism allows even a small array of receptors to distinguish 
a vast number of odorants. 

How is the information about which receptors have 
been activated transmitted to the brain? Recall that each 
neuron expresses only one OR and that the pattern of 
expression appears to be largely random. A substantial 
clue to the connections between receptors and the brain 
has been provided by the creation of mice that express a 
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FIGURE 33.7 Patterns of olfactory-receptor activation. Fourteen 
different receptors were tested for responsiveness to the compounds 
shown in Figure 33.6. A colored box indicates that the receptor at the 
top responded to the compound at the left. Darker colors indicate 
that the receptor was activated at a lower concentration of odorant. 


Odorant 


gene for an easily detectable colored marker in conjunction with a specific 
OR gene. Olfactory neurons that express the OR-marker-protein combina- 
tion were traced to their destination in the brain, a structure called the olfac- 
tory bulb (Figure 33.8). The processes from neurons that express the same 


OR gene were found to connect to the same location in the olfactory bulb. 
Moreover, this pattern of neuronal connection was found to be identical in all 
mice examined. Thus, neurons that express specific ORs are linked to specific 
sites in the brain. This property creates a spatial map of odorant-responsive 


neuronal activity within the olfactory bulb. 


Can such a combinatorial mechanism truly distinguish many different 
odorants? An electronic “nose” that functions by the same principles 


FIGURE 33.8 Converging olfactory neurons. This section of the nasal cavity is stained 
to reveal processes from sensory neurons expressing the same olfactory receptor. The 
processes converge to a single location in the olfactory bulb. [Reprinted from Cell, 87, 

P. Mombaerts, F. Wang, C. Dulac, S. K. Chao, A. Nemes, M. Mendelsohn, J. Edmondson, & 
R. Axel, Visualizing an Olfactory Sensory Map, p. 675-689, Copyright 1996, with permission 


from Elsevier.] 
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FIGURE 33.9 The Cyranose 320. The 
electronic nose may find uses in the food 
industry, animal husbandry, law enforcement, 
and medicine. [Courtesy of Smiths Medical.] 
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provides compelling evidence that it can (Figure 33.9). The receptors for the 
electronic nose are polymers that bind a range of small molecules. Each 
polymer binds every odorant, but to varying degrees. Importantly, the elec- 
trical properties of these polymers change on odorant binding. A set of 32 of 
these polymer sensors, wired together so that the pattern of responses can be 
evaluated, is capable of distinguishing individual compounds such as 
n-pentane and n-hexane as well as complex mixtures such as the odors of 
fresh and spoiled fruit. 


33.2 Taste Is a Combination of Senses That Function 
by Different Mechanisms 


The inability to taste food is a common complaint when nasal congestion 
reduces the sense of smell. Thus, smell greatly augments our sense of taste 
(also known as gustation), and taste is, in many ways, the sister sense to 
olfaction. Nevertheless, the two senses differ from each other in several 
important ways. First, we are able to sense several classes of compounds by 
taste that we are unable to detect by smell; salt and sugar have very little 
odor, yet they are primary stimuli of the gustatory system. Second, whereas 
we are able to discriminate thousands of odorants, discrimination by taste is 
much more modest. Five primary tastes are perceived: bitter, sweet, sour, 
salty, and umami (the taste of glutamate and aspartate from the Japanese 
word for “deliciousness’”). These five tastes serve to classify compounds 
into potentially nutritive and beneficial (sweet, salty, umami) or potentially 
harmful or toxic (bitter, sour). Tastants (the molecules sensed by taste) are 
quite distinct for the different groups (Figure 33.10). 


OH H 
Na* 
Sodium ion Glutamate Quinine Hydrogen ion 
(salty) (umami) (bitter) (sour) 


FIGURE 33.10 Examples of tastant molecules. Tastants fall into five groups: sweet, salty, 
umami, bitter, and sour. 


The simplest tastant, the hydrogen ion, is perceived as sour. Other 
simple ions, particularly sodium ion, are perceived as salty. The taste 
called umami is evoked by the amino acids glutamate and aspartate, the 
former often encountered as the flavor enhancer monosodium glutamate 
(MSG). In contrast, tastants perceived as sweet and, particularly, bitter are 
extremely diverse. Many bitter compounds are alkaloids or other plant 
products, many of which are toxic. However, they do not have any com- 
mon structural elements or other common properties. Carbohydrates such 
as glucose and sucrose are perceived as sweet, as are other compounds 
including some simple peptide derivatives, such as aspartame, and even 
some proteins. 


These differences in specificity among the five tastes are due to differences 
in their underlying biochemical mechanisms. The sense of taste is, in fact, a 
number of independent senses all utilizing the same organ, the tongue, for 
their expression. 

Tastants are detected by specialized structures called taste buds, which 
contain approximately 150 cells, including sensory neurons (Figure 33.11). 
Fingerlike projections called microvilli, which are rich in taste receptors, 
project from one end of each sensory neuron to the surface of the tongue. 
Nerve fibers at the opposite end of each neuron carry electrical impulses to 
the brain in response to stimulation by tastants. Structures called taste 
papillae contain numerous taste buds. 


Sequencing of the human genome led to the discovery of a large 
family of 7TM bitter receptors 


Just as in olfaction, a number of clues pointed to the involvement of G pro- 
teins and, hence, 7TM receptors in the detection of bitter and sweet tastes. 
The evidence included the isolation of a specific G-protein a subunit termed 
gustducin, which is expressed primarily in taste buds (Figure 33.12). How 
could the 7TM receptors be identified? The ability to detect some com- 
pounds depends on specific genetic loci in both human beings and mice. For 
instance, the ability to taste the bitter compound 6-n-propyl-2-thiouracil 
(PROP) was mapped to a region on human chromosome 5 by comparing 
DNA markers of persons who vary in sensitivity to this compound. 

This observation suggested that this region might encode a 7TM 
receptor that responded to PROP. Approximately 450 kilobases in this 
region had been sequenced early in the human genome project. This 
sequence was searched by computer for potential 7TM-receptor genes, 
and, indeed, one was detected and named T2R1. Additional database 
searches detected approximately 30 sequences similar to T2R1 in the 
human genome. The encoded proteins are between 30 and 70% identical 
with T2R1 (Figure 33.13). 

Are these proteins, in fact, bitter receptors? Several lines of evidence 
suggest that they are. First, their genes are expressed in taste-sensitive 
cells—in fact, in many of the same cells that express gustducin. Second, cells 
that express individual members of this family respond to specific bitter 
compounds. For example, cells that express a specific mouse receptor 
(mT2R5) responded when exposed specifically to cycloheximide. Third, 
mice that had been found unresponsive to cycloheximide were found to 
have point mutations in the gene encoding mT2R5. Finally, cycloheximide 


FIGURE 33.13 Conserved and variant regions in bitter receptors. The bitter receptors are 
members of the 7TM-receptor family. Strongly conserved residues characteristic of this protein 
family are shown in blue, and highly variable residues are shown in red. 
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FIGURE 33.11 A taste bud. Each taste 
bud contains sensory neurons that extend 
microvilli to the surface of the tongue, 
where they interact with tastants. 


6-n-Propyl-2-thiouracil 
(PROP) 


(A) 

(B) 

FIGURE 33.12 Expression of gustducin 
in the tongue. (A) A section of tongue 
stained with a fluorescent antibody reveals 
the position of the taste buds. (B) The same 
region stained with an antibody directed 
against gustducin reveals that this G protein 


is expressed in taste buds. [Courtesy of 
Dr. Charles S. Zuker.] 
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specifically stimulates the binding of GTP analogs to gustducin in the pres- 
ence of the mT2R5 protein (Figure 33.14). 

Importantly, each taste-receptor cell expresses many different members 
of the T2R family. This pattern of expression stands in sharp contrast to the 
pattern of one receptor type per cell that characterizes the olfactory system 
(Figure 33.15). The difference in expression patterns accounts for the much 
greater specificity of our perceptions of odors compared with tastes. We are 
able to distinguish among subtly different odors because each odorant stimulates 
a unique pattern of neurons. In contrast, many tastants stimulate the same 
neurons. Thus, we perceive only “bitter” without the ability to discriminate 
cycloheximide from quinine. 
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FIGURE 33.14 Evidence that T2R 
proteins are bitter taste receptors. 
Cycloheximide uniquely stimulates the 
binding of the GTP analog GTPyS to 
gustducin in the presence of the mT2R 
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FIGURE 33.15 Differing gene-expression and connection patterns in olfactory and bitter 
taste receptors. In olfaction, each neuron expresses a single OR gene, and the neurons 
expressing the same OR converge to specific sites in the brain, enabling specific perception 
of different odorants. In gustation, each neuron expresses many bitter receptor genes, and so 
the identity of the tastant is lost in transmission. 
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FIGURE 33.16 Evidence for a heterodimeric sweet 
receptor. The sensitivity to sweetness of mice with genes for 
T1R1 only, both T1R1 and T1R2, both T1R1 and T1R3, or both 
T1R2 and T1R3 was determined by observing the relative rates 
at which they licked solutions containing various amounts of 
sucrose. These studies revealed that both T1R2 and T1R3 were 
required for a full response to sucrose. [Data from G. Q. Zhao et 
al., Cell 115:255-266, 2003.] 
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A heterodimeric 7TM receptor responds to 
sweet compounds 


Most sweet compounds are carbohydrates, energy rich and 
easily digestible. Some noncarbohydrate compounds such 
as saccharin and aspartame also taste sweet. Members of a 
second family of 7TM receptors are expressed in taste- 
receptor cells sensitive to sweetness. The three members of 
this family, referred to as T1R1, T1R2, and T1R3, are dis- 
tinguished by their large extracellular domains compared 
with those of the bitter receptors. Studies in knockout mice 
have revealed that T1R2 and T1R3 are expressed simulta- 
neously in mice able to taste carbohydrates (Figure 33.16). 
Thus, T1R2 and T1R3 appear to form a specific heterodi- 
meric receptor responsible for mediating the response to 
sugars. This heterodimeric receptor also responds to artifi- 
cial sweeteners and to sweet-tasting proteins and therefore 
appears to be the receptor responsible for responses to all 
sweet tastants. Note that T1R2 and T1R3 do respond to 
sweet tastants individually but only at very high concentra- 
tions of tastant. 


The requirement for an oligomeric 7TM receptor for a fully functional 
response is surprising, considering our previous understanding of 7TM 
receptors. This discovery has at least two possible explanations. First, the 
sweet receptor could be a member of a small subset of the 7TM-receptor 
family that functions well only as oligomers. Alternatively, many 71M 
receptors may function as oligomers, but this notion is not clear, because 
these oligomers contain only one type of 7TM-receptor subunit. Further 
studies will be required to determine which of these explanations is correct. 


Umami, the taste of glutamate and aspartate, is mediated by 
a heterodimeric receptor related to the sweet receptor 


The family of receptors responsible for detecting sweetness is also respon- 
sible for detecting amino acids. In human beings, only glutamate and aspar- 
tate elicit a taste response. Studies similar to those for the sweet receptor 
revealed that the umami receptor consists of T1R1 and T1R3. Thus, this 
receptor has one subunit (T1R3) in common with the sweet receptor but 
has an additional subunit (T1R1) that does not participate in the sweet 
response. This conclusion is supported by the observation that mice in 
which the gene for T1R1 is disrupted do not respond to aspartate but do 
respond normally to sweet tastants; mice having disrupted genes for both 
T1R1 and T1R3 respond poorly to both umami and sweet tastants. 


Salty tastes are detected primarily by the passage of sodium 
ions through channels 


Salty tastants are not detected by 7TM receptors. Rather, they are detected 
directly by their passage through ion channels expressed on the surface of 
cells in the tongue. Evidence for the role of these ion channels comes from 
examining known properties of Na* channels characterized in other bio- 
logical contexts. One class of channels, characterized first for its role in salt 
reabsorption, is thought to be important in the detection of salty tastes 
because these channels are sensitive to the compound amiloride, which 
mutes the taste of salt and significantly lowers sensory-neuron activation in 
response to sodium. 

An amiloride-sensitive Na* channel comprises four subunits that may be 
either identical or distinct but in any case are homologous. An individual 
subunit ranges in length from 500 to 1000 amino acids and includes two 
presumed membrane-spanning helices as well as a large extracellular 
domain in between them (Figure 33.17). The extracellular region includes 
two (or, sometimes, three) distinct regions rich in cysteine residues (and, 
presumably, disulfide bonds). A region just ahead of the second membrane- 
spanning helix appears to form part of the pore in a manner analogous to 
that of the structurally characterized potassium channel. The members of 
the amiloride-sensitive Na‘ -channel family are numerous and diverse in 
their biological roles. We shall encounter them again in the context of the 
sense of touch. 

Sodium ions passing through these channels produce a significant trans- 
membrane current. Amiloride blocks this current, accounting for its effect 
on taste. However, about 20% of the response to sodium remains even in the 
presence of amiloride, suggesting that other ion channels also contribute to 
salt detection. 


Sour tastes arise from the effects of hydrogen ions (acids) on channels 


Like salty tastes, sour tastes are detected by direct interactions with ion channels, 
but the incoming ions are hydrogen ions (in high concentrations) rather than 
sodium ions. For example, in the absence of high concentrations of sodium, 
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FIGURE 33.17 Schematic structure 

of the amiloride-sensitive sodium 
channel. Only one of the four subunits 
that constitute the functional channel is 
illustrated. The amiloride-sensitive sodium 
channel belongs to a superfamily having 
common structural features, including two 
hydrophobic membrane-spanning regions, 
intracellular amino and carboxyl termini, 
and a large, extracellular region with 
conserved cysteine-rich domains. 
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hydrogen ion flow can induce substantial transmembrane currents through 
amiloride-sensitive Na‘ channels. However, hydrogen ions are also sensed by 
mechanisms other than their direct passage through membranes. Binding by 
hydrogen ions blocks some potassium ion channels and activates other types 
of channels. Together, these mechanisms lead to changes in membrane polar- 
ization in sensory neurons that produce the sensation of sour taste. We shall 
consider an additional receptor related to taste, one responsible for the “hot” 
taste of spicy food, when we examine mechanisms of touch perception. 


33.3 Photoreceptor Molecules in the Eye 
Detect Visible Light 


Vision is based on the absorption of light by photoreceptor cells in the eye. 
These cells are sensitive to light in a narrow region of the electromagnetic 
spectrum, the region with wavelengths between 390 and 
750 nm (Figure 33.18). Vertebrates have two kinds of pho- 
toreceptor cells, called rods and cones because of their dis- 
tinctive shapes. Cones function in bright light and are 


responsible for color vision, whereas rods function in dim 
| | | 


light but do not perceive color. A human retina contains 


—10 —7 —6 2, 
sia io a Ra rane about 3 million cones and 100 million rods. Remarkably, a 
Wavelenuttidn) rod cell can respond to a single photon, and the brain 
requires fewer than 10 such responses to register the sen- 
FIGURE 33.18 The electromagnetic spectrum. Visible light has sation of a flash of light. 


wavelengths between 390 and 750 nm. 


11-cis-Retinal 


Rhodopsin, a specialized 7TM receptor, absorbs visible light 


Rods are slender, elongated structures; the outer segment is specialized for 
photoreception (Figure 33.19). It contains a stack of about 1000 discs, 
which are membrane-enclosed sacs densely packed with photoreceptor 
molecules. The photosensitive molecule is often called a visual pigment 
because it is highly colored owing to its ability to absorb light. The photore- 
ceptor molecule in rods is rhodopsin (Section 14.1), which consists of the 
protein opsin linked to 11-cis-vetinal, a prosthetic group. 
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FIGURE 33.19 The rod cell. (Left) Scanning electron micrograph of retinal rod cells. 
(Right) Schematic representation of a rod cell. [Photograph courtesy of Dr. Deric Bownds.] 


Rhodopsin absorbs light very efficiently in the middle of the visible 
spectrum, its absorption being centered on 500 nm, which nicely matches 
the solar output (Figure 33.20). A rhodopsin molecule will absorb a high 
percentage of the photons of the correct wavelength that strike it, as indi- 
cated by the extinction coefficient of 40,000 M~'cm™! at 500 nm. The 
extinction coefficient for rhodopsin is more than an order of magnitude 
greater than that for tryptophan, the most efficient absorber in proteins that 
lack prosthetic groups. 

Opsin, the protein component of rhodopsin, is a member of the 
7TM-receptor family. Indeed, rhodopsin was the first member of this fam- 
ily to be purified, its gene was the first to be cloned and sequenced, and its 
three-dimensional structure was the first to be determined. The color of 
rhodopsin and its responsiveness to light depend on the presence of the 
light-absorbing group (chromophore) 11-cis-retinal. This compound is a 
powerful absorber of light because it is a polyene; its six alternating single 
and double bonds constitute a long, unsaturated electron network. Recall 
that alternating single and double bonds account for the chromophoric 
properties of chlorophyll (Section 19.2). The aldehyde group of 11-cis- 
retinal forms a Schiff base (Figure 33.21) with the e-amino group of lysine 
residue 296, which lies in the center of the seventh transmembrane helix. 
Free retinal absorbs maximally at 370 nm, and its unprotonated Schiff-base 
adduct absorbs at 380 nm, whereas the protonated Schiff base absorbs at 
440 nm or longer wavelengths. Thus, the 500-nm absorption maximum for 
rhodopsin strongly suggests that the Schiff base is protonated; additional inter- 
actions with opsin shift the absorption maximum farther toward the red. 
The positive charge of the protonated Schiff base is compensated by the 
negative charge of glutamate 113 located in helix 2; the glutamate residue 
closely approaches the lysine-retinal linkage in the three-dimensional 
structure of rhodopsin. 


Schiff base 


(11-cis-Retinal) Lysine 


Light absorption induces a specific isomerization of 
bound 11-cis-retinal 


How does the absorption of light by the retinal Schiff base generate a signal? 
George Wald and his coworkers discovered that light absorption results in the 
isomerization of the 11-cis-retinal group of rhodopsin to its all-trans form 
(Figure 33.22). This isomerization causes the Schiff-base nitrogen atom to 
move approximately 5 A, assuming that the cyclohexane ring of the retinal 
group remains fixed. In essence, the light energy of a photon is converted into 
atomic motion. The change in atomic positions, like the binding of a ligand 
to other 7TM receptors, sets in train a series of events that lead to the clos- 
ing of ion channels and the generation of a nerve impulse. 

The isomerization of the retinal Schiff base takes place within a few pico- 
seconds of a photon being absorbed. The initial product, termed bathorho- 
dopsin, contains a strained all-trans-retinal group. Within approximately 


40,000 500 


30,000 |— 


20,000 


10,000 —— 


Extinction coefficient (M-! cm~') 


0 | | | | 
300 400 500 600 700 


Wavelength (nm) 


FIGURE 33.20 Rhodopsin absorption 
spectrum. Almost all photons with 
wavelengths near 500 nm that strike 

a rhodopsin molecule are absorbed. 


FIGURE 33.21 Retinal-lysine linkage. 
Retinal is linked to lysine 296 in opsin by a 
Schiff-base linkage. In the resting state of 
rhodopsin, this Schiff base is protonated. 
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FIGURE 33.23 Analogous 7TM receptors. 
The conversion of rhodopsin into 
metarhodopsin II activates a signal- 
transduction pathway analogously to the 
activation induced by the binding of other 
7TM receptors to appropriate ligands. 
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FIGURE 33.22 Atomic motion in retinal. The Schiff-base nitrogen atom moves 5 Aasa 
consequence of the light-induced isomerization of 11-cis-retinal to all-trans-retinal by rotation 
about the bond shown in red. 


1 ms, this intermediate is converted through several additional intermedi- 
ates into metarhodopsin IT. In metarhodopsin II, the Schiff base is deproton- 
ated and the opsin protein has undergone significant reorganization. 

Metarhodopsin II (also referred to as R*) is analogous to the ligand- 
bound state of 7T’M receptors such as the B,-adrenergic receptor (Section 
14.1) and the odorant and tastant receptors discussed previously (Figure 
33.23). Like these receptors, this form of rhodopsin activates a heterotri- 
meric G protein that propagates the signal. The G protein associated with 
rhodopsin is called transducin. Metarhodopsin II triggers the exchange of 
GDP for GTP by the a subunit of transducin (Figure 33.24). On the bind- 
ing of GTP, the By subunits of transducin are released and the a subunit 
switches on a cGMP phosphodiesterase by binding to an inhibitory subunit 
and removing it. The activated phosphodiesterase is a potent enzyme that 
rapidly hydrolyzes cGMP to GMP. The reduction in cGMP concentration 
causes cGMP-gated ion channels to close, leading to the hyperpolarization of 
the membrane and neuronal signaling. At each step in this process, the initial 
signal—the absorption of a single photon—is amplified so that it leads to suf- 
ficient membrane hyperpolarization to result in signaling. 


Light-induced lowering of the calcium level coordinates recovery 


As we have seen, the visual system responds to changes in light and color 
within a few milliseconds, quickly enough that we are able to perceive con- 
tinuous motion at nearly 1000 frames per second. To achieve a rapid 
response, the signal must also be terminated rapidly and the system must be 
returned to its initial state. First, activated rhodopsin must be blocked from 
continuing to activate transducin. Rhodopsin kinase catalyzes the phos- 
phorylation of the carboxyl terminus of R* at multiple serine and threonine 
residues. Arrestin, an inhibitory protein then binds phosphorylated R* and 
prevents additional interaction with transducin. 


Light cGMP- gated 
: Phosphodiesterase —_ion channel Na* 


Rhodopsin 


A 


L 


FIGURE 33.24 Visual signal transduction. The light-induced activation of rhodopsin leads 
to the hydrolysis of CGMP, which in turn leads to ion-channel closing and the initiation of an 
action potential. 
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Second, the a subunit of transducin must be returned to its inactive state 973 
to prevent further signaling. Like other G proteins, the a subunit possesses 33.3 Vision 
built-in GTPase activity that hydrolyzes bound GTP to GDP. Hydrolysis 
takes place in less than a second when transducin is bound to the phospho- 
diesterase. The GDP form of transducin then leaves the phosphodiesterase 
and reassociates with the By subunits, and the phosphodiesterase returns to 
its inactive state. Third, the level of CGMP must be raised to reopen the 
cGMP-gated ion channels. The action of guanylate cyclase accomplishes this 
third step by synthesizing cGMP from GTP. 

Calcium ion plays an essential role in controlling guanylate cyclase 530 
because it markedly inhibits the activity of the enzyme. In the dark, Ca?* as 
well as Na* enter the rod outer segment through the cGMP-gated channels. 
Calcium ion influx is balanced by its efflux through an exchanger, a trans- 
port system that uses the thermodynamically favorable flow of four Na* 
ions into the cell and one K* ion out of the cell to extrude one Ca* ion. 
After illumination, the entry of Ca?* through the cGMP-gated channels 
stops, but its export through the exchanger continues. Thus, the cytoplas- 
mic Ca**t level drops from 500 nM to 50 nM after illumination. This drop 300 400 500 600 700 800 
markedly stimulates guanylate cyclase, rapidly restoring the concentration Wavelength (nm) 
of cGMP to reopen the cGMP-gated channels. 
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FIGURE 33.25 Cone-pigment absorption 
spectra. The absorption spectra of the cone 
visual pigment responsible for color vision. 
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By controlling the rate of CGMP synthesis, Ca?* levels govern the speed 
with which the system is restored to its initial state. 


Color vision is mediated by three cone receptors that 
are homologs of rhodopsin 


Cone cells, like rod cells, contain visual pigments. Like 
rhodopsin, these photoreceptor proteins are members of 
the 7TM-receptor family and use 11-cis-retinal as their 
chromophore. In human cone cells, there are three distinct 
photoreceptor proteins with absorption maxima at 426, 
530, and ~560 nm (Figure 33.25). These absorbances cor- 
respond to the blue, green, and yellow-green regions of the 
spectrum. Recall that the absorption maximum for rhodop- 
sin is 500 nm. The photoreceptor protein with its absorp- 
tion maximum near 560 nm absorbs red light (with 
wavelength >620 nm) whereas the other two photorecep- 
tors do not. For simplicity, we shall refer to the three pro- 
teins as the blue, green, and red photoreceptors. 

The amino acid sequences of the cone photoreceptors 
have been compared with one another and with rhodopsin. 
The result is striking. Each of the cone photoreceptors is 
approximately 40% identical in sequence with rhodopsin. 
Similarly, the blue photoreceptor is 40% identical with each 
of the green and red photoreceptors. The green and red 
photoreceptors, however, are >95% identical with each 
other, differing in only 15 of 364 positions (Figure 33.26). 


FIGURE 33.26 Comparison of the amino acid sequences of 

the green and red photoreceptors. Open circles correspond to 

. ae : identical residues, whereas colored circles mark residues that are 

ina These observations are sources of insight into different. The differences in the three black positions are responsible 
photoreceptor evolution. First, the green and red _for most of the difference in their absorption spectra. 
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FIGURE 33.27 Evolutionary relationships 
among visual pigments. Visual pigments 
have evolved by gene duplication along 
different branches of the animal evolutionary 
tree. The branch lengths of the “trees” 
correspond to the percentage of amino 
acid divergence. [Information from 

J. Nathans, Neuron 24:299-312, 1999; 

by permission of Cell Press.] 


Homologous recombination 

The exchange of DNA segments with 
substantial sequence similarity at equivalent 
positions between chromosomes. 


FIGURE 33.28 Recombination pathways 
leading to color blindness. Rearrangements 
in the course of DNA replication may lead to 
(A) the loss of visual pigment genes or (B) the 
formation of hybrid pigment genes that 
encode photoreceptors with anomalous 
absorption spectra. Because the amino acids 
most important for determining absorption 
spectra are in the carboxyl-terminal half of 
each photoreceptor protein, the part of the 
gene that encodes this region most strongly 
affects the absorption characteristics of hybrid 
receptors. [Information from J. Nathans, Neuron 
24:299-312, 1999; by permission of Cell Press.] 
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Mouse photoreceptors are clearly products of a recent evolution- 


600 
ary event (Figure 33.27). The green and red pigments 


appear to have diverged in the primate lineage approxi- 
mately 35 million years ago. Mammals, such as dogs and 
mice, that diverged from primates earlier have only two 
cone photoreceptors, blue and green. They are not sensi- 
tive to light as far toward the infrared region as we are, and 
they do not discriminate colors as well. In contrast, birds 
such as chickens have a total of six pigments: rhodopsin, 
4400 four cone pigments, and a pineal visual pigment called 
pinopsin. Birds have highly acute color perception. 

Second, the high level of similarity between the green 
and the red pigments has made the identification of the 
specific amino acid residues responsible for spectral tun- 
ing possible. Three residues (at positions 180, 277, and 
285) are responsible for most of the difference between the green and the red 
pigments. In the green pigment, these residues are alanine, phenylalanine, 
and alanine, respectively; in the red pigment, they are serine, tyrosine, and 
threonine. A hydroxyl group has been added to each amino acid in the red 
pigment. The hydroxyl groups can interact with the photoexcited state of 
retinal and lower its energy, leading to a shift toward the lower-energy (red) 
region of the spectrum. 
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Rearrangements in the genes for the green and red pigments 
lead to “color blindness” 


e The genes for the green and red pigments lie adjacent to each other on 
the human X chromosome. These genes are more than 98% identical 
in nucleotide sequence, including introns and untranslated regions as well 
as the protein-coding region. Regions with such high similarity are very 
susceptible to unequal homologous recombination. 

Recombination can take place either between or within transcribed 
regions of the gene (Figure 33.28). If recombination takes place between 
transcribed regions, the product chromosomes will differ in the number 
of pigment genes that they carry. One chromosome will lose a gene and 
thus may lack the gene for, say, the green pigment; the other chromosome 
will gain a gene. Consistent with this scenario, approximately 2% of 
human X chromosomes carry only a single color-pigment gene, approxi- 
mately 20% carry two, 50% carry three, 20% carry four, and 5% carry five 
or more. A person lacking the gene for the green pigment will have 
trouble distinguishing red and green color, characteristic of the most 
common form of color blindness. Approximately 5% of males have this 
form of color blindness. Recombination can also take place within the 
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transcription units, resulting in genes that encode hybrids of the green 
and red photoreceptors. The absorption maximum of such a hybrid lies 
between that of the red and green pigments. A person with such hybrid 
genes who also lacks either a functional red- or green-pigment gene does 
not discriminate color well. 


33.4 Hearing Depends on the Speedy Detection of 
Mechanical Stimuli 


Hearing and touch are based on the detection of mechanical stimuli. 
Although the proteins of these senses have not been as well characterized as 
those of the senses already discussed, anatomical, physiological, and bio- 
physical studies have elucidated the fundamental processes. A major clue to 
the mechanism of hearing is its speed. We hear frequencies ranging from 200 
to 20,000 Hz (cycles per second), corresponding to times of 5 to 0.05 ms. 
Furthermore, our ability to locate sound sources, one of the most important 
functions of hearing, depends on the ability to detect the time delay 
between the arrival of a sound at one ear and its arrival at the other. Given 
the separation of our ears and the speed of sound, we must be able to accu- 
rately sense time differences of 0.7 ms. In fact, human beings can locate 
sound sources associated with temporal delays as short as 0.02 ms. This 
high time resolution implies that hearing must employ direct transduction 
mechanisms that do not depend on second messengers. Recall that, in 
vision, for which speed also is important, the signal-transduction processes 
take place in milliseconds. 


Hair cells use a connected bundle of stereocilia to detect tiny motions 


Sound waves are detected inside the cochlea of the inner ear. The cochlea is 
a fluid-filled, membranous sac that is coiled like a snail shell. The primary 
detection is accomplished by specialized neurons inside the cochlea called 
hair cells (Figure 33.29). Each cochlea contains approximately 16,000 hair 
cells, and each hair cell contains a hexagonally shaped bundle of 20 to 300 
hairlike projections called stereocilia (Figure 33.30). These stereocilia are 
graded in length across the bundle. Mechanical deflection of the hair bun- 
dle, as takes place when a sound wave arrives at the ear, creates a change in 
the membrane potential of the hair cell. 

Micromanipulation experiments have directly probed the connection 
between mechanical stimulation and membrane potential. Displacement 
toward the direction of the tallest part of the hair bundle results in the depo- 
larization of the hair cell, whereas displacement in the opposite direction 
results in its hyperpolarization (Figure 33.31). Motion perpendicular to the 
hair-length gradient does not produce any change in resting potential. 


FIGURE 33.31 Micromanipulation of a 
aL hair cell. Movement toward the tallest part 
of the bundle depolarizes the cell as 
a l l | | l l measured by the microelectrode. Movement 
-200 0 200 toward the shortest part hyperpolarizes the 
Displacement toward cell. Lateral movement has no effect. [After 
tallest part (nm) A. J. Hudspeth, Nature 341:397-404, 1989.] 
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FIGURE 33.29 Hair cells, the sensory 
neurons crucial for hearing. These 
specialized neurons are capped with 
hairlike projections called stereocilia that 
are responsible for detecting very subtle 
vibrations. [Information from A. J. Hudspeth, 
Nature 341:397-404, 1989.] 


FIGURE 33.30 An electron micrograph 
of a hair bundle. [Courtesy of Dr. A. Jacobs 
and Dr. A. J. Hudspeth.] 
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FIGURE 33.32 Electron micrograph of tip 
links. The tip link between two hair fibers 

is marked by an arrow. [Courtesy of 

Dr. A. Jacobs and Dr. A. J. Hudspeth.] 
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FIGURE 33.33 Model for hair-cell 
transduction. When the hair bundle is 
tipped toward the tallest part, the tip link 
pulls on an ion channel and opens it. 
Movement in the opposite direction relaxes 
the tension in the tip link, increasing the 
probability that any open channels will 
close. [Information from A. J. Hudspeth, 
Nature 341:397-404, 1989.] 


Remarkably, displacement of the hair bundle by as little as 3 A (0.3 nm) results 
in a measurable (and functionally important) change in membrane potential. 
This motion of 0.001 degree corresponds to a 0.3 inch movement of the top 
of the Empire State Building. 

How does the motion of the hair bundle create a change in membrane 
potential? The rapid response, within microseconds, suggests that the 
movement of the hair bundle acts on ion channels directly. An important 
observation is that adjacent stereocilia are linked by individual filaments 
called tip links (Figure 33.32). 

The presence of these tip links suggests a simple mechanical model for 
transduction by hair cells (Figure 33.33). The tip links are coupled to ion chan- 
nels in the membranes of the stereocilia that are gated by mechanical stress. In 
the absence of a stimulus, approximately 15% of these channels are open. 
When the hair bundle is displaced toward its tallest part, the stereocilia slide 
across one another and the tension on the tip links increases, causing addi- 
tional channels to open. The flow of ions through the newly opened channels 
depolarizes the membrane. Conversely, if the displacement is in the opposite 
direction, the tension on the tip links decreases, the open channels close, and 
the membrane hyperpolarizes. Thus, the mechanical motion of the hair bundle 
is directly converted into current flow across the hair-cell membrane. 


Mechanosensory channels have been identified in 
Drosophila and vertebrates 


The search for ion channels that respond to mechanical impulses has been 
pursued in a variety of organisms. Drosophila have sensory bristles used for 
detecting small air currents. These bristles respond to mechanical displace- 
ment in ways similar to those of hair cells; displacement of a bristle in one 
direction leads to substantial transmembrane current. Strains of mutant 
fruit flies that show uncoordinated motion and clumsiness have been exam- 
ined for their electrophysiological responses to displacement of the sensory 
bristles. In one set of strains, transmembrane currents were dramatically 
reduced. The mutated gene in these strains was found to encode a protein of 
1619 amino acids, called NompC for no mechanoreceptor potential. 

The carboxyl-terminal 469 amino acids of NompC resemble a class of 
ion-channel proteins called TRP (transient receptor potential) channels. 
This region includes six putative transmembrane helices with a pore-like 
region between the fifth and the sixth helices. The amino-terminal 1150 
amino acids consist almost exclusively of 29 ankyrin repeats (Figure 33.34). 


h® FIGURE 33.34 Ankyrin repeat structure. One ankyrin domain is shown in red in this 
series of four ankyrin repeats. Notice the hairpin loop followed by a helix-turn-helix motif in the 
red-colored ankyrin unit. Ankyrin domains interact with other proteins, primarily through their 
loops. [Drawn from 1AWC.pdb.] 


Ankyrin repeats are structural motifs consisting of a hairpin loop followed 
by a helix-turn-helix. Importantly, in other proteins, regions with tandem 
arrays of these motifs mediate protein—protein interactions, suggesting that 
these arrays couple the motions of other proteins to the activity of the 
NompC channel. 

A candidate for at least one component of the mechanosensory channel 
taking part in hearing has been identified. The protein, TRPA1, is a mem- 
ber of the TRP channel family. The sequence of TRPA1 also includes 17 
ankyrin repeats. TRPA1 is expressed in hair cells, particularly near their 
tips. Further studies are under way to test and extend the exciting hypoth- 
esis that this protein is a component of the sought-after channel. 


33.5 Touch Includes the Sensing of Pressure, 
Temperature, and Other Factors 


Like taste, touch is a combination of sensory systems that are expressed ina 
common organ—in this case, the skin. The detection of pressure and the 
detection of temperature are two key components. Amiloride-sensitive Na* 
channels, homologous to those of taste, appear to play a role. Other systems 
are responsible for detecting painful stimuli such as high temperature, acid, 
or certain specific chemicals. Although our understanding of this sensory 
system is not as advanced as that of the others, recent work has revealed a 
fascinating relation between pain and taste sensation, a relation well known 
to anyone who has eaten “spicy” food. 


Studies of capsaicin reveal a receptor for sensing high 
temperatures and other painful stimuli 


Our sense of touch is intimately connected with the sensation of pain. 
Specialized neurons, termed nociceptors, transmit signals from skin to pain- 
processing centers in the spinal cord and brain in response to the onset of 
tissue damage. What is the molecular basis for the sensation of pain? An 
intriguing clue came from the realization that capsaicin, the chemical 
responsible for the “hot” taste of spicy food, activates nociceptors. 
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Early research suggested that capsaicin would act by opening ion chan- 
nels that are expressed in nociceptors. Thus, a cell that expresses the cap- 
saicin receptor should take up calcium on treatment with the molecule. 
This insight led to the isolation of the capsaicin receptor with the use of 
cDNA from cells expressing this receptor. Such cells had been detected by 
their fluorescence when loaded with the calcium-sensitive compound 
Fura-2 and then treated with capsaicin or related molecules. Cells express- 
ing the capsaicin receptor, which is called VR1 (for vanilloid receptor 1), 
respond to capsaicin below a concentration of 1 wM. The deduced 
838-residue sequence of VR1 revealed it to be a member of the TRP chan- 
nel family (Figure 33.35). The amino-terminal region of VR1 includes 
three ankyrin repeats. 
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FIGURE 33.35 The membrane topology 
deduced for VR1, the capsaicin receptor. 
The proposed site of the membrane pore 

is indicated in red, and the three ankyrin 

(A) repeats are shown in orange. The active 
receptor comprises four of these subunits. 
[Information from M. J. Caterina et al., Nature 
389:816-824, 1997.] 
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FIGURE 33.36 Response of the capsaicin receptor to pH and temperature. The ability of 
this receptor to respond to acid and to increased temperature helps detect potentially noxious 
situations. [Data from M. Tominaga et al., Neuron 21:531-548, 1998.] 


Currents through VR1 are also induced by temperatures above 40°C and 
by exposure to dilute acid, with a midpoint for activation at pH 5.4 (Figure 
33.36). Temperatures and acidity in these ranges are associated with infec- 
tion and cell injury. The responses to capsaicin, temperature, and acidity are 
not independent. The response to heat is greater at lower pH, for example. 
Thus, VR1 acts to integrate several noxious stimuli. We feel these responses 
as pain and act to prevent the potentially destructive conditions that cause 
the unpleasant sensation. Studies of mice that do not express VR1 suggest 
that this is the case; such mice do not mind food containing high concentra- 
tions of capsaicin and are, indeed, less responsive than control mice to nor- 
mally noxious heat. Plants such as chili peppers presumably gained the 
ability to synthesize capsaicin and other “hot” compounds to protect them- 
selves from being consumed by mammals. Birds, which play the beneficial 
role of spreading pepper seeds into new territory, do not appear to respond 
to capsaicin. 


e Because of its ability to simulate VR1, capsaicin is used in pain man- 

agement for arthritis, neuralgia, and other neuropathies. How can a 
compound that induces pain assist in its alleviation? Chronic exposure to 
capsaicin overstimulates pain-transmitting neurons, leading to their 
desensitization. 


SUMMARY 


Smell, taste, vision, hearing, and touch are based on signal- 
transduction pathways activated by signals from the environment. 
These sensory systems function similarly to the signal-transduction 
pathways for many hormones. These intercellular signaling pathways 
appear to have been appropriated and modified to process environ- 
mental information. 


33.1 A Wide Variety of Organic Compounds Are Detected by Olfaction 
The sense of smell, or olfaction, is remarkable in its specificity; it can, 
for example, discern stereoisomers of small organic compounds as 
distinct aromas. The 7TM receptors that detect these odorants operate 
in conjunction with Gi olf @ G protein that activates a cAMP cascade 
resulting in the opening of an ion channel and the generation of anerve 
impulse. An outstanding feature of the olfactory system is its ability to 
detect a vast array of odorants. Each olfactory neuron expresses only 
one type of receptor and connects to a particular region of the olfactory 
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bulb. Odors are decoded by a combinatorial mechanism: each odorant 
activates a number of receptors, each to a different extent, and most 
receptors are activated by more than one odorant. 


Taste Is a Combination of Senses That Function by 

Different Mechanisms 

We can detect only five tastes: bitter, sweet, salt, sour, and umami. 
The transduction pathways that detect taste are, however, diverse. 
Bitter, sweet, and umami tastants are experienced through 7TM 
receptors acting through a special G protein called gustducin. Salty 
and sour tastants act directly through membrane channels. Salty tas- 
tants are detected by passage though Na‘ channels, whereas sour taste 
results from the effects of hydrogen ions on a number of types of chan- 
nels. The end point is the same in all cases—membrane polarization 
that results in the transmission of a nerve impulse. 


Photoreceptor Molecules in the Eye Detect Visible Light 

Vision is perhaps the best understood of the senses. Two classes of 
photoreceptor cells exist: cones, which respond to bright lights and 
colors, and rods, which respond only to dim light. The photoreceptor 
in rods is rhodopsin, a 7T’M receptor that is a complex of the protein 
opsin and the chromophore 11-cis-retinal. The absorption of light by 
11-cis-retinal changes its structure into that of all-trans-retinal, setting 
in motion a signal-transduction pathway that leads to the breakdown 
of cGMP, to membrane hyperpolarization, and to a subsequent nerve 
impulse. Color vision is mediated by three distinct 7TM photorecep- 
tors that employ 11-cis-retinal as a chromophore and absorb light in 
the blue, green, and red parts of the spectrum. 


Hearing Depends on the Speedy Detection of Mechanical Stimuli 

The immediate receptors for hearing are found in the hair cells of the 
cochleae, which contain bundles of stereocilia. When the stereocilia 
move in response to sound waves, cation channels will open or close, 
depending on the direction of movement. The mechanical motion of 
the cilia is converted into current flow and then into a nerve impulse. 


Touch Includes the Sensing of Pressure, Temperature, 

and Other Factors 

Touch, detected by the skin, senses pressure, temperature, and pain. 
Specialized nerve cells called nociceptors transmit signals that are 
interpreted in the brain as pain. A receptor responsible for the percep- 
tion of pain has been isolated on the basis of its ability to bind capsa- 
icin, the molecule responsible for the hot taste of spicy food. The 
capsaicin receptor, also called VR1, functions as a cation channel that 
initiates a nerve impulse. 
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| PROBLEMS 


1. Olfaction in worms. Unlike the olfactory neurons in the 
mammalian systems discussed herein, olfactory neurons in 
the nematode C. elegans express multiple olfactory recep- 
tors. In particular, one neuron (called AWA) expresses 
receptors for compounds to which the nematode is attracted, 
whereas a different neuron (called AWB) expresses recep- 
tors for compounds that the nematode avoids. Suppose that 
a transgenic nematode is generated such that one of the 
receptors for an attractant is expressed in AWB rather than 
AWA. What behavior would you expect in the presence of 
the corresponding attractant? 


2. Odorant matching. A mixture of two of the compounds 
illustrated in Figure 33.6 is applied to a section of olfactory 
epithelium. Only receptors 3, 5, 9, 12, and 13 are activated, 
according to Figure 33.7. Identify the likely compounds in 
the mixture. 


3. Timing. Compare the aspects of taste (bitter, sweet, salty, 
sour) in regard to their potential for rapid time resolution. 


4. Two ears. Our ability to determine the direction from 
which a sound is coming is partly based on the difference 
in time at which our two ears detect the sound. Given the 
speed of sound (350 ms!) and the separation between our 
ears (0.15 m), what difference is expected in the times at 
which a sound arrives at our two ears? How does this dif- 
ference compare with the time resolution of the human 
hearing system? Would a sensory system that utilized 
7TM receptors and G proteins be capable of adequate time 
resolution? 


5. It’s better to be bitter. Some nontoxic plants taste very 
bitter to us. Suggest one or more explanations. 


6. Of mice and men. In human beings, the umami taste is 
triggered only by glutamate and aspartate. In contrast, mice 
respond to many more amino acids. Design an experiment 
to test which of the subunits (T1R1 or T1R3) determines 
the specificity of this response. Assume that all desired 
mouse strains can be readily produced. 


7. It’s not easy being green. Would you expect light with 
wavelength 530 nm to be absorbed only by the “green” 
photoreceptor? 


8. Color discrimination. Some women who have had male 
children who display color blindness have been demonstrated 


to distinguish between colors that are not readily distin- 
guished by other people. Propose an explanation. 


9. Combinatorial power. How many different odorants 
could a person distinguish if each odorant is bound by a 
single type of receptor and each receptor binds only a single 
odorant? How many if each odorant binds to two different 
odorant receptors? Three odorant receptors? 


10. Light action. Describe the effect of light absorption on 
11-cis-retinal bound within rhodopsin. 


11. Mirror, mirror. Compounds A and B typically evoke 
different odors, with compound A described as smelling 
like tropical fruit and compound B smelling oniony. 


Pe Cea 
ax, 


Explain how two such similar compounds can smell so 
different. 


12. Channeling. Provide an example of an ion channel in 
vision, taste, and hearing. 


Chapter Integration Problem 


13. Energy and information. The transmission of sensory 
information requires the input of free energy. For each sen- 
sory system (olfaction, gustation, vision, hearing, and 
touch), identify mechanisms for the input of free energy 
that allow the transmission of sensory information. 


Mechanism Problem 


14. Schiff-base formation. Propose a mechanism for the 
reaction between opsin and 11-cis-retinal. 


The Immune System 


CHAPTER 


Antibody 


Influenza 
hemagglutinin 


Just as medieval fortresses, such as the Dover Castle, used 
walls and fortifications to defend their territory, the immune 
system constantly battles against foreign invaders such as 
viruses, bacteria, and parasites to defend the organism. 
Antibody molecules provide a key element in the immune 
system’s defensive arsenal. For example, specific antibodies 
can bind to molecules on the surfaces of viruses and prevent 
the viruses from infecting cells. Above right, an antibody binds 
to one subunit on hemagglutinin from the surface of influenza 
virus. [Skyscan Photolibrary/Alamy.] 


Ve are constantly exposed to an incredible diversity of bacteria, 
viruses, and parasites, many of which would flourish in our cells or 
extracellular fluids were it not for our immune system. How does the 
immune system protect us? The human body has two lines of defense: an 
innate immune system that responds rapidly to features present in many 
pathogens, and an adaptive immune system that responds to specific fea- 
tures present only in a given pathogen. Both the innate and the adaptive 
immune systems first identify features on disease-causing organisms and then 
work to eliminate or neutralize those organisms. While a thorough descrip- 
tion of the immune system is certainly beyond the scope of this book, this 
chapter will focus on how biochemical concepts such as protein structure, 
receptor-ligand interactions, and signal transduction are applied to the 
identification of pathogens. 

The immune system must meet two tremendous challenges in the iden- 
tification of pathogens: (1) to produce a system of receptors diverse enough 
to recognize a wide array of potential pathogens and (2) to distinguish 
invaders and their disease-causing products from the organism’s own prod- 
ucts (1.e., self- versus nonself-recognition). To meet these challenges, the 
innate immune system evolved the ability to recognize structural elements, 
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Leucine-rich 


repeat Cysteine-rich 


FIGURE 34.1 Toll-like receptor. Each 
receptor is made up of a set of 18 or more 
leucine-rich repeat sequences, followed 

by a cysteine-rich domain, a single 
transmembrane helix, and an intracellular 
domain that functions in signal transduction. 


such as specific glycolipids or forms of nucleic acid, that are well conserved 
in pathogens but absent in the host organism. The repertoire of such ele- 
ments is limited, however, and so some pathogens have strategies to escape 
detection. The adaptive immune system has the remarkable ability to pro- 
duce more than 10° distinct proteins, called antibodies, that can recognize 
different foreign molecules, and more than 10!” receptors on immune cells, 
called T-cell receptors (TCRs), each of which presents a different surface 
with the potential to specifically bind a structure from a foreign organism. 
In producing this vast range of defensive molecules, however, the adaptive 
immune system has the potential to create antibodies and T-cells that rec- 
ognize and attack cells or molecules normally present in our bodies—a situ- 
ation that can result in autoimmune diseases. 

This chapter will examine these challenges, focusing first on the struc- 
tures of proteins that recognize foreign organisms and then on the mecha- 
nisms for protecting us from a specific pathogen once it has been recognized. 
The chapter will closely examine the modular construction of the proteins of 
the immune system—identifying structural motifs and considering how 
spectacular diversity can arise from modular construction. 


Innate immunity is an evolutionarily ancient defense system 


Innate immunity is an evolutionarily ancient defense system found, at least 
in some form, in all multicellular plants and animals. The innate immune 
system represents the first line of defense against foreign pathogens, relying 
on common features of invading organisms to identify and eliminate these 
threats. Components of the innate immune system include the epithelial 
lining that surrounds host cells and the specialized cells, called phagocytes, 
that can ingest and destroy pathogens without the aid of the adaptive 
immune system. 

The innate immune system also includes a family of receptors that can 
recognize specific features present in most pathogens and yet not respond 
to materials normally present in the host. The best-understood of these 
receptors are the Toll-like receptors (TLRs). The name “toll-like” is derived 
from a receptor known as Toll encoded in the Drosophila genome; Toll was 
first identified in a screen for genes important for Drosophila development 
and was subsequently shown to play a key role in the innate immune sys- 
tem. The TLRs have a common structure (Figure 34.1). Each receptor 
consists of a large extracellular domain built primarily from repeated amino 
acid sequences termed leucine-rich repeats (LRRs). Each LRR typically 
contains 20-30 residues, including 6 that are usually leucine. The human 
TLRs have from 18 to 27 LRR repeats that are capped by a cysteine-rich 
domain and followed by a sequence forming a single transmembrane helix 
and an intracellular signaling domain. This signaling domain is not a pro- 
tein kinase but acts as a docking site for other proteins. Most TLRs are 
expressed in the cell membrane for the detection of extracellular pathogens 
such as fungi and bacteria. Other TLRs are located in the membranes of 
internal compartments for the detection of intracellular pathogens such as 
viruses and some bacteria. 

Each TLR targets a specific molecular characteristic, often called a 
pathogen-associated molecular pattern (PAMP), found primarily on invading 
organisms (Table 34.1). Typically, a PAMP is a critical component of the 
pathogen’s function: mutations in these targets cannot easily block recogni- 
tion by the TLR without compromising the activity of the pathogen. One 
particularly important PAMP is lipopolysaccharide (LPS), also referred to 
as endotoxin, a specific class of glycolipid found in the cell walls of Gram- 
negative bacteria such as E. coli. LPS is recognized by TLR4. The response 


TABLE 34.1 Pathogen-associated molecular patterns (PAMPs) recognized 
by human TLRs 


Receptor PAMP Source of PAMP 

TLR1 (with TLR2) Triacyl lipoprotein Bacteria 

TLR2 Lipoprotein Bacteria, viruses, parasites 
TLR3 Double-stranded RNA (dsRNA) Viruses 

TLR4 Lipopolysaccharide (LPS) Bacteria, viruses 

TLRS5 Flagellin Bacteria 

TLR6 (with TLR2) Diacy] lipoprotein Bacteria, viruses 

TLR7 Single-stranded RNA (ssRNA) Viruses, bacteria 

TLR8 Single-stranded RNA (ssRNA) Viruses, bacteria 

TLR9O CpG-DNA motifs Viruses, bacteria, protozoa 
TLR10 Unknown Unknown 


Data from: O. Takeuchi and S. Akira, Cell 140:805—820, 2010. 


of the innate immune system to LPS can be easily demonstrated. Injection 
of less than 1 mg of LPS into a human being produces a fever and other 
signs of inflammation even though no living organisms are introduced. 

How do TLRs recognize PAMPs? The extracellular domain from human 
TLR3 has a remarkable structure that is representative of many other TLRs 
(Figure 34.2). Each LRR contributes a single B strand to a large parallel B 
sheet that lines the inside of a concave, hooklike structure. In most cases, the 
PAMP is recognized by the surface formed between two receptors. The 
PAMP for TLR3, double-stranded RNA, fits in the groove between two 
monomers, colored blue and yellow in Figure 34.3. Ligand-induced dimer- 
ization of the receptor enables the initiation of a signaling cascade within the 
cell. We encountered similar examples of signal transduction across a mem- 
brane through receptor dimerization in Chapter 14. 

Because the TLRs and other components of the innate immune system 
are always expressed, they provide the host organism with a rapid response 
to resist attack by pathogens. However, a number of pathogens have evolved 
the ability to escape detection by the innate immune system. For protection 
against such pathogens, the host relies on the adaptive immune system, 
which is able to target specific pathogens, even those that it has never 
encountered in the course of evolution. 
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hk FIGURE 34.2 Extracellular domain 
of the Toll-like receptor. (A) The structure 
of the leucine-rich repeat (LRR) domain 
from human TLRS. Notice that the LRR 
units come together to form a central 
parallel B sheet that curls to form a 
concave structure. (B) The structure of a 
single LRR showing the positions of the 
residues that are generally approximately 
conserved. Notice that the leucine residues 
come together to form a hydrophobic core 
with the single B strand on one side. [Drawn 
from 1ZIW.pdb]. 


The adaptive immune system responds by using 
the principles of evolution 


The adaptive immune system comprises two parallel 
but interrelated systems: humoral and cellular 
immune responses. In the humoral immune response, 
soluble proteins called antibodies (immunoglobulins) 
function as recognition elements that bind to foreign 
molecules and serve as markers signaling foreign 
invasion. Antibodies are secreted by plasma cells, 
which are derived from B lymphocytes (B cells) 
(Figure 34.4). A foreign macromolecule that binds 
selectively to an antibody is called an antigen. In a 
physiological context, if the binding of the foreign 
molecule stimulates an immune response, that mol- 
ecule is called an immunogen. The specific affinity of 
an antibody is not for the entire macromolecular 
antigen but for a particular site on the antigen called 
the epitope or antigenic determinant. Each B cell pro- 
duces just one type of antibody that can recognize a 
single epitope. 

In the cellular immune response, cells called cyto- 
toxic T lymphocytes (also commonly called killer T 
cells) destroy cells that have been invaded by a patho- 
gen. Because intracellular pathogens do not leave 
markings on the exteriors of infected cells, vertebrates 
have evolved a mechanism to mark the exterior of 
cells with a sample of the interior contents, both self 
and foreign. Some of the internal proteins are broken 
into peptides, which are then bound to a complex of 


FIGURE 34.3 Recognition of a PAMP by a Toll-like receptor. The 


structure of TLR3 bound to its PAMP, a fragment of double-stranded integral membrane proteins encoded by the major 
RNA, as seen from the side (top) and from above (bottom). Notice that histocompatibility complex (MHC). T cells continually 
the PAMP induces receptor dimerization by binding the surfaces on the scan the bound peptides to find and kill cells that 


side of each of the extracellular domains. [Drawn from 3CIY.pdb]. display forei en motifs on their surfaces. Another class 


of T cells called helper T lymphocytes (helper T cells) 
contributes to both the humoral and the cellular 
immune responses by stimulating the differentiation and proliferation of 
appropriate B cells and cytotoxic T cells. The cellular immune response is 
mediated by specific receptors that are expressed on the surfaces of the T cells. 

The remarkable ability of the immune system to adapt to an essentially 
limitless set of potential pathogens requires a powerful system for trans- 
forming the immune cells and molecules in response to the presence of 
pathogens. This adaptive system operates through the principles of evolution, 
including reproduction with variation followed by selection of the most well 
suited members of a population. 

If the human genome contains, by the latest estimates, only 21,000 
genes, how can the immune system generate more than 10° different anti- 
body proteins and 10!? T-cell receptors? The answer is found in a novel 
mechanism for generating a highly diverse set of genes from a limited set of 
genetic building blocks. Linking different sets of DNA regions in a combi- 
natorial manner produces many distinct protein-encoding genes that are 
not present in the genome. A rigorous selection process then leaves for pro- 

i : liferation only cells that synthesize proteins determined to be useful in the 
production. An electron micrograph of a : : ; 
biarnd.collehowe the hay develobed immune response. The subsequent reproduction of these cells without 
rough endoplasmic reticulum necessary for additional recombination enriches the cell population with members 
antibody secretion. [Courtesy of Lynne Mercer] expressing particular protein species. 
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FIGURE 34.4 Immunoglobulin 


Critical to the development of the immune response is the selection 
process, which determines which immune cells will reproduce. The proc- 
ess comprises several stages. In the early stages of immune cell develop- 
ment, cells expressing molecules that bind tightly to self-molecules are 
destroyed or silenced, whereas cells expressing molecules that do not 
bind strongly to self-molecules and that have the potential for binding 
strongly to foreign molecules are preserved. The appearance of an 
immunogenic invader at a later time will stimulate the reproduction of 
cells expressing antibodies or T-cell receptors that bind specifically to 
elements of that pathogen—in evolutionary terms, such cells are posi- 
tively selected. Thus, the immune response is based on the selection of 
cells expressing molecules that are specifically effective against a par- 
ticular invader; the response evolves from a population with wide-rang- 
ing specificities to a more-focused collection of cells and molecules that 
are well suited to defend the host when confronted with that particular 
challenge. 

Not only are antibodies and T-cell receptors a result of genetic diver- 
sity and recombination, but antibodies have highly diverse structures as 
well. Antibodies require many different structural solutions for binding 
many different antigens, each of which has a different form. T-cell recep- 
tors, in contrast, are not structurally diverse, because they have coevolved 
with the MHC. The docking mode of a T-cell receptor to the peptide 
bound to MHC is similar for all structures. As a consequence of this 
coevolution, every T-cell receptor has an inherent reactivity with every 
MHC. The coevolution ensures that all T-cell receptors can scan all pep- 
tide-MHC complexes on all tissues. The genetic diversity of the 10! 
different T-cell receptors is concentrated in a highly diverse set of resi- 
dues in the center of the MHC groove. This localized diversity allows the 
T-cell receptor to recognize the many different foreign peptides bound to 
the MHC. T-cell receptors must survey many different MHC —peptide 
complexes with rapid turnover. Therefore, the binding affinities between 
T-cell receptors and the MHC are weaker than those between antibody 
and antigen. 


34.1 Antibodies Possess Distinct Antigen-Binding 
and Effector Units 


Antibodies are central molecular players in the immune response. In 
Chapter 3, we discussed the application of antibodies as tools to study 
proteins; let us now consider the native structures and functions of these 
remarkable molecules. In 1959, Rodney Porter showed that immunoglobu- 
lin G (IgG), the major antibody in serum, can be cleaved into three 50-kDa 
fragments by the limited proteolytic action of papain. Two of these frag- 
ments bind antigen. They are called F., (F stands for fragment, ab for 
antigen binding). The other fragment, called F, because it crystallizes 
readily, does not bind antigen, but it has other important biological activi- 
ties, including the mediation of responses termed effector functions. These 
functions include the initiation of the complement cascade, a process that 
leads to the lysis of target cells. Although such effector functions are cru- 
cial to the functioning of the immune system, they will not be considered 
further here. 

How do these fragments relate to the three-dimensional structure of 
whole IgG molecules? Immunoglobulin G consists of two kinds of poly- 
peptide chains, a 25-kDa light (L) chain and a 50-kDa heavy (H) chain 
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FIGURE 34.5 Immunoglobulin 
G structure. (A) The three-dimensional 
structure of an IgG molecule showing the 
light chains in yellow and the heavy chains 
in blue. (B) A schematic view of an IgG 
molecule indicating the positions of the 
interchain disulfide bonds. Abbreviations: 
N, amino terminus; C, carboxyl 
terminus. [Drawn from 1IGT.pdb.] 
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FIGURE 34.6 Immunoglobulin G cleavage. 


Treatment of intact IgG molecules with the 
protease papain results in the formation of 
three large fragments: two F,,, fragments that 
retain antigen-binding capability and one F, 
fragment that does not. 
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(Figure 34.5). The subunit composition is L,H,. Each L chain is linked to 
an H chain by a disulfide bond, and the H chains are linked to each other by 
at least one disulfide bond. Examination of the amino acid sequences and 
three-dimensional structures of IgG molecules reveals that each L chain 
comprises two homologous domains, termed immunoglobulin domains, to be 
described in detail in Section 34.2. Each H chain has four immunoglobulin 
domains. Overall, the molecule adopts a conformation that resembles the 
letter Y. The stem of this Y, corresponding to the F’, fragment obtained by 
cleavage with papain, consists of the two carboxyl-terminal immunoglobu- 
lin domains of each H chain. The two arms of the Y, corresponding to the 
two F’,, fragments, are formed by the two amino-terminal domains of each 
H chain and the two domains of each L chain. The linkers between the 
stem and the two arms consist of extended polypeptide regions within the 
H chains and are quite flexible. 

Papain cleaves the H chains on the carboxyl-terminal side of the disul- 
fide bond that links each L and H chain (Figure 34.6). Thus, each Fy, con- 
sists of an entire L chain and the amino-terminal half of an H chain, whereas 
F, consists of the carboxyl-terminal halves of both H chains. Each F,,, 
contains a single antigen-binding site. Because an intact IgG molecule con- 
tains two F’,, components and therefore has two binding sites, it can cross- 
link multiple antigens (Figure 34.7). Furthermore, the F, and the two F,,, 
units of the intact IgG are joined by flexible polypeptide regions that allow 


FIGURE 34.7 Antigen cross-linking. Because 
IgG molecules include two antigen-binding sites, 
antibodies can cross-link multivalent antigens such 
as viral surfaces. 


TABLE 34.2 Properties of immunoglobulin classes 


Serum 

concentration Light Heavy 
Class (mg ml~!) Mass (kDa) chains chains Chain structure 
IgG 12 150 Kord Y Kyo OF Ayo 
IgA 3 180-500 Kory a (K,05), OF (A,&),, 
IgM 1 950 Kord p (KMy)5 Or (AjMy)5 
IgD 0.1 175 Kor’ rc) K,5, or N48, 
IgE 0.001 200 KorA € Ky€y OF AE, 


Note: n = 1, 2, or 3. IgM and oligomers of IgA also contain J chains that connect immunoglobulin molecules. 
IgA in secretions has an additional component. 


facile variation in the angle between the F,,, units 
through a wide range (Figure 34.8). This kind of 
mobility, called segmental flexibility, can enhance 
the formation of an antibody—antigen complex by yi; 


enabling both recognition sites on an antibody to Hinge Hinge 


bind an antigen that possesses multiple binding 
sites, such as a viral coat composed of repeating 
identical monomers. The sites at the tips of the F’,, 
units simply move to match the distance between 
specific determinants on the antigen. 
Immunoglobulin G is the antibody present in 
highest concentration in the serum, but other 
classes of immunoglobulin also are present 
(Table 34.2). Each class includes an L chain (either « or \) and a distinct H 
chain (Figure 34.9). The heavy chains in IgG are called y chains, whereas 
those in immunoglobulins A, M, D, and E are called a, w, 5, and e, respec- 
tively. Immunoglobulin M (IgM) 1s the first class of antibody to appear in the 
serum after exposure to an antigen. The presence of 10 antigen recognition 
sites enables IgM to bind especially tightly to antigens containing multiple 
identical epitopes. The strength of an interaction comprising multiple inde- 
pendent binding interactions between partners is termed avidity rather than 
affinity, which denotes the binding strength of a single binding site. 
Immunoglobulin A (IgA) is the major class of antibody in external secre- 
tions, such as saliva, tears, bronchial mucus, and intestinal mucus. Thus, IgA 
serves as a first line of defense against bacterial and viral antigens. The role of 
immunoglobulin D (IgD) has long been a mystery, but recent studies suggest 
IgD plays a role in the activation of basophils, white blood cells that have 
antiparasitic functions. Immunoglobulin E (IgE) is important in conferring 
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FIGURE 34.8 Segmental flexibility. The linkages between the F,,, and the F, 
regions of an IgG molecule are flexible, allowing the two antigen-binding sites 
to adopt a range of orientations with respect to each other. This flexibility 
allows effective interactions with a multivalent antigen without requiring that 
the epitopes on the target be a precise distance apart. 


FIGURE 34.9 Classes of immunoglobulin. 
Each of five classes of immunoglobulin has 
the same light chain (shown in yellow) 
combined with a different heavy chain (y, a, 
p, 8, or e). Disulfide bonds are indicated by 
green lines. The IgA dimer and the IgM 
pentamer have a small polypeptide chain in 
addition to the light and heavy chains. 
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protection against parasites, but IgE also participates in allergic reactions. 
IgE—antigen complexes form cross-links with receptors on the surfaces of 
mast cells to trigger a cascade that leads to the release of granules containing 
pharmacologically active molecules. Histamine, one of the agents released, 
induces smooth-muscle contraction and stimulates the secretion of mucus. 


34.2 Antibodies Bind Specific Molecules Through 
Hypervariable Loops 


A comparison of the amino acid sequences of different IgG antibodies from 

human beings or mice shows that the carboxyl-terminal half of the L chains 

and the carboxyl-terminal three-quarters of the H chains are very similar in 
all of the antibodies. Importantly, the amino-terminal 
domain of each chain is more variable, including three 
stretches of approximately 7 to 12 amino acids within each 
chain that are hypervariable, as shown for the H chain in 
Figure 34.10. The amino-terminal immunoglobulin domain 
of each chain is thus referred to as the variable region, 
whereas the remaining immunoglobulin domains are much 
more similar in all antibodies and are referred to as constant 
regions (Figure 34.11). 


The immunoglobulin fold consists of a beta-sandwich 
framework with hypervariable loops 


An IgG molecule consists of a total of 12 immunoglobulin 
domains. These domains have many sequence features in 
common and adopt a common structure, the immunoglob- 
ulin fold (Figure 34.12). Remarkably, this same structural 
domain is found in many other proteins that play key roles 
in both immune and nonimmune functions. 


100 120 


FIGURE 34.10 Immunoglobulin sequence diversity. A plot of The immunoglobulin fold consists of a pair of B sheets, 


sequence variability as a function of position along the sequence 
of the amino-terminal immunoglobulin domain of the H chain of 
human IgG molecules. Three regions (in red) show remarkably 


each built of antiparallel B strands, that surround a central 
hydrophobic core. A single disulfide bond bridges the two 


high levels of variability. These hypervariable regions correspond sheets. Two aspects of this structure are particularly 
to three loops in the immunoglobulin domain structure. [Data from important for its function. First, three loops present at one 
R. A. Goldsby, T. J. Kindt, and B. A. Osborne, Kuby Immunology, end of the structure form a potential binding surface. 


4th ed. (W. H. Freeman and Company, 2000), p. 91.] 


FIGURE 34.11 Variable and constant 
regions. Each L and H chain includes one 
immunoglobulin domain at its amino 
terminus that is quite variable from one 
antibody to another. These domains are 
referred to as V, and V,,. The remaining 
domains are more constant from one 
antibody to another and are referred to as 
constant domains (C, 1, C1, C42, and C,,3). 


These loops contain the hypervariable sequences present 

in antibodies and in T-cell receptors. Variation of the 
amino acid sequences of these loops provides the major mechanism for the 
generation of the vastly diverse set of antibodies and T-cell receptors 
expressed by the immune system. These loops are referred to as hypervari- 
able loops or complementarity-determining regions (CDRs). Second, the amino 
terminus and the carboxyl terminus are at opposite ends of the structure, 
which allows structural domains to be strung together to form chains, as in 
the L and H chains of antibodies. Such chains are present in several other 
key molecules in the immune system. 


Me The immunoglobulin fold is one of the most prevalent domains 
encoded by the human genome: more than 750 genes encode proteins 
with at least one immunoglobulin fold recognizable at the level of amino 
acid sequence. Such domains are also common in other multicellular ani- 
mals such as flies and nematodes. However, from inspection of amino acid 
sequence alone, immunoglobulin-fold domains do not appear to be present 
in yeast or plants, although these organisms possess other structurally simi- 
lar domains, including the key photosynthetic electron-transport protein 


plastocyanin in plants (Section 19.3). Thus, the immunoglobulin-fold 
family appears to have expanded greatly along evolutionary branches lead- 
ing to animals—particularly, vertebrates. 


X-ray analyses have revealed how antibodies bind antigens 


For each class of antibody, the variable domains at the amino-terminal ends 
of the L and H chains (designated V,; and V,,;) come together to form a 
binding surface. The positions of the CDRs are striking. These hypervari- 
able sequences, present in three loops of each domain, come together so that 
all six loops form a single surface at the end of each arm (Figure 34.13). 
Because virtually any V, can pair with any Vj,, a very large number of differ- 
ent binding sites can be constructed by their combinatorial association. 

The results of x-ray crystallographic studies of several hundred large 
and small antigens bound to F,,, molecules have been sources of much 
insight into the structural basis of antibody specificity. The binding of anti- 
gens to antibodies is governed by the same principles that govern the bind- 
ing of substrates to enzymes. The interaction between complementary 
shapes results in numerous contacts between amino acids at the binding 
surfaces of both molecules. Many hydrogen bonds, electrostatic interac- 
tions, and van der Waals interactions, reinforced by hydrophobic interac- 
tions, combine to give specific and strong binding. 

A few aspects of antibody binding merit specific attention, inasmuch as 
they relate directly to the structure of immunoglobulins. The binding site on 
the antibody incorporates some or all of the CDRs in the variable domains 
of the antibody. Small molecules are likely to make contact with fewer 
CDRs, with perhaps 15 residues of the antibody participating in the binding 
interaction. Macromolecules often make more extensive contact, sometimes 
interacting with all six CDRs and 20 or more residues of the antibody. Small 
molecules often bind inacleft oftheantigen-binding region. Macromolecules, 
such as globular proteins, tend to interact across larger, fairly flat apposed 
surfaces bearing complementary protrusions and depressions. 

A well-studied case of small-molecule binding is seen in an example of 
phosphorylcholine bound to F,,. Crystallographic analysis revealed phos- 
phorylcholine bound to a cavity lined by residues from five CDRs—two 
from the L chain and three from the H chain (Figure 34.14). The positively 
charged trimethylammonium group of phosphorylcholine is buried inside 
the wedge-shaped cavity, where it interacts electrostatically with two nega- 
tively charged residues, a glutamate and an aspartate. The negatively 
charged phosphoryl group of phosphorylcholine binds to the positively 
charged guanidinium group of an arginine residue at the mouth of the crev- 
ice and is hydrogen bonded to the side chain of a nearby tyrosine residue. 


Glu H35 = Tyr H33 


FIGURE 34.14 Binding of a small antigen. The structure of a complex between an F, 
fragment of an antibody (blue and yellow) and its target—in this case, phosphorylcholine. 
Notice that residues from the antibody interact with phosphorylcholine through hydrogen 
bonding and electrostatic interactions. [Drawn from 2MCP.pdb.] 
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FIGURE 34.12 Immunoglobulin 
fold. An immunoglobulin domain consists 
of a pair of B sheets linked by a disulfide 
bond and hydrophobic interactions. Notice 
that three hypervariable loops lie at one end 
of the structure. [Drawn from 1DQu.pdb.] 


k& FIGURE 34.13 Variable domains. 
A side view of the variable domains of the 
L chain (yellow) and the H chain (blue); the 
complementarity-determining regions (CDRs) 
are shown in red. Notice that the six CDRs 
come together to form a binding surface. 
The specificity of the surface is determined 
by the sequences and structures of the 
CDRs. [Drawn from 1DQJ.pdb.] 
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hk FIGURE 34.15 Antibodies against 
lysozyme. The structures of three 
complexes (i, ii, iii) between F.,, fragments 
(blue and yellow) and hen egg-white 
lysozyme (red) shown with lysozyme in the 
same orientation in each case. Notice that 
the three antibodies recognize completely 
different epitopes on the lysozyme molecule. 
[Drawn from SHFL, 1DQJ, and 1FDL.pdb.] 


L chain Gln 121 


hk FIGURE 34.16 Antibody-protein interactions. 
Notice that a single residue of lysozyme, glutamine 121, 


penetrates more deeply into the antibody binding 
site. [Drawn from 1FDL.pdb.] 


Lysozyme 


Numerous van der Waals interactions, such as those made by a tryptophan 
side chain, also stabilize this complex. 

Residues from five CDRs participate in the binding of phosphorylcholine 
to human F,,,. This binding does not significantly change the structure of 
the antibody, yet induced fit plays a role in the formation of many antibody— 
antigen complexes. A malleable binding site can accommodate many more 
kinds of ligands than can a rigid one. Thus, induced fit increases the reper- 
toire of antibody specificities. 


Large antigens bind antibodies with numerous interactions 


How do large antigens interact with antibodies? A large collection of anti- 
bodies against hen egg-white lysozyme has been structurally characterized 
in great detail (Figure 34.15). Each different antibody binds to a distinct 
surface of lysozyme. Let us examine the interactions in one of these com- 
plexes (complex ii in Figure 34.15) in detail. This antibody binds two poly- 
peptide segments of lysozyme that are widely separated in the 
primary structure (Figure 34.16). 

All six CDRs of the antibody make contact with this epitope. 
The region of contact is quite extensive (about 30 X 20 A). The 
apposed surfaces are rather flat. The only exception is the side 
chain of glutamine 121 of lysozyme, which penetrates deeply into 
the antibody’s binding site, where it forms a hydrogen bond with a 
main-chain carbonyl] oxygen atom and is surrounded by three aro- 
matic side chains. The formation of 12 hydrogen bonds and 
numerous van der Waals interactions contributes to the high affin- 
ity (Ky = 20nM) of this antibody—antigen interaction. Examination 
of the F.,, molecule without bound protein reveals that the struc- 
tures of the V; and V,, domains change little on binding, although 
they slide 1 A apart to allow more intimate contact with lysozyme. 


34.3 Diversity Is Generated by Gene Rearrangements 


A mammal such as a mouse or a human being can synthesize large amounts 
of specific antibody against virtually any foreign determinant within a matter 
of days after exposure. We have seen that antibody specificity is determined 
by the amino acid sequences of the variable regions of both light and heavy 
chains, which brings us to the key question: How are different variable- 
region sequences generated? 

The discovery of distinct variable and constant regions in the L and H 
chains raised the possibility that the genes that encode immunoglobulins have 
an unusual architecture that facilitates the generation of a diverse set of poly- 
peptide products. In 1965, William Dreyer and Claude Bennett proposed 
that multiple V (variable) genes are separate from a single C (constant) gene in 
embryonic (germ-line) DNA. According to their model, one of these V genes 
becomes joined to the C gene in the course of differentiation of the antibody- 
producing cell. A critical test of this novel hypothesis had to await the isola- 
tion of pure immunoglobulin mRNA and the development of techniques for 
analyzing mammalian genomes. Twenty years later, Susumu Tonegawa 
found that V and C genes are indeed far apart in embryonic DNA but are 
closely associated in the DNA of antibody-producing cells. Thus, immuno- 
globulin genes are rearranged in the differentiation of lymphocytes. 


J (joining) genes and D (diversity) genes increase antibody diversity 


Sequencing studies carried out by Susumu Tonegawa, Philip Leder, and 
Leroy Hood revealed that V genes in embryonic cells do not encode the 
entire variable region of L and H chains. Consider, 

for example, the region that encodes the k light-chain V, V5 
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encodes approximately the first 97 residues of the 
variable domain of the L chain, is present on human 
chromosome 2 (Figure 34.17). 

However, the variable region of the L chain contains 110 residues. 
Where is the DNA that encodes the last 13 residues of the variable domain? 
For L chains in undifferentiated cells, this stretch of DNA 1s located in an 
unexpected place: near the C gene. It is called the J gene because it joins the 
V and C gene segments in a differentiated cell. In fact, a tandem array of five 
J gene segments is located near the C gene in embryonic cells. In the dif- 
ferentiation of an antibody-producing cell, a V gene segment is joined to a 
J gene segment to form a complete gene for the variable region (Figure 34.18). 
RNA splicing (Section 29.3) generates an mRNA molecule for the complete 


FIGURE 34.17 The « light-chain locus. 
This part of human chromosome 2 includes 
an array of 40 segments that encode the 
variable (V) region (approximately residues 
1-97) of the light chain, an array of 5 
segments that encode the joining (J) region 
(residues 98-110), and a single region that 
encodes the constant (C) region. 
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FIGURE 34.18 VJ recombination. A single V gene (in this case, V,) is linked to a J gene 
(here, J,) to form an intact VJ region. The intervening DNA is released in a circular form. 
Because the V and J regions are selected at random and the joint between them is not 
always in exactly the same place, many VJ combinations can be generated by this process. 
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FIGURE 34.19 Light-chain expression. 


Th 


e light-chain protein is expressed by the 


transcription of the rearranged gene to 
produce a pre-RNA molecule with the 


VJ 


and C regions separated. RNA splicing 


removes the intervening sequences to 
produce an MRNA molecule with the 


VJ 


and C regions linked. Translation of the 


mRNA and processing of the initial protein 
product produce the light chain. 


FIGURE 34.20 V(D)J recombination. The 


heavy-chain locus includes an array of 


51 


V segments, 27 D segments, and 6 J 


segments. Gene rearrangement begins 
with D-d joining, followed by further 
rearrangement to link the V segment to 
the DJ segment. 
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L chain by linking the coding regions for the rearranged VJ unit with that for 
the C unit (Figure 34.19). 

J gene segments are important contributors to antibody diversity 
because they encode part of the last hypervariable segment (CDR3). In 
forming a continuous variable-region gene, any of the 40 V gene segments 
can become linked to any of 5 J gene segments. Thus, somatic recombina- 
tion of these gene segments amplifies the diversity already present in the 
germ line. The linkage between V and J is not precisely controlled. 
Recombination between these genes can take place at one of several bases 
near the codon for residue 95, generating additional diversity. A similar 
array of V and J gene segments encoding the X light chain is present on 
human chromosome 22. This region includes 30 VA gene segments and four 
JX gene segments. In addition, this region includes four distinct C genes, in 
contrast with the single C gene in the k locus. 

In humans, the genes encoding the heavy chain are present on chromo- 
some 14. Remarkably, the variable domain of heavy chains is assembled 
from three rather than two segments. In addition to V;,; gene segments that 
encode residues 1 through 94 and J,, gene segments that encode residues 98 
through 113, this chromosomal region includes a distinct set of segments 
that encode residues 95 through 97 (Figure 34.20). These gene segments are 
called D (for diversity). Some 27 D gene segments lie between 51 V,, and 6 
Jz gene segments. The recombination process first joins a D gene segment 
to a J;; gene segment; a V,, gene segment is then joined to DJ,;. A greater 
variety of antigen-binding patches and clefts can be formed by the H chain 
than by the L chain because the H chain is encoded by three rather than two 
gene segments. Moreover, CDR3 of the H chain is diversified by the action 
of terminal deoxyribonucleotidyl transferase, a special DNA polymerase 
that requires no template. This enzyme inserts extra nucleotides between 
V,, and D. The V(D)J recombination of both the L and the H chains is exe- 
cuted by specific enzymes present in immune cells. These proteins, called 
RAG-1 and RAG-2, recognize specific DNA sequences called recombination 
signal sequences (RSSs) adjacent to the V, D, and J gene segments and facili- 
tate the cleavage and religation of the DNA segments. 


More than 108 antibodies can be formed by combinatorial 
association and somatic mutation 


Let us recapitulate the sources of antibody diversity. The germ line contains 
a rather large repertoire of variable-region genes. For k light chains, there 
are about 40 V-segment genes and 5 J-segment genes. Hence, a total of 
40 X 5 = 200 kinds of complete V,. genes can be formed by the combinations 
of V and J. A similar analysis suggests that at least 120 different \ light chains 
can be generated. A larger number of heavy-chain genes can be formed 
because of the role of the D segments. For 51 V, 27 D, and 6 J gene segments, 

the number of complete V;, genes that can be formed is 8262. The association 
of 320 kinds of L chains with 8262 kinds of H chains would yield 2.6 x 10° 
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different antibodies. Variability in the exact points of segment joining and 
other mechanisms increases this value by at least two orders of magnitude. 

Even more diversity is introduced into antibody chains by somatic muta- 
tion—that is, by the introduction of mutations into the recombined genes. 
In fact, a 1000-fold increase in binding affinity is seen in the course of a 
typical humoral immune response, arising from somatic mutation, a process 
called affinity maturation. The generation of an expanded repertoire leads to 
the selection of antibodies that more precisely fit the antigen. Thus, nature 
draws on each of three sources of diversity—a germ-line repertoire, somatic 
recombination, and somatic mutation—to form the rich variety of antibod- 
ies that protect an organism from foreign incursions. 


The oligomerization of antibodies expressed on the surfaces 
of immature B cells triggers antibody secretion 


The processes heretofore described generate a highly diverse set of antibody 
molecules—a key first step in the generation of an immune response. The 
next stage is the selection of a particular set of antibodies directed against a 
specific invader. How is this selection accomplished? Each immature B cell, 
produced in the bone marrow, expresses a specific monomeric form of IgM 
attached to its surface (Figure 34.21). Each cell expresses approximately 10° 
IgM molecules, but all of these molecules are identical in amino acid sequence 
and, hence, in antigen-binding specificity. Thus, the selection of a particular 
immature B cell for growth will lead to the amplification of an antibody with 
a unique specificity. The selection process begins with the binding of an 
antigen to the membrane-bound antibody. 

Associated with each membrane-linked IgM molecule are two molecules 
of a heterodimeric membrane protein called Ig-a—Ig-B (Figure 34.21). 
Examination of the amino acid sequences of Ig-a and Ig-8 is highly instruc- 
tive. The amino terminus of each protein lies outside the cell and corre- 
sponds to a single immunoglobulin fold, and the carboxyl 
terminus, which lies inside the cell, includes a sequence of 
18 amino acids called an immunoveceptor tyrosine-based 
activation motif (ITAM; Figure 34.21). As its name sug- 
gests, each ITAM includes key tyrosine residues, which 
are subject to phosphorylation by particular protein 
kinases present in immune-system cells. 

A fundamental observation linking the binding of 
antigen to membrane-bound antibody with the subse- 
quent initiation of an immune response is that oligomer- 
ization, or clustering, of the antibody molecules is required 
(Figure 34.22). The requirement for oligomerization is 
reminiscent of the dimerization of receptors triggered by 
epidermal growth factor and insulin encountered in 
Section 14.3; indeed, the associated signaling mechanisms 
appear to be quite similar. The oligomerization of the 
membrane-bound antibodies results in the phosphoryla- 


FIGURE 34.21 B-cell receptor. This 
complex consists of a membrane-bound 
IgM molecule noncovalently bound to two 
Ig-a-Ig-B heterodimers. The intracellular 
domains of each of the Ig-a and Ig-B 
chains include an immunoreceptor tyrosine- 
based activation motif (ITAM). 


tion of the tyrosine residues within the ITAMs by protein 
tyrosine kinases including Lyn, a homolog of Src (Section 
14.5). The phosphorylated ITAMs serve as docking sites 
for a protein kinase termed spleen tyrosine kinase (Syk), 
which has two SH2 domains that interact with the pair of 
phosphorylated tyrosine residues in each ITAM. Docked 
Syk adopts an active confirmation, which then phosphor- 
ylates other signal-transduction proteins, including an 
inhibitory subunit of the transcription factor NF-kB and 


Phosphorylates targets including an inhibitor 
of the transcription factor NF-KB 


Kinase 
domain 


FIGURE 34.22 B-cell activation. The binding of a multivalent 
antigen such as a bacterial or viral surface links membrane-bound 
IgM molecules. This oligomerization triggers the phosphorylation of 
tyrosine residues in the ITAM sequences by protein tyrosine kinases 
such as Lyn. After phosphorylation, the ITAMs serve as docking 
sites for Syk, a protein kinase that phosphorylates a number of 
targets, including transcription factors. 
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an isoform of phospholipase C. The signaling processes continue down- 
stream to activate gene expression, leading to the stimulation of cell growth 
and initiating further B-cell differentiation. 


cs Drugs that modulate the immune system have served as sources of 

insight into immune-system signaling pathways. For example, cyclo- 
sporin, a powerful suppressor of the immune system, inhibits calcineurin, a 
phosphatase which normally activates a transcription factor called NF-AT 
by dephosphorylating it. 


Cyclosporin A 


The potent inhibition of the immune system, or immunosuppression, 
induced by cyclosporin reveals how crucial the activity of NF-AT is to the 
development of an immune response. Without such drugs, organ trans- 
plantation would be extremely difficult because transplanted tissue 
expresses a wide range of foreign antigens, causing the host immune system 
to reject the new tissue. 

The role of oligomerization in the B-cell signaling pathway is illumi- 
nated when we consider the nature of many antigens presented by patho- 
gens. The surfaces of many viruses, bacteria, and parasites are characterized 
by arrays of identical membrane proteins or membrane-linked carbohy- 
drates. Thus, most pathogens present multiple binding surfaces that will 
naturally cause membrane-associated antibodies to oligomerize as they 
bind adjacent epitopes. In addition, the mechanism accounts for the obser- 
vation that most small molecules do not induce an immune response; how- 
ever, coupling multiple copies of a small molecule to a large oligomeric 
protein such as keyhole limpet hemocyanin (KLH), which has a molecular 
mass of 1 million daltons or more, promotes antibody oligomerization and, 
hence, the production of antibodies against the small-molecule epitope. 
The large protein is called the carrier of the attached chemical group, which 
is called a haptenic determinant. The small foreign molecule by itself is 
called a hapten. Antibodies elicited by attached haptens will bind unat- 
tached haptens as well. 


Different classes of antibodies are formed by the hopping of V,, genes 


The development of an effective antibody-based immune response depends 
on the secretion into the blood of antibodies that have appropriate effector 
functions. At the beginning of this response, an alternative mRNA-splicing 
pathway is activated so that the production of membrane-linked IgM is 
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supplanted by the synthesis of secreted IgM. As noted in Section 34.1, 
secreted IgM is pentameric and has a high avidity for antigens containing 
multiple identical epitopes. Later, the antibody-producing cell makes 
either IgG, IgA, IgD, or IgE of the same specificity as that of the initially 
secreted IgM. In this switch, the light chain and the variable region of the 
heavy chain are unchanged. Only the constant region of the heavy chain 
changes. This step in the differentiation of an antibody-producing cell is 
called class switching (Figure 34.23). In undifferentiated cells, the genes 
for the constant region of each class of heavy chain, called C,,, Cs, Ca Ce 
and C,, are next to one another. There are eight in all, inclueine four @ genes 
for the constant regions of y chains. A complete transcript for the heavy 
chain of an IgM antibody is formed by the translocation of a V;, gene seg- 
ment to a DJ,, gene segment within the genomic DNA (Figure 34.20), fol- 
lowed by RNA splicing of the V,,DJ;,; segment to the oF segment, 
analogous to what is shown in Figure 34.19. 

How are other heavy chains formed? Class switching is mediated by a 
gene-rearrangement process that moves a VDJ gene from a site near one C 
gene to a site near another C gene. Importantly, the antigen-binding specific- 
ity is conserved in class switching because the entire V,,D],, gene is translocated 
in an intact form. For example, the antigen-combining specificity of IgA 
produced by a particular cell is the same as that of IgM synthesized at an 
earlier stage of its development. The biological significance of C,, switching 
is that a whole recognition domain (the variable domain) is shifted from the 
early constant region (C,) to one of several other constant regions that 
mediate different effector functions. 


34.4 Major-Histocompatibility-Complex Proteins 
Present Peptide Antigens on Cell Surfaces 
for Recognition by T-Cell Receptors 


Soluble antibodies are highly effective against extracellular patho- 
gens, but they confer little protection against microorganisms 
that are predominantly intracellular, such as some viruses and 
mycobacteria (which cause tuberculosis and leprosy). These 
pathogens are shielded from antibodies by the host-cell mem- 
brane (Figure 34.24). A different and more subtle strategy, cell- 
mediated immunity, evolved to cope with intracellular pathogens. 
T cells continually scan the surfaces of all cells and kill those that 
exhibit foreign markings. The task is not simple; intracellular 
microorganisms are not so obliging as to intentionally leave tell- 
tale traces on the surface of their host. Quite the contrary, suc- 
cessful pathogens are masters of the art of camouflage. Vertebrates 
have evolved an ingenious mechanism—cut and display—to 
reveal the presence of stealthy intruders. Nearly all vertebrate 


FIGURE 34.23 Class switching. Further 
rearrangement of the heavy-chain locus 
results in the generation of genes for 
antibody classes other than IgM. In the 
case shown, rearrangement places the VDJ 
region next to the Cy1 region, resulting in 
the production of IgG1. Note that no further 
rearrangement of the VDJ region takes 
place, and so the specificity of the antibody 
is not affected. 


FIGURE 34.24 Intracellular pathogen. An electron 


cells exhibit on their surfaces a sample of peptides derived from micrograph showing mycobacteria (arrows) inside an 
the digestion of proteins in their cytoplasm. These peptides are _ infected macrophage. [Courtesy of Dr. Stanley Falkow,] 
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FIGURE 34.26 Class | MHC protein. 
A protein of this class consists of two 
chains. Notice that the a chain begins with 
two domains (a,, e,) that include a helices 
and continues with an immunoglobulin 
domain (a,), a transmembrane domain, 
and a cytoplasmic tail. The second chain, 
B,-microglobulin, adopts an immunoglobulin 
fold. [Drawn from 1HHK.pdb.] 
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FIGURE 34.25 Presentation of peptides 
from cytoplasmic proteins. Class | MHC 
proteins on the surfaces of most cells 
display peptides that are derived from 
cytoplasmic proteins by proteolysis. 


displayed by integral membrane proteins that are encoded by the major his- 
tocompatibility complex (MHC). Specifically, peptides derived from cyto- 
plasmic proteins are bound to and displayed by class I MHC proteins. The 
dendritic cells of the innate immune system that subject pathogens to phago- 
cytosis migrate to lymphatic tissue, where they use an MHC-like mecha- 
nism to present foreign peptides or lipid components to T cells—thus linking 
the innate and adaptive immune responses to pathogens. 

How are these peptides generated and delivered to the plasma mem- 
brane? The process starts in the cytoplasm with the degradation of 
proteins—self-proteins as well as those of pathogens (Figure 34.25). 
Digestion is carried out by proteasomes (Section 23.2). The resulting pep- 
tide fragments are transported from the cytoplasm into the lumen of the 
endoplasmic reticulum by the TAP protein (for transporter associated with 
antigen processing), a member of the ABC transporter family of ATP- 
driven pumps (Section 13.2). In the ER, peptides combine with nascent 
class I MHC proteins; these complexes are then targeted to the plasma 
membrane. 

MHC proteins embedded in the plasma membrane tenaciously grip 
their bound peptides so that they can be touched and scrutinized by T-cell 
receptors on the surface of a killer cell. Foreign peptides bound to class I 
MHC proteins signal that a cell is infected and mark it for destruction by 
cytotoxic T cells. An assembly consisting of the foreign peptide —MHC 
complex, the T-cell receptor, and numerous accessory proteins—triggers a 
cascade that induces apoptosis in the infected cell. Strictly speaking, 
infected cells are not killed but, instead, are triggered to commit suicide to 
benefit the host. 


Peptides presented by MHC proteins occupy a deep groove 
flanked by alpha helices 


The three-dimensional structure of a large extracellular fragment of a 
human MHC class I protein, human leukocyte antigen A2 (HLA-A2), was 
solved in 1987 by Don Wiley and Pamela Bjorkman. Class I MHC proteins 
consist of a 44-kDa a chain noncovalently bound to a 12-kDa polypeptide 
called 6 ,-microglobulin. The a chain has three extracellular domains (a,, 5, 
and a), a transmembrane segment, and a tail that extends into the cyto- 
plasm (Figure 34.26). The 8,-microglobulin and the a, domains have 
immunoglobulin folds, although the pairing of the two domains differs 


Top view Side view 


Peptide 


FIGURE 34.27 Class | MHC peptide-binding site. The a, and a, domains come 
together to form a groove in which peptides are displayed. Notice that that the peptide is 
surrounded on three sides by a 8 sheet and two a helices, but it is accessible from the top of 
the structure. [Drawn from 1HHK.pdb.] 


from that in antibodies. The a, and a, domains exhibit a novel and remark- 
able architecture. They associate intimately to form a deep groove that 
serves as the peptide-binding site (Figure 34.27). The floor of the groove, 
which is about 25 A long and 10 A wide, is formed by eight 8 strands, four 
from each domain. A long helix contributed by the a, domain forms one 
side, and a helix contributed by the a, domain forms the other side. This 
groove is the binding site for the presentation of peptides. 

The groove can be filled by a peptide from 8 to 10 residues long in an 
extended conformation. As we shall see (p. 1002), MHC proteins are remark- 
ably diverse in the human population; each person expresses as many as six 
distinct class | MHC proteins, and many different forms are present in dif- 
ferent people. The first structure determined, HLA-A2, binds peptides 
that almost always have leucine in the second position and valine in the last 
position (Figure 34.28). Side chains from the MHC molecule interact with 
the amino and carboxyl termini and with the side chains in these two key 
positions. These two residues are often referred to as the anchor residues. 
The other residues are highly variable. Thus, many millions of different 
peptides can be presented by this particular class I MHC protein; the 
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FIGURE 34.28 Anchor residues. (A) The 
amino acid sequences of three peptides 
that bind to the class | MHC protein HLA- 
A2 are shown. Each of these peptides 
has leucine in the second position and 
valine in the carboxyl-terminal position. 
(B) Comparison of the structures of these 
peptides reveals that the amino and 
carboxyl termini, as well as the side chains 
of the leucine and valine residues, are in 
essentially the same positions in each 
peptide, whereas the remainder of the 
structures are quite different. 


Valine 
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FIGURE 34.29 T-cell receptor. This 
protein consists of an a chain and a B chain, 
linked by a disulfide bond. Notice that each 
chain consists of two immunoglobulin 
domains on the cell surface, a membrane- 
spanning domain, and a short cytoplasmic 
tail. [Drawn from 1BD2.pdb.] 


T-cell receptor 


MHC class | 


hk FIGURE 34.30 T-cell receptor-class | 
MHC complex. The T-cell receptor binds 
to aclass | MHC protein containing a 
bound peptide (red). Notice that the T-cell 
receptor contacts both the MHC protein 
and the peptide. [Drawn from 1BD2.pdb.] 


identities of only two of the nine residues are crucial for binding. Each class 
of MHC molecules requires a unique set of anchor residues. Thus, a tre- 
mendous range of peptides can be presented by these molecules. Note that 
one face of the bound peptide is exposed to solution, where it can be examined 
by other molecules, particularly T-cell receptors. An additional remarkable 
feature of MHC-peptide complexes is their kinetic stability; once bound, a 
peptide is not released, even after a period of days. 


T-cell receptors are antibody-like proteins containing 
variable and constant regions 


We are now ready to consider the receptor that recognizes peptides displayed 
by MHC proteins on target cells. The T-cell receptor consists of a 43-kDa 
a chain joined by a disulfide bond to a 43-kDa B chain (Figure 34.29). Each 
chain spans the plasma membrane and has a short carboxyl-terminal region 
on the cytoplasmic side. A small proportion of T cells express a receptor 
consisting of ‘y and 6 chains in place of a and 8. The a and £ chains of the 
T-cell receptor, like immunoglobulin L and H chains, consist of variable and 
constant regions. Indeed, these domains of the T-cell receptor are homologous 
to the V and C domains of immunoglobulins. Furthermore, hypervariable 
sequences present in the V regions of the a and B chains of the T-cell recep- 
tor form the binding site for the epitope. 

The genetic architecture of these proteins is similar to that of immuno- 
globulins, though the antibody genetic diversity is distributed over all the 
CDR loops, whereas T-cell-receptor genetic diversity is concentrated in the 
CDR3 loop that interacts with the peptide bound to the MHC. The variable 
region of the T-cell receptor a chain is encoded by about 50 V gene seg- 
ments and 70 J gene segments. The T-cell receptor B chain is encoded by 
two D gene segments in addition to 57 V gene segments and 13 J gene seg- 
ments. Again, the diversity of gene segments and the use of slightly impre- 
cise modes of joining them increase the number of distinct proteins formed. 
At least 10! different specificities could arise from combinations of this reper- 
toire of genes. Thus, T-cell receptors, like immunoglobulins, can recognize a 
very large number of different epitopes. All the receptors on a particular T 
cell have the same specificity. 

How do T cells recognize their targets? The variable regions of the a and 
B chains of the T-cell receptor form a binding site that recognizes a com- 
bined epitope—foreign peptide bound to an MHC protein (Figure 34.30). 
Neither the foreign peptide alone nor the MHC protein alone forms a com- 
plex with the T-cell receptor. Thus, fragments of an intracellular pathogen 
are presented in a context that allows their detection, leading to the initia- 
tion of an appropriate response. 


CD8 on cytotoxic T cells acts in concert with T-cell receptors 


The T-cell receptor does not act alone in recognizing and mediating the fate 
of target cells. Cytotoxic T cells also express a protein termed CD8 on their 
surfaces that is crucial for the recognition of the class I MHC-—peptide com- 
plex. The abbreviation CD stands for cluster of differentiation, referring to a 
cell-surface marker that is used to identify a lineage or stage of differentia- 
tion. Antibodies specific for particular CD proteins have been invaluable in 
following the development of leukocytes and in discovering new interac- 
tions between specific cell types. 

Each chain in the CD8 dimer contains a domain that resembles an 
immunoglobulin variable domain (Figure 34.31). CD8 interacts primarily 
with the constant «, domain of class I MHC proteins. This interaction fur- 
ther stabilizes the interactions between the T cell and its target. 


kh FIGURE 34.31 The coreceptor CD8. 
This dimeric protein extends from the surface 
of a cytotoxic T cell and binds to class | MHC 
molecules that are expressed on the surface 
of the cell that is bound to the T cell. The 
dashed lines represent extended polypeptide 
chains that link the immunoglobulin domains 
of CD8 to the membrane. Notice that the 
coreceptor interacts primarily with the 
constant a3 domain of the class | MHC 
domain. [Drawn from 1AKJ.pdb.] 


The cytoplasmic tail of CD8 contains a docking site for Lck, a cytoplasmic 
tyrosine kinase akin to Src. The T-cell receptor itself is associated with six 
polypeptides that form the CD3 complex (Figure 34.32). The y, 6, and 
chains of CD3 are homologous to Ig-a and Ig-B associated with the B-cell 
receptor (Figure 34.21); each chain consists of an extracellular immuno- 
globulin domain and an intracellular [TAM region. These chains associate 
into CD3-ye and CD3-8e heterodimers. An additional component, the 
CD3-¢ chain, has only a small extracellular domain and a larger intracellu- 
lar domain containing three ITAM sequences. 

On the basis of these components, a model for T-cell activation can 
be envisaged that is closely parallel to the pathway for B-cell activation 
(Section 34.3; Figure 34.33). The binding of the T-cell receptor to the class 
I MHC-—peptide complex and the concomitant binding of CD8 from the 
T cell to the MHC molecule link the kinase Lck to the ITAM substrates of 
the components of the CD3 complex. The phosphorylation of the tyrosine 
residues in the ITAM sequences generates docking sites for a protein kinase 
called ZAP-70 (for 70-kDa zeta-associated protein) that is homologous to 
Syk in B cells. Docked by its two SH2 domains, ZAP-70 phosphorylates 


FIGURE 34.33 T-cell 
activation. The interaction 
between the T-cell receptor and 
a class | MHC-peptide complex 
results in the binding of CD8 to 
the MHC protein, the recruitment 
of the protein tyrosine kinase 
Lck, and the phosphorylation of 
tyrosine residues in the ITAM 
sequences of the CD3 chains. 
After phosphorylation, the ITAM 
: regions serve as docking sites for 
ie Phosphorylates targets proteins the protein kinase ZAP-70, which 
Kinase phosphorylates protein targets to 
domain transmit the signal. 
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FIGURE 34.32 T-cell receptor complex. 
The T-cell receptor is associated with six CD3 
molecules: a CD3-y-CD8-« heterodimer, a 
CD3-8-CD8-« heterodimer, and two chains 
of CD8-¢. Single ITAM sequences are present 
in the cytoplasmic domains of CD8-y, 

CD3-8, and CD8-e, whereas three such 
sequences are found in each CD3-¢ chain. 
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FIGURE 34.34 Presentation of peptides from 


internalized proteins. Antigen-presenting cells bind and 
internalize foreign proteins and display peptides that are 
formed from the digestion of these proteins in class II 


MHC proteins. 


downstream targets in the signaling cascade. Additional molecules, includ- 
ing a membrane-bound protein phosphatase called CD45 and a cell-surface 
protein called CD28, play ancillary roles in this process. 

T-cell activation has two important consequences. First, the activation 
of cytotoxic T cells results in the secretion of several proteins, including 
perforin and granzymes. Perforin is a 70-kDa protein that destabilizes the 
plasma membrane of the target cell, enabling the entry of granzymes into the 
cytoplasm of the target cell. Granzymes are serine proteases (Section 9.1) 
that initiate the pathway of apoptosis, leading to the death of the target cell 
and the fragmentation of its DNA, including any viral DNA that may be 
present. Second, after it has stimulated its target cell to commit suicide, the 
activated T cell disengages and is stimulated to reproduce. Thus, additional 
T cells that express the same T-cell receptor are generated to continue the 
battle against the invader after these T cells have been identified as a suit- 
able weapon. 


Helper T cells stimulate cells that display foreign peptides 
bound to class II MHC proteins 


Not all T cells are cytotoxic. Helper T cells stimulate the proliferation of 
specific B lymphocytes and cytotoxic T cells and thereby serve as partners 
in determining the immune responses that are produced. The importance of 
helper T cells is graphically revealed by the devastation wrought by AIDS, 
a condition that destroys these cells. Helper T cells, like cytotoxic T cells, 
detect foreign peptides that are presented on cell surfaces by MHC proteins. 
However, the source of the peptides, the MHC proteins that bind them, and 
the transport pathway are different. 

Helper T cells recognize peptides bound to MHC molecules 
referred to as class II. Their helping action is focused on B cells, 
macrophages, and dendritic cells. Class IT MHC pvroteins are 
expressed only by these antigen-presenting cells, unlike class I MHC 
proteins, which are expressed on nearly all cells. The peptides 
presented by class IT MHC proteins do not come from the cyto- 
plasm. Rather, they arise from the degradation of proteins that have 
been internalized by endocytosis. Consider, for example, a virus 
particle that is captured by membrane-bound immunoglobulins 
on the surface of a B cell (Figure 34.34). This complex is delivered 
to an endosome, a membrane-enclosed acidic compartment, 
where it is digested. The resulting peptides become associated 
with class IJ MHC proteins, which move to the cell surface. 
Peptides from the cytoplasm cannot reach class II proteins, 
whereas peptides from endosomal compartments cannot reach 
class I proteins. This segregation of displayed peptides is biologi- 
cally critical. The association of a foreign peptide with a class II 
MHC protein signals that a cell has encountered a pathogen and 
serves as a call for help. In contrast, association with a class I 
MHC protein signals that a cell has succumbed to a pathogen and 
is a call for destruction. 


Helper T cells rely on the T-cell receptor and CD4 to recognize 
foreign peptides on antigen-presenting cells 


The overall structure of a class II MHC molecule is remarkably similar to 
that of a class I molecule. Class II molecules consist of a 33-kDa a chain and 
a noncovalently bound 30-kDa B chain (Figure 34.35). Each contains two 
extracellular domains, a transmembrane segment, and a short cytoplasmic 
tail. The peptide-binding site is formed by the a, and 8, domains, each of 


Side view 


Top view 


which contributes a long helix and part of a B sheet. Thus, the same struc- 
tural elements are present in class I and class I! MHC molecules, but they 
are combined into polypeptide chains in different ways. The peptide- 
binding site of a class II molecule is open at both ends, and so this groove 
can accommodate longer peptides than can be bound by class I molecules; 
typically, peptides between 13 and 18 residues long are bound. The peptide- 
binding specificity of each class II molecule depends on binding pockets 
that recognize particular amino acids, also known as anchor residues, in 
specific positions along the sequence. 

Helper T cells express T-cell receptors that are produced from the same 
genes as those on cytotoxic T cells. These T-cell receptors interact with 
class IT MHC molecules in a manner that is analogous to T-cell-receptor 
interaction with class I MHC molecules. Nonetheless, helper T cells and 
cytotoxic T cells are distinguished by other proteins that they express on 
their surfaces. In particular, helper T cells express a protein called CD4 
instead of expressing CD8. CD4 consists of four immunoglobulin domains 
that extend from the T-cell surface, as well as a small cytoplasmic region 
(Figure 34.36). The amino-terminal immunoglobulin domains of CD4 
interact with the base on the class II MHC molecule. Thus, helper T cells 
bind cells expressing class II MHC specifically because of the interactions 
with CD4 (Figure 34.37). 


(A) Cell presenting cytoplasmically (B) 
derived foreign peptide in class | MHC 


Antigen-presenting cell 


Cytotoxic T cell 


Helper T cell 


® FIGURE 34.35 Class II MHC 

protein. A class Il MHC protein consists of 
homologous a and B chains, each of which 
has an amino-terminal domain that 
constitutes half of the peptide-binding 
structure, as well as a carboxyl-terminal 
immunoglobulin domain. Notice the trough- 
like peptide-binding site, which is similar to 
that in class | MHC proteins except that it is 
open at both ends, allowing class Il MHC 
proteins to bind longer peptides than those 
bound by class |. [Drawn from 1DLH.pdb.] 


Cc 


hk FIGURE 34.36 Coreceptor CD4. 
This protein comprises four tandem 
immunoglobulin domains that extend from 
the surface of a helper T cell. [Drawn from 
1WIO.pdb.] 


FIGURE 34.37 Variations on a theme. 
(A) Cytotoxic T cells recognize foreign 
peptides presented in class | MHC proteins 
with the aid of the coreceptor CD8. 
(B) Helper T cells recognize peptides 
presented in class Il MHC proteins by 
specialized antigen-presenting cells with 
the aid of the coreceptor CD4. 
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FIGURE 34.38 Helper-T-cell action. The 
engagement of the T-cell receptor in helper 
T cells results in the secretion of cytokines. 
These cytokines bind to cytokine receptors 
expressed on the surface of the antigen- 
presenting cell, stimulating cell growth, 
differentiation, and, in regard to a B cell, 
antibody secretion. 
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When a helper T cell binds to an antigen-presenting cell expressing an 
appropriate class II MHC-—peptide complex, signaling pathways analogous 
to those in cytotoxic T cells are initiated by the action of the kinase Lck on 
ITAMs in the CD3 molecules associated with the T-cell receptor. However, 
rather than triggering events leading to the death of the attached cell, these 
signaling pathways result in the secretion of cytokines from the helper cell. 
Cytokines are a family of molecules that include, among others, interleukin-2 
and interferon-y. Cytokines bind to specific receptors on the antigen- 
presenting cell and stimulate its growth and differentiation (Figure 34.38). 
For example, these cytokines stimulate the differentiation of B cells into 
plasma cells, which specialize in the production of large amounts of antibody. 
Thus, the internalization and presentation of parts of a foreign pathogen help 
to generate a local environment in which cells taking part in the defense 
against this pathogen can flourish through the action of helper T cells. 


MHC proteins are highly diverse 


e MHC class I and II proteins, the presenters of peptides to T cells, 

were discovered because of their role in transplantation rejection. A tis- 
sue transplanted from one person to another or from one mouse to another is 
usually rejected by the immune system. In contrast, tissues transplanted 
from one identical twin to another or between mice of an inbred strain are 
accepted. Genetic analyses revealed that rejection occurs when tissues are 
transplanted between individual organisms having different genes in the 
major histocompatibility complex, a cluster of more than 75 genes playing 
key roles in immunity. The 3500-kb span of the MHC is nearly the length of 
the entire E. coli chromosome. The MHC encodes class I proteins (present- 
ers to cytotoxic T cells) and class II proteins (presenters to helper T cells), as 
well as class III proteins (components of the complement cascade) and many 
other proteins that play key roles in immunity. 

Human beings express six different class I genes (three from each par- 
ent) and six different class II genes. The three loci for class I genes are called 
HLA-A, -B, and -C; those for class II genes are called HLA-DP, -DQ, and 
-DR. These loci are highly polymorphic: many alleles of each are present in 


the population. For example, more than 50 each of HLA-A, -B, and -C 
alleles are known; the numbers discovered increase each year. Hence, the 
likelihood that two unrelated persons have identical class J and II proteins is 
very small (<10~*), accounting for transplantation rejection unless the 
genotypes of donor and acceptor are closely matched in advance. 
Differences between class I proteins are located mainly in the a, and a 

domains, which form the peptide-binding site (Figure 34.39). The a, 
domain, which interacts with a constant 8,-microglobulin, is largely con- 
served. Similarly, the differences between class II proteins cluster near the 
peptide-binding groove. Why are MHC proteins so highly variable? Their 
diversity makes the presentation of a very wide range of peptides to T cells pos- 
sible. A particular class I or class II molecule may not be able to bind any of 
the peptide fragments of a viral protein. The likelihood of a fit is markedly 
increased by having several kinds (usually six) of each class of presenters in 
each individual organism. If all members of a species had identical class I or 
class IT molecules, the population would be much more vulnerable to dev- 
astation by a pathogen that had mutated and thereby evaded presentation. 
The evolution of the diverse human MHC repertoire has been driven by the 
selection for individual members of the species who resist infections to 
which other members of the population may be susceptible. 


Human immunodeficiency viruses subvert the immune 
system by destroying helper T cells 


ey In 1981, the first cases of a new disease now called acquired immune 

deficiency syndrome (AIDS) were recognized. The victims died of rare 
infections because their immune systems were crippled. The cause was 
identified 2 years later by Luc Montagnier and coworkers. AIDS is pro- 
duced by human immunodeficiency virus (HIV), of which two major classes 
are known: HIV-1 and the much less common HIV-2. Like other retrovi- 
ruses, HIV contains a single-stranded RNA genome that is replicated 
through a double-stranded DNA intermediate. This viral DNA becomes 
integrated into the genome of the host cell. In fact, viral genes are tran- 
scribed only when they are integrated into the host DNA. 

The HIV virion is enveloped by a lipid-bilayer membrane containing 
two glycoproteins: gp41 spans the membrane and is associated with gp120, 
which is located on the external face (Figure 34.40). The core of the virus 
contains two copies of the RNA genome and associated transfer RNAs, as 
well as several molecules of reverse transcriptase. They are surrounded by 
many copies of two proteins called p17 and p24. The host cell for HIV is the 
helper T cell. The gp120 molecules on the membrane of HIV bind to CD4 
molecules on the surface of the helper T cell (Figure 34.41). This interaction 
allows the associated viral gp41 to insert its amino-terminal head into the 
host-cell membrane. The viral membrane and the helper-T-cell membrane 
fuse, and the viral core is released directly into the cytoplasm. Infection by 


CD4 
(N-terminal domains) 


K FIGURE 34.41 HIV receptor. 
A complex between a modified 
form of the envelope glycoprotein 
gp120 from HIV and a peptide 
corresponding to the two amino- 
terminal domains from the helper- 
T-cell protein CD4 reveals how 
viral infection of helper T cells is 
initiated. [Drawn from 1GC1.pdb.] 


k FIGURE 34.39 Polymorphism in 
class | MHC proteins. Notice that the 
positions of sites with a high degree of 
polymorphism in the human population are 
displayed as red spheres on the structure 
of the amino-terminal part of a class | MHC 
protein. [Drawn from 1HHK.pdb.] 


Genomic 


Reverse 
transcriptase 


FIGURE 34.40 Human immunodeficiency 
virus. A schematic representation of 

HIV reveals its proteins and nucleic acid 
components. [Information from G. B. Karlsson 
Hedestam et al., Nat. Rev. Microbiol. 
6:143-155, 2008, Fig. 2a.] 
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FIGURE 34.42 T cell selection. 

A population of thymocytes is subjected 
first to positive selection to remove cells 
that express T-cell receptors that will not 
bind to MHC proteins expressed by the 
individual organism. The surviving cells are 
then subjected to negative selection to 
remove cells that bind strongly to MHC 
complexes bound to self-peptides. 


Positive selection 
—— 
Only cells that 
bind to some MHC 
molecule survive 


HIV leads to the destruction of helper T cells because the permeability of 
the host plasma membrane is markedly increased by the insertion of viral 
glycoproteins and the budding of virus particles. The influx of ions and 
water disrupts the ionic balance, causing osmotic lysis. 


34.5 The Immune System Contributes to the Prevention 
and the Development of Human Diseases 


The primary function of the immune system is to protect the host from 
invasion by foreign organisms. But how does the immune system prevent 
itself from mounting attacks against its host? In other words, how does the 
immune system distinguish between self and nonself? Clearly, proteins 
from the organism itself do not bear some special tag identifying them. 
Instead, selection processes early in the developmental pathways for 
immune cells kill or suppress those immune cells that react strongly with 
self-antigens. The evolutionary paradigm still applies; immune cells that 
recognize self-antigens are generated, but selective mechanisms eliminate 
such cells in the course of development. 


T cells are subjected to positive and negative selection in the thymus 


T cells derive their name from the location of their production—the thy- 
mus, a small organ situated just above the heart. Examination of the devel- 
opmental pathways leading to the production of mature cytotoxic and 
helper T cells reveals the selection mechanisms that are crucial for distin- 
guishing self from nonself. These selection criteria are quite stringent; 
approximately 98% of the thymocytes, the precursors of T cells, die before 
the completion of the maturation process. 

Thymocytes produced in the bone marrow do not express the T-cell 
receptor complex, CD4, or CD8. On relocation to the thymus and rear- 
rangement of the T-cell-receptor genes, the immature thymocyte expresses 
all of these molecules. These cells are first subjected to positive selection 
(Figure 34.42). Cells for which the T-cell receptor can bind with reasonable 
affinity to either class I or class II MHC molecules survive this selection; 
those for which the T-cell receptor does not participate in such an interac- 
tion undergo apoptosis and die. The role of the positive selection step is to 
prevent the production of T cells that will not bind to any MHC complex pres- 
ent, regardless of the peptide bound. 

The cell population that survives positive selection is subjected to a sec- 
ond step, negative selection. Here, T cells that bind with high affinity to 
MHC complexes bound to self-peptides expressed on the surfaces of 
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FIGURE 34.43 Consequences of autoimmunity. Photomicrographs of an islet of 
Langerhans (A) in the pancreas of a normal mouse and (B) in the pancreas of a mouse with 
an immune response against pancreatic B cells, which results in a disease resembling insulin- 
dependent diabetes mellitus in human beings. Notice that the relatively pale cellular area in 
the center of part A is populated with the dark nuclei of inflammatory cells in part B. [From 

M. A. Atkinson and N. K. Maclaren. What causes diabetes? Scientific American, 1990. Courtesy 
Mark Atkinson.] 


antigen-presenting cells in the thymus undergo apoptosis or are otherwise 
suppressed. Those that do not bind too avidly to any such MHC complex 
complete development and become mature cytotoxic T cells (which express 
only CD8) or helper T cells (which express only CD4). The negative selec- 
tion step leads to self-tolerance; cells that bind an MHC-self-peptide com- 
plex are removed from the T-cell population. Similar mechanisms apply to 
developing B cells: maturing B cells that express antibodies that interact 
strongly with self-antigens are suppressed. 


Autoimmune diseases result from the generation of 
immune responses against self-antigens 


e Although thymic selection is remarkably efficient in suppressing the 
immune response to self-antigens, failures do occur. Such failures 
result in autoimmune diseases. These diseases include common illnesses such 
as insulin-dependent diabetes mellitus, multiple sclerosis, and rheumatoid 
arthritis. In these illnesses, immune responses against self-antigens result in 
damage to selective tissues that express the antigen (Figure 34.43). 
In many cases, the cause of the generation of self-reactive antibodies or 
T cells is unclear. However, in other cases, infectious organisms such as bac- 
teria or viruses may play a role. Infection leads to the generation of antibodies 
and T cells that react with many different epitopes from the infectious organ- 
ism. If one of these antigens closely resembles a self-antigen, an autoimmune 
response can result. For example, Streptococcus infections sometimes lead to 
rheumatic fever owing to the production of antibodies to streptococcal anti- 
gens that cross-react with exposed epitopes in heart muscle. 


The immune system plays a role in cancer prevention 


e The development of immune responses against proteins encoded by 
our own genomes can be beneficial in some circumstances. Cancer 
cells have undergone significant changes that often result in the production 
of proteins that are not normally expressed. For example, the mutation of 
genes can generate proteins that do not correspond in amino acid sequence 
to any normal protein. Such proteins may be recognized as foreign, and an 
immune response will be generated specifically against the cancer cell. 
Alternatively, cancer cells often produce proteins that are expressed during 
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embryonic development but are not expressed or are expressed at very low 
levels after birth. For example, a membrane glycoprotein called carcinoem- 
bryonic antigen (CEA) appears in the gastrointestinal cells of developing 
fetuses but is not normally expressed at significant levels after birth. More 
than 50% of patients with colorectal cancer have elevated serum levels of 
CEA. Immune cells recognizing epitopes from such proteins will not be 
subjected to negative selection and, hence, will be present in the adult 
immune repertoire. These cells may play a cancer surveillance role, killing 
cells that overexpress antigens such as CEA and preventing genetically 
damaged cells from developing into tumors. 


Vaccines are a powerful means to prevent and eradicate disease 


ee The discovery and utilization of vaccines, biological preparations that 

stimulate immunity to a particular pathogen or disease, represents a 
significant milestone in modern medical history. The pioneering work of 
Edward Jenner and Louis Pasteur in the eighteenth and nineteenth centu- 
ries revealed that inoculation with inactivated forms of a pathogen could 
protect against subsequent infection with the active pathogen. Treatment of 
a person with a vaccine, or immunization, leads to the development of immu- 
nological memory, the ability of the immune system to respond more rapidly 
and effectively to pathogens that have been encountered previously. At the 
cellular level, memory arises because positively selected B cells and T cells 
that have a high affinity for their target persist within the organism as 
memory B cells and memory T cells. 

In order for a vaccine to be effective, it must generate a sustained protec- 
tive response appropriate for the targeted pathogen. For example, patho- 
gens such as Clostridium tetani, the causative agent of tetanus, cause disease 
by the release of destructive proteins called toxins. Hence, an effective teta- 
nus vaccine should generate a strong B-cell-mediated response. In contrast, 
a vaccine designed to prevent disease caused by an intracellular pathogen 
should generate a strong T-cell-mediated response. The active component 
of a vaccine that is responsible for stimulating this immune response can 
take many forms: 


Killed or inactivated vaccines contain pathogens that have been rendered 
harmless by treatment with chemicals or high heat. 


Live attenuated vaccines contain live pathogens that have accumulated 
mutations so that they are no longer virulent to human cells. These vaccines 
are most commonly generated by repeated infection of the pathogen in cul- 
tured cells until its virulence is lost. 


Subunit vaccines contain a purified protein component of the pathogen. 
Such proteins can be either isolated from infected material (such as blood 
from chronically infected patients) or generated by recombinant methods. 


Toxoid vaccines are used against pathogens that employ an extracellular 
toxin to cause disease. These vaccines contain a form of the toxin that has 
been inactivated by treatment with chemicals or high heat. 


The effectiveness of immunization efforts has made the global eradica- 
tion of certain diseases possible. In 1966, 20 million cases of smallpox were 
reported worldwide. Just 14 years later, after coordinated vaccination efforts 
in regions of endemic disease, the World Health Assembly declared that 
smallpox had been eradicated. Nevertheless, despite the success of vaccines 
in the prevention of many devastating diseases, several pathogens have 
posed a significant challenge to vaccine development. For example, the 


development of an effective HIV vaccine has been complicated by the anti- 
genic diversity of HIV strains. Because its mechanism for replication is 
prone to error, a population of HIV presents an ever-changing array of coat 
proteins. Indeed, the mutation rate of HIV is more than 65 times higher 
than that of influenza virus. Nevertheless, the identification of protective 
antibodies in individuals resistant to HIV and advancements in vaccine 
design suggest that the development of an effective HIV vaccine remains a 
real possibility. 
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Two lines of defense against pathogens are the innate immune system 
and the adaptive immune system. The innate immune system targets 
features common to many different pathogens but misses those patho- 
gens lacking these features. The adaptive immune system 1s both more 
specific and wide-reaching. To respond effectively to a vast array of 
pathogens, this type of immune system must be tremendously flexi- 
ble. The adaptive immune system follows the principles of evolution: 
an enormously diverse set of potentially useful proteins is generated; 
these proteins are then subjected to intense selection so that only cells 
that express useful proteins flourish and continue development, until 
an effective immune response to a specific invader is generated. 


Antibodies Possess Distinct Antigen-Binding and Effector Units 

The major immunoglobulin in the serum is immunoglobulin G. An IgG 
protein is a heterotetramer with two heavy chains and two light chains. 
Treatment of IgG molecules with proteases such as papain produces 
three fragments: two F’,,, fragments that retain antigen-binding activity 
and an F’, fragment that retains the ability to activate effector functions 
such as the initiation of the complement cascade. The F.,,, fragments 
include the L chain and the amino-terminal half of the H chain; the F : 
domain is a dimer consisting of the carboxyl-terminal halves of two H 
chains. Five different classes of antibody—IgG, IgM, IgA, IgD, and 
IgE—differ in their heavy chains and, hence, in their effector functions. 


Antibodies Bind Specific Molecules Through Hypervariable Loops 

One particular protein fold is found in many of the key proteins of the 
immune system. The immunoglobulin fold consists of a pair of B 
sheets that pack against each other, linked by a single disulfide bond. 
Loops projecting from one end of the structure form a binding surface 
that can be varied by changing the amino acid sequences within the 
loops. Domains with immunoglobulin folds are linked to form anti- 
bodies and other classes of proteins in the immune system, including 
T-cell receptors. Two chains come together to form the binding surface 
of an antibody. Three loops from each domain, the complementarity- 
determining regions, form an essentially continuous surface that can 
vary tremendously in shape, charge, and other characteristics to allow 
particular antibodies to bind to molecules ranging from small mole- 
cules to large protein surfaces. 


Diversity Is Generated by Gene Rearrangements 

The tremendous diversity of the amino acid sequences of antibodies is 
generated by segmental rearrangements of genes. For antibody k light 
chains, 1 of 40 variable regions is linked to 1 of 5 joining regions. The 
combined VJ unit is then linked to the constant region. Thousands of 
different genes can be generated in this manner. Similar arrays are 
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rearranged to form the genes for the heavy chains, but an additional 
region called the diversity region lies between the V and the J regions. 
The combination of L and H chains, each obtained through such rear- 
ranged genes, can produce more than 10° distinct antibodies. Different 
classes of antibodies are also generated by gene rearrangements that 
lead to class switching. Oligomerization of membrane-bound anti- 
body molecules initiates a signal-transduction cascade inside B cells. 
Key steps in this signaling process include the phosphorylation of 
specific tyrosine residues in sequences termed immunoreceptor tyro- 
sine-based activation motifs, present in proteins that associate with 
the membrane-bound antibodies. 


Major-Histocompatibility-Complex Proteins Present Peptide Antigens 
on Cell Surfaces for Recognition by T-Cell Receptors 

Intracellular pathogens such as viruses and mycobacteria cannot be easily 
detected. Intracellular proteins are constantly being cut into small pep- 
tides by proteasomes and displayed in class I major-histocompatibility- 
complex proteins on cell surfaces. Such peptides lie in a groove defined 
by two helices in the class I MHC proteins. The combination of MHC 
protein and peptide can be bound by an appropriate T-cell receptor. 
T-cell receptors resemble the antigen-binding domains of antibodies in 
structure, and diversity in T-cell-receptor sequence is generated by V(D) 
J gene rearrangements. The T-cell receptor recognizes features of both 
the peptide and the MHC molecule that presents it. 

Cytotoxic T cells initiate apoptosis in cells to which they bind 
through interactions between T-cell receptors and class | MHC-— 
peptide complexes aided by interactions with the coreceptor molecule 
CD8. Helper T cells recognize peptides presented in class IT MHC 
proteins, a distinct type of MHC protein expressed only on antigen- 
presenting cells, such as B cells and macrophages. Helper T cells 
express the coreceptor CD4 rather than CD8. CD4 interacts with class 
Il MHC proteins present on antigen-presenting cells. Signaling pathways, 
analogous to those in B cells, are initiated by interactions between MHC-— 
peptide complexes and T-cell receptors and the CD8 and CD4 core- 
ceptors. Human immunodeficiency virus damages the immune system 
by infecting and killing cells that express CD4, such as helper T cells. 


The Immune System Contributes to the Prevention and the 
Development of Human Diseases 

In principle, the immune system is capable of generating antibodies and 
T-cell receptors that bind to self-molecules—that is, molecules that are 
normally present in a healthy and uninfected individual organism. 
Selection mechanisms prevent such self-directed molecules from being 
expressed at high levels. The selection process includes both positive 
selection, to enrich the population of cells that express molecules that 
have the potential to bind foreign antigens in an appropriate context, 
and negative selection, which eliminates cells that express molecules 
with too high an affinity for self-antigens. Autoimmune diseases such 
as insulin-dependent diabetes mellitus can result from the amplifica- 
tion of a response against a self-antigen. Vaccines stimulate immuno- 
logical memory so as to prevent, and even eradicate, disease. 
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PROBLEMS 


1. First things first. Distinguish between the innate and 
adaptive immune systems. 


2. Antibody diversity. What are the mechanisms used by 
B cells to generate antibody diversity? 


3. Hang in there. Explain the difference between affinity 
and avidity. For which immunoglobulin class might avidity 
be particularly important in antigen recognition? 


4. Innate abilities. A strain of mice has been identified that 
does not respond to LPS. This lack of response is due to a 
single amino acid change in the intracellular domain of mouse 
TLR4. Propose an explanation for the lack of response. 


5. TLR ligands. The PAMP recognized by TLR3 is double- 
stranded RNA (dsRNA). Against which pathogens would 
TLR3 be an effective immune receptor? 


6. Energetics and kinetics. Suppose that the dissociation 
constant of an F, hapten complex is 3 X 1077 M at 25°C. 


(a) What is the standard free energy of binding? 


(b) Immunologists often speak of affinity (K,), the recipro- 
cal of the dissociation constant, in comparing antibodies. 
What is the affinity of this F.,? 


(c) The rate constant for the release of hapten from the com- 
plex is 120 s-'. What is the rate constant for association? 
What does the magnitude of this value imply about the extent 
of structural change in the antibody on binding hapten? 


7. A brilliant emitter. Certain naphthalene derivatives, such 
as the dansyl group, exhibit a weak yellow fluorescence 
when they are in a highly polar environment (such as water) 
and an intense blue fluorescence when they are in a mark- 
edly nonpolar environment (such as hexane). The binding 
of e-dansyl-lysine to specific antibody is accompanied by a 
marked increase in its fluorescence intensity and a shift in 
color from yellow to blue. What does this finding reveal 
about the hapten—antibody complex? 


8. Miniantibody. The F,,, fragment of an antibody molecule 
has essentially the same affinity for a monovalent hapten as 
does intact IgG. 


(a) What is the smallest unit of an antibody that can retain 
the specificity and binding affinity of the whole protein? 


(b) Design a compact single-chain protein that is likely to 
specifically bind antigen with high affinity. 


9. Turning on B cells. B lymphocytes, the precursors of 
plasma cells, are triggered to proliferate by the binding of 
multivalent antigens to receptors on their surfaces. The cell- 
surface receptors are transmembrane immunoglobulins. 
Univalent antigens, in contrast, do not activate B cells. 


(a) What do these findings reveal about the mechanism of 
B-cell activation? 


(b) How might antibodies be used to activate B cells? 


10. An ingenious cloning strategy. In the cloning of the gene 
for the a chain of the T-cell receptor, T-cell cDNAs were 
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hybridized with B-cell mRNAs. What was the purpose of 
this hybridization step? Can the principle be applied generally? 


11. Pathogen susceptibility. Patients carrying specific muta- 
tions in the gene encoding the TLR4 protein are susceptible 
to infections from Gram-negative bacteria. Why are these 
patients vulnerable to this particular type of pathogen? 


12. Matchmaker, matchmaker. Why is it important to 
match HLA alleles between donor and recipient in organ 
transplantation? 


13. Instruction. Before the mechanism for generating anti- 
body diversity had been established, a mechanism based on 
protein folding around an antigen was proposed, primarily 
by Linus Pauling. In this model, antibodies that had differ- 
ent specificities had the same amino acid sequence but were 
folded in different ways. Propose a test of this model. 


14. Dealing with nonsense. Cells, including immune cells, 
degrade mRNA molecules in which no long open reading 
frame is present. The process is called nonsense-mediated 
RNA decay. Suggest a role for this process in immune cells. 


15. Down, but not out. To understand the genes responsible 
for growth and infectivity in a disease-causing bacterial 
strain, you perform chemical mutagenesis on a culture of 
these bacteria. In the course of your investigation into the 
properties of the resulting mutants, you identify a set of 
mutant bacteria that is still viable but their virulence is sig- 
nificantly impaired. How might these mutants be useful for 
vaccine development? 


16. Presentation. The amino acid sequence of a small protein is 


MSRLASKNLIRSDHAGGLLOATYSAVSS-— 
IKNTMSFGAWSNAALNDSRDA 


Predict the most likely peptide to be presented by the class I 
MHC molecule HLA-A2. 


Mechanism Problem 


17. Catalytic antibody. Antibody is generated against a 
transition state for the hydrolysis of the following ester. 


CcOoO- 


Ester 


CcOoO- 


: CH, 
Qo 
yi LY 
OoN No 


Transition-state analog 


Some of these antibodies catalyze the hydrolysis of the ester. 
What amino acid residue might you expect to find in the 
binding site on the antibody? 


Chapter Integration Problem 


18. Signaling. Protein tyrosine phosphatases, such as the 
molecule CD45 expressed in both B cells and T cells, play 
important roles in activating such protein tyrosine kinases 
as Fyn and Lck, which are quite similar to Src. Suggest a 
mechanism for the activation of such protein kinases by 
the removal of a phosphoryl group from a phosphotyro- 
sine residue. 


Data Interpretation Problem 


19. Affinity maturation. A mouse is immunized with an 
oligomeric human protein. Shortly after immunization, a 
cell line that expresses a single type of antibody molecule 
(antibody A) is derived. The ability of antibody A to bind 
the human protein is assayed, with the results shown in the 
graph below. 
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After repeated immunizations with the same protein, 
another cell line is derived that expresses a different anti- 
body (antibody B). The results of analyzing the binding of 
antibody B to the protein also are shown. From these data, 
estimate 


(a) the dissociation constant (K,) for the complex between 
the protein and antibody A. 


(b) the dissociation constant for the complex between the 
protein and antibody B. 


Comparison of the amino acid sequences of antibody A and 
antibody B reveals them to be identical except for a single 
amino acid. What does this finding suggest about the 
mechanism by which the gene encoding antibody B was 
generated? 


CHAPTER 


Molecular Motors 


The impressive muscles of the horse, like the muscles of all animals, are 
powered by the molecular-motor protein myosin. A part of myosin moves 
dramatically (as shown above) in response to ATP binding, hydrolysis, and 
product release, propelling myosin along an actin filament. This molecular 
movement is translated into movement of the entire animal, vividly depicted 
in da Vinci's rearing horse. [(Left) The Royal Collection © 2014 Her Majesty 
Queen Elizabeth II/Bridgeman Images.] 


OUTLINE 
6 ere from human beings to bacteria, move to adapt to changes in 
their environments, navigating toward food and away from danger. 35.1 Most Molecular-Motor Proteins 
Cells themselves are not static but are bustling assemblies of moving pro- Are Members of the P-Loop 
teins, nucleic acids, and organelles. This motion is enabled by two elements: NTPase Superfamily 


molecular-motor proteins and complex networks of filamentous proteins 
termed the cytoskeleton (Figure 35.1). The dynamic networks that deter- 
mine the shape and mobility of calls are among the most active areas of 
investigation in modern cell biology. Remarkably, the fundamental bio- 
chemical mechanisms that produce contractions in our muscles are the same 
as those that propel organelles along the cytoskeleton. In fact, many of the 35.4 A Rotary Motor Drives 
proteins that play key roles in converting chemical energy into kinetic Bacterial Motion 
energy are members of the same protein family, the P-loop NTPases, the 
hugely important group of proteins that we first examined in Chapter 9. 
These molecular motors are homologous to proteins that we have encoun- 
tered in other contexts, including the G proteins in protein synthesis, sig- 
naling, and other processes. Once again, we see the economy of evolution in 
adapting existing proteins to perform new functions. 

Molecular motors operate by small increments, converting changes in 
protein conformation into directed motion. Orderly motion across distances 
requires a track that steers the motion of the motor assembly. Indeed, we have 


35.2 Myosins Move Along 
Actin Filaments 


35.3 Kinesin and Dynein Move 
Along Microtubules 
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FIGURE 35.1 Motion within cells. This 
high-voltage electron micrograph shows 
the mitotic apparatus in a metaphase 
mammalian cell. The large cylindrical 
objects are chromosomes, and the 
threadlike structures stretched across the 
center are microtubules, key components 
of the skeleton. Microtubules serve as 
tracks for the molecular motors that move 
chromosomes. Many processes, including 
chromosome segregation in mitosis, 
depend on the action of molecular-motor 
proteins. [Courtesy of Dr. J. R. Mclntosh.] 
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already encountered a class of molecular motors that utilize mechanisms that 
we will examine here—namely, the helicases that move along DNA during 
DNA replication (Section 28.1). The proteins on which we will focus in this 
chapter move along actin and microtubules—protein filaments composed of 
repeating subunits. The motor proteins cycle between forms having high or 
low affinity for the filament tracks in response to ATP binding and hydroly- 
sis, enabling a bind, pull, and release mechanism that generates motion. 

We will also consider a completely different strategy for generating 
motion, one used by bacteria such as E. coli. A set of flagella act as propellers, 
rotated by a motor in the bacterial cell membrane. This rotary motor is driven 
by a proton gradient across the membrane, rather than by ATP hydrolysis. 
The mechanism for coupling the proton gradient to rotatory motion is analo- 
gous to that used by the F) subunit of ATP synthase. Thus, both of the major 
modes for storing biochemical energy—namely, ATP and ion gradients— 
have been harnessed by evolution to drive organized molecular motion. 


35.1 Most Molecular-Motor Proteins Are Members 
of the P-Loop NTPase Superfamily 


Eukaryotic cells contain three major families of motor proteins: myosins, 
kinesins, and dyneins. Members of each of these classes move along compo- 
nents of the cytoskeleton, but, at first glance, these protein families appear 
to be quite different from one another. Myosin, first characterized on the 
basis of its role in muscle, moves along filaments of the protein actin. Each 
molecule of muscle myosin consists of two copies each of a heavy chain with 
a molecular mass of 220 kDa, an essential light chain, and a regulatory light 
chain. The human genome encodes more than 40 distinct myosins; some 
function in muscle contraction, and others participate in a variety of other 
processes. Kinesins, which have roles in protein, mRNA, and vesicle trans- 
port as well as construction of the mitotic spindle and chromosome segrega- 
tion, are generally dimers of two polypeptides. The human genome encodes 
more than 40 kinesins. Dyneins power the motion of cilia and flagella, and a 
general cytoplasmic dynein contributes to a variety of motions in all cells, 
including vesicle transport and various transport events in mitosis. Dyneins 
are enormous, with heavy chains of molecular mass greater than 500 kDa. 
The human genome encodes approximately 10 dyneins. 

Initially, comparison of the amino acid sequences of myosins, kinesins, 
and dyneins did not reveal significant relationships between these protein 
families but, after their three-dimensional structures were determined, 
members of the myosin and kinesin families were found to have remarkable 
similarities. In particular, both myosin and kinesin contain P-loop NTPase 
cores homologous to those found in G proteins. Sequence analysis of the 
dynein heavy chain reveals it to be a member of the AAA subfamily of 
P-loop NTPases that we encountered in the context of the 19S proteasome 
(Section 23.2). Dynein has six sequences encoding such P-loop NTPase 
domains arrayed along its length, although only four actually appear to bind 
nucleotides. Thus, we expect similarities in the mechanisms of action, and 
we can draw on our knowledge of P-loop NTPases in general as we analyze 
the mechanisms of action of these motor proteins. 


Molecular motors are generally oligomeric proteins with 
an ATPase core and an extended structure 


Let us first consider the structure of myosin, which we examined briefly in 
Chapter 9. The results of electron microscopic studies of skeletal-muscle 
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Light meromyosin (LMM) 


Heavy meromyosin (HMM) 


FIGURE 35.2 Myosin dissection. Treatment of muscle myosin with proteases forms stable 
fragments, including subfragments S1 and S2 and light meromyosin. Each S1 fragment 
includes a head (shown in yellow or purple) from the heavy chain and one copy of each light 
chain (shown in blue and orange). 


myosin show it to be a two-headed structure linked to a long stalk (Figure 
9.42). The treatment of myosin with trypsin and papain results in the forma- 
tion of four fragments: two 51 fragments; heavy meromyosin (HMM) which 
consists of the S1 fragments and an additional region termed 82; and a frag- 
ment called light meromyosin (LMM; Figure 35.2). Each S1 fragment corre- 
sponds to one of the heads of the intact structure and includes 850 
amino-terminal amino acids from one of the two heavy chains as well as one 
copy of each of the light chains. Examination of the structure ofan 51 fragment 
at high resolution reveals the presence of the P-loop NTPase-domain core that 
is the site of ATP binding and hydrolysis (Figure 35.3). We examined the 
structure and mechanism of action of this motor domain in Chapter 9. 


Essential light chain 


hk FIGURE 35.3 Myosin structure at 
high resolution. The structure of the S1 
fragment from muscle myosin reveals the 
presence of a P-loop NTPase domain 
(shaded in purple). Notice that an a helix 
extending from this domain is the binding 
site for the two light chains. [Drawn from 
1DFL.pdb.] 


Nucleotide- 


Actin- binding site 


binding site 


Extending away from this structure is a long a helix from the heavy chain. 
This helix is the binding site for the two light chains. The light chains are mem- 
bers of the EF-hand family, similar to calmodulin (Figure 14.17), although 
most of the EF hands in light chains do not bind metal ions (Figure 35.4). 
Like calmodulin, these proteins wrap around an « helix, serving to thicken and 


Essential light chain Regulatory light chain Calmodulin 
Mg?* 


Ca2* 


FIGURE 35.4 Myosin light chains. 
The structures of the essential and regulatory 
light chains of muscle myosin are compared 
with the structure of calmodulin. Notice the 
similarities in the structures that allow each 
of these homologous proteins to bind an 
a helix (not shown) by wrapping around it. 
[Drawn from 1DFL.pdb and 1CM1.pdb.] 


FIGURE 35.5 Myosin two-stranded 
coiled coil. The two a helices form left- 
handed supercoiled structures that spiral 
around each other. Such structures are 
stabilized by hydrophobic residues at the 
contact points between the two 
helices. [Drawn from 2TMA.pdb.] 


Nucleotide- 
binding site 


hk FIGURE 35.7 Structure of head 
domain of kinesin at high resolution. 
Notice that the head domain of kinesin 
has the structure of a P-loop NTPase core 
(indicated by purple shading). [Drawn from 
116l.odb.] 
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stiffen it. The remaining fragments of myosin—S2 and light meromyosin—are 
largely a helical, forming two-stranded coiled coils created by the remaining 
lengths of the two heavy chains wrapping around each other (Figure 35.5). 
These structures, together extending approximately 1700 A, link the myosin 
heads to other structures. In muscle myosin, several LMM domains come 
together to form higher-order bundles. 

Conventional kinesin (kinesin 1), the first kinesin discovered, has several 
structural features in common with myosin. The dimeric protein has two 
heads connected by an extended structure (Figure 35.6). The size of the 
head domain is approximately one-third that of myosin. Determination of 
the three-dimensional structure of a kinesin fragment revealed that the head 
domain also is built around a P-loop NTPase core (Figure 35.7). The myo- 
sin domain is so much larger than that of kinesin because of two large inser- 
tions in the myosin domain that bind to actin filaments. For conventional 
kinesin, a region of approximately 500 amino acids extends from the head 
domain. Like the corresponding region in myosin, the extended part of 
kinesin forms an a-helical coiled coil. Conventional kinesin also has light 
chains, but, unlike those of myosin, these light chains bind near the car- 
boxyl terminus of the heavy chain and are thought to link the motor to 
intracellular cargo. 


Light chains “/) 
wf hf Neck linker 
pte —S— SS ee —— 
ie Coiled coil | 


FIGURE 35.6 Structure of kinesin. The elongated structure has the head domains at one 
end and the cargo-binding domains at the other, linked by a long coiled-coil region. 


Dynein has a significantly different structure than myosin or kinesin. As 
noted earlier, the dynein heavy chain includes six regions that are homolo- 
gous to the AAA subfamily of ATPase domains. A model for the structure 
of the motor domain of dynein was generated based on the structures of 
other AAA ATPases. This model has been confirmed and extended by 
more recent structure determinations of dynein itself (Figure 35.8). The 
head domain is appended to a region of approximately 1300 amino acids 
that forms an extended structure that links dynein units together to form 
oligomers and interacts with other proteins. 

Although the structures of these three classes of molecular motors have 
significant differences, some common features emerge. Almost all struc- 
tures are dimeric with two head domains, have regions of extended but quite 
rigid structures, and have regions for interacting with other proteins. As we 
shall see, these structures are suitable for actions that resemble climbing a 
rope, hand over hand. The regions for interacting with other proteins repre- 
sent the grasping hands, the extended structures represent the arms that act 
as levers to promote larger-scale motion, and the head domains are the 
engines that provide the necessary mechanical energy. 


ATP binding and hydrolysis induce changes in the conformation 
and binding affinity of motor proteins 


In Chapter 9, we examined the conformational changes that take place in the 
myosin ATPase domain from the slime mold Dictyostelium. The structures 
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35.1 Molecular-Motor Proteins 


_— AAA domains 


eo 


Coiled coil Associated chains 


Linker 


FIGURE 35.8 Dynein structure. (A) The overall structure of dynein. (B) A model of the 
motor domain of dynein. Notice the six P-loop NTPase domains, some of which bind to and 
hydrolyze ATP. [Drawn from 1HN5.pdb.] 


of myosin ATP domains from other sources have been elucidated in a variety 
of forms as well. The 51 fragment of myosin from scallop muscle provides a 
striking example of the changes observed (Figure 35.9). The structure of this 
51 fragment has been determined in a number of forms including that with- 
out bound nucleotide and that bound to a complex formed of ADP and 
vanadate (VO,°—), which, as mentioned in Chapter 9, is an analog of the 
ATP-hydrolysis transition state. The long helix that binds the light chains 
(hereafter referred to as the lever arm) protrudes outward from the head 
domain. Comparison of the structures reveals that the lever arm has rotated 
by nearly 90 degrees in the ADP-VO,?~ complex compared with its posi- 
tion in the nucleotide-free form. How does the species in the nucleotide- 
binding site cause this dramatic transition? Two regions around the 
nucleotide-binding site (termed switch I and switch I) tightly conform to the 
shape of the y-phosphoryl group of the ATP analog and adopt a looser 


(A) Myosin-ADP-VO,>- complex (B) Myosin-ADP complex 


kh FIGURE 35.9 Lever-arm motion. Two forms of the S1 fragment of scallop-muscle 
myosin. Notice the dramatic conformational changes when the identity of the bound nucleotide 
changes from the ADP-VO,3— complex (A) to the nucleotide-free form (B) or vice versa, including 
a nearly 90-degree reorientation of the lever arm. [Drawn from 1DFL.pdb and 1SR6.pdb.] 


Position of conformation when the y-phosphoryl group is absent 
lever arm when 
> no nucleotide is bound 


(Figure 35.10). This conformational change allows a long 
« helix (termed the relay helix) to adjust its position. The 
carboxyl-terminal end of the relay helix interacts with 
structures at the base of the lever arm, and so a change in 
Q the position of the relay helix leads to a reorientation of 

the lever arm. . 
Bociionot Analogous conformational changes take place in 
lever arm when kinesin. The kinesins also have a relay helix that can 
ADP-VO,*" is bound adopt different configurations when kinesin binds dif- 
ferent nucleotides. Kinesin lacks an a-helical lever arm, 
however. Instead, a relatively short segment termed the 
neck linker changes conformation in response to nucleo- 
tide binding (Figure 35.11). The neck linker binds to the 
Switch II head domain of kinesin when ATP is bound but is 
on released when the nucleotide-binding site is vacant or 
Switch | occupied by ADP. 


FIGURE 35.10 Relay helix. A superposition of key elements in two 
forms of scallop myosin reveals the structural changes that are 
transmitted by the relay helix from the switch | and switch Il loops to the 


base of the lever arm. Notice that the switch | and switch II loops 35.2 Myosins Move Along Actin Filaments 
interact with VO,3— in the position that would be occupied by the ‘ : : : . 

‘y-phosphoryl group of ATP. The structure of the myosin-ADP-VO,3- Myosins, kinesins, and dyneins move by cycling between 
complex is shown in lighter colors. [Drawn from 1DFL.pdb and 1SR6.pdb.] states with different affinities for the long, polymeric 


macromolecules that serve as tracks along which they 

move. For myosin, the molecular track is a polymeric 
form of actin, a 42-kDa protein that is among the most abundant in cells, 
typically accounting for as much as 10% of the protein in eukaryotic cells. 
We begin with a general discussion of the polymeric structure of actin and 
its assembly. We then examine the interactions between myosin and actin, 
including both structure and the dynamic interactions between these two 
proteins. Finally, we turn to the structure of muscle and the roles of myosin 


KR FIGURE 35.11 Neck linker. A and actin in muscle contraction. 


comparison of the structures of a kinesin i : : 
bound to ADP and bound to an ATP analog. Actin is a polar, self-assembling, dynamic polymer 


Neotel INe EGS WS OMEN ee) waniCH The structure of the actin monomer has been determined to atomic resolu- 
connects the head domain to the remainder 


Gi the nes mic odie 1 bound tothe tion by x-ray crystallography and has been used to interpret the structure of 
head domain in the presence of the ATP actin filaments, already somewhat understood through electron microscopy 


analog but is free in the presence of ADP studies at lower resolution. Each actin monomer comprises four domains 
only. [Drawn from 1I6l.pdb and 115S.pdb.] 


Kinesin-ATP complex ) | Kinesin-ADP complex 


Relay helix ' * Neck linker 


and switch II 
1016 


Pointed end 


Nucleotide- 
binding site 


Barbed end 


(Figure 35.12). These domains come together to surround a bound nucleo- 
tide, either ATP or ADP. The ATP form can be converted into the ADP 
form by hydrolysis. 

Actin monomers (often called G-actin for globular) come together to 
form actin filaments (often called F-actin; see Figure 35.12). F-actin has a 
helical structure; each monomer is related to the preceding one by a transla- 
tion of 27.5 A and a rotation of 166 degrees around the helical axis. Because 
the rotation is nearly 180 degrees, F-actin resembles a two-stranded cable. 
Note that each actin monomer is oriented in the same direction along the 
F-actin filament, and so the structure is polar, with discernibly different 
ends. One end is called the barbed (plus) end, and the other is called the 
pointed (minus) end. The names “barbed” and “pointed” refer to the 
appearance of an actin filament when myosin S1 fragments are bound to it. 

How are actin filaments formed? Like many biological structures, actin 
filaments self-assemble; that is, under appropriate conditions, actin mono- 
mers will come together to form well-structured, polar filaments. The 
aggregation of the first two or three monomers to form a filament is highly 
unfavorable. Thus, specialized protein complexes, including one called 
Arp2/3, serve as nuclei for actin assembly in cells. Once such a filament 
nucleus exists, the addition of subunits is more favorable. Let us consider 
the polymerization reaction in more detail. We designate an actin filament 
with n subunits A,. This filament can bind an additional actin monomer, 
A, to form A, ,- 
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Kk FIGURE 35.12 Actin structure. (Left) 
Schematic view of actin monomers (one in 
blue) of an actin filament. (Right) The 
domains in the four-domain structure of an 
actin monomer are identified by different 
shades of blue. Notice the nucleotide- 
binding site at the center of the 
structure. [Drawn from 1J6Z.pdb.] 
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The dissociation constant, K ai for this reaction, defines the monomer 
concentrations at which the polymerization reaction will take place, because 
the concentration of polymers of length n + 1 will be essentially equal to 
that for polymers of length n. Thus, 


[An]~[An+1] and Kg = ~~ [A] 
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FIGURE 35.13 The structure of myosin 
bound to actin. (A) The gray surface 
represents the structure observed by 
cryoelectron microscopy, with the green 
space-filling model representing one actin 
subunit. The ribbon diagram shows the 
structure of the S1 fragment of myosin 
docked into the cryoelectron microscopic 
structure. Notice that some of the myosin 
structure lies outside the gray surface. 

(B) The structure after the myosin S1 
fragment has been allowed to adjust to 
more closely match the structure observed 
by cryoelectron microscopy. Notice that the 
myosin structure now more closely matches 
the gray surface. [M. Lorenz and K. Holmes. 
PNAS 2010 (107) 12529-12534. Copyright 
2010 National Academy of Sciences.] 


In other words, the polymerization reaction will proceed until the monomer 
concentration is reduced to the value of K4. If the monomer concentration 
is below the value of Kj, the polymerization reaction will not proceed at all; 
indeed, existing filaments will depolymerize until the monomer concentra- 
tion reaches the value of Kj. Because of these phenomena, K, is referred to 
as the critical concentration for the polymer. Recall that actin contains a 
nucleotide-binding site that can contain either ATP or ADP. The critical 
concentration for the actin—ATP complex is approximately 20-fold lower 
than that for the actin—ADP complex; actin-ATP polymerizes more readily 
than does actin—ADP. 

Actin filaments inside cells are highly dynamic structures that are contin- 
ually gaining and losing monomers. Nucleation by complexes such as Arp2/3 
can initiate the polymerization of actin-ATP. In contrast, the hydrolysis of 
bound ATP to ADP favors actin depolymerization. This reaction acts as a 
timer to make actin filaments kinetically unstable as ATP hydrolyzes over 
time, making the filaments more unstable. Proteins that bind actin monomers 
or promote the severing of actin filaments also play roles. Polymerization 
reactions can exert force, pushing or pulling on cell membranes. Regulated 
actin polymerization is central to the changes in cell shape associated with cell 
motility in amebae as well as in human cells such as macrophages. 


Myosin head domains bind to actin filaments 


It has not been possible to determine the in vivo structure of a complex 
between actin and myosin at sufficiently high resolution to discern molecu- 
lar details. However, treatment of actin filaments with myosin 51 fragments 
in the absence of ATP results in a complex referred to as decorated actin for 
which the structure has been determined by cryoelectron microscopy to 
a resolution of 13 A. Although a structure at this resolution alone would 
not be adequate to observe molecular details, superimposition of the high- 
resolution structures of actin monomers and the myosin 51 fragment on the 
structure of decorated actin can be a source of insight into the details of its 
structure (Figure 35.13). The myosin head domain is in a conformation 
close to that observed for the nucleotide-free form. This structure also 
reveals the interaction surfaces between myosin and actin. The modeling 
suggests that the myosin head-domain conformation changes somewhat to 
increase its interaction with the actin filament. These conformational 
changes result in a slight opening of the nucleotide-binding site in myosin. 
This observation has implications for the mechanism by which myosin 
moves along actin filaments. 


Motions of single motor proteins can be directly observed 


Now that we understand the conformational changes behind myosin’s 
action, we can explore how myosin “walks” along its actin track. Studies of 
single myosin molecules moving relative to actin filaments have been sources 
of deep insight into the mechanisms underlying muscle contraction and 
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other complex processes. A powerful tool for these studies, called an optical 
trap, relies on highly focused laser beams (Figure 35.14). Small beads can be 
caught in these traps and held in place in solution. 

The position of the beads can be monitored with nanometer precision. 
James Spudich and his coworkers designed an experimental arrangement 
consisting of an actin filament that had a bead attached to each end. Each 
bead could be caught in an optical trap (one at each end of the filament) and 
the actin filament could be pulled taut over a microscope slide containing 
other beads that had been coated with fragments of myosin such as the 
heavy meromyosin fragment (Figure 35.14). On the addition of ATP, tran- 
sient displacements of the actin filament were observed along its long axis. 
The size of the displacement steps was fairly uniform with an average size 
of 11 nm (110 A). 

The results of these studies, performed in the presence of varying con- 
centrations of ATP, are interpreted as showing that individual myosin 
heads bind the actin filament and undergo a conformational change (the 
power stroke) that pulls the actin filament, leading to the displacement of the 
beads. After a period of time, the myosin head releases the actin, which then 
snaps back into place. 


Phosphate release triggers the myosin power stroke 


How does ATP hydrolysis drive the power stroke? A key observation is that 
the addition of ATP toa complex of myosin and actin results in the dissocia- 
tion of the complex. Thus, ATP binding and hydrolysis cannot be directly 
responsible for the power stroke. We can combine this fact with the struc- 
tural observations described earlier to construct a mechanism for the motion 
of myosin along actin (Figure 35.15). Let us begin with nucleotide-free 
myosin bound to actin. The binding of ATP to actin results in the dissocia- 
tion of myosin from actin. With ATP bound and free of actin, the myosin 
domain can undergo the conformational change associated with the forma- 
tion of the transition state for ATP hydrolysis. This conformational change 
results in the reorientation of the lever arm. In this form, the myosin head 
can dock onto the actin filament; phosphate is released with concomitant 
motion of the lever arm. This conformational change represents the power 
stroke and moves the body of the myosin molecule relative to the actin filament 
by approximately 110 A. The release of ADP completes the cycle. 


Muscle is a complex of myosin and actin 


The mechanism of moving a single myosin molecule relative to an actin fila- 
ment explains how muscles contract. Vertebrate muscle that is under voluntary 
control, such as the biceps and triceps in your upper arm, has a banded (stri- 
ated) appearance when examined under a light microscope. It consists of 
multinucleated cells that are bounded by an electrically excitable plasma 
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FIGURE 35.14 Watching a single motor 
protein in action. (A) An actin filament 
blue) is placed above a heavy meromyosin 
HMM) fragment (yellow) that projects from 
a bead on a glass slide. A bead attached 
0 each end of the actin filament is held in 
an optical trap produced by a focused, 
intense infrared laser beam (orange). The 
position of these beads can be measured 
with nanometer precision. (B) Recording of 
he displacement of an actin filament due 
© a myosin derivative attached to a bead, 
influenced by the addition of ATP. Note the 
airly uniform step sizes that are 

observed. [(A) Information from J. T. Finer, 

R. M. Simmons, and J. A. Spudich, Nature 
368:113-119, 1994. (B) Data from R. S. 
Rock, M. Rief, A. D. Metra, and J. A. Spudich, 
Methods 22:378-381, 2000.] 
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FIGURE 35.15 Myosin motion 
along actin. A myosin head (yellow) 
in the apo form is bound to an actin 
filament (blue). The binding of ATP (1) 
results in the release of myosin from 
actin. The reversible hydrolysis of 
ATP bound to myosin (2) can result 
in the reorientation of the lever arm. 
With ATP hydrolyzed but still bound 
to actin, myosin can bind actin (8). 
The release of P; (4) results in the 
reorientation of the lever arm and the 
concomitant motion of actin relative 
to myosin. The release of ADP (6) 
completes the cycle. 
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membrane. A muscle cell contains many parallel myo- 
fibrils, each about 1 pm in diameter. The functional 
unit, called a sarcomere, typically repeats every 2.3 wm 
(23,000 A) along the fibril axis in relaxed muscle 
(Figure 35.16). A dark A band and a light I band alter- 
nate regularly. The central region of the A band, termed 
the H zone, is less dense than the rest of the band. The 
I band is bisected by a very dense, narrow Z line. 

The underlying molecular plan of a sarcomere is 
revealed by cross sections of a myofibril. These cross 
sections show the presence of two kinds of interact- 
ing protein filaments. The thick filaments have diam- 
eters of about 15 nm (150 A) and consist primarily of 
myosin. The thin filaments have diameters of approx- 
imately 8 nm (80 A) and consist of actin as well as 
tropomyosin and the troponin complex. Muscle con- 
traction is achieved through the sliding of the thin 
filaments along the length of the thick filaments, 
driven by the hydrolysis of ATP (Figure 35.17). 

To form the thick filaments, myosin molecules 
self-assemble into thick bipolar structures, with the 
myosin heads protruding at both ends of a bare 
region in the center (Figure 35.18A). Approximately 
500 head domains line the surface of each thick fila- 
ment. Each head-rich region associates with two 
actin filaments, one on each side of the myosin mol- 
ecules (Figure 35.18B). The interaction of individual 
myosin heads with actin units creates the sliding 
force that gives rise to muscle contraction. 


FIGURE 35.16 Sarcomere. (A) Structure of muscle cell and 
myofibril containing sarcomeres. (B) Electron micrograph of a 
longitudinal section of a skeletal-muscle myofibril showing a single 
sarcomere. (C) Schematic representation of the sarcomere 
corresponding to the regions in the micrograph. [(B) Courtesy of 
Dr. Hugh Huxley.] 


Tropomyosin and the troponin complex regulate this sliding in response 
to nerve impulses. Under resting conditions, tropomyosin blocks the inti- 
mate interaction between myosin and actin. A nerve impulse leads to an 
increase in calcium ion concentration within the muscle cell. A component 
of the troponin complex senses the increase in Ca*t and, in response, 
relieves the inhibition of myosin—actin interactions by tropomyosin. 

How does the myosin reaction cycle apply to muscle contraction? Recall 
that hundreds of head domains project from the ends of each thick filament. 
The head domains are paired in myosin dimers, but the two heads within 
each dimer act independently. Actin filaments associate with each head-rich 
region, with the barbed ends of actin toward the Z line. In the presence of 
normal levels of ATP, most of the myosin heads are detached from actin. 
Each head can independently hydrolyze ATP, bind to actin, release P., and 
undergo its power stroke. Because few other heads are attached, the actin 
filament is relatively free to slide. Each head cycles approximately five 
times per second with a movement of 110 A per cycle. However, when hun- 
dreds of heads are interacting with the same actin filament, the overall rate 
of movement of myosin relative to the actin filament may reach 80,000 A 
per second, allowing a sarcomere to contract from its fully relaxed to its fully 
contracted form rapidly. Having many myosin heads briefly and indepen- 
dently attaching and moving an actin filament allows for much greater 
speed than could be achieved by a single motor protein. 
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FIGURE 35.18 Thick filament. (A) An electron micrograph of a reconstituted thick filament 
reveals the presence of myosin head domains at each end and a relatively narrow central 
region. A schematic view below shows how myosin molecules come together to form the 
thick filament. (B) A diagram showing the interaction of thick and thin filaments in skeletal- 
muscle contraction. [(A, top) Courtesy of Dr. Hugh Huxley.] 
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35.2 Myosin and Actin 


FIGURE 35.17 Sliding-filament model. 
Muscle contraction depends on the motion 
of thin filaments (blue) relative to thick 
filaments (red). [Information from H. E. Huxley. 
The mechanism of muscular contraction. 
Copyright © 1965 by Scientific American, 

Inc. All rights reserved.] 
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Number of light-chain binding sites 


FIGURE 35.19 Myosin lever-arm 

length. Examination of the rates of actin 
movement supported by a set of myosin 
mutants with different numbers of light- 
chain binding sites revealed a linear 
relation; the greater the number of light- 
chain binding sites (and, hence, the longer 
the lever arm), the faster the sliding 
velocity. [Data from T. Q. P. Uyeda, 

P. D. Abramson, and J. A. Spudich, Proc. Natl. 
Acad. Sci. U. S. A. 93:4459-4464, 1996.] 
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FIGURE 35.20 Microtubule structure. 
Schematic views of the helical structure of 
a microtubule. a-Tubulin is shown in dark 
red and B-tubulin in light red. (A) Top view. 
(B) Side view. 
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The length of the lever arm determines motor velocity 


A key feature of myosin motors is the role of the lever arm as an amplifier. 
The lever arm amplifies small structural changes at the nucleotide-binding 
site to achieve the 110-A movement along the actin filament that takes place 
in each ATP hydrolysis cycle. A strong prediction of the mechanism pro- 
posed for the movement of myosin along actin is that the length traveled per 
cycle should depend on the length of this lever arm. Thus, the length of the 
lever arm should influence the overall rate at which actin moves relative to 
a collection of myosin heads. 

This prediction was tested with the use of mutated forms of myosin with 
lever arms of different lengths. The lever arm in muscle myosin includes 
binding sites for two light chains (Section 35.1). Thus, investigators short- 
ened the lever arm by deleting the sequences that correspond to one or both 
of these binding sites. They then examined the rates at which actin filaments 
were transported along collections of these mutated myosins (Figure 35.19). 
As predicted, the rate decreased as the lever arm was shortened. A mutated 
form of myosin with an unusually long lever arm was generated by inserting 
23 amino acids corresponding to the binding site for an additional regulatory 
light chain. Remarkably, this form was found to support actin movement that 
was faster than that of the wild-type protein. These results strongly support the 
proposed role of the lever arm in contributing to myosin motor activity. 


35.3 Kinesin and Dynein Move Along Microtubules 


In addition to actin, the cytoskeleton includes other components, notably 
intermediate filaments and microtubules. Microtubules serve as tracks for 
two classes of motor proteins—namely, kinesins and dyneins. Kinesins 
moving along microtubules usually carry cargo such as organelles and vesi- 
cles from the center of a cell to its periphery. Dyneins are important in slid- 
ing microtubules relative to one another during the beating of cilia and 
flagella on the surfaces of some eukaryotic cells. Additionally, dynein car- 
ries cargo from the cell periphery to the cell center. 


ee Some members of the kinesin family are crucial to the transport of 
organelles and other cargo to nerve endings at the peripheries of neu- 
rons. It is not surprising, then, that mutations in these kinesins can lead to 
nervous system disorders. For example, mutations in a kinesin called KIF1B 
can lead to the most common peripheral neuropathy (weakness and pain in 
the hands and feet), Charcot-Marie-Tooth disease, which affects 1 in 2500 
people. A glutamine-to-leucine mutation in the P-loop of the motor domain 
of this kinesin has been found in some affected persons. Knockout mice with 
a disruption of the orthologous gene have been generated. Mice heterozygous 
for the disruption show symptoms similar to those observed in human 
beings; homozygotes die shortly after birth. Mutations in other kinesin genes 
have been linked to human spastic paraplegia. In these disorders, defects in 
kinesin-linked transport may impair nerve function directly, and the decrease 
in the activity of specific neurons may lead to other degenerative processes. 


Microtubules are hollow cylindrical polymers 


Microtubules are a major component of the cytoskeleton. They are built 
from two kinds of homologous 50-kDa subunits, a- and B-tubulin, which 
assemble in a helical array of alternating tubulin types to form the wall of a 
hollow cylinder (Figure 35.20). Alternatively, a microtubule can be regarded 
as 13 protofilaments that run parallel to its long axis. The outer diameter of 


a microtubule is 30 nm, much larger than that of actin (8 nm). Like actin, 
microtubules are polar structures. The minus end of a microtubule is 
anchored near the center of a cell, whereas the plus end extends toward the 
cell surface. 

Microtubules are also key components of cilia and flagella present on 
some eukaryotic cells. For example, sperm propel themselves through the 
motion of flagella containing microtubules. The microtubules present in 
these structures adopt a common architecture (Figure 35.21). A bundle of 
microtubules called an axoneme is surrounded by a membrane contiguous 
with the plasma membrane. The axoneme is composed of a peripheral 
group of nine microtubule pairs surrounding two singlet microtubules. This 
recurring motif is often called a 9 + 2 array. Dynein drives the motion of one 
member of each outer pair relative to the other, causing the overall structure 
to bend. 

Microtubules are important in determining the shapes of cells and in 
separating daughter chromosomes in mitosis. They are highly dynamic 
structures that grow through the addition of a- and B-tubulin to the ends of 
existing structures. Like actin, tubulins bind and hydrolyze nucleoside tri- 
phosphates, although, for tubulin, the nucleotide is GTP rather than ATP. 
The critical concentration for the polymerization of the GTP forms of tubu- 
lin is lower than that for the GDP forms. Thus, a newly formed microtubule 
consists primarily of GTP-tubulins. Through time, the GTP is hydrolyzed 
to GDP. The GDP-tubulin subunits in the interior length of a microtubule 
remain stably polymerized, whereas GDP subunits exposed at an end have 
a strong tendency to dissociate. Marc Kirschner and Tim Mitchison found 
that some microtubules in a population lengthen while others simultane- 
ously shorten. This property, called dynamic instability, arises from random 
fluctuations in the number of GTP- or GDP-tubulin subunits at the plus 
end of the polymer. The dynamic character of microtubules is crucial for 
processes such as mitosis, which require the assembly and disassembly of 
elaborate microtubule-based structures such as spindle fibers. 


The structure of a tubulin dimer was determined 

at high resolution by electron crystallographic 
methods (Figure 35.22). As expected from their 40% 
sequence identity, a- and B-tubulin have very similar 
three- dimensional structures. Further analysis revealed 
that the tubulins are members of the P-loop NTPase 
family and contain a nucleotide-binding site adjacent to 
the P-loop. Tubulins are present only in eukaryotes, 
although a prokaryotic homolog has been found. 
Sequence analysis identified a prokaryotic protein called 
FtsZ (for filamentous temperature-sensitive mutant Z) 
that is quite similar to the tubulins. The homology was 
confirmed when the structure was determined by x-ray 
crystallography. Interestingly, this protein participates in 
bacterial cell division, forming ring-shaped structures at 
the constriction that arises when a cell divides. These 
observations suggest that tubulins may have evolved 
from an ancient cell-division protein. 
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FIGURE 35.21 Microtubule arrangement. 
Electron micrograph of a cross section of a 
flagellar axoneme shows nine microtubule 
doublets surrounding two singlets. 
[Courtesy of Dr. Joel Rosenbaum.] 
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=e The continual lengthening and shortening of 
microtubules is essential to their role in cell divi- 
sion. Taxol, a compound isolated from the bark of the 
Pacific yew tree, was discovered through its ability to 
interfere with cell proliferation. 


h& FIGURE 35.22 Tubulin. Microtubules can be viewed as an 
assembly of a-tubulin—B-tubulin dimers. The structures of a-tubulin 
and B-tubulin are quite similar. Notice that each includes a P-loop 
NTPase domain (shown in purple) and a bound guanine 
nucleotide. [Drawn from 1JFF.pdb.] 
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FIGURE 35.23 Monitoring movements 
mediated by kinesin. (A) The movement 
of beads or vesicles, carried by individual 
kinesin dimers along a microtubule, can be 
directly observed. (B) A trace shows the 
displacement of a bead carried by a kinesin 
molecule. Multiple steps are taken in the 
6-s interval. The average step size is about 
8 nm (80 A). [(B) Data from K. Svoboda et al., 


Nature 365:721-727, 1993.] 
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Taxol 


Taxol binds to microtubules and stabilizes the polymerized form. Taxol 
and its derivatives have been developed as anticancer agents because they 
preferentially affect rapidly dividing cells, such as those in tumors. 


Kinesin motion is highly processive 


Kinesins are motor proteins that move along microtubules. We have seen 
that muscle myosin moves along actin filaments by a process in which actin 
is released in each cycle; a myosin head group acting independently dissoci- 
ates from actin after every power stroke. In contrast, when a kinesin mole- 
cule moves along a microtubule, the two head groups of the kinesin molecule 
operate in tandem: one binds, and then the next one does. A kinesin mole- 
cule may take many steps before both head groups are dissociated at the 
same time. In other words, the motion of kinesin is highly processive. 
Single-molecule measurements allow processive motion to be observed 
(Figure 35.23). A single kinesin molecule will typically take 100 or more 
steps toward the plus end of a microtubule in a period of seconds before the 
molecule becomes detached from the microtubule. These measurements 
also revealed that the average step size is approximately 80 A, a value that 
corresponds to the distance between consecutive a- or B-tubulin subunits 
along each protofilament. 

An additional fact is crucial to the development of a mechanism for kine- 
sin motion—namely, that the addition of ATP strongly increases the affinity 
of kinesin for microtubules. This behavior stands in contrast with the 
behavior of myosin; ATP binding to myosin promotes its dissociation from 
actin. Do these differences imply that kinesin and myosin operate by com- 
pletely different mechanisms? Indeed not. Kinesin-generated movement 
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appears to proceed by a mechanism that is quite similar to that used by 
myosin (Figure 35.24). Let us begin with a two-headed kinesin molecule in 
its ADP form, dissociated from a microtubule. Recall that the neck linker 
binds the head domain when ATP is bound and is released when ADP is 
bound. The initial interaction of one of the head domains with a tubulin 
dimer on a microtubule stimulates the release of ADP from this head 
domain and the subsequent binding of ATP. The binding of ATP triggers 
a conformational change in the head domain that leads to two important 
events. First, the affinity of the head domain for the microtubule increases, 
essentially locking this head domain in place. Second, the neck linker binds 
to the head domain. This change, transmitted through the coiled-coil 
domain that connects the two kinesin monomers, repositions the other 
head domain. In its new position, the second head domain 1s close to a sec- 
ond tubulin dimer, 80 A along the microtubule in the direction of the plus 
end. Meanwhile, the intrinsic ATPase activity of the first head domain 
hydrolyzes the ATP to ADP and P.. When the second head domain binds 
to the microtubule, the first head releases ADP and binds ATP. Again, 
ATP binding favors a conformational change that pulls the first domain 
forward. This process can continue for many cycles until, by chance, both 
head domains are in the ADP form simultaneously and kinesin dissociates 
from the microtubule. Because of the relative rates of the component 
reactions, a simultaneous dissociation takes place approximately every 
100 cycles. Thus, kinesin “walks” hand over hand along the microtubule 
for approximately 100 steps before both “hands” simultaneously let go. 
Although recent studies have led to a structural model for the dynein power 
stroke, the mechanism by which it works is unknown and an interesting 
research frontier. 

Kinesin hydrolyzes ATP at a rate of approximately 80 molecules per sec- 
ond. Thus, given the step size of 80 A per molecule of ATP, kinesin moves 
along a microtubule at a speed of 6400 A per second. This rate is considerably 
lower than the maximum rate for myosin, which moves relative to actin at 
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FIGURE 35.24 Kinesin moving along a 
microtubule. (1) One head of a two- 
headed kinesin molecule, initially with both 
heads in the ADP form, binds to a 
microtubule. (2) The release of ADP and the 
binding of ATP results in a conformational 
change that locks the head to the 
microtubule and pulls the neck linker 
(orange) to the head domain, throwing the 
second domain toward the plus end of the 
microtubule. (8) ATP undergoes hydrolysis 
while the second head interacts with the 
microtubule. (4) The exchange of ATP for 
ADP in the second head pulls the first head 
off the microtubule, releasing P; and moving 
the first domain along the microtubule. 

(5) The cycle repeats, moving the kinesin 
dimer farther down the microtubule. 
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FIGURE 35.25 Bacterial flagella. Electron 
micrograph of Salmonella typhimurium 
shows flagella directed backwards, where 
they can form a bundle under appropriate 
circumstances. [Science Source.] 


hk FIGURE 35.26 Structure of flagellin. 
A bacterial flagellum is a helical polymer of 
the protein flagellin. Notice that each subunit 
corresponds to a bent structure with a 
relatively flat surface facing the hollow core 
of the flagellum. [Drawn from 1101 .pdb.] 


80,000 A per second. Recall, however, that myosin movement depends on the 
independent action of hundreds of different head domains working along the 
same actin filament, whereas the movement of kinesin is driven by the proces- 
sive action of kinesin head groups working in pairs. Muscle myosin evolved to 
maximize the speed of the motion, whereas kinesin functions to achieve 
steady, but slower, transport in one direction along a filament. 


35.4 A Rotary Motor Drives Bacterial Motion 


In 1s, a motile bacterium can move approximately 25 jm, or about 10 body 
lengths. A human being sprinting at a proportional rate would complete 
the 100-meter dash in slightly more than 5 s. The motors that power this 
impressive motion are strikingly different from the eukaryotic motors that 
we have seen so far. In the bacterial motor, an element spins around a central 
axis rather than moving along a polymeric track. The direction of rotation 
can change rapidly, a feature that is central to chemotaxis, the process by 
which bacteria swim preferentially toward an increasing concentration of 
certain useful compounds and away from potentially harmful ones. One 
type of flagellar motor, powered by a Na* gradient, turns at a rate of 
200,000 revolutions per minute. 


Bacteria swim by rotating their flagella 


Bacteria such as Escherichia coli and Salmonella typhimurium swim by rotat- 
ing flagella that lie on their surfaces (Figure 35.25). When the flagella rotate 
in a counterclockwise direction (viewed from outside a bacterium), the 
separate flagella form a bundle that very efficiently propels the bacterium 
through solution. 

Bacterial flagella are polymers approximately 15 nm in diameter and as 
much as 15 wm in length, composed of 53-kDa subunits of a protein called 
flagellin (Figure 35.26). These subunits associate into a helical structure that 
has 5.5 subunits per turn, giving the appearance of 11 protofilaments. Each 
flagellum has a hollow core. Remarkably, flagella form not by growing at 
the base adjacent to the cell body but, instead, by the addition of new 
subunits that pass through the hollow core and add to the free end. Each 
flagellum is intrinsically twisted in a left-handed sense. At its base, each 
flagellum has a rotory motor. 


Flagellum Flagellin 


Proton flow drives bacterial flagellar rotation 


Early experiments by Julius Adler demonstrated that ATP is not required 
for flagellar motion. What powers these rotary motors? The necessary free 
energy is derived from the proton gradient that exists across the plasma 


membrane. The flagellar motor is quite complex, containing as 
many as 40 distinct proteins (Figure 35.27). Five components 
particularly crucial to motor function have been identified through 
genetic studies. MotA is a membrane protein that appears to have 
four transmembrane helices as well as a cytoplasmic domain. 
MotB is another membrane protein with a single trans-membrane 
helix and a large periplasmic domain. Approximately 11 MotA— 
MotB pairs form a ring around the base of the flagellum. The 
proteins FliG, FliM, and FIliN are part of a disc-like structure 
called the MS (membrane and supramembrane) ring, with approx- 
imately 30 FliG subunits coming together to form the ring. The 
three-dimensional structure of the carboxyl-terminal half of FliG 
reveals a wedge-shaped domain with a set of charged amino acids, 
conserved among many species, lying along the thick edge of the 
wedge (Figure 35.28). 

The MotA—MotB pair and FliG combine to create a proton 
channel that drives the rotation of the flagellum. How can pro- 
ton flow across a membrane drive mechanical rotation? We have 
seen such a process earlier in regard to ATP synthase (Section 
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FIGURE 35.27 Flagellar motor. A schematic view of the 
flagellar motor, a complex structure containing as many 
as 40 distinct types of protein. The approximate positions 
of the proteins MotA and MotB (red), FliG (orange), FIiN 
(yellow), and FIiM (green) are shown. 


18.4). Recall that the key to driving the rotation of the y subunit 

of ATP synthase is the a subunit of the F, fragment. This sub- 

unit appears to have two half-channels; protons can move across the 
membrane only by moving into the half-channel from the side of the 
membrane with the higher local proton concentration, binding to a disc- 
like structure formed by the c subunits, riding on this structure as it 
rotates to the opening of the other half-channel, and exiting to the side 
with the lower local proton concentration. Could a similar mechanism 
apply to flagellar rotation? Indeed, such a mechanism was first proposed 
by Howard Berg to explain flagellar rotation before the rotary mechanism 
of ATP synthase was elucidated. Each MotA—MotB pair is conjectured to 
form a structure that has two half-channels; FliG serves as the rotating 
proton carrier, perhaps with the participation of some of the charged resi- 
dues identified in crystallographic studies (Figure 35.29). In this scenario, 
a proton from the periplasmic space passes into the outer half-channel and 
is transferred to an FliG subunit. The MS ring rotates, rotating the flagel- 
lum with it and allowing the proton to pass into the inner half-channel and 
into the cell. Ongoing structural and mutagenesis studies are testing and 
refining this hypothesis. 


hk FIGURE 35.28 Flagellar motor components. Approximately 30 subunits of FliG 
assemble to form part of the MS ring. The ring is surrounded by approximately 11 structures 
consisting of MotA and MotB. Notice that the carboxyl-terminal domain of FliG includes a ridge 
lined with charged residues that may participate in proton transport. [Drawn from 1QC7.pdb.] 
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(A) Outer 
half-channel 


Inner 
half-channel 

FIGURE 35.29 Proton-transport-coupled 
rotation of the flagellum. (A) MotA-MotB 
may form a structure having two half- 
channels. (B) One model for the 
mechanism of coupling rotation to a proton 
gradient requires protons to be taken up 
into the outer half-channel and transferred 
to the MS ring. The MS ring rotates in a 
counterclockwise direction, and the protons 
are released into the inner half-channel. The 
flagellum is linked to the MS ring and so 
the flagellum rotates as well. 
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FIGURE 35.30 Charting a course. This 
projection of the track of an E. coli bacterium 
was obtained with a microscope that 
automatically follows bacterial motion in three 
dimensions. The points show the locations of 
the bacterium at 80-ms intervals. [Information 
from H. C. Berg, Nature 254:389-392, 1975.] 


1028 


Counterclockwise 
rotation of MS ring Yt 


Ht 
Proton uptake through Proton release through 
outer half-channel inner half-channel 


Bacterial chemotaxis depends on reversal of the 
direction of flagellar rotation 


Many species of bacteria respond to changes in their environments by 
adjusting their swimming behavior. Examination of the paths taken is 
highly revealing (Figure 35.30). The bacteria swim in one direction for some 
length of time (typically about a second), tumble briefly, and then set off in 
a new direction. The tumbling is caused by a brief reversal in the direction 
of the flagellar motor. When the flagella rotate counterclockwise, the helical 
filaments form a coherent bundle favored by the intrinsic shape of each fila- 
ment, and the bacterium swims smoothly. When the rotation reverses, the 
bundle flies apart because the screw sense of the helical flagella does not 
match the direction of rotation. Each flagellum then pulls in a different 
direction and the cell tumbles. 

In the presence of a gradient of certain substances such as glucose, bacte- 
ria swim preferentially toward the direction of the higher concentration of 
the substance. Such compounds are referred to as chemoattractants. Bacteria 
also swim preferentially away from potentially harmful compounds such as 
phenol, a chemorepellant. The process of moving in specific directions in 
response to environmental cues is called chemotaxis. In the presence of a 
gradient of a chemoattractant, bacteria swim for longer periods of time with- 
out tumbling when moving toward higher concentrations of the chemoat- 
tractant. In contrast, they tumble more frequently when moving toward 
lower concentrations of the chemoattractant. This behavior is reversed for 
chemorepellants. The result of these actions is a biased random walk that 
facilitates net motion toward conditions more favorable to the bacterium. 

Chemotaxis depends on a signaling pathway that terminates at the fla- 
gellar motor. The signaling pathway begins with the binding of molecules 
to receptors in the plasma membrane (Figure 35.31). In their unoccupied 
forms, these receptors initiate a pathway leading eventually to the phos- 
phorylation of a specific aspartate residue on a soluble protein called CheY. 
In its phosphorylated form, CheY binds to the base on the flagellar motor. 
When bound to phosphorylated CheY, the flagellar motor rotates in a 
clockwise rather than a counterclockwise direction, causing tumbling. 

The binding of a chemoattractant to a surface receptor blocks the signal- 
ing pathway leading to CheY phosphorylation. Phosphorylated CheY 
spontaneously hydrolyzes and releases its phosphoryl group in a process 
accelerated by another protein, CheZ. The concentration of phosphorylated 
CheY drops, and the flagella are less likely to rotate in a clockwise direction. 
Under these conditions, bacteria swim smoothly without tumbling. Thus, 
the reversible rotary flagellar motor and a phosphorylation-based signaling 
pathway work together to generate an effective means for responding to 
environmental conditions. 


Unoccupied 
——> Clockwise 
rotation 
(tumbling) 
Attractant 


Repellant 


Counterclockwise 
rotation 

(smooth 
swimming) 


CheY 


Clockwise 
rotation 
(tumbling) 


—> 


Bacteria sense spatial gradients of chemoattractants by measurements 
separated in time. A bacterium sets off in a random direction and, if the 
concentration of the chemoattractant has increased after the bacterium 
has been swimming for a period of time, the likelihood of tumbling 
decreases and the bacterium continues in roughly the same direction. If 
the concentration has decreased, the tumbling frequency increases and 
the bacterium tests other random directions. The success of this mecha- 
nism once again reveals the power of evolutionary problem solving: many 
possible solutions are tried at random, and those that are beneficial are 
selected and exploited. 


SUMMARY 


35.1 


35.2 


Most Molecular-Motor Proteins Are Members of the P-Loop NTPase 
Superfamily 

Eukaryotic cells contain three families of molecular-motor proteins: 
myosins, kinesins, and dyneins. These proteins move along tracks 
defined by the actin and microtubule cytoskeletons of eukaryotic 
cells, contributing to cell and organismal movement and to the intra- 
cellular transport of proteins, vesicles, and organelles. Despite con- 
siderable differences in size and a lack of similarity detectable at the 
level of amino acid sequence, these proteins are homologous, con- 
taining core structures of the P-loop NTPase family. The ability of 
these core structures to change conformations in response to nucleo- 
side triphosphate binding and hydrolysis is key to molecular-motor 
function. Motor proteins consist of motor domains attached to 
extended structures that serve to amplify the conformational changes 
in the core domains and to link the core domains to one another or to 
other structures. 


Myosins Move Along Actin Filaments 

The motile structure of muscle consists of a complex of myosin and 
actin, along with accessory proteins. Actin, a highly abundant 42-kDa 
protein, polymerizes to form long filaments. Each actin monomer can 
bind either ATP or ADP. A myosin motor domain moves along actin 
filaments in a cyclic manner, beginning with myosin free of bound 
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FIGURE 35.31 Chemotaxis signaling 
pathway. Receptors in the plasma 
membrane initiate a signaling pathway 
leading to the phosphorylation of the CheY 
protein. Phosphorylated CheY binds to the 
flagellar motor and favors clockwise 
rotation. When an attractant binds to the 
receptor, this pathway is blocked, and 
counterclockwise flagellar rotation and, 
hence, smooth swimming result. When a 
repellant binds, the pathway is stimulated, 
leading to an increased concentration of 
phosphorylated CheY and, hence, more- 
frequent clockwise rotation and tumbling. 
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KEY TERMS 


35.3 


35.4 


nucleotides bound to actin: (1) ATP binds to myosin and the myosin 
is released from actin; (2) a reversible conformational change associ- 
ated with the hydrolysis of ATP while still bound to myosin leads to a 
large motion of a lever arm that extends from the motor domain; 
(3) myosin with bound ADP and P. binds to actin; (4) P, is released 
from myosin, resulting in resetting the position of the lever arm and 
moving actin relative to myosin; and (5) the release of ADP returns 
the motor domain to its initial state. The length of the lever arm deter- 
mines the size of the step taken along actin in each cycle. The ability to 
monitor single molecular-motor proteins has provided key tests for 
hypotheses concerning motor function. Muscle contraction entails the 
rapid sliding of thin filaments, composed of actin, relative to thick 
filaments, composed of myosin. The thick filaments consist of multi- 
ple myosin molecules bound together. Each myosin molecule has two 
heads that can bind to actin and move it relative to myosin, driven by 
the hydrolysis of ATP by myosin. Muscle contraction is regulated by 
tropomyosin and the troponin complex. These proteins prevent actin 
and myosin from productively interacting until an increase in calcium 
concentration associated with a nerve impulse results in calcium- 
induced changes in troponin and tropomyosin. 


Kinesin and Dynein Move Along Microtubules 

Kinesin and dynein move along microtubules rather than actin. 
Microtubules are polymeric structures composed of a- and B-tubulin, 
two very similar guanine-nucleotide-binding proteins. Each microtu- 
bule comprises 13 protofilaments with alternating a- and B-tubulin 
subunits. Kinesins move along microtubules by a mechanism quite 
similar to that used by myosin to move along actin but with several 
important differences. First, ATP binding to kinesin favors motor- 
domain binding rather than dissociation. Second, the power stroke is 
triggered by the binding of ATP rather than the release of P.. Finally, 
kinesin motion is processive. The two heads of a kinesin dimer work 
together, taking turns binding and releasing the microtubule, and 
many steps are taken along a microtubule before both heads dissociate. 
Most kinesins move toward the plus end of microtubules. 


A Rotary Motor Drives Bacterial Motion 

Many motile bacteria use rotating flagella to propel themselves. When 
rotating counterclockwise, multiple flagella on the surface of a bactert'um 
come together to form a bundle that effectively propels the bacterium 
through solution. A proton gradient across the plasma membrane, 
rather than ATP hydrolysis, powers the flagellar motor. The mechanism 
for coupling transmembrane proton transport to macromolecular rota- 
tion appears to be similar to that used by ATP synthase. When rotating 
clockwise, the flagella fly apart and the bacterium tumbles. Bacteria 
swim preferentially toward chemoattractants in a process called chemo- 
taxis. When bacteria are swimming in the direction of an increasing 
concentration of a chemoattractant, clockwise flagellar motion predomi- 
nates and tumbling is suppressed, leading to a biased random walk in the 
direction of increasing chemoattractant concentration. 
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chemotaxis (p. 1028) 

CheY (p. 1028) 


1. Diverse motors. Skeletal muscle, eukaryotic cilia, and 
bacterial flagella use different strategies for the conversion 
of free energy into coherent motion. Compare and contrast 
these motility systems with respect to (a) the free-energy 
source and (b) the number of essential components and 
their identity. 


2. You call that slow? At maximum speed, a kinesin mole- 
cule moves at a rate of 6400 A per second. Given the dimen- 
sions of the motor region of a kinesin dimer of approximately 
80 A, calculate its speed in “body lengths” per second. To 
what speed does this body-length speed correspond for an 
automobile 10 feet long? 


3. Heavy lifting. A single myosin motor domain can gener- 
ate a force of approximately 4 piconewtons (4 pN). How 
many times its “body weight” can a myosin motor domain 
lift? Note that 1 newton = 0.22 pounds (100 g). Assume a 
molecular mass of 100 kDa for the motor domain. 


4. Compare and contrast. Describe two similarities and two 
differences between actin filaments and microtubules. 


5. Lighten up. What is the primary role of the light chains 
in myosin? In kinesin? 


6. Rigor mortis. Propose an explanation for the fact that the 
body stiffens after death. 


7. Now you see it, now you don’t. Under certain stable con- 
centration conditions, actin monomers in their ATP form 
will polymerize to form filaments that disperse again into 
free actin monomers over time. Explain. 


8. Helicases as motors. Helicases can use single-stranded 
DNA as tracks. Consider a helicase that moves one base in 
the 3’ — 5’ direction in each cycle. Assuming that the heli- 
case can hydrolyze ATP at a rate of 50 molecules per second 
in the presence of a single-stranded DNA template, calcu- 
late the velocity of the helicase in micrometers per second. 
How does this velocity compare with that of kinesin? 


9. New moves. When bacteria such as E. coli are starved to 
a sufficient extent, they become nonmotile. However, when 
such bacteria are placed in an acidic solution, they resume 
swimming. Explain. 


10. Going straight. Suppose you measure the mean distance 
that an E. coli bacterium moves along a straight path before 
it tumbles over a period of time. Would you expect this 
distance to change in the presence of a gradient of a che- 
moattractant? Would it be longer or shorter? 


11. Hauling a load. Consider the action of a single kinesin 
molecule in moving a vesicle along a microtubule track. The 
force required to drag a spherical particle of radius a at a 
velocity v in a medium having a viscosity 7 is 


F = 67nav 


Suppose that a 2-jm-diameter bead is carried at a velocity 
of 0.6 ums! in an aqueous medium (7n = 0.01 poise = 
0.01gcm!s~1). 


(a) What is the magnitude of the force exerted by the kinesin 
molecule? Express the value in dynes (1 dyne = 1 gems”). 


(b) How much work is performed in 1 s? Express the value 
in ergs (1 erg = 1 dyne cm). 


(c) A kinesin motor hydrolyzes approximately 80 mole- 
cules of ATP per second. What is the energy associated 
with the hydrolysis of this much ATP in ergs? Compare this 
value with the actual work performed. 


12. Unusual strides. A publication describes a kinesin mol- 
ecule that is claimed to move along microtubules with a step 
size of 6 nm. You are skeptical. Why? 


13. The sound of one hand clapping. KIF1A is a motor pro- 
tein that moves toward the plus end of microtubules as a 
monomer. KIF1A has only a single motor domain. What 
additional structural elements would you expect to find in 
the KIF1A structure? 


14. Building blocks. Actin filaments, microtubules, and 
bacterial flagella are all built from small subunits. Describe 
three advantages of assembling long filamentous structures 
from subunits rather than from single, long proteins. 


Mechanism Problem 


15. Backward rotation. On the basis of the proposed struc- 
ture in Figure 35.29 for the bacterial flagellar motor, suggest 
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a pathway for transmembrane proton flow when the flagel- 
lar motor is rotating clockwise rather than counterclockwise. 


Chapter Integration Problem 


16. Smooth muscle. Smooth muscle, in contrast with ske- 
letal muscle, is not regulated by a tropomyosin—-troponin 
mechanism. Instead, vertebrate smooth-muscle contraction 
is controlled by the degree of phosphorylation of its light 
chains. Phosphorylation induces contraction, and dephos- 
phorylation leads to relaxation. Like that of skeletal muscle, 
smooth-muscle contraction is triggered by an increase in 
the cytoplasmic calcium ion level. Propose a mechanism for 
this action of calcium ion on the basis of your knowledge of 
other signal-transduction processes. 


Data Interpretation Problem 


17. Myosin V. An abundant myosin-family member, myo- 
sin V is isolated from brain tissue. This myosin has a number 
of unusual properties. First, on the basis of its amino acid 
sequence, each heavy chain has six tandem binding sites for 
calmodulin-like light chains. Second, it forms dimers but 
not higher-order oligomers. Finally, unlike almost all other 
myosin-family members, myosin V is highly processive. 

The rate of ATP hydrolysis by myosin has been examined 
as a function of ATP concentration, as shown in graph A. 
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(a) Estimate the values of k,,, and K,, for ATP. 


With the use of optical-trap measurements, the motion 
of single myosin V dimers could be followed, as shown in 
graph B. 
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[Based on M. Rief et al., Proc. Natl. Acad. Sci. U. S. A. 97:9482-9486, 
2000.] 


(b) Estimate the step size for myosin V. 


The rate of ADP release from myosin V is found to be 
approximately 13 molecules per second. 


(c) Combine the observations about the amino acid 
sequence of myosin, the observed step size, and the kinetics 
results to propose a mechanism for the processive motion of 
myosin V. 


CHAPTER 


Drug Development 


COOH 


Many drugs are based on natural products. Aspirin (above) 
is a molecule related to a compound isolated from willow 
bark (near left). Extracts of willow bark had long been 
known to have medicinal properties. The active compound 
was isolated, modified, and, beginning in 1899, packaged 
for consumers (far left). [(Far left) AP Images/Bayer. (Near 
left) H. Mark Weidman Photography/Alamy.] 
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OUTLINE 
he development of drugs is among the most important interfaces between 
biochemistry and medicine. Throughout the previous chapters in this 36.1 The Development of Drugs 
book, we have encountered a number of types of proteins—enzymes, recep- Presents Huge Challenges 


tors, and transporters—that serve as targets for a majority of the drugs in 26.9 Dig Candidaiée Gane 
clinical use (Figure 36.1). In most cases, drugs act by binding to these pro- Discovered by Serendipity, 
teins and inhibiting or otherwise modulating their activities. Thus, knowl- Screening, or Design 
edge of these molecules and the pathways in which they participate is crucial 
to drug development. An effective drug is much more than a potent modula- 
tor of its target, however. Drugs must be readily administered to patients, 


36.3 Analyses of Genomes Hold Great 
Promise for Drug Discovery 


preferably as small tablets taken orally, and must survive within the body 36.4 The Clinical Development of 
long enough to reach their targets. Furthermore, to prevent unwanted physi- Drugs Proceeds Through 
ological effects, drugs must not modulate the properties of biomolecules Several Phases 


other than the intended targets. These requirements tremendously limit the 

number of compounds that have the potential to be clinically useful drugs. 
Drugs have been discovered by two fundamentally distinct approaches 
(Figure 36.2). The first approach identifies a substance that has a desirable 
physiological consequence when administered to a human being, to an 
appropriate animal, or to cells. Such substances can be discovered by seren- 
dipity, by the fractionation of plants or other materials known to have 
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Pharmacology 

The science that deals with the discovery, 
chemistry, composition, identification, 
biological and physiological effects, uses, 
and manufacture of drugs. 


Other protein 


targets 
8 Enzymes 


» (Chapters 8 and others) 


Other transporters 


Solute carriers 
(Section 13.3) 


Ligand-gated ion channels 
(Section 13.4) 


Voltage-gated ion channels 
(Section 13.4) 


Other receptors 


G-protein-coupled receptors 


Nuclear receptors (Section 14.1) 


(Section 32.3) 


FIGURE 36.1 The targets of drugs in current use. In this pie chart, currently-used drugs 
that modulate human proteins are sorted by target type: enzymes in green, receptors in blue, 
transporters in red, and other targets in gray. [Data from M. Rask-Andersen et al., Nat. Rev. 
Drug Discov. 10:579-590, 2011] 


medicinal properties, or by screening natural products or other “libraries” 
of compounds. In this approach, a biological effect is known before the 
molecular target is identified. The mode of action of the substance is only 
later identified after substantial additional work. The second approach 
begins with a known molecular target. Compounds are sought, either by 
screening or by designing molecules with desired properties, that bind to 
the target molecule and modulate its properties. After such compounds are 
available, scientists can explore their effects on appropriate cells or organ- 
isms. Many unexpected results may be encountered in this process as the 
complexity of biological systems reveals itself. 

In this chapter, we explore the science of pharmacology. We examine a 
number of case histories that illustrate drug development—including many of 
its concepts, methods, and challenges. We then see how the concepts and tools 
from genomics are influencing approaches to drug discovery. The chapter ends 
with a summary of the clinical trial phases required for drug development. 
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Physiological Molecular 
——_—> 
Compound effect target 


(B) 


Molecular . Compound . Physiological 


target effect 


FIGURE 36.2 Two paths to drug discovery. (A) A compound is discovered to have a 
desirable physiological effect. The molecular target can be identified in a separate step as 
needed. (B) A molecular target is selected first. Drug candidates that bind to the target are 
identified and then examined for their physiological effects. 


36.1 The Development of Drugs Presents Huge Challenges 


Many compounds have significant effects when taken into the body, but 
only a very small fraction of them have the potential to be useful drugs. 
A foreign compound, not adapted to its role in the cell through long evolu- 
tion, must have a range of special properties to function effectively without 


causing serious harm. Let us consider some of the challenges faced by 
drug developers. 


Drug candidates must be potent and selective modulators 
of their targets 


Most drugs bind to specific proteins, usually receptors or enzymes, within 
the body. To be effective, a drug should bind a sufficient number of its 
target proteins when taken at a reasonable dose. One factor in determining 
drug effectiveness is the strength of the interaction between the drug and its 
target. A molecule that binds to some target protein is often referred to as a 
ligand. A ligand-binding curve is shown in Figure 36.3. Ligand molecules 
occupy progressively more target binding sites as ligand concentration 
increases until essentially all of the available sites are occupied. The ten- 
dency of a ligand to bind to its target is measured by the dissociation con- 
stant, K4, defined by the expression 


Ka = [R][L]/[RE] 


where [R] is the concentration of the free receptor, [L] is the concentration of 
the free ligand, and [RL] is the concentration of the receptor—ligand complex. 
The dissociation constant is a measure of the strength of the interaction 
between the drug candidate and the target; the lower the value, the stronger the 
interaction. The concentration of free ligand at which one-half of the binding 
sites are occupied equals the dissociation constant, as long as the concentration 
of binding sites is substantially less than the dissociation constant. 

In many cases, biological assays using living cells or tissues (rather than 
direct enzyme or binding assays) are employed to examine the potency of 
drug candidates. For example, the fraction of bacteria killed by a drug might 
indicate the potency of a potential antibiotic. In these cases, values such as 
EC,, are used. EC,, is the concentration of the drug candidate required to 
elicit 50% of the maximal biological response (Figure 36.4). Similarly, ECg, 
is the concentration required to achieve 90% of the maximal response. In the 
example of an antibiotic, EC,, would be the concentration required to kill 
90% of bacteria exposed to the drug. For drug candidates that are inhibitors, 
the corresponding terms IC, and IC,, are often used to describe the con- 
centrations of the inhibitor required to reduce a response by 50% or 90%, 
relative to its value in the absence of inhibitor, respectively. 

Values such as the IC, and EC., are measures of the potency of a drug 
candidate in modulating the activity of the desired biological target. To 
prevent unwanted effects, often called side effects, ideal drug candidates 
should also be selective. That is, they should not bind biomolecules other 
than the target to any significant extent. Developing such a drug can be 
quite challenging, particularly if the drug target is a member of a large fam- 
ily of evolutionarily related proteins. The degree of selectivity can be 
described in terms of the ratio of the K, values for the binding of the drug 
candidate to any other molecules to the K, value for the binding of the drug 
candidate to the desired target. 

Under physiological conditions, achieving sufficient binding of a target 
site by a drug can be quite challenging. Most drug targets also bind ligands 
normally present in tissues; often, the drug and these ligands will compete for 
binding sites on the target. We encountered this scenario when we considered 
competitive inhibitors in Chapter 8. Suppose that the drug target is an enzyme 
and the drug candidate is competitive with respect to the enzyme’s natural 
substrate. The concentration of the drug candidate necessary to inhibit the 
enzyme will depend on the physiological concentration of the normal sub- 
strate (Figure 36.5). Biochemists Yung-Chi Cheng and William Prusoff 
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FIGURE 36.3 Ligand binding. The titration 
of a receptor, R, with a ligand, L, results 

in the formation of the complex RL. In 
uncomplicated cases, the binding reaction 
follows a simple saturation curve. Half of 
the receptors are bound to ligand when the 
ligand concentration equals the dissociation 
constant, Ky, for the RL complex. 
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FIGURE 36.4 Effective concentrations. 
The concentration of a ligand required to 
elicit a biological response can be quantified 
in terms of ECs, the concentration required 
to give 50% of the maximum response, 

and ECgg, the concentration required to 
give 90% of the maximum response. 
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FIGURE 36.5 Inhibitors compete with 

substrates for enzyme active sites. 

The measured IC,, of a competitive 

inhibitor to its target enzyme depends on 

the concentration of substrate present. 
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FIGURE 36.6 Absorption, distribution, 
metabolism, and excretion (ADME). The 
concentration of a compound at its target 
site (yellow) is affected by the extents 

and rates of absorption, distribution, 
metabolism, and excretion. 
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described the relation between the IC,, of an enzyme inhibitor and its inhibi- 
tion constant K; (analogous to the dissociation constant, K,, of a ligand): 


ICso = Kj (1 + [S]/Km) 


This relation, referred to as the Cheng—Prusoff equation, demonstrates that 
the IC... of a competitive inhibitor will depend on the concentration and the 
Michaelis constant (K,,) for the substrate 5. The higher the concentration 
of the natural substrate, the higher the concentration of drug needed to 
inhibit the enzyme. 


Drugs must have suitable properties to reach their targets 


Thus far, we have focused on the ability of molecules to act on specific tar- 
get molecules. However, an effective drug must also have other characteris- 
tics. It must be easily administered and must reach its target at sufficient 
concentration to be effective. After a drug molecule has entered the body, it 
is acted upon by several processes—absorption, distribution, metabolism, 
and excretion—that will determine the effective concentration of this mol- 
ecule over time (Figure 36.6). A drug’s response to these processes is 
referred to as its ADME (pronounced “add-me’’) properties. 


Administration and absorption. Ideally, a drug can be taken orally as a small 
tablet. An orally administered active compound must be able to survive the 
acidic conditions in the gut and then be absorbed through the intestinal epi- 
thelium. Thus, the compound must be able to pass through cell membranes at 
a significant rate. Larger molecules such as proteins cannot be administered 
orally, because they often cannot survive the acidic conditions in the stomach 
and, if they do, are not readily absorbed. Even many small molecules are not 
absorbed well; they may be too polar and not pass through cell membranes 
easily, for example. The ability to be absorbed is often quantified in terms of 
the oral bioavailability. This quantity is defined as the ratio of the peak concen- 
tration of a compound given orally to the peak concentration of the same dose 
injected directly into the bloodstream. Bioavailability can vary considerably 
from species to species, and so results from animal studies may be difficult to 
apply to human beings. Despite this variability, some useful generalizations 
have been made. One powerful set consists of Lipinski’s rules. 
Lipinski’s rules tell us that poor absorption is likely when 


1. the molecular weight is greater than 500. 


2. the number of hydrogen-bond donors is greater than 5. 


3. the number of hydrogen-bond acceptors is greater than 10. 


4. the partition coefficient [measured as log(P)] is greater than 5. 


The partition coefficient is a way to measure the tendency of a 
molecule to dissolve in membranes, which correlates with its 
ability to dissolve in organic solvents. It is determined by allow- 
ing a compound to equilibrate between water and an organic 
phase, n-octanol. The log(P) value is defined as log,, of the ratio 
of the concentration of a compound in n-octanol to the concen- 
tration of the compound in water. For example, if the concentra- 
tion of the compound in the n-octanol phase is 100 times that in 
the aqueous phase, then log(P) is 2. Although the ability of a 
drug to partition in organic solvents is ideal, because it implies 
that the compound can penetrate membranes, a log(P) value that 
is too high suggests that the molecule may be poorly soluble in an 
aqueous environment. 

Morphine, for example, satisfies all of Lipinski’s rules and has moderate 
bioavailability (Figure 36.7). A drug that violates one or more of these rules 
may still have satisfactory bioavailability. Nonetheless, these rules serve as 
guiding principles for evaluating new drug candidates. 


bond 


Distribution. Compounds taken up by intestinal epithelial cells can 
pass into the bloodstream. However, hydrophobic compounds and 
many others do not freely dissolve in the bloodstream. These com- 
pounds bind to proteins, such as albumin (Figure 36.8), that are 
abundant in the blood serum and by this means are carried through- 
out the circulatory system. 

When a compound has reached the bloodstream, it is distributed 
to different fluids and tissues, which are often referred to as compart- 
ments. Some compounds are highly concentrated in their target com- 
partments, either by binding to the target molecules themselves or by 
other mechanisms. Other compounds are distributed more widely 
(Figure 36.9). An effective drug will reach the target compartment in 
sufficient quantity; the concentration of the compound in the target 
compartment is reduced whenever the compound is distributed into 
other compartments. 

Some target compartments are particularly hard to reach. Many 
compounds are excluded from the central nervous system by the 
blood-brain barrier, the tight junctions between endothelial cells that 
line blood vessels within the brain and spinal cord. 


Metabolism. A final challenge to a potential drug molecule is to evade the 
body’s defenses against foreign compounds. Many such compounds (called 
xenobiotic compounds) are released from the body in the urine or stool, often 
after having been metabolized to aid in excretion. This drug metabolism poses 
a considerable threat to drug effectiveness because the concentration of the 
desired compound decreases as it is metabolized. Thus, a rapidly metabo- 
lized compound must be administered more frequently or at higher doses. 
Two of the most common pathways in xenobiotic metabolism are oxi- 
dation and conjugation. Oxidation reactions can aid excretion in at least 
two ways: by increasing water solubility, and thus ease of transport, and 
by introducing functional groups that participate in subsequent metabolic 
steps. These reactions are often promoted by cytochrome P450 enzymes in 
the liver (Section 26.4). The human genome encodes more than 50 differ- 
ent P450 isozymes, many of which participate in xenobiotic metabolism. 


Two hydrogen- 


donors 


Four hydrogen- 
<— bond 
acceptors 


Morphine (C,7H1903N) 


Molecular weight = 285 
log(P) = 1.27 


FIGURE 36.7 Lipinski’s rules applied to 
morphine. Morphine satisfies all of Lipinski’s 
rules and has an oral bioavailability in human 
beings of 33%. 


FIGURE 36.8 Structure of the drug carrier 
human serum albumin. Seven hydrophobic 
molecules (in red) are shown bound to the 
molecule. [Drawn from 1BKE.pdb.] 
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FIGURE 36.9 Distribution of the drug 
fluconazole. After having been taken in, 
compounds distribute themselves to various 
organs within the body. The distribution of 
he antifungal agent fluconazole has been 
monitored through the use of positron 
emission tomography (PET) scanning. These 
images were taken of a healthy human 
volunteer 90 minutes after injection of a 
dose of 5 mg kg~" of fluconazole containing 
race amounts of fluconazole labeled with 
he positron-emitting isotope '8F. [Reprinted 
by permission from Macmillan Publishers Ltd: 
Nature Reviews Drug Discovery, Rudin, M., 
and Weissleder, R., “Molecular imaging in drug 
discovery and development,” 2: 2, p. 123-131, 
Copyright 2003.] 
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A typical reaction catalyzed by a P450 isozyme is the hydroxylation of 
ibuprofen (Figure 36.10). 
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FIGURE 36.10 P450 conversion of 

Pte ESOC) pee EOF me Conjugation is the addition of particular groups to the xenobiotic com- 

primarily in the liver, catalyze xenobiotic : f A 

metabolic reactions such as hydroxylation. pound. Common groups added are glutathione (Section 20.5), glucuronic 

The reaction introduces an oxygen atom acid, and sulfate (Figure 36.11). These additions often increase water solu- 

derived from molecular oxygen. bility and provide labels that can be recognized to target excretion. Examples 
of conjugation include the addition of glutathione to the anticancer drug 
cyclophosphamide, the addition of glucuronidate to the analgesic morphine, 
and the addition of a sulfate group to the hair-growth stimulator minoxidil. 
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Cyclophosphamide-glutathione conjugate Morphine glucuronidate Minoxidil sulfate 


Interestingly, the sulfation of minoxidil produces a compound that is more 
active in stimulating hair growth than is the unmodified compound. Thus, 
the metabolic products of a drug, though usually less active than the drug, 
can sometimes be more active. 
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3’-Phosphoadenosine-5’-phosphosulfate (PAPS) 


Note that an oxidation reaction often precedes conjugation because the 
oxidation reaction can generate modifications, such as the hydroxyl group, to 
which groups such as glucuronic acid can be attached. The oxidation reactions 
of xenobiotic compounds are often referred to as phase I transformations, and 
the conjugation reactions are referred to as phase II transformations. These 
reactions take place primarily in the liver. Because blood flows from the intes- 
tine directly to the liver through the portal vein, xenobiotic metabolism often 
alters drug compounds before they ever reach full circulation. This first-pass 
metabolism can substantially limit the availability of compounds taken orally. 


Excretion. After compounds have entered the bloodstream, they can be 
removed from circulation and excreted from the body by two primary path- 
ways. First, they can be absorbed through the kidneys and excreted in the urine. 
In this process, the blood passes through glomeruli, networks of fine capillaries 
in the kidney that act as filters. Compounds with molecular weights less than 
approximately 60,000 pass though the glomeruli. Many of the water molecules, 
glucose molecules, nucleotides, and other low-molecular-weight compounds 
that pass through the glomeruli are reabsorbed into the bloodstream, either by 
transporters that have broad specificities or by the passive transfer of hydro- 
phobic molecules through membranes. Drugs and metabolites that pass 
through the first filtration step and are not reabsorbed are excreted. 

Second, compounds can be actively transported into bile, a process that 
takes place in the liver. After concentration in the gall bladder, bile flows 
into the intestine. In the intestine, the drugs and metabolites can be excreted 
through the stool, reabsorbed into the bloodstream, or further degraded by 


FIGURE 36.11 Conjugation reactions. 
Compounds that have appropriate groups 
are often modified by conjugation reactions. 
Such reactions include the addition of 
glutathione (top), glucuronic acid (middle), 
or sulfate (bottom). The conjugated 

product is shown boxed. 
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FIGURE 36.12 Enterohepatic cycling. 
Some drugs can move from the blood 
circulation to the liver, into the bile, into 
the intestine, to the liver, and back into 
circulation. This cycling decreases the rate 
of drug excretion. 
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FIGURE 36.13 Half-life of drug excretion. 
In the case shown, the concentration of a 
drug in the bloodstream decreases to one- 
half of its value in a period of time, t, 5, 
referred to as its half-life. 
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Cytochrome P450 
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digestive enzymes. Sometimes, compounds are recycled from the blood- 
stream into the intestine and back into the bloodstream, a process referred 
to as enterohepatic cycling (Figure 36.12). This process can significantly 
decrease the rate of excretion of some compounds because they escape from 
an excretory pathway and reenter the circulation. 

The kinetics of compound excretion is often complex. In some cases, a 
fixed percentage of the remaining compound is excreted over a given period of 
time (Figure 36.13). This pattern of excretion results in exponential loss of the 
compound from the bloodstream that can be characterized by a half-life (t, ,,). 
The half-life is the fixed period of time required to eliminate 50% of the 
remaining compound. It is a measure of how long an effective concentration of 
the compound remains in the system after administration. As such, the half- 
life is a major factor in determining how often a drug must be taken. A drug 
with a long half-life might need to be taken only once per day, whereas a drug 
with a short half-life might need to be taken three or four times per day. 


Toxicity can limit drug effectiveness 


An effective drug must not be so toxic that it seriously harms the person who 
takes it. A drug may be toxic for any of several reasons. First, it may modu- 
late the target molecule itself too effectively. For example, the presence of too 
much of the anticoagulant drug coumadin can result in dangerous, uncon- 
trolled bleeding and death. Second, the compound may modulate the prop- 
erties of proteins that are distinct from, but related to, the target molecule 
itself. Compounds that are directed to one member of a family of enzymes or 
receptors often bind to other family members. For example, an antiviral 
drug directed against viral proteases may be toxic if it also inhibits proteases 
normally present in the body such as those that regulate blood pressure. 

A compound may also be toxic if it modulates the activity of a protein unre- 
lated to its intended target. For example, many compounds block ion channels 
such as the potassium channel hERG (Section 13.4), causing potentially life- 
threatening disturbances of cardiac rhythm. To prevent cardiac side effects, 
many compounds are screened for their ability to block such channels. 

Finally, even if a compound is not itself toxic, its metabolic by-products may 
be. Phase I metabolic processes can generate damaging reactive groups in prod- 
ucts. An important example is liver toxicity observed with large doses of the 
common pain reliever acetaminophen (Figure 36.14). A particular cytochrome 
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FIGURE 36.14 Acetaminophen toxicity. A minor metabolic product of acetaminophen is 
N-acetyl-p-benzoquinone imine. This metabolite is conjugated to glutathione. Large doses of 
acetaminophen can deplete liver glutathione stores. 


P450 isozyme oxidizes acetaminophen to N-acetyl-p-benzoquinone imine. 1041 
The resulting compound is conjugated to glutathione. With large doses, how- 36.2 Discovery of Drug Candidates 
ever, the liver concentration of glutathione drops dramatically, and the liver 
is no longer able to protect itself from this reactive compound and others. 
Initial symptoms of excessive acetaminophen include nausea and vomiting. 
Within 24 to 48 hours, symptoms of liver failure may appear. Acetaminophen 
poisoning accounts for about 35% of cases of severe liver failure in the United 
States. A liver transplant is often the only effective treatment. 

The toxicity of a drug candidate can be described in terms of the therapeu- 
tic index. This measure of toxicity is determined through animal tests, usually 
with mice or rats. The therapeutic index is defined as the ratio of the dose of 
a compound that is required to kill one-half of the animals (referred to as the 
LD.,, for “lethal dose’’) to a comparable measure of the effective dose, usually 
the EC,,. Thus, if the therapeutic index is 1000, then lethality is significant 
only when 1000 times the effective dose is administered. Analogous indices 
can provide measures of toxicity less severe than lethality. 

Many compounds have favorable properties in vitro, yet fail when 
administered to a living organism because of difficulties with ADME and 
toxicity. Expensive and time-consuming animal studies are required to 
verify that a drug candidate is not toxic, yet differences between animal spe- 
cies in their response can confound decisions about moving forward with a 
compound toward human studies. One hope is that, with more understand- 
ing of the biochemistry of these processes, scientists can develop computer- 
based models to replace or augment animal tests. Such models would need 
to accurately predict the fate of a compound inside a living organism from 
its molecular structure or other properties that are easily measured in the 
laboratory without the use of animals. 


36.2 Drug Candidates Can Be Discovered by Serendipity, 
Screening, or Design 


Traditionally, many drugs were discovered by serendipity, or chance observa- 
tion. More recently, drugs have been discovered by screening collections of 
natural products or other compounds for molecules that have desired medici- 
nal properties. Alternatively, scientists have designed specific drug candidates 
by using their knowledge about a preselected molecular target. We will exam- 
ine several examples of each of these pathways to reveal common principles. 


Serendipitous observations can drive drug development 


Perhaps the most well-known observation in the history of drug develop- 

ment is Alexander Fleming’s chance observation in 1928 that colonies of the 

bacterium Staphylococcus aureus died when they were adjacent to colonies of 

the mold Penicillium notatum. Spores of the mold had landed accidentally on 

plates growing the bacteria. Fleming soon realized that the mold produced oO 
a substance that could kill disease-causing bacteria. This discovery led to a R it 


fundamentally new approach to the treatment of bacterial infections. HN a S 

Howard Florey and Ernest Chain developed a powdered form of the sub- ] its 
Sa : ia cae B-Lactam tn, 

stance, termed penicillin, that became a widely used antibiotic in the 1940s. ring N “CHs 

When the structure of penicillin was elucidated in 1945, it was found to O . 

contain a four-membered B-lactam ring. This unusual feature is key to the coo™ 

antibacterial function of penicillin, as noted earlier (Section 8.5). Penicillin 


Three steps were crucial to fully capitalize on Fleming’s discovery. First, an 
industrial process was developed for the production of penicillin from Penicillium 
mold on a large scale. Second, penicillin and its derivatives were chemically 
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FIGURE 36.15 Mechanism of cell-wall 
biosynthesis disrupted by penicillin. 

A transpeptidase enzyme catalyzes 

he formation of cross-links between 
peptidoglycan groups. In the case shown, 
he transpeptidase catalyzes the linkage of 
b-alanine at the end of one peptide chain 
o the amino acid diaminopimelic acid 
DAP) on another peptide chain. The 
diaminopimelic acid linkage (bottom left) is 
ound in Gram-negative bacteria such as 
E. coli. Linkages of glycine-rich peptides are 
ound in Gram-positive bacteria. Penicillin 
inhibits the action of the transpeptidase; so 
bacteria exposed to the drug have weak 
cell walls that are susceptible to lysis. 
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synthesized. The availability of synthetic penicillin derivatives opened the way 
for scientists to explore the relations between structure and function. Many 
such penicillin derivatives have found widespread use in medicine. Finally, in 
1965, Jack Strominger and James Park independently determined that penicil- 
lin exerts its antibiotic activity by blocking a critical transpeptidase reaction in 
bacterial cell-wall biosynthesis (Figure 36.15), as introduced in Section 8.5. 

Many other drugs have been discovered by serendipitous observations. 
Chlorpromazine (Thorazine), a drug used to treat psychosis, was discovered 
in the course of investigations directed toward the treatment of shock in sur- 
gical patients. In 1952, French surgeon Henri Laborit noticed that, after 
taking the compound, his patients were remarkably calm. This observation 
suggested that chlorpromazine could benefit psychiatric patients, and, 
indeed, the drug has been used for many years to treat patients with schizo- 
phrenia and bipolar disorder. The drug does have significant side effects, and 
its use has been largely superseded by more recently developed drugs. 

Chlorpromazine acts by binding to receptors for the neurotransmitter dopa- 
mine and blocking them. Dopamine D2 receptors are the targets of many other 
psychoactive drugs. In the search for drugs with more-limited side effects, stud- 
ies have been performed to correlate drug effects with biochemical parameters 
such as dissociation constants and binding and release rate constants. 

A more recent example of a drug discovered by chance observation is 
sildenafil (Viagra). This compound was developed as an inhibitor of phos- 
phodiesterase 5 (PDE5), an enzyme that catalyzes the hydrolysis of cGMP to 
GMP (Figure 36.16). The compound was intended as a treatment for hyper- 
tension and angina because cGMP plays a central role in the relaxation of 
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smooth-muscle cells in blood vessels (Figure 36.17). The inhibition of PDE5 
was expected to increase the concentration of cGMP by blocking the path- 
way for its degradation. In the course of early clinical trials in Wales, some 
men reported unusual penile erections. Whether this chance observation by 
a few men was due to the compound or to other effects was unclear. However, 
the observation made some biochemical sense because smooth-muscle relax- 
ation due to increased cGMP levels had been discovered to play a role in 
penile erection. Subsequent clinical trials directed toward the evaluation of 
sildenafil for erectile dysfunction were successful. This account testifies to 
the importance of collecting comprehensive information from clinical-trial 
participants. In this case, incidental observations led to a new treatment for 
erectile dysfunction and a multibillion-dollar-per-year drug market. 


Natural products are a valuable source of drugs and drug leads 


No drug is as widely used as aspirin. Observers at least as far back as 
Hippocrates (~400 B.C.) have noted the use of extracts from the bark and 
leaves of the willow tree for pain relief. In 1829, a mixture called salicin was 
isolated from willow bark. Subsequent analysis identified salicylic acid as the 
active component of this mixture. Salicylic acid was formerly used to treat pain, 
but this compound often irritated the stomach. Several investigators attempted 
to find a means to neutralize salicylic acid. Felix Hoffmann, a chemist working 
at the German company Bayer, developed a less-irritating derivative by treat- 
ing salicylic acid with a base and acety] chloride. This derivative, acetylsalicylic 
acid, was named aspirin from “a” for acetyl chloride, “spir” for Spiraea ulmaria 
(meadowsweet, a flowering plant that also contains salicylic acid), and “in” 
(a common ending for drugs). Each year, approximately 35,000 tons of aspirin 
are taken worldwide, nearly the weight of the Titanic. 

As discussed in Chapter 12, the acetyl group in aspirin is transferred to the 
side chain of a serine residue that lies along the path to the active site of the 
cyclooxygenase component of prostaglandin H, synthase (Figure 12.24). In 
this position, the acetyl group blocks access to the active site. Thus, even though 
aspirin binds in the same pocket on the enzyme as salicylic acid, the acetyl 
group of aspirin dramatically increases its effectiveness as a drug. This account 
illustrates the value of screening extracts from plants and other materials that 
are believed to have medicinal properties for active compounds. The large num- 
ber of herbal and folk medicines are a treasure trove of new drug leads. 

Let us consider another example, which has also had a significant impact 
on the clinical management of individuals with cardiovascular disease. More 
than 100 years ago, a fatty, yellowish material was discovered on the arterial 
walls of patients who had died of vascular disease. The presence of the material 
was termed atheroma from the Greek word for porridge. This material proved 
to be cholesterol. The Framingham heart study, initiated in 1948, documented 


FIGURE 36.16 Sildenafil, a mimic of 
cGMP. Sildenafil was designed to resemble 
cGMP, the substrate of phosphodiesterase 
5 (PDES). 
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FIGURE 36.17 Muscle-relaxation 
pathway. Increases in nitric oxide levels 
stimulate guanylate cyclase, which 
produces CGMP. The increased cGMP 
concentration promotes smooth-muscle 
relaxation. PDE5 hydrolyzes cGMP, which 
lowers the cGMP concentration. The 
inhibition of PDES by sildenafil maintains 
elevated levels of cGMP. 
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FIGURE 36.18 Synthetic statins. 
Atorvastatin (Lipitor) and rosuvastatin 
(Crestor) are completely synthetic drugs 
that inhibit HMG-CoA reductase. 
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a correlation between high blood-cholesterol levels and high mortality rates 
from heart disease. This observation led to the notion that blocking cholesterol 
synthesis might lower blood-cholesterol levels and, in turn, lower the risk of 
heart disease. Initial attempts at blocking cholesterol synthesis focused on 
steps near the end of the pathway. However, these efforts were abandoned 
because the accumulation of the insoluble substrate for the inhibited enzyme 
led to the development of cataracts and other side effects. Investigators even- 
tually identified a more-favorable target—namely, the enzyme HMG-CoA 
reductase (Section 26.2). This enzyme acts on a substrate, HMG-CoA 
(3-hydroxy-3-methylglutaryl coenzyme A), that does not accumulate, because 
it is water-soluble and can be utilized by other pathways. 

A promising natural product, compactin, was discovered in a screen of 
compounds from a fermentation broth from Penicillium citrinum in a search 
for antibacterial agents. In some, but not all, animal studies, compactin was 
found to inhibit HMG-CoA reductase and to lower serum cholesterol lev- 
els. In 1982, a new HMG-CoA reductase inhibitor was discovered in a 
fermentation broth from Aspergillus cereus. This compound, now called 
lovastatin, was found to be structurally very similar to compactin, bearing 
one additional methyl group. 

In clinical trials, lovastatin significantly reduced serum cholesterol levels 
with few side effects. Most side effects could be prevented by treatment 
with mevalonate (the product of HMG-CoA reductase), indicating that the 
side effects were likely due to the highly effective blocking of HMG-CoA 
reductase. One notable side effect is muscle pain or weakness (termed 
myopathy), although its cause remains to be fully established. After many 
studies, the Food and Drug Administration (FDA) approved lovastatin for 
treating high serum cholesterol levels. 

A structurally related HMG-CoA reductase inhibitor was later shown to 
cause a statistically significant decrease in deaths due to coronary heart dis- 
ease. This result validated the benefits of lowering serum cholesterol levels. 
Further mechanistic analysis revealed that the HMG-CoA reductase inhibi- 
tor acts not only by lowering the rate of cholesterol biosynthesis, but also by 
inducing the expression of the low-density-lipoprotein (LDL) receptor 
(Section 26.3). Cells with such receptors remove LDL particles from the 
bloodstream, and so these particles cannot contribute to atheroma. 


Screening libraries of synthetic compounds expands the 
opportunity for identification of drug leads 


Lovastatin and its relatives are either natural products or readily derived 
from natural products. After the discovery of these compounds, totally syn- 
thetic molecules were developed that are more-potent inhibitors of HMG- 
CoA reductase (Figure 36.18). These compounds were found to be effective 
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at lower dose levels, reducing side effects. The original HMG-CoA reduc- 
tase inhibitors or their precursors were found by screening libraries of natu- 
ral products. More recently, drug developers have tried screening large 
libraries of both natural products and purely synthetic compounds prepared 
in the course of many drug-development programs. Under favorable cir- 
cumstances, hundreds of thousands or even millions of compounds can be 
tested in this process, termed high-throughput screening. Compounds in these 
libraries can be synthesized one at a time for testing. An alternative approach 
is to synthesize a large number of structurally related compounds that differ 
from one another at only one or a few positions all at once. This approach is 
often termed combinatorial chemistry. Here, compounds are synthesized with 
the use of the same chemical reactions but a variable set of reactants. Suppose 
that a molecular scaffold is constructed with two reactive sites and that 20 
reactants can be used in the first site and 40 reactants can be used in the sec- 
ond site. A total of 20 X 40 = 800 possible compounds can be produced. 

A key method in combinatorial chemistry is split-pool synthesis 
(Figure 36.19). The method depends on solid-phase synthetic methods, first 
developed for the synthesis of peptides (Section 3.5). Compounds are syn- 
thesized on small beads. Beads containing an appropriate starting scaffold are 
produced and divided (split) into n sets, with n corresponding to the number 
of building blocks to be used at one site. Reactions adding the reactants at the 
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FIGURE 36.19 Split-pool synthesis. 
Reactions are performed on beads. Each of 
the reactions with the first set of reactants 
is performed on a separate set of beads. 
The beads are then pooled, mixed, and 
split into sets. The second set of reactants 
is then added. Many different compounds 
will be produced, but all of the compounds 
on a single bead will be identical. 


FIGURE 36.20 Screening a library of 
synthesized carbohydrates. A small 
combinatorial library of carbohydrates 
synthesized on the surface of 130-mm 
beads is screened for carbohydrates that 
are bound tightly by a lectin from peanuts. 
Beads that have such carbohydrates are 
darkly stained through the action of an 
enzyme linked to the lectin. [From Figure 3 
in Liang et al., “Polyvalent binding to 
carbohydrates immobilized on an insoluble 
resin,” Proceedings of the National Academy 


of Sciences, USA, vol. 94, pp. 10554-10559, 


September 1997. Copyright 2004 National 
Academy of Sciences, USA.] 


FIGURE 36.21 Initial design of an HIV 
protease inhibitor. This compound was 
designed by combining part of one 
compound with good inhibition activity but 
poor solubility (shown in red) with part of 
another compound with better solubility 
(shown in blue). 
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first site are run, and the beads are isolated by filtration. The n sets of beads 
are then combined (pooled), mixed, and split again into m sets, with m cor- 
responding to the number of reactants to be used at the second site. Reactions 
adding these m reactants are run, and the beads are again isolated. The 
important result is that each bead contains only one compound, even though 
the entire library of beads contains many. Furthermore, although only 
n + m reactions were run, n X m compounds are produced. With the preced- 
ing values for n and m, 20 + 40 = 60 reactions produce 20 X 40 = 800 
compounds. In some cases, assays can be performed directly with the com- 
pounds still attached to the bead to find compounds with desired properties 
(Figure 36.20). Alternatively, each bead can be isolated and the compound 
can be cleaved from the bead to produce free compounds for analysis. After 
an interesting compound has been identified, analytical methods of various 
types must be used to identify which of the n X m compounds is present. 

Note that the “universe” of druglike compounds is vast. More than an 
estimated 104? compounds are possible with molecular weights less than 
750. Thus, even with “large” libraries of millions of compounds, only a tiny 
fraction of the chemical possibilities are present for study. 


Drugs can be designed on the basis of three-dimensional 
structural information about their targets 


Many drugs bind to their targets in a manner reminiscent of Emil Fischer’s 
lock and key (Figure 8.8). Therefore, we should be able to design a key, given 
enough knowledge about the shape and chemical composition of the lock. In 
the idealized case, we would like to design a small molecule that is comple- 
mentary in shape and electronic structure to a target protein so that it binds 
effectively to the targeted site. Despite our ability to determine three- 
dimensional structures rapidly, the achievement of this goal remains in the 
future. Designing stable compounds from scratch that have the correct shape 
and other properties to fit precisely into a binding site is difficult because pre- 
dicting the structure that will best fit into a binding site is difficult. Prediction 
of binding affinity requires a detailed understanding of the interactions 
between a compound and its binding partner and of the interactions between 
the compound and the solvent when the compound is free in solution. 
Nonetheless, structure-based drug design has proved to be a powerful 
tool in drug development. Among its most prominent successes has been 
the development of drugs that inhibit the protease from the human immu- 
nodeficiency virus (HIV; Section 34.4). Consider the development of the 
protease inhibitor indinavir (Crixivan; Figure 9.19). Two sets of promising 
inhibitors that had high potency but poor solubility and bioavailability 
were discovered. X-ray crystallographic analysis and molecular-modeling 
findings suggested that a hybrid of these two inhibitor sets might have 
both high potency and improved bioavailability (Figure 36.21). The syn- 
thesized hybrid compound did show improvements but required 
further optimization. The structural data suggested one point 
where modifications could be tolerated. A series of compounds 
were prepared and tested for their ability to inhibit the protease 
(Figure 36.22). These data are a demonstration of a structure— 
: activity relationship (SAR); they provide an opportunity to corre- 
late structure with function and guide design of further molecules. 
The most-active compound showed poor bioavailability, but one of 
the other compounds (highlighted in yellow in Figure 36.22) 
showed good bioavailability and acceptable activity. The maxi- 
mum serum concentration available through oral administration 
was significantly higher than the levels required to suppress 
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replication of the virus. This drug, as well as other protease inhibitors 
developed at about the same time, has been used in combination with other 
drugs to treat AIDS with much more encouraging results than had been 
obtained previously (Figure 36.23). 

Aspirin targets the cyclooxygenase site in prostaglandin H, synthase, as 
discussed earlier. Animal studies suggested that mammals contain not one 
but two distinct cyclooxygenase enzymes, both of which are targeted by 
aspirin. The more recently discovered enzyme, cyclooxygenase 2 (COX2), 
is expressed primarily as part of the inflammatory response, whereas cyclo- 
oxygenase 1 (COX1) is expressed more generally. These observations sug- 
gested that a cyclooxygenase inhibitor that was specific for COX2 might be 
able to reduce inflammation in conditions such as arthritis without produc- 
ing the gastric and other side effects associated with aspirin. 

The amino acid sequences of COX1 and COX2 were deduced from 
cDNA cloning studies. These sequences are more than 60% identical, clearly 
indicating that the enzymes have the same overall structure. Nevertheless, 
there are some differences in the residues around the aspirin-binding site. 
X-ray crystallography revealed that an extension of the binding pocket was 
present in COX2, but absent in COX1. This structural difference suggested 
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FIGURE 36.22 Compound optimization. 
Four compounds are evaluated for 
characteristics including the IC,,, log(P), 
and C,,4, (the maximal concentration of 
compound present) measured in the serum 
of dogs. The compound shown at the 
bottom (highlighted in yellow) has the 
weakest inhibitory power (measured by 
IC59) but by far the best bioavailability 
(measured by c,,,,,). This compound was 
selected for further development, leading 
to the drug indinavir (Crixivan). 


FIGURE 36.23 The effect of anti-HIV 
drug development. Death rates from HIV 
infection (AIDS) reveal the tremendous 
effect of HIV protease inhibitors and their 
use in combination with inhibitors of HIV 
reverse transcriptase. The death rates in 
his graph are from the leading causes of 
death among persons 24 to 44 years old in 
he United States. [Data from Centers for 
Disease Control.] 
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FIGURE 36.24 COX2-specific inhibitors. These compounds have protuberances (shown in 
red) that fit into a pocket in the COX2 isozyme but sterically clash with the COX1 isozyme. 


a strategy for constructing COX2-specific inhibitors—namely, to synthesize 
compounds that had a protuberance that would fit into the pocket in the 
COX2 enzyme. Such compounds were designed and synthesized and then 
further refined to produce effective drugs familiar as Celebrex and Vioxx 
(Figure 36.24). Vioxx was subsequently withdrawn from the market because 
some people experienced adverse effects. Some of these effects may be due 
to the inhibition of COX2, the intended target. Thus, although the develop- 
ment of these drugs is a triumph for structure-based drug design, these 
outcomes highlight the fact that the inhibition of important enzymes can 
lead to complex physiological responses. 


36.3 Analyses of Genomes Hold Great Promise 
for Drug Discovery 


The completion of the sequencing of the human and other genomes is a 
potentially powerful driving force for the development of new drugs. 
Genomic sequencing and analysis projects have vastly increased our knowl- 
edge of the proteins encoded by the human genome. This new source of 
knowledge may greatly accelerate early stages of the drug-development 
process or even allow drugs to be tailored to the individual patient. 


Potential targets can be identified in the human proteome 


The human genome encodes approximately 21,000 proteins, not counting the 
variation produced by alternative mRNA splicing and posttranslational 
modifications. Many of these proteins are potential drug targets, in particular 
those that are enzymes or receptors and have significant biological effects 
when activated or inhibited. Several large protein families are particularly rich 
sources of targets. For example, the human genome includes genes for more 
than 500 protein kinases that can be recognized by comparing the deduced 
amino acid sequences. Many of these kinases are known to play a role in the 
progression of a variety of diseases. For example, Bcr-Abl kinase, a dysregu- 
lated kinase formed from a specific chromosomal defect, is known to contrib- 
ute to certain leukemias and is the target of the drug imatinib mesylate 
(Gleevec; Section 14.5). Some of the other protein kinases undoubtedly play 
central roles in particular cancers as well. Similarly, the human genome 
encodes approximately 800 7T’M receptors (Section 14.1), of which approxi- 
mately 350 are odorant receptors. Many of the remaining 7TM receptors are 
known or potential drug targets. For example, the beta-blockers, which are 
widely used to treat hypertension, target the B,-adrenergic receptor, and the 


antiulcer medication ranitidine (Zantac) targets the histamine H, receptor 1049 
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Novel proteins that are not part of large families already supplying drug 
targets can be more readily identified through the use of genomic informa- 
tion. There are a number of ways to identify proteins that could serve as tar- 
gets of drug-development programs. One way is to look for changes in 
expression patterns, protein localization, or posttranslational modifications in 
cells from disease-afflicted organisms. Another is to perform studies of tis- 
sues or cell types in which particular genes are expressed. Such analyses of the 
human genome should increase the number of actively pursued drug targets. 


Animal models can be developed to test the validity 
of potential drug targets 


The genomes of a number of model organisms have now been sequenced. 
The most important of these genomes for drug development is that of the 
mouse. Remarkably, the mouse and human genomes are approximately 85% 
identical in sequence, and more than 98% of all human genes have recogniz- 
able mouse counterparts. Mouse studies provide drug developers with a 
powerful tool—the ability to disrupt (“knock out’’) specific genes in the 
mouse (Section 5.4). If disruption of a gene has a desirable effect, then the 
product of this gene may represent a promising drug target. The utility of this 
approach has been demonstrated retrospectively. For example, disruption of 
the gene for the a subunit of the Ht-Kt ATPase, the key protein for secret- 
ing acid into the stomach, produces mice with a gastric pH of 6.9. Under 
similar conditions, their wild-type counterparts produce a gastric pH of 3.2. 
This protein is the target of the drugs omeprazole (Prilosec) and lansoprazole 
(Prevacid and Takepron), used for treating gastroesophageal reflux disease. 
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Several large-scale efforts are underway to generate hundreds or thou- 
sands of mouse strains, each having a different gene disrupted. The pheno- 
types of these mice are a good indication of whether the protein encoded by 
a disrupted gene is a promising drug target. This approach allows drug 
developers to evaluate potential targets without any preconceived notions 
regarding physiological function. 


FIGURE 36.25 Emerging drug target. 
The structure of a protease from the 
coronavirus that causes SARS (severe acute 
respiratory syndrome) is shown bound to an 
inhibitor. This structure was determined less 
than a year after the identification of the 
virus. [Drawn from 1P9S.pdb.] 
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FIGURE 36.26 Phenotype—genotype 
correlation. Average changes in diastolic 
blood pressure on treatment with 
metoprolol. Persons with two copies of the 
most common allele (Sy93g9) showed 
significant decreases in blood pressure. 
Those with one variant allele (GR or SG) 
showed more modest decreases, and those 
with two variant alleles (GR/SG) showed no 
decrease. [Data from J. A. Johnson et al., Clin. 
Pharmacol. Ther. 74:44-52, 2008.] 
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Potential targets can be identified in the 
genomes of pathogens 


Human proteins are not the only important drug tar- 
gets. Drugs such as penicillin and HIV protease inhibi- 
tors act by targeting proteins within a pathogen. The 
genomes of hundreds of pathogens have now been 
sequenced, and these genome sequences can be mined 
for potential targets. 

New antibiotics are needed to combat bacteria that 
are resistant to many existing antibiotics. One approach 
seeks proteins essential for cell survival that are con- 
served in a wide range of bacteria. Drugs that inactivate 
such proteins are expected to be broad-spectrum antibi- 
otics, useful for treating infections from any of a range 
of different bacteria. One such protein is peptide deformylase, the enzyme 
that removes formyl groups present at the amino termini of bacterial pro- 
teins immediately after translation (Figure 30.19). 

Alternatively, a drug may be needed against a specific pathogen. An 
example of such a pathogen is the strain of coronavirus responsible for 
severe acute respiratory syndrome (SARS). Within one month of the recog- 
nition of this emerging disease, investigators had isolated the virus that 
causes the syndrome, and, within weeks, completely sequenced its 29,751- 
base genome. This sequence revealed the presence of a gene encoding a viral 
protease, known to be essential for viral replication from studies of other 
members of the coronavirus family to which the SARS virus belongs. This 
structure has opened the possibility for specific antiviral treatments for this, 
and related, viruses (Figure 36.25). 


Genetic differences influence individual responses to drugs 


Many drugs are not effective in everyone, often because of genetic differences 
among individuals. Nonresponding persons may have slight differences in 
either a drug’s target molecule or the proteins taking part in drug transport and 
metabolism. The goal of the emerging fields of pharmacogenetics and phar- 
macogenomics is to design drugs that either act more consistently from person 
to person or are tailored to individual persons with particular genotypes. 

Drugs such as metoprolol that target the B,-adrenergic receptor are 
popular treatments for hypertension. These drugs are often referred to as 
“beta-blockers.” 


Metoprolol 


However, some individuals do not respond well to beta-blockers. Two vari- 
ants of the gene coding for the B, -adrenergic receptor are common in the U.S. 
population. The most common allele has serine in position 49 and arginine in 
position 389. In some persons, however, glycine replaces one or the other of 
these residues. In clinical studies, participants with two copies of the most 
common allele responded well to metoprolol: their daytime diastolic blood 
pressure was reduced by 14.7 + 2.9 mm Hg on average. In contrast, partici- 
pants with one variant allele showed a smaller reduction in blood pressure, 
and the drug had no significant effect on participants with two variant alleles 
(Figure 36.26). These observations suggest the potential utility of genotyping 


individual patients at these positions. One could then predict whether treat- 
ment with metoprolol or other beta-blockers is likely to be effective. 

Given the importance of ADME and toxicity properties in determining 
drug efficacy, it is not surprising that variations in proteins participating 
in drug transport and metabolism can alter a drug’s effectiveness. An 
important example is the use of thiopurine drugs such as 6-thioguanine, 
6-mercaptopurine, and azothioprine to treat diseases including leukemia, 
immune disorders, and inflammatory bowel disease. 
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A minority of patients who are treated with these drugs show signs of toxic- 
ity at doses that are well tolerated by most patients. These differences 
between patients are due to rare variations in the gene encoding the xenobi- 
otic-metabolizing enzyme thiopurine methyltransferase, which adds a 
methyl group to sulfur atoms. 
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The variant enzyme is less stable. Patients with these variant enzymes can 
build up toxic levels of the drugs if appropriate care is not taken. Thus, 
genetic variability in an enzyme participating in drug metabolism plays a 
large role in determining the variation in the tolerance of different persons 
to particular drug levels. Many other drug-metabolism enzymes and drug- 
transport proteins have been implicated in controlling individual reactions 
to specific drugs. The identification of the genetic factors will allow a deeper 
understanding of why some drugs work well in some individuals but poorly 
in others. In the future, doctors may examine a patient’s genes to help plan 
drug-therapy programs. 


36.4 The Clinical Development of Drugs Proceeds 
Through Several Phases 


In the United States, the FDA requires demonstration that drug candidates 
be effective and safe before they may be used in human beings on a large 
scale. This requirement is particularly true for drug candidates that are to 
be taken by people who are relatively healthy. More side effects are accept- 
able for drug candidates intended to treat significantly ill patients such as 
those with serious forms of cancer, where there are clear, unfavorable con- 
sequences for not having an effective treatment. 
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FIGURE 36.27 Clinical-trial phases. 
Clinical trials proceed in phases examining 
safety and efficacy in increasingly large 
groups. 
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Clinical and expensive 
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Efficacy side effects date drug before it is approved by 
Side effects the FDA for general use. These trials 


proceed in at least three phases 
(Figure 36.27). In Phase I, a small number (typically from 10 to 100) of 
usually healthy volunteers take the drug for an initial study of safety. These 
volunteers are given a range of doses and are monitored for signs of toxicity. 
The efficacy of the drug candidate is not specifically evaluated. 

In Phase II, the efficacy of the drug candidate is tested in a small number 
of persons who might benefit from the drug. Further data regarding the 
drug’s safety are obtained. Such trials are often controlled and double- 
blinded. In a controlled study, subjects are divided randomly into two 
groups. Subjects in the treatment group are given the treatment under 
investigation. Subjects in the control group are given either a placebo—that 
is, a treatment such as sugar pills known to not have intrinsic value—or the 
best standard treatment available, if withholding treatment altogether 
would be unethical. In a double-blinded study, neither the subjects nor the 
researchers know which subjects are in the treatment group and which are 
in the control group. A double-blinded study prevents bias in the course of 
the trial. When the trial has been completed, the assignments of the subjects 
into treatment and control groups are unsealed and the results for the two 
groups are compared. A variety of doses are often investigated in Phase II 
trials to determine which doses appear to be free of serious side effects and 
which doses appear to be effective. 

The power of the placebo effect—that is, the tendency to perceive 
improvement in a subject who believes that he or she is receiving a poten- 
tially beneficial treatment—should not be underestimated. In a study of 
arthroscopic surgical treatment for knee pain, for example, subjects who 
were led to believe that they had received surgery through the use of video- 
tapes and other means showed the same level of improvement, on average, 
as subjects who actually received the procedure. 

In Phase III, similar studies are performed on a larger and more diverse 
population. This phase is intended to more firmly establish the efficacy of 
the drug candidate and to detect side effects that may develop in a small 
percentage of the subjects who receive treatment. Thousands of subjects 
may participate in a typical Phase III study. 

Clinical trials can be extremely costly. Hundreds or thousands of 
patients must be recruited and monitored for the duration of the trial. Many 
physicians, nurses, clinical pharmacologists, statisticians, and others par- 
ticipate in the design and execution of the trial. Costs can run from tens of 
millions to hundreds of millions of dollars. Extensive records must be kept, 
including documentation of any adverse reactions. These data are compiled 
and submitted to the FDA. The full cost of developing a drug is currently 
estimated to be more than $800 million. 

Even after a drug has been approved and is in use, difficulties can arise. 
Clinical trials run after a drug has entered the market, referred to as Phase 
IV studies, are designed to identify low-frequency side effects that may only 
emerge after widespread or long-term use. As mentioned earlier, rofecoxib 
(Vioxx), for example, was withdrawn from the market after significant car- 
diac side effects were detected in Phase IV clinical trials. Such events high- 
light the necessity for users of any drug to balance beneficial effects against 
potential risks. 


The evolution of drug resistance can limit the utility of 
drugs for infectious agents and cancer 


Many drugs are used for long periods of time without any loss of effective- 
ness. However, in some cases, particularly for the treatment of cancer or 
infectious diseases, drug treatments that were initially effective become less 
so. In other words, the disease becomes resistant to the drug therapy. Why 
does this resistance develop? Infectious diseases and cancer have a common 
feature—namely, that an affected person contains many cells (or viruses) 
that can mutate and reproduce. These conditions are necessary for evolution 
to take place. Thus, an individual microorganism or cancer cell may, by 
chance, have a genetic variation that makes it more suitable for growth and 
reproduction in the presence of the drug. These microorganisms or cells are 
more fit than others in their population, and they will tend to take over the 
population. As the selective pressure due to the drug is continually applied, 
the population of microorganisms or cancer cells will tend to become more 
and more resistant to the presence of the drug. Note that resistance can 
develop by a number of mechanisms. 

The HIV protease inhibitors discussed earlier provide an important 
example of the evolution of drug resistance. Retroviruses are very well 
suited to this sort of evolution because reverse transcriptase carries out rep- 
lication without a proofreading mechanism. In a genome of approximately 
9750 bases, each possible single point mutation is estimated to appear in a 
virus particle more than 1000 times per day in each infected person. Many 
multiple mutations also occur. Most of these mutations either have no effect 
or are detrimental to the virus. However, a few of the mutant virus particles 
encode proteases that are less susceptible to inhibition by the drug. In the 
presence of an HIV protease inhibitor, these virus particles will tend to rep- 
licate more effectively than does the population at large. With the passage of 
time, the less-susceptible viruses will come to dominate the population and 
the virus population will become resistant to the drug. 

Pathogens may become resistant to antibiotics by completely different 
mechanisms. Some pathogens contain enzymes that inactivate or degrade 
specific antibiotics. For example, many bacteria are resistant to B-lactams 
such as penicillin because they contain B-lactamase enzymes. These 
enzymes hydrolyze the B-lactam ring and render the drugs inactive. 
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Many of these enzymes are encoded in plasmids, small circular pieces of 
DNA often carried by bacteria. Many plasmids are readily transferred from 
one bacterial cell to another, transmitting the capability for antibiotic resis- 
tance. Plasmid transfer thus contributes to the spread of antibiotic resis- 
tance, a major health-care challenge. On the other hand, plasmids have been 
harnessed for use in recombinant DNA methods (Section 5.2). 

Drug resistance commonly emerges in the course of cancer treatment. 
Cancer cells are characterized by their ability to grow rapidly without the 
constraints that apply to normal cells. Many drugs used for cancer chemo- 
therapy inhibit processes that are necessary for this rapid cell growth. 
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However, individual cancer cells may accumulate genetic changes that miti- 
gate the effects of such drugs. These altered cancer cells will tend to grow 
more rapidly than others and will become dominant within the cancer-cell 
population. This ability of cancer cells to mutate quickly has been a chal- 
lenge to one of the major breakthroughs in cancer treatment: the develop- 
ment of inhibitors for proteins specific to cancer cells present in certain 
leukemias (Section 14.5). For example, tumors become undetectable in 
patients treated with imatinib mesylate, which is directed against the Ber- 
Abl protein kinase. Unfortunately, the tumors of many of the patients 
treated with imatinib mesylate recur after a period of years. In many of these 
cases, mutations have altered the Bcr-Abl protein so that it is no longer 
inhibited by the concentrations of imatinib mesylate used in therapy. 

Cancer patients often take multiple drugs concurrently in the course of 
chemotherapy and, in many cases, cancer cells become simultaneously 
resistant to many or all of them. This multiple-drug resistance can be due to 
the proliferation of cancer cells that overexpress a number of ABC trans- 
porter proteins that pump drugs out of the cell (Section 13.2). Thus, cancer 
cells can evolve drug resistance by over expressing normal human proteins 
or by modifying proteins responsible for the cancer phenotype. 


SUMMARY 


36.1 The Development of Drugs Presents Huge Challenges 

Most drugs act by binding to enzymes or receptors and modulating 
their activities. To be effective, drugs must bind to these targets with 
high affinity and specificity. However, compounds with the desired 
affinity and specificity do not necessarily make suitable drugs. Most 
compounds are poorly absorbed or rapidly excreted from the body or 
they are modified by metabolic pathways that target foreign com- 
pounds. Consequently, when taken orally, these compounds do not 
reach their targets at appropriate concentrations for a sufficient period 
of time. A drug’s properties related to its absorption, distribution, 
metabolism, and excretion are called ADME properties. Oral bioavail- 
ability is a measure of a drug’s ability to be absorbed; it is the ratio of 
the peak concentration of a compound given orally to the peak concen- 
tration of the same dose directly injected. The structure of a compound 
can affect its bioavailability in complicated ways, but generalizations 
called Lipinski’s rules provide useful guidelines. Drug-metabolism 
pathways include oxidation by cytochrome P450 enzymes (phase I 
metabolism) and conjugation to glutathione, glucuronic acid, and sul- 
fate (phase IT metabolism). A compound may also not be a useful drug 
because it is toxic, either because it modulates the target molecule too 
effectively or because it also binds to proteins other than the target. The 
liver and kidneys play central roles in drug metabolism and excretion. 


36.2 Drug Candidates Can Be Discovered by Serendipity, Screening, or Design 
Many drugs have been discovered by serendipity—that is, by chance 
observation. The antibiotic penicillin is produced by a mold that acci- 
dentally contaminated a culture dish, killing nearby bacteria. Drugs such 
as chlorpromazine and sildenafil were discovered to have beneficial, but 
unexpected, effects on human physiology. The cholesterol-lowering 
statin drugs were developed after large collections of compounds were 
screened for potentially interesting activities. Combinatorial chemistry 
methods have been developed to generate large collections of chemically 
related yet diverse compounds for screening. In some cases, the three- 
dimensional structure of a drug target is available and can be used to aid 


the design of potent and specific inhibitors. Examples of drugs designed 1055 
in this manner are the HIV protease inhibitor indinavir and cyclooxy- Problems 
genase 2 inhibitors such as celecoxib. 


36.3 Analyses of Genomes Hold Great Promise for Drug Discovery 

The human genome encodes approximately 21,000 proteins, and 
many more if derivatives due to alternative mRNA splicing and post- 
translational modification are included. The genome sequences can be 
examined for potential drug targets. Large families of proteins known 
to participate in key physiological processes such as the protein kinases 
and 7TM receptors have each yielded several targets for which drugs 
have been developed. The genomes of model organisms also are useful 
for drug-development studies. Strains of mice with particular genes 
disrupted have been useful in validating certain drug targets. The 
genomes of bacteria, viruses, and parasites encode many potential 
drug targets that can be exploited owing to their important functions 
and their differences from human proteins, minimizing the potential 
for side effects. Genetic differences between individuals can be exam- 
ined and correlated with differences in responses to drugs, potentially 
aiding both clinical treatments and drug development. 


36.4 The Clinical Development of Drugs Proceeds Through Several Phases 
Before compounds can be given to human beings as drugs, they must be 
extensively tested for safety and efficacy. Clinical trials are performed in 
stages: first testing safety, then safety and efficacy in a small population, 
and finally safety and efficacy in a larger population to detect rarer 
adverse effects. Largely due to the expenses associated with clinical 
trials, the cost of developing a new drug has been estimated to be more 
than $800 million. Even when a drug has been approved for use, compli- 
cations can arise. With regard to infectious diseases and cancer, patients 
often develop resistance to a drug after it has been administered for a 
period of time because variants of the disease agent that are less suscep- 
tible to the drug arise and replicate, even when the drug is present. 


KEY TERMS 
ligand (p. 1035) xenobiotic compounds (p. 1037) atheroma (p. 1043) 
dissociation constant (K4) (p. 1035) drug metabolism (p. 1037) myopathy (p. 1044) 
side effect (p. 1035) oxidation (p. 1037) high-throughput screening (p. 1045) 
inhibition constant (K;,) (p. 1036) conjugation (p. 1037) combinatorial chemistry (p. 1045) 
Cheng—Prusoff equation (p. 1036) phase I transformation (p. 1039) split-pool synthesis (p. 1045) 
ADME (p. 1036) phase II transformation (p. 1039) structure-based drug design (p. 1046) 
oral bioavailability (p. 1036) first-pass metabolism (p. 1039) structure—activity relationship 
Lipinski’s rules (p. 1036) glomeruli (p. 1039) (SAR) (p. 1046) 
compartment (p. 1037) enterohepatic cycling (p. 1040) placebo (p. 1052) 
blood-brain barrier (p. 1037) therapeutic index (p. 1041) placebo effect (p. 1052) 

PROBLEMS 
1. Routes to discovery. For each of the following drugs, indi- 2. Lipinski’s rules. Which of the following compounds sat- 
cate whether the physiological effects of the drug were isfy all of Lipinski’s rules? [Log(P) values are given in 
known before or after the target was identified. parentheses. | 
(a) Penicillin (d) Atorvastatin (Lipitor) (a) Atenolol (0.23) 
(b) Sildenafil (Viagra) (e) Aspirin (b) Sildenafil (3.18) 


(c) Rofecoxib (Vioxx) (f) Indinavir (Crixivan) (c) Indinavir (2.78) 
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3. Calculating log tables. Considerable effort has been 
expended to develop computer programs that can estimate 
log(P) values entirely on the basis of chemical structure. 
Why would such programs be useful? 


4, An ounce of prevention. Legislation has been proposed 
that would require that N-acetylcysteine be added to acet- 
aminophen tablets. Speculate about the role of this additive. 


5. Clinical-trial design. Distinguish between Phase I and 
Phase II clinical trials in regard to number of persons enrolled, 
the state of health of the subjects, and the goals of the study. 


6. Drug interactions. As noted in this chapter, coumadin can 
be a very dangerous drug because too much can cause uncon- 
trolled bleeding. Persons taking coumadin must be careful 
about taking other drugs, particularly those that bind to albu- 
min. Propose a mechanism for this drug—drug interaction. 


7. A bad combination. Explain why drugs that inhibit P450 
enzymes may be particularly dangerous when used in com- 
bination with other medications. 


8. Mechanistically speaking. Name one advantage of a non- 
competitive inhibitor as a potential drug compared with a 
competitive inhibitor. 


9. A helping hand. You have developed a drug that is capa- 
ble of inhibiting the ABC transporter MDR. Suggest a pos- 
sible application for this drug in cancer chemotherapy. 


10. Find the target. Trypanosomes are unicellular parasites 
that cause sleeping sickness. During one stage of their life 
cycle, these organisms live in the bloodstream and derive all 
of their energy from glycolysis, which takes place in a spe- 
cialized organelle called a glycosome inside the parasite. 
Propose potential targets for treating sleeping sickness. 
What are some potential difficulties with your approach? 


11. Knowledge is power. How might genomic information 
be helpful for the effective use of imatinib mesylate 
(Gleevec) in cancer chemotherapy? 


12. Multiple targets, same goal. Sildenafil induces its phys- 
iological effects by increasing the intracellular concentra- 
tions of cGMP, leading to muscle relaxation. On the basis 
of the scheme shown in Figure 36.17, identify another 
approach for increasing cGMP levels with a small molecule. 


Mechanism Problem 


13. Variations on a theme. The metabolism of amphetamine by 
cytochrome P450 enzymes results in the conversion shown here. 
Propose a mechanism and indicate any additional products. 
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Data Interpretation Problem 


14. HIV protease inhibitor design. Compound A is one of 
a series that were designed to be potent inhibitors of HIV 
protease. 


Compound A 
Compound A was tested by using two assays: (1) direct 
inhibition of HIV protease in vitro and (2) inhibition of viral 
RNA production in HIV-infected cells, a measure of viral 
replication. The results of these assays are shown below. The 
HIV protease activity is measured with a substrate peptide 
present at a concentration equal to its K,, value. 


Compound A (nM) HIV protease activity (arbitrary units) 

0 11.2 

0.2 9.9 

0.4 7.4 

0.6 5.6 

0.8 4.3 

1 4.0 

2 2.2 

10 0.9 

100 0.2 
Compound A (nM) Viral RNA production (arbitrary units) 

0 760 

1.0 740 

2.0 380 

3.0 280 

4.0 180 

5.0 100 

10 30 

50 20 


Estimate the values for the K; of compound A in the 
protease-activity assay and for its IC,, in the viral- RNA- 
production assay. 

Treating rats with the relatively high oral dose of 20 mg 
kg! results in a maximum concentration of compound A of 
0.4 wM. On the basis of this value, do you expect com- 
pound A to be effective in preventing HIV replication when 
taken orally? 


ANSWERS TO PROBLEMS 


Chapter 1 

1. The hydrogen-bond donors are the NH and NH, groups. 

The hydrogen-bond acceptors are the carbonyl oxygen atoms and 
those ring nitrogen atoms that are not bonded to hydrogen or to 
deoxyribose. 

2. Interchange the positions of the single and double bonds in the 
six-membered ring. 

3. (a) Ionic interactions; (b) van der Waals interactions. 

4, Processes a and b. 

5. AS.stem = —661 J mol”! K~1(—158 kcal mol! K~!) 


AS surroundings = +842 J mol! K71(+201 cal mol ! kK") 
6. (a) 1.0; (b) 13.0; (c) 1.3; (d) 12.7 
7. 2.88 

8. 1.96 

9. 55.5M 

11.83 

447; 0.00050 

0.00066 M 

6.0 

5.53 

6.48 

7.8 

100 

(a) 1.6; (b) 0.51; (c) 0.16. 
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[OH] added — 


20. 0.1 M sodium acetate solution: 6.34; 6.03; 5.70; 4.75. 

0.01 M sodium acetate solution: 5.90; 4.75; 3.38; 1.40. 

21. 90 mM acetic acid; 160 mM sodium acetate, 0.18 moles 
acetic acid; 0.32 moles sodium acetate; 10.81 g acetic acid; 26.25 g 
sodium acetate. 

22. 0.50 moles of acetic acid; 0.32 moles of NaOH; 30.03 g of 
acetic acid; 12.80 g of NaOH. 

23. 250 mM; yes; no, it will also contain 90 mM NaCl. 

24. 8.63 g Na,HPO,; 4.71¢ NaH, PO, 

25. 7.0; this buffer will not be very / asefualy because the pH value is 
far from the pK, value. 

26. +1.45 kJ ml= 1 (+0.35 kcal mol~!); +57.9 kJ mol! 

(+13.8 kcal mol~!) 

27. There will be approximately 15 million differences. 

28. (20!)/(10!*10!) = 184,756 

29. 7.9% 


Chapter 2 

1. (A) Proline, Pro, P; (B) tyrosine, Tyr, Y; (C) leucine, Leu, L; 
(D) lysine, Lys, K. 

. (a) C, B, A; (b) D; (c) D, B; (d) B, D; (e) B. 

(a) 6; (b) 2; (c) 3 (d ) 1; (e) 4; (£) 5. 

(a) Ala; (b) Tyr; (c) Ser; (d) His. 

Ser, Glu, Tyr, Thr 

(a) Alanine-glycine-serine; (b) Alanine; (c and d): 
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7. At pH'5.5, the net charge is +1: 


ar? 


At pH 7.5, the net charge is 0: 


NW 


8. There are 20 choices for each of the 50 amino acids: 20°, or 1 X 10°. 
9. 


*H;N— CH—C—N— CH—C—O—CH, 


10. The (nitrogen—a carbon-carbony] carbon) repeating unit. 

11. Side chain is the functional group attached to the a-carbon 
atom of an amino acid. 

12. Amino acid composition refers simply to the amino acids 

that make up the protein. The order is not specified. Amino acid 
sequence is the same as the primary structure—the sequence of 
amino acids from the amino terminal to the carboxyl terminal 

of the protein. Different proteins may have the same amino acid 
composition, but amino acid sequence identifies a unique protein. 
13. (a) Each strand is 35 kDa and hence has about 318 residues 
(the mean residue mass is 110 daltons). Because the rise per residue 
in an a helix is 1.5 A, the length is 477 A. More precisely, for an 
a-helical coiled coil, the rise per residue is 1.46 A; so the length is 
464 A. (b) Eighteen residues in each strand (40 minus 4 divided by 2) 
are in a B-sheet conformation. Because the rise per residue is 3.5 A, 
the length is 63 A. 

14. The methyl group attached to the B-carbon atom of isoleucine 
sterically interferes with a-helix formation. In leucine, this methyl 
group is attached to the y-carbon atom, which is farther from the 
main chain and hence does not interfere. 


A1 


A2 
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15. Proline and glycine. The cyclic side chain of proline linking 
the nitrogen and a-carbon atoms limits ¢ to a very narrow range 
(around —60 degrees). The lack of steric hindrance exhibited by 
the side chain hydrogen atom of glycine enables this amino acid to 
access a much greater area of the Ramachandran plot. 

16. The first mutation destroys activity because valine occupies 
more space than alanine does, and so the protein must take a 
different shape, assuming that this residue lies in the closely 
packed interior. The second mutation restores activity because 
of a compensatory reduction of volume; glycine is smaller 

than isoleucine. 

17. Loops invariably are on the surface of proteins, exposed to the 
environment. Because many proteins exist in aqueous environ- 
ments, the exposed loops will be hydrophilic to interact with water. 
18. The native conformation of insulin is not the thermodynami- 
cally most stable form, because it contains two separate chains 
linked by disulfide bonds. Insulin is formed from proinsulin, a 
single-chain precursor, that is cleaved to form insulin, a 51-residue 
molecule, after the disulfide bonds have formed. 

19. A segment of the main chain of the protease could hydrogen 
bond to the main chain of the substrate to form an extended paral- 
lel or antiparallel pair of B strands. 

20. Glycine has the smallest side chain of any amino acid. Its size 
is often critical in allowing polypeptide chains to make tight turns 
or to approach one another closely. 

21. Glutamate, aspartate, and the terminal carboxylate can form 
salt bridges with the guanidinium group of arginine. In addition, 
this group can be a hydrogen-bond donor to the side chains of 
glutamine, asparagine, serine, threonine, aspartate, tyrosine, and 
glutamate and to the main-chain carbonyl group. Histidine can 
form hydrogen bonds with arginine at pH 7. 

22. Disulfide bonds in hair are broken by adding a thiol-containing 
reagent and applying gentle heat. The hair is curled, and an oxi- 
dizing agent is added to re-form disulfide bonds to stabilize the 
desired shape. 

23. Some proteins that span biological membranes are “‘the excep- 
tions that prove the rule” because they have the reverse distribu- 
tion of hydrophobic and hydrophilic amino acids. For example, 
consider porins, proteins found in the outer membranes of many 
bacteria. Membranes are built largely of hydrophobic chains. 
Thus, porins are covered on the outside largely with hydrophobic 
residues that interact with the neighboring hydrophobic chains. In 
contrast, the center of the protein contains many charged and polar 
amino acids that surround a water-filled channel running through 
the middle of the protein. Thus, because porins function in hydro- 
phobic environments, they are “inside out” relative to proteins that 
function in aqueous solution. 

24. The amino acids would be hydrophobic in nature. An « helix 
is especially suited to crossing a membrane because all of the amide 
hydrogen atoms and carbonyl oxygen atoms of the peptide back- 
bone take part in intrachain hydrogen bonds, thus stabilizing these 
polar atoms in a hydrophobic environment. 

25. This example demonstrates that the pK, values are affected by 
the environment. A given amino acid can have a variety of pK, val- 
ues, depending on the chemical environment inside the protein. 

26. Recall that hemoglobin exists as a tetramer while myoglo- 
bin is a monomer. Consequently, the hydrophobic residues on 
the surface of hemoglobin subunits are probably involved in 
van der Waals interactions with similar regions on the other 
subunits, and will be shielded from the aqueous environment 
by this interaction. 

27. A possible explanation is that the severity of the symptoms cor- 
responds to the degree of structural disruption. Hence, substitution 


of alanine for glycine might result in mild symptoms, but substi- 
tution of the much larger tryptophan may result in little or no 
collagen triple-helix formation. 

28. The energy barrier that must be crossed to go from the polym- 
erized state to the hydrolyzed state is large even though the reac- 
tion is thermodynamically favorable. 

29. Using the Henderson—Hasselbach equation, we find the ratio 
of alanine-COOH to alanine-COO™ at pH 7 to be 107+. The ratio 
of alanine-NH, to alanine-NH,*, determined in the same fashion, 
is 10~!. Thus, the ratio of neutral alanine to the zwitterionic spe- 
ciesis10* x 10°' = 10°. 

30. The assignment of absolute configuration requires the assign- 
ment of priorities to the four groups connected to a tetrahedral 
carbon atom. For all amino acids except cysteine, the priorities are: 
(1) amino group; (2) carbonyl group; (3) side chain; (4) hydrogen. 
For cysteine, because of the sulfur atom in its side chain, the side 
chain has a greater priority than does the carbonyl group, leading 
to the assignment of an R rather than S configuration. 

31. ELVISISLIVINGINLASVEGAS 

32. No, Pro—X would have the characteristics of any other peptide 
bond. The steric hindrance in X—Pro arises because the R group 
of Pro is bonded to the amino group. Hence, in X—Pro, the proline 
R group is near the R group of X, which would not be the case in 
Pro-X. 

33. A, c; B, e; C, d; D, a; E, b. 

34. The reason is that the wrong disulfides formed pairs in urea. 
There are 105 different ways of pairing eight cysteine molecules 

to form four disulfides; only one of these combinations is enzy- 
matically active. The 104 wrong pairings have been picturesquely 
termed “scrambled” ribonuclease. 


Chapter 3 

1. (a) Phenyl isothiocyanate; (b) urea; B-mercaptoethanol to 
reduce disulfides; (c) chymotrypsin; (d) CNBr; (e) trypsin. 

2. For each cell within an organism, the genome is a fixed prop- 
erty. However, the proteome is dynamic, reflecting different 
environmental conditions and external stimuli. Two different 

cell types will likely express different subsets of proteins encoded 
within the genome. 

3. The S-aminoethylcysteine side chain resembles that of lysine. 
The only difference is a sulfur atom in place of a methylene group. 
4. A1 mg m7! solution of myoglobin (17.8 kDa; Table 3.2) cor- 
responds to 5.62 X 10-° M. The absorbance of a 1-cm path length 
is 0.84, which corresponds to an I)/I ratio of 6.96. Hence 14.4% of 
the incident light is transmitted. 

5. The sample was diluted 1000-fold. The concentration after 
dialysis is thus 0.001 M, or 1 mM. You could reduce the salt con- 
centration by dialyzing your sample, now 1 mM, in more buffer 
free of (NH,),5O,. 

6. Ifthe salt concentration becomes too high, the salt ions interact 
with the water molecules. Eventually, there will not be enough 
water molecules to interact with the protein, and the protein will 
precipitate. If there is lack of salt in a protein solution, the proteins 
may interact with one another—the positive charges on one protein 
with the negative charges on another or several others. Such an 
aggregate becomes too large to be solubilized by water alone. If salt 
is added, the salt neutralizes the charges on the proteins, preventing 
protein-protein interactions. 

7. Tropomyosin is rod shaped, whereas hemoglobin is approxi- 
mately spherical. 

8. The frictional coefficient, f, and the mass, m, determine s. 
Specifically, fis proportional to r (see equation 2 on p. 71). Hence, 
fis proportional to m!/%, and so s is proportional to m2/3 (see the 


equation on p. 76). An 80-kDa spherical protein undergoes sedi- 
mentation 1.59 times as rapidly as a 40-kDa spherical protein. 

9. The long hydrophobic tail on the SDS molecule (see p. 72) dis- 
rupts the hydrophobic interactions in the interior of the protein. 
The protein unfolds, with the hydrophobic R groups now interact- 
ing with SDS rather than with one another. 

10. 50kDa 

11. The protein may be modified. For instance, asparagine resi- 
dues in the protein may be modified with carbohydrate units 
(Section 2.6). 

12. A fluorescence-labeled derivative of a bacterial degradation 
product (e.g., a formylmethiony] peptide) would bind to cells con- 
taining the receptor of interest. 

13. (a) Trypsin cleaves after arginine (R) and lysine (K), generat- 
ing AVGWR, VK, and 8S. Because they differ in size, these prod- 
ucts could be separated by molecular exclusion chromatography. 
(b) Chymotrypsin, which cleaves after large aliphatic or aromatic 
R groups, generates two peptides of equal size (AVGW) and 
(RVKS). Separation based on size would not be effective. The 
peptide RVKS has two positive charges (R and K), whereas the 
other peptide is neutral. Therefore, the two products could be 
separated by ion-exchange chromatography. 

14. Antibody molecules bound to a solid support can be used for 
affinity purification of proteins for which a ligand molecule is not 
known or unavailable. 

15. Ifthe product of the enzyme-catalyzed reaction is highly 
antigenic, it may be possible to obtain antibodies to this particular 
molecule. These antibodies can be used to detect the presence of 
product by ELISA, providing an assay format suitable for the 
purification of this enzyme. 

16. An inhibitor of the enzyme being purified might have been 
present and subsequently removed by a purification step. This 
removal would lead to an apparent increase in the total amount of 
enzyme present. 

17. Many proteins have similar masses but different sequences 
and different patterns when digested with trypsin. The set of 
masses of tryptic peptides forms a detailed “fingerprint” of a 
protein that is very unlikely to appear at random in other proteins 
regardless of size. (A conceivable analogy is: “Just as similarly 
sized fingers will give different individual fingerprints, so also 
similarly sized proteins will give different digestion patterns 

with trypsin.”) 

18. Isoleucine and leucine are isomers and, hence, have identical 
masses. Peptide sequencing by mass spectrometry as described in 
this chapter is incapable of distinguishing these residues. Further 
analytical techniques are required to differentiate these residues. 
19. See the table at the bottom of the page. 

20. (a) Ion exchange chromatography will remove Proteins A and 
D, which have substantially lower isoelectric point; then gel filtra- 
tion chromatography will remove Protein C, which has a lower 
molecular weight. (b) If Protein B carries a His tag, a single affinity 
chromatography step with an immobilized nickel(II) column may 
be sufficient to isolate the desired protein from the others. 
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21. Protein crystal formation requires the ordered arrangement of 
identically positioned molecules. Proteins with flexible linkers can 
introduce disorder into this arrangement and prevent the forma- 
tion of suitable crystals. A ligand or binding partner may induce 
an ordered conformation to this linker and could be included in the 
solution to facilitate crystal growth. Alternatively, the individual 
domains separated by the linker may be expressed by recombinant 
methods and their crystal structures solved separately. 
22. Treatment with urea will disrupt noncovalent bonds. Thus the 
original 60-kDa protein must be made of two 30-kDa subunits. 
When these subunits are treated with urea and B-mercaptoethanol, 
a single 15-kDa species results, suggesting that disulfide bonds 
link the 30-kDa subunits. 
23. (a) Electrostatic repulsion between positively charged e-amino 
groups hinders a-helix formation at pH 7. At pH 10, the side 
chains become deprotonated, allowing a-helix formation. 
(b) Poly-L-glutamate is a random coil at pH 7 and becomes « helical 
below pH 4.5 because the y-carboxylate groups become protonated. 
24. The difference between the predicted and the observed masses 
for this fragment equals 28.0, exactly the mass shift that would be 
expected in a formylated peptide. This peptide is likely formylated 
at its amino terminus, and corresponds to the most N-terminal 
fragment of the protein. 
25. Light was used to direct the synthesis of these peptides. Each 
amino acid added to the solid support contained a photolabile 
protecting group instead of a t-Boc protecting group at its a-amino 
group. Illumination of selected regions of the solid support led 
to the release of the protecting group, which exposed the amino 
groups in these sites to make them reactive. The pattern of masks 
used in these illuminations and the sequence of reactants define 
the ultimate products and their locations. 
26. Mass spectrometry is highly sensitive and capable of detecting 
the mass difference between a protein and its deuterated counter- 
part. Fragmentation techniques can be used to identify the amino 
acids that retained the isotope label. Alternatively, NMR spectros- 
copy can be used to detect the isotopically labeled atoms because the 
deuteron and the proton have very different nuclear-spin properties. 
27. First amino acid: A 

Last amino acid: R (not cleaved by carboxypeptidase). 

Sequence of N-terminal tryptic peptide: AVR (tryptic peptide 

ends in K) 

Sequence of N-terminal chymotryptic peptide: AVRY 

(chymotryptic peptide ends in Y) 

Sequence: AVRYSR 
28. First amino acid: 5 

Last amino acid: L 

Cyanogen bromide cleavage: M is 10th position 

C-terminal residues are: (25,L,W) 

Amino-terminal residues: (G,K,5S,Y), tryptic peptide, ends in K 

Amino-terminal sequence: SYGK 

Chymotryptic peptide order: (S,Y), (G,K,L), (F,1,5), (M,T), 

(S,W), (5,L) 

Sequence: SYGKLSIFTMSWSL 


Purification Total protein Total activity Specific activity Purification Yield 
procedure (mg) (units) (units mg!) level (%) 
Crude extract 20,000 4,000,000 200 1 100 
(NH,),50, precipitation 5,000 3,000,000 600 3 75 
DEAE-cellulose chromatography 1,500 1,000,000 667 3.3 25 
Gel-filtration chromatography 500 750,000 1,500 WS 19 
Affinity chromatography 45 675,000 15,000 75 17 
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29. Ifthe protein did not contain any disulfide bonds, then the 
electrophoretic mobility of the trypsin fragments would be the 
same before and after performic acid treatment: all the fragments 
would lie along the diagonal of the paper. If one disulfide bond 
were present, the disulfide-linked trypsin fragments would run as 
a single peak in the first direction, then would run as two separate 
peaks after performic acid treatment. The result would be two 
peaks appearing off the diagonal: 


No disulfides present One disulfide present 


First direction of electrophoresis —> _ First direction of electrophoresis > 
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These fragments could then be isolated from the chromatography 


paper and analyzed by mass spectrometry to determine their amino 
acid composition and thus identify the cysteines participating in 
the disulfide bond. 


Chapter 4 

1. A nucleoside is a base attached to a ribose or deoxyribose sugar. 
A nucleotide is a nucleoside with one or more phosphoryl groups 
attached to the ribose or deoxyribose. 

2. Hydrogen-bond pairing between the base A and the base T as 
well as hydrogen-bond pairing between the base G and the base C 
in DNA. 

3. Tis always equal to A, and so these two nucleotides constitute 
40% of the bases. G is always equal to C, and so the remaining 60% 
must be 30% G and 30% C. 

4. Nothing, because the base-pair rules do not apply to single- 
stranded nucleic acids. 

5. (a) TTGATC; (b) GTTCGA,; (c) ACGCGT; (d) ATGGTA. 
6. (a) [T] + [C] = 0.46. (b) [T] = 0.30, [C] = 0.24, and 

[A] + [G] = 0.46. 

7. Stable hydrogen bonding occurs only between GC and AT pairs. 
Moreover, two purines are too large to fit inside the double helix, 
and two pyrimidines are too small to form base pairs with each other. 
8. The thermal energy causes the chains to wiggle about, which 
disrupts the hydrogen bonds between base pairs and the stacking 
forces between bases and thereby causes the strands to separate. 

9. The probability that any sequence will appear is 1/4", where 4 
is the number of nucleotides and n is the length of the sequence. 
The probability of any 15-base sequence appearing is 1/4!°, or 
1/1,073,741,824. Thus, a 15-nucleotide sequence would be likely 
to appear approximately three times (3 billion X probability of 
appearance). The probability of a 16-base sequence appearing is 
1/416, which is equal to 1/4,294,967,296. Such a sequence will be 
unlikely to appear more than once. 

10. One end of a nucleic acid polymer ends with a free 5’-hydroxyl 
group (or a phosphory] group esterified to the hydroxyl group), and 
the other end has a free 3’-hydroxyl group. Thus, the ends are differ- 
ent. Two strands of DNA can form a double helix only if the strands 
are running in different directions—that is, have opposite polarity. 
11. Although the individual bonds are weak, the population of 
thousands to millions of such bonds provides much stability. There 
is strength in numbers. 


12. There would be too much charge repulsion from the negative 
charges on the phosphory] groups. These charges must be coun- 
tered by the addition of cations. 

13. The three forms are the A-DNA, the B-DNA and the 
Z-DNA, with B-DNA being the most common. There are many 
differences (Table 4.2). Some key differences are: A-DNA and 
B-DNA are right-handed, whereas Z-DNA is left-handed. 
A-DNA forms in less-hydrated conditions than does B-DNA. 
The A form is shorter and wider than the B form. 

14. 5.88 X 10° base pairs. 

15. The diameter of DNA is 20 A and1 A = 0.1 nm, so the diam- 
eter is 2 nm. Because 1 ~m = 107 nm, the length is 2 X 104 nm. 
Thus, the axial ratio is 1 X 101. 

16. A template is the sequence of DNA or RNA that directs the 
synthesis of a complementary sequence. A primer is the initial seg- 
ment of a polymer that is to be extended during elongation. 

17. In conservative replication, after 1.0 generation, half of the 
molecules would be !°N-!5N, the other half ‘#N-!4N. After 2.0 
generations, one-quarter of the molecules would be !°N-!5N, the 
other three-quarters !4N-!4N. Hybrid !4N-!5N molecules would 
not be observed in conservative replication. 

18. The nucleotides used for DNA synthesis have the triphos- 
phate attached to the 5’-hydroxyl group with free 3’ -hydroxyl 
groups. Such nucleotides can be utilized only for 5’-to-3’ DNA 
synthesis. 

19. (a) Tritiated thymine or tritiated thymidine. (b) dATP, dGTP, 
dCTP, and TTP labeled with 3?P in the innermost (a) phosphorus 
atom. 

20. Molecules in parts a and b would not lead to DNA synthesis, 
because they lack a 3’-OH group (a primer). The molecule in part 
d has a free 3'-OH group at one end of each strand but no template 
strand beyond. Only the molecule in part c would lead to DNA 
synthesis. 

21. A retrovirus is a virus that has RNA as its genetic material. 
However, for the information to be expressed, it must first be con- 
verted into DNA, a reaction catalyzed by the enzyme reverse tran- 
scriptase. Thus, at least initially, information flow is opposite that 
of anormal cell: RNA — DNA rather than DNA > RNA. 

22. A thymidylate oligonucleotide should be used as the primer. 
The poly(A) template specifies the incorporation of T; hence, 
radioactive thymidine triphosphate (labeled in the a phosphoryl 
group) should be used in the assay. 

23. The ribonuclease serves to degrade the RNA strand, a neces- 
sary step in forming duplex DNA from the RNA—DNA hybrid. 
24. Treat one aliquot of the sample with ribonuclease and another 
with deoxyribonuclease. Test these nuclease-treated samples for 
infectivity. 

25. Deamination changes the original G - C base pair intoa G- U 
pair. After one round of replication, one daughter duplex will con- 
tain a G - C pair and the other duplex will contain an A - U pair. 
After two rounds of replication, there will be two G - C pairs, one 
A. U pair, and one A - T pair. 

26. (a) 4° = 65,536. In computer terminology, there are 64K 
8-mers of DNA. 

(b) A bit specifies two bases (say, A and C) and a second bit speci- 
fies the other two (G and T). Hence, two bits are needed to specify 
a single nucleotide (base pair) in DNA. For example, 00, 01, 10, 
and 11 could encode A, C, G, and T. An 8-mer stores 16 bits (2!® = 
65,536), the E. coli genome (4.6 X 10° bp) stores 9.2 10° bits, 
and the human genome (3.0 X 10° bases) stores 6.0 X 10° bits of 
genetic information. 

(c) A standard CD can hold about 700 megabytes, which is equal 
to 5.6 X 10° bits. A large number of 8-mer sequences could be 


stored on such a CD. The DNA sequence of E. coli could be writ- 
ten on a single CD with room to spare for a lot of music. One CD 
would not be quite enough to record the entire human genome. 
27. (a) Deoxyribonucleoside triphosphates versus ribonucleoside 
triphosphates. 

(b) 5’ — 3’ for both. 

(c) Semiconserved for DNA polymerase I; conserved for RNA 
polymerase. 

(d) DNA polymerase I needs a primer, whereas RNA polymerase 
does not. 

28. The template strand has a sequence complementary to that of the 
RNA transcript. The coding strand has the same sequence as that of 
the RNA transcript except for thymine (T) in place of uracil (U). 
29. Messenger RNA encodes the information that, on transla- 
tion, yields a protein. Ribosomal RNA is the catalytic component 
of ribosomes, the molecular complexes that synthesize proteins. 
Transfer RNA is an adaptor molecule, capable of binding a spe- 
cific amino acid and recognizing a corresponding codon. Transfer 
RNAs with attached amino acids are substrates for the ribosome. 
30. Three nucleotides encode an amino acid; the code is nonover- 
lapping; the code has no punctuation; the code exhibits direction- 
ality; the code is degenerate. 

31. (a) 5'-UAACGGUACGAU-3' 

(b) Leu-Pro-Ser-Asp-Trp-Met 

(c) Poly(Leu-Leu-Thr-Tyr) 

32. The 2'-OH group in RNA acts as an intramolecular nucleo- 
phile. In the alkaline hydrolysis of RNA, it forms a 2'-3' cyclic 
intermediate. 
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34. Gene expression is the process of expressing the information 
of a gene in its functional molecular form. For many genes, the 
functional information is a protein molecule. Thus, gene expres- 
sion includes transcription and translation. 

35. A nucleotide sequence whose bases represent the most- 
common, but not necessarily the only, members of the sequence. 


AS 


Answers to Problems 


A consensus sequence can be thought of as the average of many 
similar sequences. 

36. Cordycepin terminates RNA synthesis. An RNA chain con- 
taining cordycepin lacks a 3’-OH group. 

37. Only single-stranded RNA can serve as a template for protein 
synthesis. 

38. Degeneracy of the code refers to the fact that most amino acids 
are encoded by more than one codon. 

39. Ifonly 20 of the 64 possible codons encoded amino acids, then 
a mutation that changed a codon would likely result in a nonsense 
codon, leading to termination of protein synthesis. With degen- 
eracy, a nucleotide change might yield a synonym or a codon for an 
amino acid with similar chemical properties. 

40. (a) 2, 4, 8; (b) 1, 6, 10; (c) 3, 5, 7, 9. 

41. (a) 3; (b) 6; (c) 2; (d) 5; (e) 7; (£) 1; (g) 4. 

42. Incubation with RNA polymerase and only UTP, ATP, and 
CTP led to the synthesis of only poly(UAC). Only poly(GUA) 
was formed when GTP was used in place of CTP. 

43. A peptide terminating with Lys (UGA is a stop codon), 
another containing -Asn-Glu-, and a third containing -Met-Arg-. 
44. Phe-Cys-His-Val-Ala-Ala 

45. Exon shuffling is a molecular process that can lead to the 
generation of new proteins by the rearrangement of exons within 
genes. Because many exons encode functional protein domains, 
exon shuffling is a rapid and efficient means of generating 

new genes. 

46. Alternate splicing allows one gene to code for several different 
but related proteins. 

47. It shows that the genetic code and the 
biochemical means of interpreting the code 


5- 2 oo” are common to even very distantly related 
| life forms. It also testifies to the unity 
0 of life: that all life arose from a common 
| ancestor. 
48. (a) A codon for lysine cannot be 
CH, 
fe) Base changed to one for aspartate by the muta- 


tion of a single nucleotide. (b) Arg, Asn, 
Gln, Glu, Ile, Met, or Thr. 
49. The genetic code is degenerate. Of the 
20 amino acids, 18 are specified by more 
ee than one codon. Hence, many nucleotide 
changes (especially in the third base of 
a codon) do not alter the nature of the 
encoded amino acid. Mutations leading to 
an altered amino acid are usually more del- 
eterious than those that do not and hence 
| are subject to more stringent selection. 
50. GC base pairs have three hydrogen 
bonds compared with two for AT base 
pairs. Thus, the higher content of GC 
means more hydrogen bonds and greater 
helix stability. 
| 51. Cot value essentially corresponds to 
the complexity of the DNA sequence— 

g in other words, how long it will take for a 
sequence of DNA to find its complemen- 
tary strand to form a double helix. The 

more complex the DNA, the slower it reassociates to make the 
double-stranded form. 

52. Increasing salt increases the melting temperature. Because 

the DNA backbone is negatively charged, there is a tendency for 
charge repulsion to destabilize the helix and cause it to melt. The 
addition of salt neutralizes the charge repulsion, thereby stabilizing 
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the double helix. The results show that within the parameters of 
the experiment, more salt results in more stabilization, which is 
reflected as a higher melting temperature. 


Chapter 5 

1. Taq polymerase is the DNA polymerase from the thermophilic 
bacterium that lives in hot springs. Consequently, it is heat stable 
and can withstand the high temperatures required for PCR with- 
out denaturing. 

2. Ovalbumin cDNA should be used. E. coli lacks the machinery 
to splice the primary transcript arising from genomic DNA. 

3. Consistent with its planar, aromatic structure, ethidium bro- 
mide is a DNA intercalator: it aligns itself between the paired 
bases in a DNA duplex. 

4. The presence of the AlulI sequence would, on average, be 
(1/4)*, or 1/256, because the likelihood of any base being at any 
position is one-fourth and there are four positions. By the same 
reasoning, the presence of the NotI sequence would be (1/4)8, or 
1/65,536. Thus, the average product of digestion by AluI would be 
250 base pairs (0.25 kb) in length, whereas that by NotI would be 
66,000 base pairs (66 kb) in length. 

5. No, because most human genes are much longer than 4 kb. 

A fragment would contain only a small part of a complete gene. 

6. Southern blotting of an MstII digest would distinguish between 
the normal and the mutant genes. The loss of a restriction site 
would lead to the replacement of two fragments on the Southern 
blot by a single longer fragment. Such a finding would not prove 
that GTG replaced GAG; other sequence changes at the restric- 
tion site could yield the same result. 

7. Although the two enzymes cleave the same recognition site, 
they each break different bonds within the 6-bp sequence. 
Cleavage by KpnI yields an overhang on the 3’ strand, whereas 
cleavage by Acc65I produces an overhang on the 5’ strand. These 
sticky ends do not overlap. 


GGTACC 
CCATGG 
Cleave with Cleave with 
Kpnl Acc651 


G GTACC 
CCATG G 


GTACC 
Ss 


Incompatible sticky ends 


8. A simple strategy for generating many mutants is to synthesize 
a degenerate set of cassettes by using a mixture of activated nucleo- 
sides in particular rounds of oligonucleotide synthesis. Suppose 
that the 30-bp coding region begins with GTT, which encodes 
valine. If a mixture of all four nucleotides is used in the first and 
second rounds of synthesis, the resulting oligonucleotides will begin 
with the sequence XYT (where X and Y denote A, C, G, or T). 
These 16 different versions of the cassette will encode proteins 
containing either Phe, Leu, Ile, Val, Ser, Pro, Thr, Ala, Tyr, His, 
Asn, Asp, Cys, Arg, or Gly at the first position. Likewise, 
degenerate cassettes can be made in which two or more codons 
are simultaneously varied. 

9. Because PCR can amplify as little as one molecule of DNA, 
statements claiming the isolation of ancient DNA need to be 


greeted with some skepticism. The DNA would need to be 
sequenced. Is it similar to human, bacterial, or fungal DNA? If so, 
contamination is the likely source of the amplified DNA. Is it sim- 
ilar to that of birds or crocodiles? This sequence similarity would 
strengthen the case that it is dinosaur DNA because these species 
are evolutionarily close to dinosaurs. 

10. PCR amplification is greatly hindered by the presence of 
G-—C-rich regions within the template. Owing to their high melting 
temperatures, these templates do not denature easily, preventing 
the initiation of an amplification cycle. In addition, rigid secondary 
structures prevent the progress of DNA polymerase along the tem- 
plate strand during elongation. 

11. At high temperatures of hybridization, only very close 
matches between primer and target would be stable because all 

(or most) of the bases would need to find partners to stabilize the 
primer-target helix. As the temperature is lowered, more mis- 
matches would be tolerated; so the amplification is likely to yield 
genes with less sequence similarity. In regard to the yeast gene, 
synthesize primers corresponding to the ends of the gene, and 

then use these primers and human DNA as the target. If nothing 
is amplified at 54°C, the human gene differs from the yeast gene, 
but a counterpart may still be present. Repeat the experiment at a 
lower temperature of hybridization. 

12. Digest genomic DNA with a restriction enzyme, and select the 
fragment that contains the known sequence. Circularize this frag- 
ment. Then carry out PCR with the use of a pair of primers that 
serve as templates for the synthesis of DNA away from the known 
sequence. 

13. The encoded protein contains four repeats of a specific sequence. 
14. Use chemical synthesis or the polymerase chain reaction to 
prepare hybridization probes that are complementary to both ends 
of the known (previously isolated) DNA fragment. Challenge 
clones representing the library of DNA fragments with both of 

the hybridization probes. Select clones that hybridize to one of the 
probes but not the other; such clones are likely to represent DNA 
fragments that contain one end of the known fragment along with 
the adjacent region of the particular chromosome. 

15. The codon(s) for each amino acid can be used to determine the 
number of possible nucleotide sequences that encode each peptide 
sequence (Table 4.5): 


Ala—Met—Ser—Leu—Pro—Trp: 

4X 1X 6X 6X 4X 1 = 576 total sequences 
Gly—Trp—Asp—Met-His-Lys: 
4X1xX2X1X 2X 2 = 32 total sequences 
Cys—Val-Trp—Asn—Lys-lle: 

2X4X1X 2X 2 X 3 = 96 total sequences 


Arg—Ser—Met—Leu—Gln—Asn: 
6X6X1X 6X 2 X 2 = 864 total sequences 


The set of DNA sequences encoding the peptide Gly-Trp-Asp- 
Met-His-Lys would be most ideal for probe design because it 
encompasses only 32 total oligonucleotides. 

16. Within a single species, individual dogs show enormous varia- 
tion in body size and substantial diversity in other physical char- 
acteristics. Therefore, genomic analysis of individual dogs would 
provide valuable clues concerning the genes responsible for the 
diversity within the species. 

17. On the basis of the comparative genome map shown in 
Figure 5.28, the region of greatest overlap with human chromo- 
some 20 can be found on mouse chromosome 2. 

18. T,, is the melting temperature of a double-stranded nucleic 
acid. If the melting temperatures of the primers are too different, 


the extent of hybridization with the target DNA will differ during 
the annealing phase, which would result in differential replications 
of the strands. 

19. Careful comparison of the sequences reveals that there is 

a 7-bp region of complementarity at the 3’ ends of these two 
primers: 


5'-GGATCGATGCTCGCGA-3' 


eeeee 
3'-GAGCGCTGGGCTAGGA-5’ 


In a PCR experiment, these primers would likely anneal to one 
another, preventing their interaction with the template DNA. 
During DNA synthesis by the polymerase, each primer would act 
as a template for the other primer, leading to the amplification of a 
25-bp sequence corresponding to the overlapped primers. 

20. A mutation in person B has altered one of the alleles for 

gene X, leaving the other intact. The fact that the mutated allele 
is smaller suggests that a deletion has occurred in one copy of 

the gene. The one functioning copy is transcribed and translated 
and apparently produces enough protein to render the person 
asymptomatic. 

Person C has only the smaller version of the gene. This gene 
is neither transcribed (negative northern blot) nor translated 
(negative western blot). 

Person D has a normal-size copy of the gene but no corre- 
sponding RNA or protein. There may be a mutation in the pro- 
moter region of the gene that prevents transcription. 

Person E has a normal-size copy of the gene that is tran- 
scribed, but no protein is made, which suggests that a mutation 
prevents translation. There are a number of possible explanations, 
including a mutation that introduced a premature stop codon in 
the mRNA. 

Person F has a normal amount of protein but still displays the 
metabolic problem. This finding suggests that the mutation affects 
the activity of the protein—for instance, a mutation that compro- 
mises the active site of enzyme Y. 

21. Chongqing: residue 2, L—> R, CTG—CGG 

Karachi: residue 5, A > P, GCC > CCC 

Swan River: residue 6, D— G, GAC > GGC 
22. This particular person is heterozygous for this particular 
mutation: one allele is wild type, whereas the other carries a point 
mutation at this position. Both alleles are PCR amplified in this 
experiment, yielding the “dual peak” appearance on the sequenc- 
ing chromatogram. 


Chapter 6 

1. There are 26 identities and two gaps for a score of 210. The two 
sequences are approximately 26% identical. This level of homology 
is likely to be statistically significant. 

2. They are likely related by divergent evolution, because three- 
dimensional structure is more conserved than is sequence identity. 
3. (1) Identity score = —25; Blosum score = 14; (2) identity score = 
15; Blosum score = 3. 

4. U. One possible structure: 
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5. There are 449, or 1.2 X 1024, different molecules. Each molecule 
has a mass of 2.2 X 10~?°, because 1 mol of polymer has a mass of 
330 g mol! X 40, and there are 6.02 X 107° molecules per mole. 
Therefore, 26.4 kg of RNA would be required. 

6. Because three-dimensional structure is much more closely asso- 
ciated with function than is sequence, tertiary structure is more 
evolutionarily conserved than is primary structure. In other words, 
protein function is the most important characteristic, and protein 
function is determined by structure. Thus, the structure must be 
conserved but not necessarily a specific amino acid sequence. 

7. Alignment score of sequences (1) and (2) is 6 X 10 = 60. 

Many answers are possible, depending on the randomly reordered 
sequence. A possible result is 


Shuffled sequence: (2) TKADKAGEYL 


(1) ASNFLDKAGK 
(2) TKADKAGEYL 


Alignment: 


Alignment score is 4 X 10 = 40. 


8. (a) Almost certainly diverged from a common ancestor. 

(b) Almost certainly diverged from a common ancestor. (c) May 
have diverged from a common ancestor, but the sequence align- 
ment may not provide supporting evidence. (d) May have diverged 
from a common ancestor, but the sequence alignment is unlikely to 
provide supporting evidence. 

9. Replacement of cysteine, glycine, and proline never yields a 
positive score. Each of these residues exhibits features unlike those 
of its other 19 counterparts: cysteine is the only amino acid capable 
of forming disulfide bonds, glycine is the only amino acid without 
a side-chain and is highly flexible, and proline is the only amino 
acid that is highly constrained through the bonding of its side 
chain to its amine nitrogen. 

10. Protein A is clearly homologous to protein B, given 65% 
sequence identity, and so A and B are expected to have quite simi- 
lar three-dimensional structures. Likewise, proteins B and C are 
clearly homologous, given 55% sequence identity, and so B and C 
are expected to have quite similar three-dimensional structures. 
Thus, proteins A and C are likely to have similar three-dimensional 
structures, even though they are only 15% identical in sequence. 

11. The likely secondary structure is 


12. To detect pairs of residues with correlated mutations, there 
must be variability in these sequences. If the alignment is over- 
represented by closely related organisms, there may not be enough 
changes in their sequences to allow the identification of potential 
base-pairing patterns. 

13. After RNA molecules have been selected and reverse tran- 
scribed, PCR is performed to introduce additional mutations into 
these strands. The use of this error-prone, thermostable poly- 
merase in the amplification step would enhance the efficiency of 
this random mutagenesis. 

14. The initial pool of RNA molecules used in a molecular- 
evolution experiment is typically much smaller than the total num- 
ber of possible sequences. Hence, the best possible RNA sequences 
will likely not be represented in the initial set of oligonucleotides. 
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Mutagenesis of the initial selected RNA molecules allows for the 
iterative improvement of these sequences for the desired property. 
15. 107 or 108 identities (depending on which annotated human 
sequence is chosen). 


Chapter 7 

1. The whale swims long distances between breaths. A high con- 
centration of myoglobin in the whale muscle maintains a ready 
supply of oxygen for the muscle between breathing episodes. 

2. (a) 2.96 X 10-11 g 

(b) 2.74 X 108 molecules 

(c) No. There would be 3.17 X 108 hemoglobin molecules in a 

red cell if they were packed in a cubic crystalline array. Hence, the 
actual packing density is about 84% of the maximum possible. 

3. 2.65 g (or 4.75 X 107? mol) of Fe 

4, (a) In human beings, 1.44 X 1072 g (4.49 X 1074 mol) of O, 
per kilogram of muscle. In sperm whale, 0.144 g (4.49 X 107% mol) 
of O, per kilogram. 

(b) 128 

5. The pK, is (a) lowered; (b) raised; and (c) raised. 

6. Deoxy Hb A contains a complementary site, and so it can add 
on to a fiber of deoxy Hb S. The fiber cannot then grow further, 
because the terminal deoxy Hb A molecule lacks a sticky patch. 

7. 62.7% oxygen-carrying capacity 

8. Myoglobin does not exhibit a Bohr effect. The interactions 
responsible for mediating the Bohr effect in hemoglobin are depen- 
dent upon a tetrameric structure. Myoglobin is a monomer. 

9. A higher concentration of BPG would shift the oxygen-binding 
curve to the right, causing an increase in P,... The larger value of 
P.. would promote dissociation of oxygen in the tissues and would 
thereby increase the percentage of oxygen delivered to the tissues. 
10. (a) The transfusion would increase the number of red blood 
cells, which increases the oxygen-carrying capacity of the blood, 
allowing more sustained effort. 

(b) BPG stabilizes the T-state of hemoglobin, which results in a more 
efficient release of oxygen. If BPG is depleted, the oxygen will not be 
released even though the red blood cells are carrying more oxygen. 
11. Oxygen binding appears to cause the copper ions and their 
associated histidine ligands to move closer to one another, thereby 
also moving the helices to which the histidines are attached (in 
similar fashion to the conformational change in hemoglobin). 

12. The modified hemoglobin should not show cooperativity. 
Although the imidazole in solution will bind to the heme iron (in 
place of histidine) and will facilitate oxygen binding, the imidazole 
lacks the crucial connection to the particular a helix that must 
move so as to transmit the change in conformation. 

13. Inositol pentaphosphate (part c) is highly anionic, much like 
2,3-bisphosphoglycerate. 

14. 
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15. Release of acid will lower the pH. A lower pH promotes oxy- 
gen dissociation in the tissues. However, the enhanced release of 


oxygen in the tissues will increase the concentration of deoxy-Hb, 
thereby increasing the likelihood that the cells will sickle. 

16. (a) Y = 0.5 when pO, = 10 torr. The plot of Y versus pO, 
appears to indicate little or no cooperativity. 
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(b) The Hill plot shows slight cooperativity with n = 1.3 in the 
central region. 
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(c) Deoxy dimers of lamprey hemoglobin could have lower affinity 
for oxygen than do the monomers. If the binding of the first oxy- 
gen atom to a dimer causes dissociation of the dimer to give two 
monomers, then the process would be cooperative. In this mecha- 
nism, oxygen binding to each monomer would be easier than bind- 
ing the first oxygen atom to a deoxy dimer. 

17. (a) 2; (b) 4; (c) 2; (d) 1. 

18. The electrostatic interactions between BPG and hemoglobin 
would be weakened by competition with water molecules. The T 
state would not be stabilized. 

19. Fetal hemoglobin contains two a chains and two y chains. 
The increase in levels of the y chain provides an alternative to the 
mutated B chain in sickle-cell anemia patients. Hence, by increas- 
ing fetal hemoglobin expression, hydroxyurea reduces the potential 
for the formation of insoluble hemoglobin aggregates. 


Chapter 8 

1. Rate enhancement and substrate specificity. 
2. A cofactor. 

3. Coenzymes and metals. 


4, Vitamins are converted into coenzymes. 

5. Enzymes facilitate the formation of the transition state. 

6. The intricate three-dimensional structure of proteins allows 

the construction of active sites that will recognize only specific 

substrates. 

7. The energy required to reach the transition state (the activation 

energy) is returned when the transition state proceeds to product. 
[P] kp 107+ 

8. (a)K = 

[S]_ kr 10° 

AG* = —11.42 kJ mol7! (—2.73 kcal mol™') 


100. Using equation 5 in the text, 


(b) kp = 10-2 s-! and kp = 10~* s~!. The equilibrium constant 
and AG°’ values are the same for both the uncatalyzed and cata- 
lyzed reactions. 

9. Protein hydrolysis has a large activation energy. Protein synthe- 
sis must require energy to proceed. 

10. The enzymes help protect the fluid that surrounds eyes from 
bacterial infection. 

11. Binding energy is the free energy released when two molecules 
bind together, such as when an enzyme and a substrate interact. 
12. Binding energy is maximized when an enzyme interacts with 
the transition state, thereby facilitating the formation of the transi- 
tion state and enhancing the rate of the reaction. 

13. There would be no catalytic activity. If the enzyme-substrate 
complex is more stable than the enzyme-transition-state com- 
plex, the transition state would not form and catalysis would not 
take place. 

14. Transition states are very unstable. Consequently, molecules 
that resemble transition states are themselves likely to be unstable 
and, hence, difficult to synthesize. 

15. (a) 0; (b) +28.53; (c) —22.84; (d) —11.42; (e) +5.69. 

16. (a) AG°’ = —RT In Ke, 


+1.8 = —(1.98 X 1073 kcal7! K7~! mol7!)(298 K) 
(In[G1P]/[G6P]) 


—3.05 = In[G1P]/[GoP] 
+3.05 = In[G6P]/[G1P] 
Kig’ = 21 or Kig = 4.8 X 10°? 


Because [G6P]/[G1P] = 21, there is 1 molecule of G1P for every 
21 molecules of G6P. Because we started with 0.1 M, the [G1P] 

is 1/22(0.1 M) = 0.0045 M and [G6P] must be 21/22(0.1 M) or 
0.096 M. Consequently, the reaction does not proceed as written to 
a significant extent. 

(b) Supply G6P at a high rate and remove G1P at a high rate by 
other reactions. In other words, make sure that the [G6P]/[G1P] 

is kept large. 

17. Kes = 19, AG = —7.3kJ mol7! (1.77 kcal mol~!) 

18. The three-dimensional structure of an enzyme is stabilized by 
interactions with the substrate, reaction intermediates, and prod- 
ucts. This stabilization minimizes thermal denaturation. 

19. At substrate concentrations near the K,,, the enzyme dis- 
plays significant catalysis yet is sensitive to changes in substrate 
concentration. 

20. No, Ky, is not equal to the dissociation constant, because the 
numerator also contains k,, the rate constant for the conversion of 
the enzyme-substrate complex into enzyme and product. If, how- 
ever, k, is much smaller than R_, Kyy = Kj. 

21. When [S] = 10 K\y, Vp = 0.91 V,.,,. When [S] = 20 Ky, 

Vy = 95 Vinay 
So any Michaelis—Menten curves showing that the enzyme actually 
attains V__,, are pernicious lies. 
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22. (a) 31.1 pmol; (b) 0.05 mol; (c) 622 s~!, a midrange value for 
enzymes (Table 8.5). 

23. (a) Yes, Ky, = 5.2 X 107° M; (b) V,,,,, = 6.8 X 10719 mol 
minute™!; (c) 337 s71, 

24. Penicillinase, like glycopeptide transpeptidase, forms an acyl- 
enzyme intermediate with its substrate but transfers the interme- 
diate to water rather than to the terminal glycine residue of the 
pentaglycine bridge. 

25. (a) Vinay 18 9.5 wmol minute” !, Ky, is 1.1 Xx 1075 M, the same 
as without inhibitor. 

(b) Noncompetitive. 

(c) 2.5 X 10-5 M 

(d) fgg = 0.73, in the presence or absence of this noncompetitive 
inhibitor. 

26. (a) Vo = Vinax ~ (Vo/TS]) Kyy 
(b) Slope = —Ky,, y-intercept = V 


, x-intercept = V, 
max 
(c) An Eadie—Hofstee plot. 


max 


/Kyy 


1 No inhibitor 


2 Competitive inhibitor 


3 Noncompetitive inhibitor 


V/[S] 


27. Sequential reactions are characterized by the formation of a 
ternary complex consisting of the enzyme and both substrates. 
Double-displacement reactions always require the formation of a 
temporarily substituted enzyme intermediate. 

28. The rates of utilization of substrates A and B are given by 


Reat 
Va= ( rt) EIA 


V, = (Fe) E][B 
ae al ][B] 


Hence, the ratio of these rates is 


-_ Reat Reat 
Vale = (Fe) tau Fe) 


Thus, an enzyme discriminates between competing substrates on 
the basis of their values of k,,,/Ky, rather than of K,, alone. 

29. The mutation slows the reaction by a factor of 100 because the 
activation free energy is increased by +11.42 kJ mol! (+2.73 kcal 
mol‘). Strong binding of the substrate relative to the transition 
state slows catalysis. 

30. 11 mol minute! 

31. (a) This piece of information is necessary for determining the 
correct dosage of succinylcholine to administer. 

(b) The duration of the paralysis depends on the ability of the 
serum cholinesterase to clear the drug. If there were one-eighth the 
amount of enzyme activity, paralysis could last eight times as long. 
(c) Ky, is the concentration needed by the enzyme to reach % V,,.,,- 
Consequently, for a given concentration of substrate, the reaction 
catalyzed by the enzyme with the lower K,, will have the higher 
rate. The mutant patient with the higher K), will clear the drug at 
a much lower rate. 

32. (a) Ky, 1s a measure of affinity only if k, is rate limiting, which 
is the case here. Therefore, the lower K,, means higher affinity. 
The mutant enzyme has a higher affinity. 


and 
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(b) 50 pmol minute~!. 10 mM is K,,, and Kj, yields % Vinase V mas 
is 100 pmol minute~!, and so. . . 

(c) Enzymes do not alter the equilibrium of the reaction. 

33. Enzyme 2. Despite the fact that enzyme 1 has a higher V,,_,.. 
than enzyme 2, enzyme 2 shows greater activity at the concentra- 
tion of the substrate in the environment because enzyme 2 has a 
lower K, for the substrate. 

34. (a) The most effective means of measuring the efficiency 

of any enzyme-substrate complex is to determine the k.,./ Ky, 
values. For the three substrates in question, the respective values 
of k.,,/ Ky, are: 6, 15, and 36. Thus, the enzyme exhibits a strong 
preference for cleaving peptide bonds in which the second amino 
acid is a large hydrophobic amino acid. 

(b) The k.,,/ Ky, for this substrate is 2. Not very effective. This 
value suggests that the enzyme prefers to cleave peptide bonds 
with the following specificity: small R group—large hydrophobic 
R group. 

35. If the total amount of enzyme (E+) is increased, V,,,,, will 


max 


increase, because V,.,,. = R,[E];. But Ky = (R-1 + k2)/k1; that 
is, it is independent of substrate concentration. The middle graph 


describes this situation. 


36. 
Experimental condition Vas Ky 
a. Twice as much enzyme 
is used Doubles No change 
b. Half as much enzyme 
is used Half as large No change 
c. A competitive inhibitor 
is present No change Increases 
d. An uncompetitive inhibitor 
is present Decreases Decreases 
e. A pure non-competitive 
1s present Decreases No change 
37. (a) 


ae 


1/Vo 


1/[S] 


(b) This behavior is substrate inhibition: at high concentrations, 
the substrate forms unproductive complexes at the active site. The 
adjoining drawing shows what might happen. Substrate normally 
binds in a defined orientation, shown in the drawing as red to red 
and blue to blue. At high concentrations, the substrate may bind at 
the active site such that the proper orientation is met for each end 
of the molecule, but two different substrate molecules are binding. 


Enzyme 
active site 


? 


Normal substrate binding at 
the active site. Substrate will 
be cleaved to red and blue 
balls. e. 6.8 


Enzyme 
active site 


ty 


Substrate inhibition 


38. The first step will be the rate-limiting step. Enzymes Ep and 
Eq are operating at % V,.,,, whereas the K,, for enzyme Eq is 
greater than the substrate concentration. E, would be operating at 
approximately 107?Vinax- 

39. The fluorescence spectroscopy reveals the existence of an 
enzyme-serine complex and of an enzyme-serine—indole complex. 
40. (a) When [S*] is much greater than the value of K,,, pH will 
have a negligible effect on the enzyme because St will interact with 
E~ as soon as the enzyme becomes available. 


V, 


max 


max | 


pH 


(b) When [S*] is much less than the value of Ky, the plot of Vp 
versus pH becomes essentially a titration curve for the ionizable 
groups, with enzyme activity being the titration marker. At low 
pH, the high concentration of H* will keep the enzyme in the EH 
form and inactive. As the pH rises, more and more of the enzyme 
will be in the E~ form and active. At high pH (low H*), all of the 
enzyme is E~. 

(c) The midpoint on this curve will be the pK, of the ionizable 
group, which is stated to be pH 6. 

41. (a) Incubating the enzyme at 37°C leads to a denaturation 

of enzyme structure and a loss of activity. For this reason, most 
enzymes must be kept cool if they are not actively catalyzing their 
reactions. 

(b) The coenzyme apparently helps to stabilize enzyme struc- 
ture, because enzyme from PLP-deficient cells denatures faster. 
Cofactors often help stabilize enzyme structure. 

42. The slow second order step occurs when complementary 
sequences on two different molecules bind with one another. 
Once the initial sequence alignment occurs, the remainder of the 
molecule can quickly locate complementary sequences from this 
nucleation site and reanneal, a first order reaction. 


Chapter 9 

1. For the amide substrate, the formation of the acyl-enzyme 
intermediate is slower than the hydrolysis of the acyl-enzyme 
intermediate, and so no burst is observed. A burst is observed for 
ester substrates; the formation of the acyl-enzyme intermediate is 
faster, leading to the observed burst. 

2. The histidine residue in the substrate can substitute to some 
extent for the missing histidine residue of the catalytic triad of the 
mutant enzyme. 

3. No. The catalytic triad works as a unit. After this unit has been 
made ineffective by the mutation of histidine to alanine, the further 
mutation of serine to alanine should have only a small effect. 

4. The substitution corresponds to one of the key differences 
between trypsin and chymotrypsin, and so trypsin-like speci- 
ficity (cleavage after lysine and arginine) might be predicted. 


In fact, additional changes are required to effect this specificity 
change. 

5. Imidazole is apparently small enough to reach the active site 
of carbonic anhydrase and compensate for the missing histidine. 
Buffers with large molecular components cannot do so, and the 
effects of the mutation are more evident. 

6. No. The odds of such a sequence being present are approxi- 
mately 1 in 4!° = 1,048,576. Because a typical viral genome 

has only 50,000 bp, the target sequence would be unlikely to be 
present. 

7. No, because the enzyme would destroy the host DNA before 
protective methylation could take place. 

8. No. The bacteria receiving the enzyme would have their own 
DNA destroyed because they would likely lack the appropriate 
protective methylase. 

9. EDTA will bind to Zn2+ and remove the ion, which is required 
for enzyme activity, from the enzyme. 

10. (a) The aldehyde reacts with the active-site serine. (b) A hemi- 
acetal is formed. 

1. Trypsin. 

2. The reaction is expected to be slower by a factor of 10 
because the rate depends on the pK, of the zinc-bound water. 
Reat = 60,000 8-7, 

13. EDTA binds the magnesium necessary for the reaction. 

14. ATP hydrolysis is reversible within the active site. ATP 
hydrolysis takes place within the active site with the incorpora- 
tion of !8O, ATP is re-formed, and the ATP is released back into 
solution. 

15. If the aspartate is mutated, the protease is inactive and the 
virus will not be viable. 

16. Water substitutes for the hydroxyl group of serine 236 in 
mediating proton transfer from the attacking water and the 
-phosphoryl group. 

17. For subtilisin, the catalytic power is approximately 

30s 1/1078 s~! = 3 X 10°. For carbonic anhydrase (at pH 7), 
the catalytic power is approximately 500,000 s~!/0.15 s~! = 


3.3 X 10°. Subtilisin is the more powerful enzyme by this criterion. 


18. The triple mutant still catalyzes the reaction by a factor 

of approximately 1000-fold compared to the uncatalyzed 
reaction. This could be due to the enzyme binding the substrate 
and holding it in a conformation that is susceptible to attack 

by water. 

19. (a) Cysteine protease: The same as Figure 9.8, except that 
cysteine replaces serine in the active site and no aspartate is 
present. 

(b) Aspartyl protease: 
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(c) Metalloprotease: 
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Chapter 10 

1. The enzyme catalyzes the first step in the synthesis of pyrimi- 
dines. It facilitates the condensation of carbamoyl phosphate and 
aspartate to form N-carbamoylaspartate and inorganic phosphate. 
2. The protonated form of histidine probably stabilizes the nega- 
tively charged carbonyl oxygen atom of the scissile bond (bond to 
be broken) in the transition state. Deprotonation would lead to a 
loss of activity. Hence, the rate is expected to be half maximal at a 
pH of about 6.5 (the pK of an unperturbed histidine side chain in 
a protein) and to decrease as the pH is raised. 

3. The inhibition of an allosteric enzyme by the end product of 
the pathway controlled by the enzyme. It prevents the production 
of too much end product and the consumption of substrates when 
product is not required. 

4. High concentrations of ATP might signal two overlapping situ- 
ations. The high levels of ATP might suggest that some nucleotides 
are available for nucleic acid synthesis, and consequently, UTP 
and CTP should be synthesized. The high levels of ATP indicate 
that energy is available for nucleic acid synthesis, and so UTP and 
CTP should be produced. 

5. All of the enzyme would be in the R form all of the time. There 
would be no cooperativity. The kinetics would look like that of a 
Michaelis-Menten enzyme. 

6. The enzyme would show simple Michaelis—Menten kinetics 
because it is essentially always in the R state. 

7. CTP is formed by the addition of an amino group to UTP. 
Evidence indicates the UTP is also capable of inhibiting ATCase 
in the presence of CTP. 

8. Homotropic effectors are the substrates of allosteric enzymes. 
Heterotropic effectors are the regulators of allosteric enzymes. 
Homotropic effectors account for the sigmoidal nature of the 
velocity versus substrate concentration curve, whereas heterotropic 
effectors alter the K,, of the curve. Ultimately, both types of effec- 
tors work by altering the T/R ratio. 

9. The reconstitution shows that the complex quaternary structure 
and the resulting catalytic and regulatory properties are ultimately 
encoded in the primary structure of individual components. 

10. If substrates had been used, the enzyme would catalyze the 
reaction. Intermediates would not accumulate on the enzyme. 
Consequently, any enzyme that crystallized would have been free 
of substrates or products. 

11. (a) 100. The change in the [R]/[T] ratio on binding one 
substrate molecule must be the same as the ratio of the substrate 
affinities of the two forms. 

(b) 10. The binding of four substrate molecules changes the 
[R]/[T] by a factor of 100* = 108. The ratio in the absence of 
substrate is 10’. Hence, the ratio in the fully liganded molecule 

is 10° x 10°” = 10. 

12. The fraction of molecules in the R form is 107°, 0.004, 0.615, 
0.998, and 1 when 0, 1, 2, 3, and 4 ligands, respectively, are bound. 
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13. The sequential model can account for negative cooperativity, 
whereas the concerted model cannot. 
14. 


Allosteric enzyme — 


ue k—— Allosteric enzyme 
plus inhibitor Y 


Allosteric enzyme 
plus activator 


| 


Michaelis-Menten enzyme 


i) 1/[S] 


15. The binding of PALA switches ATCase from the T to 

the R state because PALA acts as a substrate analog. An 
enzyme molecule containing bound PALA has fewer free 
catalytic sites than does an unoccupied enzyme molecule. 
However, the PALA-containing enzyme will be in the R state 
and, hence, have a higher affinity for the substrates. The 
dependence of the degree of activation on the concentration of 
PALA is a complex function of the allosteric constant L, and 
of the binding affinities of the R and T states for the analog 
and substrates. 

16. The net outcome of the two reactions is the hydrolysis of ATP 
to ADP and P., which has a AG of —50 kJ mol~! (—12 kcal mol~!) 
under cellular conditions. 

17. Isozymes are homologous enzymes that catalyze the same 
reaction but have different kinetic or regulatory properties. 

18. Although the same reaction may be required in a variety of 
tissues, the biochemical properties of tissues will differ accord- 

ing to their biological function. Isozymes allow the fine-tuning 

of catalytic and regulatory properties to meet the specific needs of 
the tissue. 

19. (a) 7; (b) 8; (c) 11; (d) 6; (e) 1; (£) 12; (g) 3; (h) 4; @) 5; G) 2; 

(k) 10; (1) 9. 

20. When phosphorylation takes place at the expense of ATP, suf- 
ficient energy is expended to dramatically alter the structure and 
hence activity of a protein. Moreover, because ATP is the cellular 
energy currency, protein modification is linked to the energy status 
of the cell. 

21. Covalent modification is reversible, whereas proteolytic cleav- 
age is irreversible. 

22. Activation is independent of zymogen concentration because 
the reaction is intramolecular. 

23. Although quite rare, cases of enteropeptidase deficiency have 
been reported. The affected person has diarrhea and fails to thrive 
because digestion is inadequate. In particular, protein digestion is 
impaired. 

24. Add blood from the second patient to a sample from the 
first. If the mixture clots, the second patient has a defect different 
from that of the first. This type of assay is called a complementa- 
tion test. 

25. Activated factor X remains bound to blood-platelet mem- 
branes, which accelerates its activation of prothrombin. 

26. Antithrombin III is a very slowly hydrolyzed substrate of 
thrombin. Hence, its interaction with thrombin requires a fully 
formed active site on the enzyme. 


27. Replacing methionine with leucine would be a good choice. 
Leucine is resistant to oxidation and has nearly the same volume 
and degree of hydrophobicity as methionine has. 

28. Inappropriate clot formation could block arteries in the brain, 
causing a stroke, or the heart, causing a heart attack. 

29. Thrombin catalyzes the hydrolysis of fibrinogen to form active 
fibrin. But it also has a role in shutting down the cascade by acti- 
vating protein C, a protease that digests other clotting enzymes V, 
and VIII. 

30. Tissue-type plasminogen activator, or TPA, is a serine pro- 
tease that leads to the dissolution of blood clots. TPA activates 
plasminogen that is bound to a fibrin clot, converting it into active 
plasmin, which then hydrolyzes the fibrin of the clot. 

31. A mature clot is stabilized by amide linkages between the side 
chains of lysine and glutamine that are absent in a soft clot. The 
linkages are formed by transglutaminase. 

32. The simple sequential model predicts that the fraction of cata- 
lytic chains in the R state, fp, 1s equal to the fraction containing 
bound substrate, Y. The concerted model, in contrast, predicts that 
fp increases more rapidly than Y as the substrate concentration 1s 
increased. The change in fp leads to the change in Y on addition of 
substrate, as predicted by the concerted model. 

33. The binding of succinate to the functional catalytic sites of 
the native c, moiety changed the visible absorption spectrum of 
nitrotyrosine residues in the other c, moiety of the hybrid enzyme. 
Thus, the binding of substrate analog to the active sites of one tri- 
mer altered the structure of the other trimer. 

34. According to the concerted model, an allosteric activator 
shifts the conformational equilibrium of all subunits toward the 

R state, whereas an allosteric inhibitor shifts it toward the T state. 
Thus, ATP (an allosteric activator) shifted the equilibrium to the 
R form, resulting in an absorption change similar to that obtained 
when substrate is bound. CTP had a different effect. Hence, this 
allosteric inhibitor shifted the equilibrium to the T form. Thus, the 
concerted model accounts for the ATP-induced and CTP-induced 
(heterotropic), as well as for the substrate-induced (homotropic), 
allosteric interactions of ATCase. 

35. (a) When crowded, the control group displays gregarious 
behavior. (b) Inhibition of PKA appears to prevent gregarious 
behavior, while inhibition of PKG has no effect on the behavior. 
(c) The effect of PKG inhibition was investigated to establish that 
the effect seen with PKA inhibition is specific and not just due to 
the inhibition of any kinase. (d) PKA plays a role in altering behav- 
ior of insects. (e) Gregariousness in locusts that are always gregari- 
ous is not affected by PKA inhibition, suggesting that PKA may 
play a role in modifying behavior patterns but not in establishing 
the patterns. 

36. Residues a and d are located in the interior of an a-helical 
coiled coil, near the axis of the superhelix. Hydrophobic interac- 
tions between these side chains contribute to the stability of the 
coiled coil. 

37. In the R state, ATCase expands and becomes less dense. This 
decrease in density results in a decrease in the sedimentation value 
(see the formula on p. 76). 

38. The interaction between trypsin and the inhibitor is so stable 
that the transition state is rarely formed. Recall that maximal bind- 
ing energy is released when an enzyme binds to the transition state. 
If the substrate-enzyme interaction is too stable, the transition 
state rarely forms. 

39. Dicoumarol is a competitive inhibitor of y-glutamy] 
carboxylase. Consequently, y-carboxyglutamate, which is 
required for prothrombin to be converted into thrombin, 

is not formed. Amino acid composition is determined by 


hydrolyzing the protein at elevated temperatures in the presence 
of strong acid. Under these conditions, the carboxyl group of 
y-carboxyglutamate is removed from the normal prothrombin, 
leaving simply glutamate. 


40. 
41. 
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Chapter 11 


1. Carbohydrates were originally regarded as hydrates of carbon 
because the empirical formula of many of them is (CH,O), . 

2. Three amino acids can be linked by peptide bonds in only six 
different ways. However, three different monosaccharides can be 
linked in a plethora of ways. The monosaccharides can be linked 
ina linear or branched manner, with a or B linkages, with bonds 
between C-1 and C-3, between C-1 and C-4, between C-1 and 
C-6, and so forth. In fact, the three monosaccharides can form 
12,288 different trisaccharides. 

3. (a) 10; (b) 6; (c) 8; (d) 9; (e) 2; (£) 4; (g) 1; (A) 5; (@) 7; G) 3. 

4. (a) aldose-ketose; (b) epimers; (c) aldose-ketose (d) anomers; 
(e) aldose-ketose; (f) epimers. 
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5. Erythrose: tetrose aldose; ribose: pentose aldose; glyceral- 
dehyde: triose aldose; dihydroxyacetone: triose ketose; erythru- 
lose: tetrose ketose; ribulose: pentose ketose; fructose: hexose 


ketose. 
6. 

CHO CHO CHO 
H—C—\OH HO—-C——H HO—-C——-H 
H—C—\OH H—C— OH HO—-C——-H 
H—C—OH H—C— OH H—C—OH 
H—C—\OH H—C— OH H—C— OH 

CH,OH CH,OH CH,OH 

p-Allose p-Altrose p-Mannose 

CHO CHO CHO 
H—C—OH HO—-C——-H H—C—OH 
H— C—OH H—C— OH HO—-C——-H 

HO—C——H HO—-C——-H HO—-C——-H 
H—C—OH H—C— OH H—C—OH 

CH,0OH CH,OH CH,0OH 

p-Gulose p-Idose p-Galactose 

CHO 

HO—-C——-H 
HO—-C——-H 
HO—-C——-H 
H—C—OH 
CH,0OH 
p-Talose 
73 
CH,OH 
OH 
OH H 
OH O— CH, 
OH OQ OH 
H OH H 
OH H 
H H 
H OH 


a-Glucosyl-(1— 6)-galactose 


8. The proportion of the a anomer is 0.36, and that of the B anomer 
is 0.64. 

9. Glucose is reactive because of the presence of an aldehyde 
group in its open-chain form. The aldehyde group slowly con- 
denses with amino groups to form aldimine products of a type 
called Schiff-base adducts. 
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10. A pyranoside reacts with two molecules of periodate; formate 
is one of the products. A furanoside reacts with only one molecule 
of periodate; formate is not formed. 
11. (a) B-D-Mannose; (b) B-D-galactose; (c) B-D-fructose; 
(d) B-D-glucosamine. 
12. The trisaccharide itself should be a competitive inhibitor of 
cell adhesion if the trisaccharide unit of the glycoprotein is critical 
for the interaction. 
13. Reducing ends would form 1,2,3,6-tetramethylglucose. The 
branch points would yield 2,3-dimethylglucose. The remainder of 
the molecule would yield 2,3,6-trimethylglucose. 
14. (a) Not a reducing sugar; no open-chain forms are possible. 
(b) D-galactose, D-glucose, D-fructose. (c) D-galactose and sucrose 
(glucose + fructose). 
15. CH,OH The hemiketal linkage of the a anomer is 
broken to form the open form. Rotation about 
the C-1 and C-2 bonds allows the formation 
OH OH of the B anomer, and a mixture of isomers 
HO H results. 
B-p-Mannose 16. Heating converts the very sweet pyranose 
form into the more-stable but less-sweet fura- 

nose form. Consequently, the sweetness of the preparation is diffi- 
cult to accurately control, which also accounts for why honey loses 
sweetness with time. See Figure 11.5 for structures. 
17. (a) Each glycogen molecule has one reducing end, whereas 
the number of nonreducing ends is determined by the number of 
branches, or a-1,6 linkages. (b) Because the number of nonreduc- 
ing ends greatly exceeds the number of reducing ends in a collec- 
tion of glycogen molecules, all of the degradation and synthesis of 
glycogen takes place at the nonreducing ends, thus maximizing the 
rate of degradation and synthesis. 
18. No, sucrose is not a reducing sugar. The anomeric carbon 
atom acts as the reducing agent in both glucose and fructose but, 
in sucrose, the anomeric carbon atoms of fructose and glucose are 
joined by a covalent bond and are thus not available to react. 
19. Glycogen is a polymer of glucose linked by a-1,4-glycosidic 
bonds with branches formed approximately every 10 glucose 
units by a-1,6-glycosidic bonds. Starch consists of two poly- 
mers of glucose. Amylose is a straight-chain polymer formed by 
a-1,4-glycosidic bonds. Amylopectin is similar to glycogen 
but amylopectin has fewer branches, one branch per 30 or so 
glucose units. 
20. Cellulose is a linear polymer of glucose joined by B-1,4 link- 
ages. Glycogen is a branched polymer with the main chain being 
formed by a-1,4-glycosidic bonds. The B-1,4 linkages allow the 
formation of a linear polymer ideal for structural roles. The a-1,4 
linkages of glycogen form a helical structure, which allows the stor- 
age of many glucose moieties in a small space. 
21. Simple glycoproteins are often secreted proteins and thus 
play a variety of roles. For example, the hormone EPO isa gly- 
coprotein. Usually, the protein component constitutes the bulk 
of the glycoprotein by mass. In contrast, proteoglycans and 
mucoproteins are predominantly carbohydrates. Proteoglycans 
have glycosaminoglycans attached, and play structural roles as in 
cartilage and the extracellular matrix. Mucoproteins often serve as 
lubricants and have multiple carbohydrates attached through an 
N-acetylgalactosamine moiety. 
22. The attachment of the carbohydrate allows the EPO to stay in 
circulation longer and thus to function for longer periods of time 
than would a carbohydrate-free EPO. 
23. The glycosaminoglycan, because it is heavily charged, binds 
many water molecules. When cartilage is stressed, such as when 


——0. OH 


your heel hits the ground, the water is released, thus cushioning 
the impact. When you lift your heel, the water rebinds. 

24. The lectin that binds the mannose 6-phosphate might be defec- 
tive and not recognize a correctly addressed protein. 

25. Asparagine, serine, and threonine. 

26. Different molecular forms of a glycoprotein that differ in 
the amount of carbohydrate attached or the location of attach- 
ment or both. 

27. The total collection of carbohydrates synthesized by a 

cell at particular times and under particular environmental 
conditions. 

28. The genome comprises all of the genes present in an organism. 
The proteome includes all of the possible protein products and 
modified proteins that a cell expresses under any particular set of 
circumstances. The glycome consists of all of the carbohydrates 
synthesized by the cell under any particular set of circumstances. 
Because the genome is static, but any given protein can be vari- 
ously expressed and modified, the proteome is more complex than 
the genome. The glycome, which includes not only glycoforms of 
proteins, but also many possible carbohydrate structures, must be 
even more complex. 

29. An asparagine residue can be glycosylated if the residue is part 
of an Asn-X-Ser or Asn-X-Thr sequence, in which X can be any 
residue, except proline. Your roommate’s hedging is because not all 
potential sites are glycosylated. 

30. It suggests that carbohydrates are on the cell surfaces of all 
organisms for the purpose of recognition by other cells, organisms, 
or the environment. 

31. A glycoprotein is a protein that is decorated with carbohydrates. 
A lectin is a protein that specifically recognizes carbohydrates. A 
lectin can also be a glycoprotein. 

32. Each site either is or is not glycosylated, and so there are 

2° = 64 possible proteins. 

33. The 20 amino acid components of proteins and the 4 nucleo- 
tide components of nucleic acids are linked by the same type of 
bond—the peptide bond in proteins and the 5'-to-3’ phospho- 
diester linkage in nucleic acids. In contrast, there are many dif- 
ferent carbohydrates that can be modified and linked in a variety 
of ways. Moreover, oligosaccharides can be branched. Finally, 
many sugars have the same or similar chemical formula, and 
similar chemical properties, making specific identification and 
linkage difficult. 

34. As discussed in Chapter 9, many enzymes display stereo- 
chemical specificity. Clearly, the enzymes of sucrose synthesis are 
able to distinguish between the isomers of the substrates and link 
only the correct pair. 

35. Ifthe carbohydrate specificity of the lectin is known, an affin- 
ity column with the appropriate carbohydrate attached could be 
prepared. The protein preparation containing the lectin of inter- 
est could be passed over the column. The use of this method 

was indeed how the glucose-binding lectin concanavalin A was 
purified. 

36. (a) Aggrecan is heavily decorated with glycosaminoglycans. If 
glycosaminoglycans are released into the media, aggrecan must be 
undergoing degradation. 

(b) Another enzyme might be present that cleaves glycosamino- 
glycans from aggrecan without degrading aggrecan. Other experi- 
ments not shown established that glycosaminoglycan release is an 
accurate measure of aggrecan destruction. 

(c) The control provides a baseline of “background” degradation 
inherent in the assay. 

(d) Aggrecan degradation is greatly enhanced. 


(e) Aggrecan degradation is reduced to the background system. 
(f) It is an in vitro system in which not all the factors contributing 
to cartilage stabilization in vivo are present. 


Chapter 12 

1. 2.86 X 10° molecules, because each leaflet of the bilayer con- 
tains 1.43 X 10° molecules. 

2. Essentially an “inside-out” membrane. The hydrophilic groups 
would come together on the interior of the structure, away from 
the solvent, whereas the hydrocarbon chains would interact with 
the solvent. 

3. 2X 10°7cm, 6 X 10 °cm, and2 X 10-*cm. 

4. The radius of this molecule is 3.1 X 10°’ cm, and its diffu- 
sion coefficient is 7.4 x 10~° cm’s !. The average distances 
traversed are 1.7 X 10 ‘cmin1 ps, 5.4 X 10-° cm in 1 ms, and 
1.7X 10°-*cmin1s. 

5. The membrane underwent a phase transition from a highly 
fluid to a nearly frozen state when the temperature was lowered. A 
carrier can shuttle ions across a membrane only when the bilayer is 
highly fluid. A channel, in contrast, allows ions to traverse its pore 
even when the bilayer is quite rigid. 

6. The presence of a cis double bond introduces a kink in the fatty 
acid chain that prevents tight packing and reduces the number of 
atoms in van der Waals contact. The kink lowers the melting point 
compared with that of a saturated fatty acid. Trans fatty acids do 
not have the kink, and so their melting temperatures are higher, 
more similar to those of saturated fatty acids. Because trans fatty 
acids have no structural effect, they are rarely observed. 

7. Palmitic acid is shorter than stearic acid. Thus, when the chains 
pack together, there is less opportunity for van der Waals interac- 
tion and the melting point is thus lower than that of the longer 
stearic acid. 

8. Hibernators selectively feed on plants that have a high 
proportion of polyunsaturated fatty acids with lower melting 
temperature. 

9. The initial decrease in fluorescence with the first addition 

of sodium dithionite results from the quenching of NBD-PS 
molecules in the outer leaflet of the bilayer. Sodium dithionite 
does not traverse the membrane under these experimental con- 
ditions; hence, it does not quench the labeled phospholipids in 
the inner leaflet. A second addition of sodium dithionite has 

no effect, as the NBD-PS molecules in the outer leaflet remain 
quenched. However, after a 6.5-hour incubation, about half 

the NBD-PS has flipped over to the outer leaflet of the bilayer, 
resulting in the 50% decrease in fluorescence when sodium 
dithionite is added. 

10. The addition of the carbohydrate introduces a significant 
energy barrier to the flip-flop because a hydrophilic carbohy- 
drate moiety would need to be moved through a hydropho- 

bic environment. This energetic barrier enhances membrane 
asymmetry. 

11. The C,, alkyl chain is attached by an ether linkage. The C-2 
carbon atom of glycerol has only an acetyl group attached by 

an ester linkage instead of a fatty acid, as is the case with most 
phospholipids. 

12. Ina hydrophobic environment, the formation of intrachain 
hydrogen bonds stabilizes the amide hydrogen atoms and carbonyl 
oxygen atoms of the polypeptide chain, and so an a helix forms. In 
an aqueous environment, these groups are stabilized by interaction 
with water, and so there is no energetic reason to form an « helix. 
Thus, the a helix would be more likely to form in a hydrophobic 
environment. 
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13. The protein may contain an @ helix that passes through the 
hydrophobic core of the protein. This helix is likely to feature a 
stretch of hydrophobic amino acids similar to those observed in 
transmembrane helices. 

14. The shift to the lower temperature would decrease fluidity by 
enhancing the packing of the hydrophobic chains by van der Waals 
interactions. To prevent this packing, new phospholipids having 
shorter chains and a greater number of cis double bonds would be 
synthesized. The shorter chains would reduce the number of van 
der Waals interactions, and the cis double bonds, which cause the 
kink in structure, would prevent the packing of the fatty acid tails 
of the phospholipids. 

15. Each of the 21 v-SNARE proteins could interact with each 

of 7 t-SNARE partners. Multiplication gives the total number of 
different interacting pairs: 7 X 21 = 147 different vSNARE- 
t-SNARE pairs. 

16. (a) The graph shows that, as temperature increases, the 
phospholipid bilayer becomes more fluid. T,,, is the temperature 
of the transition from the predominantly less fluid state to the 
predominantly more fluid state. Cholesterol broadens the transi- 
tion from the less-fluid to the more-fluid state. In essence, cho- 
lesterol makes membrane fluidity less sensitive to temperature 
changes. 

(b) This effect is important because the presence of cholesterol 
tends to stabilize membrane fluidity by preventing sharp transi- 
tions. Because protein function depends on the proper fluidity of 
the membrane, cholesterol maintains the proper environment for 
membrane-protein function. 

17. The protein plotted in part c is a transmembrane protein from 
C. elegans. It spans the membrane with four a helices that are 
prominently displayed as hydrophobic peaks in the hydropathy 
plot. Interestingly, the protein plotted in part a also is a membrane 
protein, a porin. This protein is made primarily of B strands, which 
lack the prominent hydrophobic window of membrane helices. 
This example shows that, although hydropathy plots are useful, 
they are not infallible. 

18. (a) Prostaglandin H, synthase-1 recovers its activity imme- 
diately after removal of ibuprofen, suggesting that this inhibitor 
dissociates rapidly from the enzyme. In contrast, the enzyme 
remains significantly inhibited 30 minutes after removal of indo- 
methacin, suggesting that this inhibitor dissociates slowly from 
its active site. 

(b) Aspirin covalently modifies prostaglandin H, synthase-1, indi- 
cating that it would dissociate very slowly (if at all). Hence, one 
would anticipate that very low activity would be evident in all con- 
ditions where inhibitor has been added (columns 2, 3, and 4). 

19. To purify any protein, the protein must first be solubilized. For 
a membrane protein, solubilization usually requires a detergent— 
hydrophobic molecules that bind to the protein and thus replace the 
lipid environment of the membrane. If the detergent is removed, 
the protein aggregates and precipitates from solution. Often, the 
steps in purification, such as ion-exchange chromatography, are dif- 
ficult to perform in the presence of sufficient detergent to solubilize 
the protein. Crystals of appropriate protein—detergent complexes 
must be generated. 


Chapter 13 

1. In simple diffusion, the substance in question can diffuse down 
its concentration gradient through the membrane. In facilitated 
diffusion, the substance is not lipophilic and cannot directly dif- 
fuse through the membrane. A channel or carrier is required to 
facilitate movement down the gradient. 
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2. The two forms are (1) ATP hydrolysis and (2) the movement 
of one molecule down its concentration gradient coupled with the 
movement of another molecule up its concentration gradient. 

3. The three types of carriers are symporters, antiporters, and uni- 
porters. Symporters and antiporters can mediate secondary active 
transport. 


4. The free-energy cost is +32 kJ mol~! (+7.6 kcal mol~!). The 
chemical work performed is +20.4 kJ mol~! (+4.9 kcal mol~!), and 
the electrical work performed is +11.5 kJ mol! (+2.8 kcal mol~!). 
5. For chloride, z = —1; for calcium z = +2. At the concentra- 
tions given, the equilibrium potential for chloride is -97 mV and 
the equlibrium potential for calcium is +122 mV. 

6. The concentration of glucose inside the cell is 66 times as great 
as that outside the cell [(c2/c,) = 66] when the free-energy input 
is +10.8 kJ mol~! (+2.6 kcal mol~!). 

7. By analogy with the Ca2* ATPase, with three Na‘ ions bind- 
ing from inside the cell to the E, conformation and with two K* 
ions binding from outside the cell to the E, conformation, a plau- 
sible mechanism is as follows: 


(1) A catalytic cycle could begin with the enzyme in its unphos- 
phorylated state (E,) with three sodium ions bound. 

(il) The E, conformation binds ATP. A conformational change 
traps sodium ions inside the enzyme. 

(iii) The phosphoryl group is transferred from ATP to an aspartyl 
residue. 

(iv) On ADP release, the enzyme changes its overall conformation, 
including the membrane domain. This new conformation (E,) 
releases the sodium ions to the side of the membrane opposite that 
at which they entered and binds two potassium ions from the side 
where sodium ions are released. 

(v) The phosphorylaspartate residue is hydrolyzed to release inor- 
ganic phosphate. With the release of phosphate, the interactions 
stabilizing E, are lost, and the enzyme everts to the E, conforma- 
tion. Potassium ions are released to the cytoplasmic side of the 
membrane. The binding of three sodium ions from the cytoplasmic 
side of the membrane completes the cycle. 


8. Establish a lactose gradient across vesicle membranes that 
contain properly oriented lactose permease. Initially, the pH 
should be the same on both sides of the membrane and the lactose 
concentration should be higher on the “exit” side of lactose per- 
mease. As the lactose flows “in reverse” through the permease, 
down its concentration gradient, it can be tested whether or not 
a pH gradient becomes established as the lactose gradient is 
dissipated. 

9. Ligand-gated channels open in response to the binding of a 
molecule by the channel, whereas voltage-gated channels open in 
response to changes in the membrane potential. 

10. An ion channel must transport ions in either direction at the 
same rate. The net flow of ions is determined only by the composi- 
tion of the solutions on either side of the membrane. 

11. Uniporters act as enzymes do; their transport cycles include 
large conformational changes, and only a few molecules interact 
with the protein per transport cycle. In contrast, channels, after 
having opened, provide a pore in the membrane through which 
many ions may pass. As such, channels mediate transport at a 
much higher rate than do uniporters. 

12. FCCP effectively creates a pore in the bacterial membrane 
through which protons can pass rapidly. Protons that are pumped 
out of the bacteria will pass through this pore preferentially (the 
“path of least resistance”), rather than participate in Ht /lactose 
symport. 


13. Cardiac muscle must contract in a highly coordinated manner 
in order to pump blood effectively. Gap junctions mediate the 
orderly cell-to-cell propagation of the action potential through the 
heart during each beat. 

14. The positively charged guanidinium group resembles Nat and 
binds to negatively charged carboxylate groups in the mouth of the 
channel. 

15. SERCA, a P-type ATPase, uses a mechanism by which a 
covalent phosphorylated intermediate (at an aspartate residue) 

is formed. At steady state, a subset of the SERCA molecules is 
trapped in the E,-P state and, as a result, radiolabeled. The MDR 
protein is an ABC transporter and does not operate through a 
phosphorylated intermediate. Hence, a radiolabeled band would 
not be observed for MDR. 

16. The blockage of ion channels inhibits action potentials, lead- 
ing to loss of nervous function. Like tetrodotoxin, these toxin mol- 
ecules are useful for isolating and specifically inhibiting particular 
ion channels. 

17. After repolarization, the ball domains of the ion channels 
engage the channel pore, rendering them inactive for a short 
period of time. During this time, the channels cannot be reopened 
until the ball domains disengage and the channel returns to the 
“closed” state. 

18. Because sodium ions are charged and because sodium chan- 
nels carry only sodium ions (but not anions), the accumulation of 
excess positive charge on one side of the membrane dominates the 
chemical gradients. 

19. A mutation that impairs the ability of the sodium channel to 
inactivate would prolong the duration of the depolarizing sodium 
current, thus lengthening the cardiac action potential. 

20. No. Channels will likely open or close in response to an exter- 
nal stimulus, but the unit conductance of the open channel will be 
influenced very little. 

21. The ratio of closed to open forms of the channel is 10°, 5000, 
250, 12.5, and 0.625 when zero, one, two, three, and four ligands, 
respectively, are bound. Hence, the fraction of open channels is 
1.0 X 10°, 2.0 x 1074, 4.0 x 10°9, 7.4 x 10-7, and 0.62. 

22. These organic phosphates inhibit acetylcholinesterase by 
reacting with the active-site serine residue to form a stable phos- 
phorylated derivative. They cause respiratory paralysis by blocking 
synaptic transmission at cholinergic synapses. 

23. (a) The binding of the first acetylcholine molecule increases 
the open-to-closed ratio by a factor of 240, and the binding of the 
second increases it by a factor of 11,700. (b) The free-energy con- 
tributions are +14 kJ mol~! (+3.3 kcal mol~!) and +23 kJ mol7! 
(+5.6 kcal mol~'), respectively. (c) No; the MWC model predicts 
that the binding of each ligand will have the same effect on the 
open-to-closed ratio. 

24. Batrachotoxin blocks the transition from the open to the 
closed state. 

25. (a) Chloride ions flow into the cell. (b) Chloride flux is inhibi- 
tory because it hyperpolarizes the membrane. (c) The channel con- 
sists of five subunits. 

26. After the addition of ATP and calcium, SERCA will pump 
Ca?* ions into the vesicle. However, the accumulation of Ca2* 
ions inside the vesicle will rapidly lead to the formation of an 
electrical gradient that cannot be overcome by ATP hydrolysis. 
The addition of calcimycin will allow the pumped Ca2* ions to 
flow back out of the vesicle, dissipating the charge buildup, and 
enabling the pump to operate continuously. 

27. The catalytic prowess of acetylcholinesterase ensures that the 
duration of the nerve stimulus will be short. 


28. See reaction below. 


protein o> 
| Zo *N(CHs) 
3/3 
O.. a HQ” SN HR He) a Sa 
7 Acetylcholine — 
cin 
— + 
5) ie N(CHs3)3 


ee: 
me) @) 
0.0 HQ” Sn He 
S —————-> 
HO 


29. (a) Only ASIC1a is inhibited by the toxin. (b) Yes; when the 
toxin was removed, the activity of the acid-sensing channel began 
to be restored. (c) 0.9 nM. 

30. This mutation is one of a class of mutations that result in slow- 
channel syndrome (SCS). The results suggest a defect in chan- 

nel closing; so the channel remains open for prolonged periods. 
Alternatively, the channel may have a higher affinity for acetylcho- 
line than does the control channel. 

31. The mutation reduces the affinity of acetylcholine for the 
receptor. The recordings would show the channel opening only 
infrequently. 

32. Glucose displays a transport curve that suggests the participa- 
tion of a carrier because the initial rate is high but then levels off 

at higher concentrations, consistent with saturation of the carrier, 
which is reminiscent of Michaelis-Menten enzymes (Section 8.4). 
Indole shows no such saturation phenomenon, which implies 

that the molecule is lipophilic and simply diffuses across the 
membrane. Ouabain is a specific inhibitor the Nat—-K* pump. 

If ouabain were to inhibit glucose transport, then a Na*-glucose 
cotransporter would be assisting in transport. 


Chapter 14 

1. The negatively charged glutamate residues mimic the nega- 
tively charged phosphoserine or phosphothreonine residues and 
stabilize the active conformation of the enzyme. 

2. No. Phosphoserine and phosphothreonine are considerably 
shorter than phosphotyrosine. 

3. The GTPase activity terminates the signal. Without such activ- 
ity, after a pathway has been activated, it remains activated and is 
unresponsive to changes in the initial signal. If the GTPase activity 
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were more efficient, the lifetime of the GTP-bound G, sub- 
unit would be too short to achieve downstream signaling. 
4. Two identical receptor molecules must recognize different 
aspects of the same signal molecule. 
5. Growth-factor receptors can be activated by dimeriza- 
tion. If an antibody causes a receptor to dimerize, the signal- 
transduction pathway in a cell will be activated. 
6. The mutated a subunit will always be in the GTP form 
and, hence, in the active form, which would stimulate its 
signaling pathway. 
7. A G protein is a component of the signal-transduction 
pathway. GTPYS is not hydrolyzed by the G, subunit, leading 
to prolonged activation. 
8. Calcium ions diffuse slowly because they bind to many 
protein surfaces within a cell, impeding their free motion. 
Cyclic AMP does not bind as frequently, and so it diffuses 
more rapidly. 
9. Fura-2 is a highly negatively charged molecule, with five 
carboxylate groups. Its charge prevents it from effectively 
crossing the hydrophobic region of the plasma membrane. 
10. G,, stimulates adenylate cyclase, leading to the genera- 
tion of cAMP. This signal then leads to glucose mobilization 
(Chapter 21). If cAMP phosphodiesterase were inhibited, then 
cAMP levels would remain high even after the termination of 
the epinephrine signal, and glucose mobilization would continue. 
11. If the two kinase domains are forced to be within close 
proximity of each other, the activation loop of one kinase, in 
its inactivating conformation, can be displaced by the activa- 
tion loop of the neighboring kinase, which acts as a substrate 
for phosphorylation. 
12. The full network of pathways initiated by insulin includes 
a large number of proteins and is substantially more elaborate 
than indicated in Figure 14.26. Furthermore, many additional 
proteins take part in the termination of insulin signaling. A defect 
in any of the proteins in the insulin signaling pathways or in the 
subsequent termination of the insulin response could potentially 
cause problems. Therefore, it is not surprising that many different 
gene defects can cause type 2 diabetes. 
13. The binding of growth hormone causes its monomeric recep- 
tor to dimerize. The dimeric receptor can then activate a separate 
tyrosine kinase to which the receptor binds. The signaling pathway 
can then continue in similar fashion to the pathways that are acti- 
vated by the insulin receptor or other mammalian EGF receptors. 
14. The truncated receptor will dimerize with the full-length 
monomers on EGF-binding, but cross-phosphorylation can- 
not take place, because the truncated receptor possesses neither 
the substrate for the neighboring kinase domain nor its own 
kinase domain to phosphorylate the C-terminal tail of the other 
monomer. Hence, these mutant receptors will block normal EGF 
signaling. 
15. Insulin would elicit the response that is normally caused 
by EGF. Insulin binding will likely stimulate dimerization and 
phosphorylation of the chimeric receptor and thereby signal the 
downstream events that are normally triggered by EGF binding. 
Exposure of these cells to EGF would have no effect. 
16. 10° 
17. The formation of diacylglycerol implies the participation of 
phospholipase C. A simple pathway would entail receptor activa- 
tion by cross-phosphorylation, followed by the binding of phos- 
pholipase C (through its SH2 domains). The participation of 
phospholipase C indicates that IP would be formed and, hence, 
calcium concentrations would increase. 
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18. Other potential drug targets within the EGF signaling cascade 
include, but are not limited to, the kinase active sites of the EGF 
receptor, Raf, MEK, or ERK. 

19. In the reaction catalyzed by adenylate cyclase, the 3’-OH 
group nucleophilically attacks the a-phosphorus atom attached to 
the 5'-OH group, leading to displacement of pyrophosphate. The 
reaction catalyzed by DNA polymerase is similar except that the 
3’-OH group is on a different nucleotide. 

20. ATP-competitive inhibitors are likely to act on multiple 
kinases because every kinase domain contains an ATP-binding 
site. Hence, these drugs may not be selective for the desired kinase 
target. 

21. (a)X ~10°-’M; Y = 5 X 10°°M; Z = 10° M. (b) Because 
much less X is required to fill half of the sites, X displays the 
highest affinity. (c) The binding affinity almost perfectly matches 
the ability to stimulate adenylate cyclase, suggesting that the 
hormone-receptor complex leads to the stimulation of adenylate 
cyclase. (d) Try performing the experiment in the presence of anti- 
bodies to G,,.. 

22. (a) The total binding does not distinguish binding to a specific 
receptor from binding to different receptors or from nonspecific 
binding to the membrane. 

(b) The rationale is that the receptor will have a high affinity for 
the ligand. Thus, in the presence of excess nonradioactive ligand, 
the receptor will bind to nonradioactive ligand. Therefore, any 
binding of the radioactive ligand must be nonspecific. 

(c) The plateau suggests that the number of receptor-binding sites 
in the cell membrane is limited. 

23. Number of receptors per cell = 


10* cpm mg of membrane protein 
mg of membrane protein 10!° cells 
mmol _ 6.023 x 10°’ molecules _ ang 
1017 cpm mmol 


Chapter 15 

1. The highly integrated biochemical reactions that take place 
inside the cell. 

2. Anabolism is the set of biochemical reactions that use energy 

to build new molecules and ultimately new cells. Catabolism is the 
set of biochemical reactions that extract energy from fuel sources 
or break down biomolecules. 

3. You reply that vandalism is disrespectful and expensive. Part of 
your tuition money will now have to pay to remove the vandalism. 
Plus, the fool should know that Gibbs free energy is at a minimum 
when a system is in equilibrium. 

4. Cellular movements and the performance of mechanical work; 
active transport; biosynthetic reactions. 

5. 1. f;2.h; 3.4; 4. a; 5. g; 6. b; 7. c; 8. e; 9.3; 10. d. 

6. These ions neutralize the charges on the ATP and also facilitate 
interactions with macromolecules that bind ATP. 

7. Charge repulsion, resonance stabilization, increase in entropy, 
and stabilization by hydration. 

8. Trick question. The answer is not known. Adenine appears to 
form more readily under prebiotic conditions, so ATP may have 
predominated initially. 

9. Having only one nucleotide represent the available energy 
allows the cell to better monitor its energy status. 

10. Increasing the concentration of ATP or decreasing the concen- 
tration cellular ADP or P; (by rapid removal by other reactions, for 
instance) would make the reaction more exergonic. Likewise, altering 
the Mg?* concentration could raise or lower the AG of the reaction. 


11. The free-energy changes of the individual steps in a pathway 
are summed to determine the overall free-energy change of the 
entire pathway. Consequently, a reaction with a positive free- 
energy value can be powered to take place if coupled to a suffi- 
ciently exergonic reaction. 

12. Reactions in parts a and c, to the left; reactions in parts b and 
d, to the right. 

13. None whatsoever. 

14. (a) AG" = +31.4kJ mol7! (+7.5 kcal mol~!) and Wea = 3.06 
X 107; (b) 3.28 x 104. 

15. AG = +7.1 kJ mol7! (+1.7 kcal mol~!). The equilibrium 
ratio is 17.5. 

16. (a) Acetate + CoA + H* goes to acetyl CoA + H,O, AG®’ = 
—31.4 kJ mol! (—7.5 kcal mol~!). ATP hydrolysis to AMP 
and PP., AG®’ = —45.6k]J mol~! (—10.9 kcal mol~'). Overall 
reaction, AG’ = —14.2 kJ mol~! (—3.4 kcal mol7!). 

(b) With pyrophosphate hydrolysis, AG°’ = —33.4 kJ mol! 
(—7.98 kcal mol~!). Pyrophosphate hydrolysis makes the overall 
reaction even more exergonic. 


17. (a) For an acid AH, 


[A ][H*] 
[AH] 
The pK is defined as pK = —log,,) K. AG®’ is the standard free- 
energy change at pH 7. Thus, AG’ = —RT1n K = —2.303 RT 
log,y K = +2.303 RT pK. 
(b) AG°’ = +27.32 kJ mol”! (+6.53 kcal mol7!), 
18. Arginine phosphate in invertebrate muscle, like creatine phos- 
phate in vertebrate muscle, serves as a reservoir of high-potential 
phosphoryl groups. Arginine phosphate maintains a high level of 
ATP in muscular exertion. 
19. An ADP unit. 
20. (a) The rationale behind creatine supplementation is that it 
would be converted into creatine phosphate and thus serve as a 
rapid means of replenishing ATP after muscle contraction. (b) 
If creatine supplementation is beneficial, it would affect activi- 
ties that depend on short bursts of activity; any sustained activity 
would require ATP generation by fuel metabolism, which, as 
Figure 15.7 shows, requires more time. 
21. Under standard conditions, AG°’ = —RT In [products]/ 
[reactants]. Substituting +23.8 kJ mol! (+5.7 kcal mol!) for 
AG*’ and solving for [products]/[reactants] yields 9.9 X 107°. In 
other words, the forward reaction does not take place to a signifi- 
cant extent. Under intracellular conditions, AG is —1.3 kJ mol! 
(—0.3 kcal mol~!). Using the equation AG = AG°’ + RT In 
[products]/[reactants] and solving for [products]/[reactants] gives 
a ratio of 5.96 X 107°. Thus, a reaction that is endergonic under 
standard conditions can be converted into an exergonic reaction by 
maintaining the [products]/[reactants] ratio below the equilibrium 
value. This conversion is usually attained by using the products in 
another coupled reaction as soon as they are formed. 
22. Under standard conditions, 
[Bleg [ADP ]eqlPileg 
[Aleq [ATP ]eq 
At equilibrium, the ratio of [B] to [A] is given by 
[Bleq ,, [ATPleg 
[Aleq _ [ADP ]eq[Pileg 

The ATP-generating system of cells maintains the [ATP]/[ADP] 
[P,] ratio at a high level, typically about 500 M~|. For this ratio, 

[Bleq 

[A]. 


AH>—A°+H?* K= 


= = 10/136 — 9.67 x 107 


= 2.67 X 107 X 500 = 1.34 X 10° 
q 


This equilibrium ratio is strikingly different from the value of 
1.15 X 1073 for the reaction A > B in the absence of ATP hydro- 
lysis. In other words, coupling the hydrolysis of ATP with the 
conversion of A into B has changed the equilibrium ratio of B to A 
by a factor of about 108. 

23. Liver: —45.2 kJ mol! (—10.8 kcal mol~'); muscle: —48.1 kJ 
mol~! (—11.5 kcal mol~!); brain: —48.5 kJ mol~! (—11.6 kcal 
mol~'). The AG is most negative in brain cells. 

24. (a) Ethanol; (b) lactate; (c) succinate; (d) isocitrate; (e) malate. 
25. Recall that AG = AG®’ + RT In [products]/[reactants]. 
Altering the ratio of products to reactants will cause AG to vary. In 
glycolysis, the concentrations of the components of the pathway 
result in a value of AG greater than that of AG®’. 

26. Higher organisms cannot make vitamins, and thus are depen- 
dent on obtaining them from other organisms. 

27. Unless the ingested food is converted into molecules capable 
of being absorbed by the intestine, no energy can ever be extracted 
by the body. 

28. NADH and FADH, are electron carriers for catabolism; 
NADPH is the carrier for anabolism. 

29. The electrons of the C—O bond cannot form resonance struc- 
tures with the C-S bond that are as stable as those that they can 
form with the C—O bond. Thus, the thioester is not stabilized by 
resonance to the same degree as an oxygen ester is stabilized. 

30. Oxidation—reduction reactions; ligation reactions; isomeriza- 
tion reactions; group-transfer reactions; hydrolytic reactions; cleav- 
age of bonds by means other than hydrolysis or oxidation. 

31. Controlling the amount of enzymes; controlling enzyme activ- 
ity; controlling the availability of substrates. 

32. Although the reaction is thermodynamically favorable, the 
reactants are kinetically stable because of the large activation 
energy. Enzymes lower the activation energy so that reactions take 
place on time scales required by the cell. 

33. The activated form of sulfate in most organisms is 
3'-phosphoadenosine-5’-phosphosulfate. 

34. (a) As the Mg?* concentration falls, the AG of hydrolysis 
rises. Note that pMg is a logarithmic plot, and so each number on 
the x-axis represents a 10-fold change in [Mg?*]. 

(b) Mg? would bind to the phosphates of ATP and help to miti- 
gate charge repulsion. As the [Mg?*] falls, charge stabilization 

of ATP would be less, leading to greater charge repulsion and an 
increase in AG on hydrolysis. 


Chapter 16 

1. Two molecules of ATP are produced per molecule of glycer- 
aldehyde 3-phosphate and, because two molecules of GAP are 
produced per molecule of glucose, the total ATP yield is four. 
However, two molecules of ATP are required to convert glucose 
into fructose 1,6-bisphosphate. Thus, the net yield is only two 
molecules of ATP. 

2. (a) 4; (b) 3; (c) 1; (d) 6; (e) 8; (£) 2; (g) 10; (h) 9; (@) 7; G) 5. 

3. In both cases, the electron donor is glyceraldehyde 3-phosphate. 
In lactic acid fermentation, the electron acceptor is pyruvate, con- 
verting it into lactate. In alcoholic fermentation, acetaldehyde is 
the electron acceptor, forming ethanol. 

4. (a) 3 ATP; (b) 2 ATP; (c) 2 ATP; (d) 2 ATP; (e) 4 ATP. 

5. Glucokinase enables the liver to remove glucose from the blood 
when hexokinase is saturated, ensuring that glucose is captured for 
later use. 

6. The GAP formed is immediately removed by subsequent reac- 
tions, resulting in the conversion of DHAP into GAP by the enzyme. 
7. A thioester couples the oxidation of glyceraldehyde 3-phosphate to 
3-phosphoglycerate with the formation of 1,3-bisphosphoglycerate. 
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1,3-Bisphosphoglycerate can subsequently power the formation 

of ATP. 

8. Glycolysis is a component of alcoholic fermentation, the path- 

way that produces alcohol for beer and wine. The belief was that 

understanding the biochemical basis of alcohol production might 

lead to a more-efficient means of producing beer. 

9. The conversion of glyceraldehyde 3-phosphate into 1,3- bisphos- 

phoglycerate would be impaired. Glycolysis would be less effective. 

10. Glucose 6-phosphate must have other fates. Indeed, it can 

be converted into glycogen (Chapter 21) or be processed to yield 

reducing power for biosynthesis (Chapter 20). 

11. The energy needs of a muscle cell vary widely, from rest to 

intense exercise. Consequently, the regulation of phosphofructo- 

kinase by energy charge is vital. In other tissues, such as the liver, 

ATP concentration is less likely to fluctuate and will not be a key 

regulator of phosphofructokinase. 

12. (a) 6; (b) 1; (c) 7; (d) 3; (e) 2; (f) 5; (g) 4. 

13. The AG” for the reverse of glycolysis is +90 kJ mol~! 

(+22 kcal mol~!), far too endergonic to take place. 

14. The conversion of glucose into glucose 6-phosphate by 

hexokinase; the conversion of fructose 6-phosphate into fructose 

1,6-bisphosphate by phosphofructokinase; the formation of pyru- 

vate from phosphoenolpyruvate by pyruvate kinase. 

15. Lactic acid is a strong acid. If it remained in the cell, the pH of 

the cell would fall, which could lead to the denaturation of muscle 

protein and result in muscle damage. 

16. GLUT2 transports glucose only when the blood concentration 

of glucose is high, which is precisely the condition in which the B 

cells of the pancreas secrete insulin. 

17. Fructose + ATP ——- fructose 1-phosphate + ADP: 
Fructokinase 


Fructose 1-phosphate ——-> dihydroxyacetone phosphate + 
glyceraldehyde: Fructose 1-phosphate aldolase 


Glyceraldehyde + ATP ——-=> glyceraldehyde 3-phosphate + 
ADP: Triose kinase 


The primary controlling step of glycolysis catalyzed by phospho- 
fructokinase is bypassed by the preceding reactions. Glycolysis 
will proceed in an unregulated fashion. 

18. (a) A, B; (b) C, D; (c) D; (d) A; (e) B; (£) G; (g) As (h) D; 

(1) none; (j) A; (k) A. 

19. (a) 4; (b) 10; (c) 1; (d) 5; (e) 7; (£) 8; (g) 9; (A) 2; (a) 3; G) 6. 

20. Without triose isomerase, only one of the two three-carbon 
molecules generated by aldolase could be used to generate ATP. 
Only two molecules of ATP would result from the metabolism of 
each glucose. But two molecules of ATP would still be required to 
form fructose 1,6-bisphosphate, the substrate for aldolase. The net 
yield of ATP would be zero, a yield incompatible with life. 

21. Glucose is reactive because its open-chain form contains an 
aldehyde group. 

22. (a) The label is in the methyl carbon atom of pyruvate. (b) 5 
mCimM~!. The specific activity is halved because the number of 
moles of product (pyruvate) is twice that of the labeled substrate 
(glucose). 

23. (a) Glucose + 2 P; + 2 ADP — 2 lactate + 2 ATP. 

(b) AG = —114kJ mol~! (—27.2 kcal mol7!). 

24. 3.06 X 10° 

25. The equilibrium concentrations of fructose 1,6-bisphosphate, 
dihydroxyacetone phosphate, and glyceraldehyde 3-phosphate are 
7.8 x 104M, 2.2 x 1074M, and 2.2 x 107-4 M, respectively. 
26. All three carbon atoms of 2,3-BPG are '*C labeled. The phos- 
phorus atom attached to the C-2 hydroxyl group is *P labeled. 
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27. Hexokinase has a low ATPase activity in the absence of a 
sugar because it is in a catalytically inactive conformation. The 
addition of xylose closes the cleft between the two lobes of the 
enzyme. However, xylose lacks a hydroxymethyl group, and so 

it cannot be phosphorylated. Instead, a water molecule at the site 
normally occupied by the C-6 hydroxymethyl group acts as the 
acceptor of the phosphoryl group from ATP. 

28. (a) The fructose 1-phosphate pathway forms glyceraldehyde 
3-phosphate. 

(b) Phosphofructokinase, a key control enzyme, is bypassed. 

29. The reverse of glycolysis is highly endergonic under cellular 
conditions. The expenditure of six NTP molecules in gluconeo- 
genesis renders gluconeogenesis exergonic. 

30. (a) 2, 3, 6, 9; (b) 1, 4, 5, 7, 8. 

31. Lactic acid is capable of being further oxidized and is thus 
useful energy. The conversion of this acid into glucose saves the 
carbon atoms for future combustion. 

32. In glycolysis, the formation of pyruvate and ATP by 
pyruvate kinase is irreversible. This step is bypassed by two 
reactions in gluconeogenesis: (1) the formation of oxaloacetate 
from pyruvate and CO, by pyruvate carboxylase and (2) the 
formation of phosphoenolpyruvate from oxaloacetate and GTP by 
phosphoenolpyruvate carboxykinase. The formation of fructose 
1,6-bisphosphate by phosphofructokinase is bypassed by fructose 
1,6-bisphosphatase in gluconeogenesis, which catalyzes the con- 
version of fructose 1,6-bisphosphate into fructose 6-phosphate. 
Finally, the hexokinase-catalyzed formation of glucose 
6-phosphate in glycolysis is bypassed by glucose 6-phosphatase, 
but only in the liver. 

33. Reciprocal regulation at the key allosteric enzymes in the 

two pathways. For instance, PFK is stimulated by fructose 
2,6-bisphosphate and AMP. The effect of these signals is opposite 
that of fructose 1,6-bisphosphatase. If both pathways were oper- 
ating simultaneously, a futile cycle would result. ATP would be 
hydrolyzed, yielding only heat. 

34. Muscle is likely to produce lactic acid during contraction. 
Lactic acid is a strong acid and cannot accumulate in muscle or 
blood. Liver removes the lactic acid from the blood and converts 
it into glucose. The glucose can be released into the blood or stored 
as glycogen for later use. 

35. Glucose produced by the liver could not be released into the 
blood. Tissues that rely on glucose as an energy source would not 
function as well unless glucose was provided in the diet. 

36. Glucose is an important energy source for both tissues and 

is essentially the only energy source for the brain. Consequently, 
these tissues should never release glucose. Glucose release is pre- 
vented by the absence of glucose 6-phosphatase. 

37. 6 NTP (4 ATP and 2 GTP); 2 NADH. 

38. (a) None; (b) none; (c) 4 (2 ATP and 2 GTP); (d) none. 

39. Ifthe amino groups are removed from alanine and aspartate, 
the ketoacids pyruvate and oxaloacetate are formed. Both of these 
molecules are components of the gluconeogenic pathway. 

40. (a) Increased; (b) increased; (c) increased; (d) decreased. 

41. Fructose 2,6-bisphosphate, present at high concentration 
when glucose is abundant, normally inhibits gluconeogenesis by 
blocking fructose 1,6-bisphosphatase. In this genetic disorder, the 
phosphatase is active irrespective of the glucose level. Hence, sub- 
strate cycling is increased. The level of fructose 1,6-bisphosphate is 
consequently lower than normal. Less pyruvate is formed and thus 
less ATP is generated. 

42. Reactions in parts b and e would be blocked. 

43. There will be no labeled carbons. The CO, added to pyruvate 
(formed from the lactate) to form oxaloacetate is lost with the con- 
version of oxaloacetate into phosphoenolpyruvate. 


44. The net reaction in the presence of arsenate is 


Glyceraldehyde 3-phosphate + NAD* + H,O —> 
3-phosphoglycerate + NADH + 2 Ht 


Glycolysis proceeds in the presence of arsenate, but the ATP nor- 
mally formed in the conversion of 1,3-bisphosphoglycerate into 
3-phosphoglycerate is lost. Thus, arsenate uncouples oxidation and 
phosphorylation by forming a highly labile acyl arsenate. 

45. This example illustrates the difference between the stoichio- 
metric and the catalytic use of a molecule. If cells used NAD* stoi- 
chiometrically, a new molecule of NAD* would be required each 
time a molecule of lactate was produced. As we will see, the syn- 
thesis of NAD* requires ATP. On the other hand, if the NAD* 
that is converted into NADH could be recycled and reused, a small 
amount of the molecule could regenerate a vast amount of lactate, 
which is the case in the cell. NAD* is regenerated by the oxidation 
of NADH and reused. NAD* is thus used catalytically. 

46. Consider the equilibrium equation of adenylate kinase: 


Keg = [ATP][AMP]/ [ADP]? (1) 


or 
AMP = K,, [ADP]*/[ATP](2) 
Recall that [ATP] > [ADP] > [AMP] in the cell. As ATP is 


utilized, a small decrease in its concentration will result in a 
larger percentage increase in [ADP] because its concentration 

is greater than that of ADP. This larger percentage increase 

in [ADP] will result in an even greater percentage increase in 
[AMP] because the concentration of AMP is related to the 
square of [ADP]. In essence, equation 2 shows that monitoring 
the energy status with AMP magnifies small changes in [ATP], 
leading to tighter control. 

47. The synthesis of glucose during intense exercise provides 

a good example of interorgan cooperation in higher organisms. 
When muscle is actively contracting, lactate is produced from 
glucose by glycolysis. The lactate is released into the blood and 
absorbed by the liver, where it is converted by gluconeogenesis 
into glucose. The newly synthesized glucose is then released and 
taken up by the muscle for energy generation. 

48. The input of four additional high-phosphory]-transfer-potential 
molecules in gluconeogenesis changes the equilibrium constant 

by a factor of 10°”, which makes the conversion of pyruvate into 
glucose thermodynamically feasible. Without this energetic input, 
gluconeogenesis would not take place. 

49. The mechanism is analogous to that for triose phosphate isom- 
erase (Figure 16.5). It proceeds through an enediol intermediate. 
The active site would be expected to have a general base (analo- 
gous to Glu 165 in TPI) and a general acid (analogous to His 95 

in TPI). 

50. Galactose is a component of glycoproteins. Possibly, the 
absence of galactose leads to the improper formation or function of 
glycoproteins required in the central nervous system. More gener- 
ally, the fact that the symptoms arise in the absence of galactose 
suggests that galactose is required in some fashion. 

51. Using the Michaelis-Menten equation to solve for [S] when 
Ky, = 50 uM and V, =0.9V__., shows that a substrate concentra- 
tion of 0.45 mM yields 90% of V,,,,- Under normal conditions, the 
enzyme is essentially working at V,,.,. 

52. Fructose 2,6-bisphosphate stabilizes the R state of the enzyme. 
53. (a) Curiously, the enzyme uses ADP as the phosphoryl! donor 
rather than ATP. 

(b) Both AMP and ATP behave as competitive inhibitors of ADP, 
the phosphoryl donor. Apparently, the P. furiosus enzyme is not 
allosterically inhibited by ATP. 


54. (a) If both enzymes operated simultaneously, the following 
reactions would take place: 


Fructose 
a 6-phosphate x 
, Fructose 
Phosphofructokinase 1,6-bisphosphatase 
ADP Fructose H,0 


1,6-bisphosphate 


The net result would be simply: 
ATP + H,O 


> ADP + P; 


The energy of ATP hydrolysis would be released as heat. 

(b) Not really. For the cycle to generate heat, both enzymes must 
be functional at the same time in the same cell. 

(c) The species B. terrestris and B. rufocinctus might show some 
futile cycling because both enzymes are active to a substantial 
degree. 

(d) No. These results simply suggest that simultaneous activity of 
phosphofructokinase and fructose 1,6-bisphosphatase is unlikely 
to be employed to generate heat in the species shown. 

55. ATP initially stimulates PFK activity, as would be expected 
for a substrate. Higher concentrations of ATP inhibit the enzyme. 
Although this effect seems counterintuitive for a substrate, recall 
that the function of glycolysis in muscle is to generate ATP. 
Consequently, high concentrations of ATP signal that the ATP 
needs are met and glycolysis should stop. In addition to being a 
substrate, ATP is an allosteric inhibitor of PFK. 


Chapter 17 
1. The pyruvate dehydrogenase complex catalyzes the following 
reaction, linking glycolysis and the citric acid cycle: 


Pyruvate + CoA + NAD*t ——> acetyl CoA + NADH 
+ H* + CO, 


2. Pyruvate dehydrogenase catalyzes the decarboxylation of pyru- 
vate and the formation of acetyllipoamide. Dihydrolipoy] trans- 
acetylase catalyzes the formation of acetyl CoA. Dihydrolipoyl 
dehydrogenase catalyzes the reduction of the oxidized lipoic acid. 
The kinase associated with the complex phosphorylates and inacti- 
vates the complex, whereas the phosphatase dephosphorylates and 
activates the complex. 
3. Thiamine pyrophosphate plays a role in the decarboxylation 
of pyruvate. Lipoic acid (as lipoamide) transfers the acetyl group. 
Coenzyme A accepts the acetyl group from lipoic acid to form acetyl 
CoA. FAD accepts the electrons and hydrogen ions when reduced 
lipoic acid is oxidized. NAD* accepts electrons from FADH),. 
4. Catalytic coenzymes (TPP, lipoic acid, and FAD) are modified 
but regenerated in each reaction cycle. Thus, they can play a role 
in the processing of many molecules of pyruvate. Stoichiometric 
coenzymes (coenzyme A and NAD‘) are used in only one reaction 
because they are the components of products of the reaction. 
5. The remaining steps regenerate oxidized lipoamide, which is 
required to begin the next reaction cycle. Moreover, this regenera- 
tion results in the production of high-energy electrons in the form 
of NADH. 
6. The advantages are as follows: 
The reaction is facilitated by having the active sites in proximity. 
The reactants do not leave the enzyme until the final product 
is formed. 
Constraining the reactants minimizes loss due to diffusion and 
minimizes side reactions. 
All of the enzymes are present in the correct amounts. 
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Regulation is more efficient because the regulatory enzymes— 

the kinase and phosphatase—are part of the complex. 
7. (a) After one round of the citric acid cycle, the label emerges in 
C-2 and C-3 of oxaloacetate. (b) The label emerges in CO, in the 
formation of acetyl CoA from pyruvate. (c) After one round of the 
citric acid cycle, the label emerges in C-1 and C-4 of oxaloacetate. 
(d) and (e) Same fate as that in part a. 
8. (a) Isocitrate lyase and malate synthase are required in addition 
to the enzymes of the citric acid cycle. 
(b) 2 Acetyl CoA + 2 NAD* + FAD + 3 H,O — oxaloacetate + 
2 CoA + 2 NADH + FADH, + 3H?*. 
(c) No. Hence, mammals cannot carry out the net synthesis of oxa- 
loacetate from acetyl CoA. 
9. —41.0kJ mol !(—9.8 kcal mol” ') 
10. Enzymes or enzyme complexes are biological catalysts. Recall 
that a catalyst facilitates a chemical reaction without the catalyst 
itself being permanently altered. Oxaloacetate can be thought of as 
a catalyst because it binds to an acetyl group, leads to the oxidative 
decarboxylation of the two carbon atoms, and is regenerated at the 
completion of a cycle. In essence, oxaloacetate (and any cycle inter- 
mediate) acts as a catalyst. 
11. Thiamine thiazolone pyrophosphate is a transition-state ana- 
log. The sulfur-containing ring of this analog is uncharged, and so 
it closely resembles the transition state of the normal coenzyme in 
thiamine-catalyzed reactions (e.g., the uncharged resonance form 
of hydroxyethyl-TPP). 
12. (a) 6; (b) 10; (c) 1; (d) 7; (e) 2; (£) 8; (g) 3; (h) 4; (4) 5; G) 9. 
13. A decrease in the amount of O, will necessitate an increase in 
anaerobic glycolysis for energy production, leading to the genera- 
tion of a large amount of lactic acid. Under conditions of shock, the 
kinase inhibitor is administered to ensure that pyruvate dehydro- 
genase is operating maximally. 
14. (a) As is stated in the previous problem, DCA inhibits pyru- 
vate dehydrogenase kinase. 
(b) The fact that inhibiting the kinase results in more dehydroge- 
nase activity suggests that there must be some residual activity that 
is being inhibited by the kinase. 
15. Acetyllipoamide and acetyl CoA. 
16. In muscle, the acetyl CoA generated by the complex is used 
for energy generation. Consequently, signals that indicate an 
energy-rich state (high ratios of ATP/ADP and NADH/NAD*) 
inhibit the complex, whereas the reverse conditions stimulate the 
enzyme. Calcium as the signal for muscle contraction (and, hence, 
energy need) also stimulates the enzyme. In liver, acetyl CoA 
derived from pyruvate is used for biosynthetic purposes, such as 
fatty acid synthesis. Insulin, the hormone denoting the fed state, 
stimulates the complex. 
17. (a) Enhanced kinase activity will result in a decrease in the 
activity of the PDH complex because phosphorylation by the 
kinase inhibits the complex. 
(b) Phosphatase activates the complex by removing a phosphate. 
If the phosphatase activity is diminished, the activity of the PDH 
complex also will decrease. 
18. She might have been ingesting, in some fashion, the arsenite from 
the peeling paint or the wallpaper. Also, she might have been breath- 
ing arsine gas from the wallpaper, which would be oxidized to arse- 
nite in her body. In any of these circumstances, the arsenite inhibited 
enzymes that require lipoic acid—notably, the PDH complex. 
19. (a) 5; (b) 7; (c) 1; (d) 10; (e) 2; (£) 4; (g) 9; (h) 3; (a) 8; Gj) 6. 
20. The TCA cycle depends on a steady supply of NAD* as an 
oxidant, generating NADH. O, is never directly utilized in the 
cycle. However, NAD* is regenerated via donation of electrons to 
O, by way of the electron transport chain, so eventually a lack of 
O, will cause the cycle to cease due to a lack of NAD*. 
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21. Succinate dehydrogenase is the only enzyme in the citric acid 
cycle that is embedded in the mitochondrial membrane, which 
makes it associated with the electron-transport chain. 

22. (a) The steady-state concentrations of the products are low 
compared with those of the substrates. (b) The ratio of malate to 
oxaloacetate must be greater than 1.57 X 104 for oxaloacetate to be 
formed. 

23. 


Pyruvate 
dehydrogenase 


complex 
Pyruvate + CoA + NAD+ —————> 


acetyl CoA + CO, + NADH 


Pyruvate 
carboxylase 


Pyruvate + CO, + ATP + H,O 
oxaloacetate + ADP+ P. + Ht 


Citrate 
synthase 


Oxaloacetate + acetyl CoA + H,O 
citrate + CoA + Ht 


. Aconitase . . 
Citrate —————————> _isocitrate 


Isocitrate 


Isocitrate + NADt = 


a-ketoglutarate + CO, + NADH 


Net: 2 Pyruvate + 2 NADt + ATP 4 H,O > 
a-ketoglutarate + CO, + ADP + P; + 2NADH + 3 Ht 


24. Succinate will increase in concentration, followed by a- 
ketoglutarate and the other intermediates “upstream” of the site of 
inhibition. Succinate has two methylene groups that are required 
for the dehydrogenation, whereas malonate has but one. 

25. Pyruvate carboxylase should be active only when the acetyl 
CoA concentration is high. Acetyl CoA might accumulate if the 
energy needs of the cell are not being met, because of a deficiency 
of oxaloacetate. Under these conditions the pyruvate carboxyl- 
ase catalyzes an anaplerotic reaction. Alternatively, acetyl CoA 
might accumulate because the energy needs of the cell have been 
met. In this circumstance, pyruvate will be converted back into 
glucose, and the first step in this conversion is the formation of 
oxaloacetate. 

26. The energy released when succinate is oxidized to fumarate is 
not sufficient to power the synthesis of NADH but is sufficient to 
reduce FAD. 

27. Citrate is a tertiary alcohol that cannot be oxidized, because 
oxidation requires a hydrogen atom to be removed from the alco- 
hol and a hydrogen atom to be removed from the carbon atom 
bonded to the alcohol. No such hydrogen exists in citrate. The 
isomerization converts the tertiary alcohol into isocitrate, which is 
a secondary alcohol that can be oxidized. 

28. The enzyme nucleoside diphosphokinase transfers a phos- 
phoryl group from GTP (or any nucleoside triphosphate) to ADP 
according to the reversible reaction: 


GTP + ADP == GDP + ATP 


29. The reaction is powered by the hydrolysis of a thioester. 
Acetyl CoA provides the thioester that is converted into citryl 
CoA. When this thioester is hydrolyzed, citrate is formed in an 
irreversible reaction. 

30. It enables organisms such as plants and bacteria to convert 
fats, through acetyl CoA, into glucose. 


31. We cannot get the net conversion of fats into glucose, because 
the only means to get the carbon atoms from fats into oxaloac- 
etate, the precursor of glucose, is through the citric acid cycle. 
However, although two carbon atoms enter the cycle as acetyl 
CoA, two carbon atoms are lost as CO, before oxaloacetate is 
formed. Thus, although some carbon atoms from fats may end 
up as carbon atoms in glucose, we cannot obtain a net synthesis 
of glucose from fats. 

32. Acetyl CoA will inhibit the complex. Glucose metabolism to 
pyruvate will be slowed because acetyl CoA is being derived from 
an alternative source. 

33. The enol intermediate of acetyl CoA attacks the carbonyl 
carbon atom of glyoxylate to form a C-C bond. This reac- 

tion is like the condensation of oxaloacetate with the enol 
intermediate of acetyl CoA in the reaction catalyzed by citrate 
synthase. Glyoxylate contains a hydrogen atom in place of the 
—CH,COO*™ group of oxaloacetate; the reactions are otherwise 
nearly identical. 

34. Citrate is a symmetric molecule. Consequently, the investiga- 
tors assumed that the two -CH,COO™ groups in it would react 
identically. Thus, for every citrate molecule undergoing the reac- 
tions shown in path 1, they thought that another citrate molecule 
would react as shown in path 2. If so, then only half the label 
should have emerged in the CO,,. 


CH; 
é coo- 
00c S-CoA HS-CoA OH 
2 YS coo- 
> ———— 
COoO7 Ccoo- 
coo7 coo- 
co, 
> O ea > -oo¢ 
‘coo- 
Path 1 
CH; 
Oo 
00G S-CoA HS-CoA 
° oe a 
‘coo- 
coo- coo- 
co, 
coo- 
> O ea > -ooc 
COO" 


Path 2 (does not occur) 


35. Call one hydrogen atom A and the other B. Now suppose that 
an enzyme binds three groups of this substrate—X, Y, and H—at 
three complementary sites. The adjoining diagram shows X, Y, and 
H, bound to three points on the enzyme. In contrast, X, Y, and 
Hg cannot be bound to this active site; two of these three groups 
can be bound, but not all three. Thus, H, and Hg will have differ- 
ent fates. 


Sterically nonequivalent groups such as H, and Hg will almost 
always be distinguished in enzymatic reactions. The essence of 
the differentiation of these groups is that the enzyme holds the 
substrate in a specific orientation. Attachment at three points, as 
depicted in the diagram, is a readily visualized way of achieving a 
particular orientation of the substrate, but it is not the only means 
of doing so. 

36. (a) The complete oxidation of citrate requires 4.5 mol of O, 
for every micromole of citrate. 


CeHgO7 + 4.5 Oo 


>6CO, + 4H,O 


Thus, 13.5 mol of O, would be consumed by 3 mol of citrate. 
(b) Citrate led to the consumption of far more O, than can be 
accounted for simply by the oxidation of citrate itself. Citrate thus 
facilitated O, consumption. 

37. (a) In the absence of arsenite, the amount of citrate remained 
constant. In its presence, the concentration of citrate fell, suggest- 
ing that it was being metabolized. 

(b) The action of arsenite is not altered. Citrate still disappears. 

(c) Arsenite is preventing the regeneration of citrate. Recall 

(pp. 515-516) that arsenite inhibits the pyruvate dehydrogenase 
complex. 

38. (a) The initial infection is unaffected by the absence of isocit- 
rate lyase, but the absence of this enzyme inhibits the latent phase 
of the infection. 

(b) Yes. 

(c) A critic could say that, in the process of deleting the isocitrate 
lyase gene, some other gene was damaged, and it is the absence 

of this other gene that prevents latent infection. Reinserting the 
isocitrate lyase gene into the bacteria from which it had been 
removed renders the criticism less valid. 

(d) Isocitrate lyase enables the bacteria to synthesize carbohydrates 
that are necessary for survival, including carbohydrate components 
of the cell membrane. 


Chapter 18 

1. In fermentations, organic compounds are both the donors and 
the acceptors of electrons. In respiration, the electron donor is 
usually an organic compound, whereas the electron acceptor is an 
inorganic molecule, such as oxygen. 

2. Biochemists use E, the value at pH 7, whereas chemists use 
Ey, the value in 1 M H*. The prime denotes that pH 7 is the 
standard state. 

3. The reduction potential of FADH, is less than that of NADH 
(Table 18.1). Consequently, when those electrons are passed along 
to oxygen, less energy is released. The consequence of the differ- 
ence is that electron flow from FADH, to O, pumps fewer protons 
than do the electrons from NADH. 

4. The AG®’ for the reduction of oxygen by FADH, is —200 kJ 
mol”! (—48 kcal mol!). 

5. AG” is +67 kJ mol! (+16.1 kcal mol!) for oxidation by 
NAD? and —3.8 kJ mol! (—0.92 kcal mol” ') for oxidation by 
FAD. The oxidation of succinate by NAD* is not thermodynami- 
cally feasible. 


A23 


Answers to Problems 


6. An oxidizing agent, or oxidant, accepts electrons in oxidation— 
reduction reactions. A reducing reagent, or reductant, donates 
electrons in such reactions. 

7. Pyruvate accepts electrons and is thus the oxidant. NADH 
gives up electrons and is the reductant. 

8. AG°’ = —nFAE) 

9. The AE9 value of iron can be altered by changing the environ- 
ment of the ion. 

10. c,e, b,a, d. 

11. (a) 4; (b) 5; (c) 2; (d) 10; (e) 3; (£) 8; (g) 9; (h) 7; @) 1; G) 6. 

12. (a) 4; (b) 3; (c) 1; (d) 5; (e) 2. 

13. The 10 isoprene units render coenzyme Q soluble in the 
hydrophobic environment of the inner mitochondrial membrane. 
The two oxygen atoms can reversibly bind two electrons and two 
protons as the molecule transitions between the quinone form and 
quinol form. 

14. Rotenone: NADH, NADH-Q oxidoreductase will be 
reduced. The remainder will be oxidized. Antimycin A: NADH, 
NADH-Q oxidoreductase and coenzyme Q will be reduced. The 
remainder will be oxidized. Cyanide: All will be reduced. 

15. Complex I would be reduced, whereas Complexes II, III, and 
IV would be oxidized. The citric acid cycle would halt because it 
has no way to oxidize NADH. 

16. The respirasome is another example of the use of supramo- 
lecular complexes in biochemistry. Having the three complexes 
that are proton pumps associated with one another will enhance 
the efficiency of electron flow from complex to complex, which in 
turn will cause more-efficient proton pumping. 

17. Succinate dehydrogenase is a component of Complex II. 

18. Hydroxyl radical (OH -), hydrogen peroxide (H,O,), superoxide 
ion (O2_), and peroxide (O»? ~ ). These small molecules react with a 
host of macromolecules—including proteins, nucleotides, and mem- 
branes—to disrupt cell structure and function. 

19. The ATP is recycled by ATP-generating processes, most 
notably oxidative phosphorylation. 

20. (a) 12.5; (b) 14; (c) 32; (d) 13.5; (e) 30; (£) 16. 

21. (a) It blocks electron transport and proton pumping at 
Complex IV. (b) It blocks electron transport and ATP synthesis by 
inhibiting the exchange of ATP and ADP across the inner mito- 
chondrial membrane. (c) It blocks electron transport and proton 
pumping at Complex I. (d) It blocks ATP synthesis without inhib- 
iting electron transport by dissipating the proton gradient. (e) It 
blocks electron transport and proton pumping at Complex IV. (f ) 
It blocks electron transport and proton pumping at Complex III. 
22. If the proton gradient is not dissipated by the influx of protons 
into a mitochondrion with the generation of ATP, eventually the 
outside of the mitochondrion develops such a large positive charge 
that the electron-transport chain can no longer pump protons 
against the gradient. 

23. The subunits are jostled by background thermal energy 
(Brownian motion). The proton gradient makes clockwise rotation 
more likely because that direction results in protons flowing down 
their concentration gradient. 

24. Dicyclohexylcarbodiimide reacts readily with carboxyl groups. 
Hence, the most likely targets are aspartate and glutamate side 
chains. In fact, Asp 61 of subunit c of E. coli Fp is specifically 
modified by this reagent. The conversion of Asp 61 into asparagine 
by site-specific mutagenesis eliminates proton conduction. 

25. In the presence of poorly functioning mitochondria, the only 
means of generating ATP is by anaerobic glycolysis, which will 
lead to an accumulation of lactic acid in blood. 

26. If ADP cannot get into mitochondria, the electron-transport 
chain will cease to function because there will be no acceptor for 
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the energy. NADH will build up in the matrix. Recall that NADH 
inhibits some citric acid cycle enzymes and that NAD? is required 
by several citric acid cycle enzymes. Glycolysis will switch to lactic 
acid fermenation so that the NADH can be reoxidized to NADt 
by lactate dehydrogenase. 

27. (a) No effect; mitochondria cannot metabolize glucose. 

b) No effect; no fuel is present to power the synthesis of ATP. 

c) The [O,] falls because citrate is a fuel and ATP can be formed 
from ADP and P.. 

d) Oxygen consumption stops because oligomycin inhibits ATP 
synthesis, which is coupled to the activity of the electron-transport 
chain. 

e) No effect, for the reasons given in part d. 

f) [O,] falls rapidly because the system is uncoupled and does not 
require ATP synthesis to lower the proton-motive force. 

(g) [O,] falls, though at a lower rate. Rotenone inhibits Complex 

I, but the presence of succinate will enable electrons to enter at 
Complex II. 

(h) Oxygen consumption ceases because Complex IV is inhibited 
and the entire chain backs up. 

28. (a) The P : O ratio is equal to the product of (H*/2e ) and 
(P/H*). Note that the P : O ratio is identical with the P : 2 e™ ratio. 
(b) 2.5 and 1.5, respectively. 

29. Cyanide can be lethal because it binds to the ferric form of 
cytochrome oxidase and thereby inhibits oxidative phosphoryla- 
tion. Nitrite converts ferrohemoglobin into ferrihemoglobin, 
which also binds cyanide. Thus, ferrihemoglobin competes with 
cytochrome c oxidase for cyanide. This competition is therapeuti- 
cally effective because the amount of ferrihemoglobin that can be 
formed without impairing oxygen transport is much greater than 
the amount of cytochrome c oxidase. 

30. Such a defect (called Luft syndrome) was found in a 38-year-old 
woman who was incapable of performing prolonged physical work. 
Her basal metabolic rate was more than twice normal, but her thy- 
roid function was normal. A muscle biopsy showed that her mito- 
chondria were highly variable and atypical in structure. Biochemical 
studies then revealed that oxidation and phosphorylation were not 
tightly coupled in these mitochondria. In this patient, much of the 
energy of fuel molecules was converted into heat rather than ATP. 
31. Triose phosphate isomerase converts dihydroxyacetone phos- 
phate (a potential dead end) into glyceraldehyde 3-phosphate (a 
mainstream glycolytic intermediate). 

32. This inhibitor (like antimycin A) blocks the reduction of cyto- 
chrome c, by QH,, the crossover point. 

33. If oxidative phosphorylation were uncoupled, no ATP could be 
produced. In a futile attempt to generate ATP, much fuel would be 
consumed. The danger lies in the dose. Too much uncoupling would 
lead to tissue damage in highly aerobic organs such as the brain and 
heart, which would have severe consequences for the organism as 

a whole. The energy that is normally transformed into ATP would 

be released as heat. To maintain body temperature, sweating might 
increase, although the very process of sweating itself depends on ATP. 
34. If ATP and ADP cannot exchange between the matrix and the 
mitochondria, ATP synthase will cease to function because its sub- 
strate ADP is absent. The proton gradient will eventually become 
so large that the energy released by the electron-transport chain 
will not be great enough to pump protons against the larger-than- 
normal gradient. 

35. Add the inhibitor with and without an uncoupler, and 
monitor the rate of O, consumption. If the O, consumption 
increases again in the presence of inhibitor and uncoupler, the 
inhibitor must be inhibiting ATP synthase. If the uncoupler 
has no effect on the inhibition, the inhibitor is inhibiting the 
electron-transport chain. 


36. Presumably, because the muscle has greater energy needs, espe- 
cially during exercise, it will require more ATP. This requirement 
means that more sites of oxidative phosphorylation are called for, and 
these sites can be provided by an increase in the amount of cristae. 
37. The arginine residue, with its positive charge, will facilitate 
proton release from aspartic acid by stabilizing the negatively 
charged aspartate. 

38. 4; 4.7 

39. The ATP synthase would pump protons at the expense of 
ATP hydrolysis, thus maintaining the proton-motive force. The 
synthase would function as an ATPase. There is some evidence 
that damaged mitochondria use this tactic to maintain, at least 
temporarily, the proton-motive force. 

40. It suggests that malfunctioning mitochondria may play a role 
in the development of Parkinson disease. Specifically, it implicates 
Complex I. 

41. The extra negative charge on ATP relative to that on ADP 
accounts for ATP’s more-rapid translocation out of the mitochon- 
drial matrix. If the charge differences between ATP and ADP were 
lessened by the binding of Mg?*, ADP might more readily com- 
pete with ATP for transport to the cytoplasm. 

42. When all of the available ADP has been converted into 

ATP, ATP synthase can no longer function. The proton gradient 
becomes large enough that the energy of the electron-transport 
chain is not enough to pump against the gradient, and electron 
transport and, hence, oxygen consumption falls. 

43. The effect on the proton gradient is the same in each case. 

44 ATP export from the matrix. Phosphate import into the matrix. 
45. Recall from the discussion of enzyme-catalyzed reactions 

that the direction of a reaction is determined by the AG difference 
between substrate and products. An enzyme speeds up the rate of 
both the forward and the backward reactions. The hydrolysis of 
ATP is exergonic, and so ATP synthase will enhance the hydro- 
lytic reaction. 

46. The cytoplasmic kinases thereby obtain preferential access to 
the exported ATP. 

47. The organic acids in the blood are indications that the mice are 
deriving a large part of their energy needs through anaerobic gly- 
colysis. Lactate is the end product of anaerobic glycolysis. Alanine 
is an aminated transport form of pyruvate, which is formed from 
lactate. Alanine formation plays a role in succinate formation, 
which is caused by the reduced state of the mitochondria. 


NADH NADH 


Asparate oxaloacetate me > malate i > fumarate 


FADH2 


pyruvate 


alanine succinate 


Lactate 


The electron-transport chain is slowed because the inner mito- 
chondrial membrane is hyperpolarized. Without ADP to accept 
the energy of the proton-motive force, the membrane becomes 
polarized to such an extent that protons can no longer be pumped. 
The excess HO, 1s probably due to the fact that the superoxide 
radical is present in higher concentration because the oxygen can 
no longer be effectively reduced. 


O,:7 + O,:-— + 2H*—>0, + H,0, 
Indeed, these mice display evidence of such oxidative damage. 
48. (a) Vitamins C and E. 


(b) Exercise induces superoxide dismutase, which converts ROS in 
hydrogen peroxide and oxygen. 


(c) The answer to this question is not fully established. Two pos- 
sibilities are (1) the suppression of ROS by vitamins prevents the 
expression of more superoxide dismutase and (2) some ROS may be 
signal molecules required to stimulate insulin-sensitivity pathways. 
49. (a) DNP is an uncoupler that prevents the use of the proton- 
motive force for ATP synthesis. Consequently, the rate of oxygen 
consumption rises (reflecting the rate of the electron transport 
chain) in a futile attempt to synthesize ATP. Because mitochon- 
drial ATP synthesis in inhibited, the rate of glycolysis, as mea- 
sured by ECAR, increases in an attempt meet the cells ATP needs. 
(b) Because glycolysis is now inhibited, no lactic acid will be produced 
and the rate of extracellular acidification will fall. Because DNP is 
still present, oxygen consumption will still occur at a high rate. 

(c) A key step in glycolysis is the isomerization of glucose 
6-phosphate to fructose 6-phosphate. 2-Deoxyglucose is inca- 
pable of undergoing the isomerization. 

(d) Rotenone inhibits electron flow through Complex I, the elec- 
tron transport chain is inhibited and oxygen consumption ceases. 
50. (a) Succinate is oxidized by Complex II, and the electrons are 
used to establish a proton-motive force that powers ATP synthesis. 
(b) The ability to synthesize ATP is greatly reduced. 

(c) The goal was to measure ATP hydrolysis. If succinate had been 
added in the presence of ATP, no reaction would have taken place, 
because of respiratory control. 

(d) The mutation has little effect on the ability of the enzyme to 
catalyze the hydrolysis of ATP. 

(e) They suggest two things: (1) the mutation did not affect the cat- 
alytic site on the enzyme, because ATP synthase is still capable of 
catalyzing the reverse reaction, and (2) the mutation did not affect 
the amount of enzyme present, given that the controls and patients 
had similar amounts of activity. 

51. The absolute configuration of thiophosphate indicates that 
inversion at phosphorus has taken place in the reaction catalyzed 
by ATP synthase. This result is consistent with an inline phosphoryl- 
transfer reaction taking place in a single step. The retention of con- 
figuration in the Ca2*-ATPase reaction points to two phosphoryl- 
transfer reactions—inversion by the first and a return to the 
starting configuration by the second. The Ca?*-ATPase reaction 
proceeds by a phosphorylated enzyme intermediate. 


Chapter 19 

1. Ultimately, all of the carbon atoms of which we are made, not 
just carbohydrates, enter the biosphere through the process of 
photosynthesis. Moreover, the oxygen that we require is produced 
by photosynthesis. 

2. 2 NADP* + 3 ADP3- + 3P,?2- + H*-—>O, + 2 NADPH + 
3 ATP*- + H,O 

3. (a) 7; (b) 5; (c) 4; (d) 10; (e) 1; (£) 2; (g) 9; (h) 3; @) 8; G) 6. 

4. Photosystem IJ, in conjunction with the oxygen-generating 
complex, powers oxygen release. The reaction center of photosys- 
tem IT absorbs light maximally at 680 nm. 

5. Oxygen consumption will be maximal when photosystems I 
and II are operating cooperatively. Oxygen will be efficiently gen- 
erated when electrons from photosystem II fill the electron holes in 
photosystem I, which were generated when the reaction center of 
photosystem I was illuminated by light of 700 nm. 

6. Photosystem I generates ferredoxin, which reduces NADP* to 
NADPH, a biosynthetic reducing power. Photosystem II activates 
the water-oxidizing complex, generating electrons for photosyn- 
thesis, and generating protons to form a proton gradient and to 
reduce ferredoxin and O,. 

7. The light reactions take place on thylakoid membranes. 
Increasing the membrane surface increases the number of ATP- 
and NADPH -generating sites. 
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8. These complexes absorb more light than can a reaction center 
alone. The light-harvesting complexes funnel light to the reaction 
centers. 

9. NADP? is the acceptor. H,O is the donor. Light energy. 

10. The charge gradient, a component of the proton-motive force 
in mitochondria, is neutralized by the influx of Mg?* into the 
lumen of the thylakoid membranes. 

11. Chlorophyll is readily inserted into the hydrophobic interior of 
the thylakoid membranes. 

12. Protons released by the oxidation of water; protons pumped 
into the lumen by the cytochrome bf complex; protons removed 
from the stroma by the reduction of NADP* and plastoquinone. 
13. 700-nm photons have an energy content of +172 kJ mol7!. 
The absorption of light by photosystem I results ina AE9 of —1.0V. 
Recall that AG) = —nFAE$, where F = 96.48 kJ mol”! V1. 
Under standard conditions, the energy change for the electrons is 
96.5 kJ. Thus, the efficiency is 96.5/172 = 56%. 

14. The electron flow from photosystem II to photosystem I is 
uphill, or exergonic. For this uphill flow, ATP would need to be 
consumed, defeating the purpose of photosynthesis. 

15. AE) = 10.11 V, and AG” = —21.3kJ mol~! (—5.1 kcal mol~!). 
16. (a) All ecosystems require an energy source from outside the 
system, because the chemical energy sources will ultimately be 
limited. The photosynthetic conversion of sunlight is one example 
of such a conversion. 

(b) Not at all. Spock would point out that chemicals other than 
water can donate electrons and protons. 

17. DCMU inhibits electron transfer in the link between photosys- 
tems IJ and I. O, can evolve in the presence of DCMU if an artificial 
electron acceptor such as ferricyanide can accept electrons from Q. 
18. DCMU will have no effect, because it blocks photosystem II, 
and cyclic photophosphorylation uses photosystem I and the cyto- 
chrome bf complex. 

19. (a) +120 kJ einstein | (+ 28.7 kcal einstein ') 

(b) 1.24 V 

(c) One 1000-nm photon has the free energy content of 2.4 mol- 
ecules of ATP. A minimum of 0.42 photon is needed to drive the 
synthesis of a molecule of ATP. 

20. At this distance, the expected rate is one electron per second. 
21. The distance doubles, and so the rate should decrease by a fac- 
tor of 64 to 640 ps. 

22. The cristae. 

23. In eukaryotes, both processes take place in specialized organ- 
elles. Both depend on high-energy electrons to generate ATP. In 
oxidative phosphorylation, the high-energy electrons originate in 
fuels and are extracted as reducing power in the form of NADH. 
In photosynthesis, the high-energy electrons are generated by light 
and are captured as reducing power in the form of NADPH. Both 
processes use redox reactions to generate a proton gradient, and 
the enzymes that convert the proton gradient into ATP are very 
similar in both processes. In both systems, electron transport takes 
place in membranes inside organelles. 

24. We need to factor in the NADPH because it is an energy-rich 
molecule. Recall from Chapter 18 that NADH is worth 2.5 ATP if 
oxidized by the electron-transport chain. 12 NADPH = 30 ATP. 
Eighteen molecules of ATP are used directly, and so the equivalent 
of 48 molecules of ATP is required for the synthesis of glucose. 

25. Both photosynthesis and cellular respiration are powered by 
high-energy electrons flowing toward a more-stable state. In cel- 
lular respiration, the high-energy electrons are derived from the 
oxidation of carbon fuels as NADH and FADH,. They release 
their energy as they reduce oxygen. In photosynthesis, high-energy 
electrons are generated by absorbing light energy, and they find 
stability in photosystem I and ferridoxin. 
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26. The electrons flow through photosystem II directly to ferri- 
cyanide. No other steps are required. 

27. (a) Thioredoxin 

(b) The control enzyme is unaffected, but the mitochondrial 
enzyme with part of the chloroplast y subunit increases activity as 
the concentration of DTT increases. 

(c) The increase was even larger when thioredoxin was present. 
Thioredoxin is the natural reductant for the chloroplast enzyme, 
and so it presumably operates more efficiently than would DTT, 
which probably functions to keep the thioredoxin reduced. 

(d) They seem to have done so. 

(e) The enzyme is susceptible to control by the redox state. In plant 
cells, reduced thioredoxin is generated by photosystem I. Thus, the 
enzyme is active when photosynthesis is taking place. 

(f) Cysteine. 

(g) Group-specific modification or site-specific mutagenesis. 


Chapter 20 

1. The Calvin cycle is the primary means of converting gaseous 
CO, into organic matter—that is, biomolecules. Essentially, every 
carbon atom in your body passed through rubisco and the Calvin 
cycle at some time in the past. 

2. Autotrophs can use the energy of sunlight, carbon dioxide, and 
water to synthesize carbohydrates, which can subsequently be used 
for catabolic or anabolic purposes. Heterotrophs require chemical 
fuels and are thus ultimately dependent on autotrophs. 

3. Nothing grim or secret about these reactions. They are sometimes 
called the dark reactions because they do not directly depend on light. 
4. 


Krebs cycle 
Matrix 
Carbon chemistry for 


Calvin cycle 
Stroma 
Carbon chemistry for 


photosynthesis oxidative phosphorylation 
Fixes CO, Releases CO, 
Requires high-energy electrons Generates high-energy 
(NADPH) electrons (NADH) 


Regenerates starting 
compound (oxaloacetate) 


Generates ATP 
Simple stoichiometry 


Regenerates starting compound 
(ribulose 1.5-bisphosphate) 


Requires ATP 
Complex stoichiometry 


5. (a) 3-Phosphoglycerate. (b) The other members of the Calvin cycle. 
6. Stage 1 is the fixation of CO, with ribulose 1,5-bisphosphate 
and the subsequent formation of 3-phosphoglycerate. Stage 2 is the 
conversion of some of the 3-phosphoglycerate into hexose. Stage 3 
is the regeneration of ribulose 1,5-bisphosphate. 

7. It catalyzes a crucial reaction, but it is highly inefficient. 
Consequently, it is required in large amounts to overcome its slow 
catalysis. 

8. Because carbamate forms only in the presence of CO,, this 
property prevents rubisco from catalyzing the oxygenase reaction 
exclusively when CO, is absent. 

9. Because NADPH is generated in the chloroplasts by the light 
reactions. 

10. The concentration of 3-phosphoglycerate would increase, 
whereas that of ribulose 1,5-bisphosphate would decrease. 

11. The concentration of 3-phosphoglycerate would decrease, 
whereas that of ribulose 1,5-bisphosphate would increase. 

12. Aspartate + glyoxylate —- oxaloacetate + glycine 

13. The oxygenase activity of rubisco increases with temperature. 
Crabgrass is a C, plant, whereas most grasses lack this capability. 


Consequently, the crabgrass will thrive at the hottest part of the sum- 
mer because the C, pathway provides an ample supply of CO,. 
14. The C, pathway allows the CO, concentration to increase at 
the site of carbon fixation. High concentrations of CO, inhibit 
the oxygenase reaction of rubisco. This inhibition is important for 
tropical plants because the oxygenase activity increases more rap- 
idly with temperature than does the carboxylase activity. 
15. ATP is required to form phosphoenolpyruvate (PEP) from 
pyruvate. The PEP combines with CO, to form oxaloacetate and, 
subsequently, malate. Two ATP molecules are required because a 
second ATP molecule is required to phosphorylate AMP to ADP. 
16. Photorespiration is the consumption of oxygen by plants 
with the production of CO,, but it does not generate energy. 
Photorespiration is due to the oxygenase activity of rubisco. It is 
wasteful because, instead of fixing CO, for conversion into hex- 
oses, rubisco is generating CO,. 
17. As global warming progresses, C, plants will invade the higher 
latitudes, and C, plants will retreat to cooler regions. 
18. The light reactions lead to an increase in the stromal concen- 
trations of NADPH, reduced ferredoxin, and Mg?", as well as an 
increase in pH. 
19. (a) 5; (b) 1; (c) 7; (d) 2; (e) 10; (£) 3; (g) 6; (h) 4; @) 8; G) 9. 
20. The enzymes catalyze the transformation of the five-carbon 
sugar formed by the oxidative phase of the pentose phosphate 
pathway into fructose 6-phosphate and glyceraldehyde 3-phosphate, 
intermediates in glycolysis (and gluconeogenesis). 
21. (a) C; (b) Band F; (c) G; (d) F; (e) E; f) H; (g) I (h) D; (i) AS 
() F; (k) B. 
22. The label emerges at C-5 of ribulose 5-phosphate. 
23. Oxidative decarboxylation of isocitrate to a-ketoglutarate. A 
B-ketoacid intermediate is formed in both reactions. 
24. (a) 5 Glucose 6-phosphate + ATP > 

6 ribose 5-phosphate + ADP + Ht. 
(b) Glucose 6-phosphate + 12 NADP* + 7H,O> 

6CO, + 12 NADPH + 12 Ht + P.. 
25. The nonoxidative phase of the pentose phosphate pathway 
can be used to convert three molecules of ribose 5-phosphate into 
two molecules of fructose 6-phosphate and one molecule of glyc- 
eraldehyde 3-phosphate. These molecules are components of the 
glycolytic pathway. 
26. The conversion of fructose 6-phosphate into fructose 
1,6-bisphosphate by phosphofructokinase requires ATP. 
27. When much NADPH is required. The oxidative phase of 
the pentose phosphate pathway is followed by the nonoxidative 
phase. The resulting fructose 6-phosphate and glyceraldehyde 
3-phosphate are used to generate glucose 6-phosphate through 
gluconeogenesis, and the cycle is repeated until the equivalent of 
one glucose molecule is oxidized to CO,. 
28. Fava beans contain vicine, a purine glycoside that can lead 
to the generation of peroxides—reactive oxygen species that can 
damage membranes as well as other biomolecules. Glutathione 
is used to detoxify the ROS. The regeneration of glutathione 
depends on an adequate supply of NADPH, which is synthe- 
sized by the oxidative phase of the pentose phosphate pathway. 
People with low levels of the dehydrogenase are especially sus- 
ceptible to vicine toxicity. 
29. Because red blood cells do not have mitochondria and the 
only means to obtain NADPH is through the pentose phosphate 
pathway. There are biochemical means to convert mitochondrial 
NADH into cytoplasmic NADPH. 
30. Reactive peroxides are a type of reactive oxygen species. The 
enzyme glutathione peroxidase uses reduced glutathione to neu- 
tralize peroxides by converting them into alcohols while generating 


oxidized glutathione. Reduced glutathione is regenerated by glu- 
tathione reductase with the use of NADPH, the product of the 
oxidative phase of the pentose phosphate pathway. 

31. AE) for the reduction of glutathione by NADPH is +0.09 V. 
Hence, AG® is —17.4 kJ mol7! (—4.2 kcal mol~!), which corresponds 
to an equilibrium constant of 1126. The required [NADPH]/ 
[NADP*] ratio is 8.9 X 107°. 
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34. Incubate an aliquot of a tissue homogenate with glucose 
labeled with !4C at C-1, and incubate another with glucose labeled 
with !4C at C-6. Compare the radioactivity of the CO, produced 
by the two samples. The rationale of this experiment is that only 
C-1 is decarboxylated by the pentose phosphate pathway, whereas 
C-1 and C-6 are decarboxylated equally when glucose is metabo- 
lized by the glycolytic pathway, the pyruvate dehydrogenase 
complex, and the citric acid cycle. The reason for the equivalence 
of C-1 and C-6 in the latter set of reactions is that glyceraldehyde 
3-phosphate and dihydroxyacetone phosphate are rapidly inter- 
converted by triose phosphate isomerase. 

35. Lacking mitochondria, red blood cells metabolize glucose to 
lactate to obtain energy in the form of ATP. The CO, results from 
extensive use of the pentose phosphate pathway coupled with 
gluconeogenesis. This coupling allows the generation of much 
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NADPH with the complete oxidation of glucose by the oxidative 
branch of the pentose phosphate pathway. 
36. (a) Reg °02/KyyOO2 = 3 X 10°s'M7! 


p.,02/ oe on = 4x 103s-!M"! 
(b) Despite the ba that the specificity constant for CO, as a 
substrate is much greater than that of O os the concentration of O, 
in the atmosphere is higher than that of CO,, allowing the oxygen- 
ation reaction to occur. 
37. The reduction of each mole of CO, to the level of a hexose 
requires two moles of NADPH. The reduction of NADP? is 
a two-electron process. Hence, the formation of two moles of 
NADPH requires the pumping of four moles of electrons by 
photosystem I. The electrons given up by photosystem I are 
replenished by photosystem II, which needs to absorb an equal 
number of photons. Hence, eight photons are needed to gener- 
ate the required NADPH. The energy input of eight moles of 
photons is +1594 kJ (+381 kcal). Thus, the overall efficiency of 
photosynthesis under standard conditions is at least 477/1594, 
or 30%. 
38. It is neither a violation nor a miracle. The equation on page 
597 requires not only 18 ATP, but also 12 NADPH. These elec- 
trons, if transferred to NAD* and used in the electron-transport 
chain, would yield 30 ATP. Thus, the synthesis of glucose requires 
the equivalent of 48 ATP. 
39. (a) The curve on the right in graph A was generated by the 
C, plant. Recall that the oxygenase activity of rubisco increases 
with temperature more rapidly than does the carboxylase activity. 
Consequently, at higher temperatures, the C, plants would fix less 
carbon. Because C, plants can maintain a higher CO, concentra- 
tion, the rise in temperature is less deleterious. 
(b) The oxygenase activity will predominate. Additionally, when 
the temperature rise is very high, the evaporation of water might 
become a problem. The higher temperatures can begin to damage 
protein structures as well. 
(c) The C, pathway is a very effective active-transport system for 
concentrating CO,, even when environmental concentrations are 
very low. 
(d) With the assumption that the plants have approximately the 
same capability to fix CO,, the C, pathway is apparently the rate- 
limiting step in C, plants. 


Chapter 21 

1. Step 1 is the release of glucose 1-phosphate from glycogen by 
glycogen phosphorylase. Step 2 is the formation of glucose 
6-phosphate from glucose 1-phosphate, a reaction catalyzed by 
phosphoglucomutase. Step 3 is the remodeling of the glycogen by 
the transferase and the glucosidase. 

2. (a) 8; (b) 3; (c) 6; (d) 5; (e) 9; (£) 2; (g) 10; (h) 1; @) 45 G) 7 

3. Glycogen is an important fuel reserve for several reasons. The 
controlled breakdown of glycogen and release of glucose increase 
the amount of glucose that is available between meals. Hence, 
glycogen serves as a buffer to maintain blood-glucose levels. 
Glycogen’s role in maintaining blood-glucose levels is especially 
important because glucose is virtually the only fuel used by the 
brain, except during prolonged starvation. Moreover, the glucose 
from glycogen is readily mobilized and is therefore a good source 
of energy for sudden, strenuous activity. Unlike fatty acids, the 
released glucose can provide energy in the absence of oxygen and 
can thus supply energy for anaerobic activity. 

4. Asan unbranched polymer, a-amylose has only one nonreduc- 
ing end. Therefore, only one glycogen phosphorylase molecule 
could degrade each a-amylose molecule. Because glycogen is 
highly branched, there are many nonreducing ends per molecule. 
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Consequently, many phosphorylase molecules can release many 
glucose molecules per glycogen molecule. 

5. The patient has a deficiency of the branching enzyme. 

6. In muscle, the b form of phosphorylase is activated by AMP. 

In the liver, the a form is inhibited by glucose. The difference cor- 
responds to the difference in the metabolic role of glycogen in each 
tissue. Muscle uses glycogen as a fuel for contraction, whereas the 
liver uses glycogen to maintain blood-glucose levels. 

7. Cells maintain the [P,]/[glucose 1-phosphate] ratio at greater 
than 100, substantially favoring phosphorolysis. We see here an 
example of how the cell can alter the free-energy change to favor a 
reaction taking place by altering the ratio of substrate and product. 
8. The high level of glucose 6-phosphate in von Gierke disease, 
resulting from the absence of glucose 6-phosphatase or the trans- 
porter, shifts the allosteric equilibrium of phosphorylated glycogen 
synthase toward the active form. 

9. The phosphoryl donor is glucose 1,6-bisphosphate, which is 
formed from glucose 1-phosphate and ATP in a reaction catalyzed 
by phosphoglucokinase. 

10. The different manifestations correspond to the different roles 
of the liver and muscle. Liver glycogen phosphorylase plays a 
crucial role in the maintenance of blood-glucose levels. Recall that 
glucose is the primary fuel for the brain. Muscle glycogen phos- 
phorylase provides glucose only for the muscle and, even then, only 
when the energy needs of the muscle are high, as during exercise. 
The fact that there are two different diseases suggests that there 
are two different isozymic forms of the glycogen phosphorylase— 
a liver-specific isozyme and a muscle-specific isozyme. 

11. Water is excluded from the active site to prevent hydrolysis. 
The entry of water could lead to the formation of glucose rather 
than glucose 1-phosphate. A site-specific mutagenesis experiment 
is revealing in this regard. In phosphorylase, Tyr 573 is hydrogen 
bonded to the 2’-OH group of a glucose residue. The ratio of glu- 
cose 1-phosphate to glucose product is 9000 : 1 for the wild-type 
enzyme, and 500 : 1 for the Phe 573 mutant. Model building sug- 
gests that a water molecule occupies the site normally filled by the 
phenolic OH group of tyrosine and occasionally attacks the oxo- 
carbonium ion intermediate to form glucose. 

12. The amylase activity was necessary to remove all of the glyco- 
gen from the glycogenin. Recall that glycogenin synthesizes oligo- 
saccharides of about eight glucose units, and then activity stops. 
Consequently, if the glucose residues are not removed by extensive 
amylase treatment, glycogenin will not function. 

13. The substrate can be handed directly from the transferase site 
to the debranching site. 

14. During exercise, [ATP] falls and [AMP] rises. Recall that 
AMP is an allosteric activator of glycogen phosphorylase b. Thus, 
even in the absence of covalent modification by phosphorylase 
kinase, glycogen is degraded. 

15. Although glucose 1-phosphate is the actual product of the 
phosphorylase reaction, glucose 6-phosphate is a more versatile 
molecule with respect to metabolism. Among other fates, glucose- 
6-phosphate can be processed to yield energy or building blocks. 
In the liver, glucose 6-phosphate can be converted into glucose and 
released into the blood. 

16. Epinephrine binds to its G-protein-coupled receptor. The resulting 
structural changes activate a G,, protein, which in turn activates ade- 
nylate cyclase. Adenylate cyclase synthesizes cAMP, which activates 
protein kinase A. Protein kinase A partly activates phosphoryl kinase, 
which phosphorylates and activates glycogen phosphorylase. The cal- 
cium released during muscle contraction also activates the phosphory- 
lase kinase, leading to further stimulation of glycogen phosphorylase. 
17. First, the signal-transduction pathway is shut down when 

the initiating hormone is no longer present. Second, the inherent 


GTPase activity of the G protein converts the bound GTP into 
inactive GDP. Third, phosphodiesterases convert cyclic AMP into 
AMP. Fourth, PP1 removes the phosphoryl group from glycogen 
phosphorylase, converting the enzyme into the usually inactive 

b form. 

18. It prevents both from operating simultaneously, which would 
lead to a useless expenditure of energy. 

19. All these symptoms suggest central nervous system prob- 
lems. If exercise is exhaustive enough or the athlete has not pre- 
pared well enough or both, liver glycogen also can be depleted. 
The brain depends on glucose derived from liver glycogen. 

The symptoms suggest that the brain is not getting enough fuel 
(hypoglycemia). 

20. Liver phosphorylase a is inhibited by glucose, which facili- 
tates the R > T transition. This transition releases PP1, which 
inactivates glycogen breakdown and stimulates glycogen synthesis. 
Muscle phosphorylase is insensitive to glucose. 

21. (a) 4; (b) 1; (c) 5; (d) 10; (e) 7; (£) 2; (g) 8; (h) 9; (4) 6; G) 3. 

22. Phosphoglucomutase, UDP-glucose pyrophosphorylase, 
pyrophosphatase, glycogenin, glycogen synthase, and branching 
enzyme. 

23. The enzyme pyrophosphatase converts the pyrophosphate into 
two molecules of inorganic phosphate. This conversion renders the 
overall reaction irreversible. 


Glucose 1-phosphate + UTP == UDP-glucose + PP; 
PP, + H,O —>2P, 


i 


Glucose 1-phosphate + UTP —~> UDP-glucose + 2Pi 


24. The presence of high concentrations of glucose 6-phosphate 
indicates that glucose is abundant and that it is not being used by 
glycolysis. Therefore, this valuable resource is saved by incorpora- 
tion into glycogen. 

25. Free glucose must be phosphorylated at the expense of a 
molecule of ATP. Glucose 6-phosphate derived from glycogen is 
formed by phosphorolytic cleavage, sparing one molecule of ATP. 
Thus, the net yield of ATP when glycogen-derived glucose is pro- 
cessed to pyruvate is three molecules of ATP compared with two 
molecules of ATP from free glucose. 

26. Breakdown: Phosphoglucomutase converts glucose 1-phosphate, 
liberated from glycogen breakdown, into glucose 6-phosphate, 
which can be released as free glucose (liver) or processed in glycolysis 
(muscle and liver). Synthesis: Converts glucose 6-phosphate into 
glucose 1-phosphate, which reacts with UTP to form UDP-glucose, 
the substrate for glycogen synthase. 

27. Glycogen, + P; —> glycogen, _, + glucose 6-phosphate 


Glucose 6-phosphate —-> glucose 1-phosphate 
UTP + glucose 1-phosphate —> UDP-glucose + 2 P; 


Glycogen, _, + UDP-glucose —-~> glycogen, + UDP 


Sum: Glycogen, + UTP —=> glycogen, + UDP + P; 
28. In principle, having glycogen be the only primer for the 
further synthesis of glycogen should be a successful strategy. 
However, if the glycogen granules were not evenly divided 
between daughter cells, glycogen stores for future generations of 
cells might be compromised. Glycogenin synthesizes the primer 
for glycogen synthase. 
29. Insulin binds to its receptor and activates the tyrosine kinase 
activity of the receptor, which in turn triggers a pathway that acti- 
vates protein kinases. The kinases phosphorylate and inactivate 
glycogen synthase kinase. Protein phosphatase 1 then removes 
the phosphate from glycogen synthase and thereby activates the 
synthase. 
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o-1,6-Glucosidase reaction 


31. In the liver, glucagon stimulates the cAMP-dependent path- 
way that activates protein kinase A. Epinephrine binds to a 7TM 
a-adrenergic receptor in the liver plasma membrane, which acti- 
vates phospholipase C and the phosphoinositide cascade. This 
activation causes calcium ions to be released from the endoplasmic 
reticulum, which bind to calmodulin, and further stimulates phos- 
phorylase kinase and glycogen breakdown. 
32. Galactose + ATP + UTP + H,O + glycogen, —~> 
glycogen, ,, + ADP + UDP + 2 P. + H?*. 
33. Phosphorylase, transferase, glucosidase, phosphoglucomutase, 
and glucose 6-phosphatase. 
34. Glucose is an allosteric inhibitor of phosphorylase a. Hence, 
crystals grown in its presence are in the T state. The addition of 
glucose 1-phosphate, a substrate, shifts the R-to-T equilibrium 
toward the R state. The conformational differences between these 
states are sufficiently large that the crystal shatters unless it is sta- 
bilized by chemical cross-links. 
35. Galactose is converted into UDP-galactose to eventually form 
glucose 6-phosphate. 
36. This disease can also be produced by a mutation in the gene 
that encodes the glucose 6-phosphate transporter. Recall that glu- 
cose 6-phosphate must be transported into the lumen of the endo- 
plasmic reticulum to be hydrolyzed by phosphatase. Mutations in 
the other three essential proteins of this system can likewise lead to 
von Gierke disease. 
37. (a) Apparently, the glutamate, with its negatively charged R 
group, can mimic to some extent the presence of a phosphoryl group 
on serine. That the stimulation is not as great is not surprising in that 
the carboxyl group is smaller and not as charged as the phosphate. 
(b) Substitution of aspartate would give some stimulation, but because 
it is smaller than the glutamate, the stimulation would be smaller. 
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38. (a) Glycogen was too large to enter the gel and, because analy- 
sis was by western blot with the use of an antibody specific to gly- 
cogenin, we would not expect to see background proteins. 

(b) a-Amylase degrades glycogen, releasing the protein glyco- 
genin, which can be visualized by a western blot. 

(c) Glycogen phosphorylase, glycogen synthase, and protein phos- 
phatase 1. These proteins might be visible if the gel were stained 
for protein, but a western analysis reveals the presence of glyco- 
genin only. 

39. (a) The smear was due to molecules of glycogenin with 
increasingly large amounts of glycogen attached to them. 

(b) In the absence of glucose in the medium, glycogen is metabo- 
lized, resulting in a loss of the high-molecular-weight material. 

(c) Glycogen could have been resynthesized and added to the gly- 
cogenin when the cells were fed glucose again. 

(d) No difference between lanes 3 and 4 suggests that, by 1 hour, 
the glycogen molecules had attained maximum size in this cell line. 
Prolonged incubation does not apparently increase the amount of 
glycogen. 

(e) a-Amylase removes essentially all of the glycogen, and so only 
the glycogenin remains. 


Chapter 22 
1. (a) 5; (b) 11; (c) 1; (d) 10; (e) 2; (f) 6; (g) 9; (h) 3; (1) 4; G) 7; (k) 8. 
2. Glycerol + 2 NADT + P. + ADP —> 

pyruvate + ATP + H,O + 2 NADH + Ht 


Glycerol kinase and glycerol phosphate dehydrogenase 

3. The ready reversibility is due to the high-energy nature of the 
thioester in the acyl CoA. 

4. To return the AMP to a form that can be phosphorylated by 
oxidative phosphorylation or substrate-level phosphorylation, 
another molecule of ATP must be expended in the reaction: 


ATP + AMP == 2 ADP 


5. b,c, a,g,h,d,e, f. 

6. The citric acid cycle. The reactions that take succinate to oxalo- 
acetate, or the reverse, are similar to those of fatty acid metabolism 
(Section 17.2). 

7. The next-to-last degradation product, acetoacetyl CoA, yields 
two molecules of acetyl CoA with the thiolysis by only one mol- 
ecule of CoA. 

8. Palmitic acid yields 106 molecules of ATP. Palmitoleic acid has 
a double bond between carbons C-9 and C-10. When palmitoleic 
acid is processed in B oxidation, one of the oxidation steps (to 
introduce a double bond before the addition of water) will not take 
place, because a double bond already exists. Thus, FADH, will not 
be generated, and palmitoleic acid will yield 1.5 fewer molecules of 
ATP than palmitic acid, for a total of 104.5 molecules of ATP. 

9. 


Activation fee to form the acyl CoA —2 ATP 
Seven rounds of yield: 
7 acetyl CoA at 10 ATP/acetyl CoA ap HAO) NADY? 
7 NADH at 2.5 ATP/NADH + 17.5 ATP 
7 FADH, at 1.5 ATP/FADH, + 10.5 ATP 
Propionyl CoA, which requires an = iL ATP 
ATP to be converted into succinyl CoA 
Succinyl CoA —> succinate a> il AIM 
Succinate —> fumarate + FADH, ap ilo ANPP 
FADH, at 1.5 ATP/FADH, 
Fumarate —> malate 
Malate — oxaloacetate + NADH ap 25 ANP 
NADH at 2.5 ATP/NADH 
Total 120 ATP 
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10. You might hate yourself in the morning, but at least you won’t 
have to worry about energy. To form stearoyl CoA requires the 
equivalent of 2 molecules of ATP. 


Stearoyl CoA + 8 FAD + 8 NAD* + 8 CoA + 8 H,O —> 
9 acetyl CoA + 8 FADH, + 8 NADH + 8 H* 


9 acetyl CoA at 10 ATP/acetyl CoA +90 ATP 
8 NADH at 2.5 ATP/NADH + 20 ATP 
8 FADH, at 1.5 ATP/FADH, +12ATP 
Activation fee —2.0 

Total 122 ATP 


11. Keep in mind that, in the citric acid cycle, 1 molecule of 
FADH, yields 1.5 ATP, 1 molecule of NADH yields 2.5 ATP, 
and 1 molecule of acetyl CoA yields 10 ATP. Two molecules of 
ATP are produced when glucose is degraded to 2 molecules of 
pyruvate. Two molecules of NADH also are produced, but the 
electrons are transferred to FADH, to enter the electron trans- 
port chain. Each molecule of FADH, can generate 1.5 ATP. 
Each molecule of pyruvate will produce 1 molecule of NADH. 
Each molecule of acetyl CoA generates 3 molecules of NADH, 

1 molecule of FADH,, and 1 molecule of ATP. So, we have a total 
of 10 ATP per acetyl CoA, or 20 for the 2 molecules of acetyl 
CoA. The total for glucose is 30 ATP. Now, what about hexanoic 
acid? Caprioic acid is activated to caprioic CoA at the expense 

of 2 ATP, and so we are 2 ATP in the hole. The first cycle of 

B oxidation generates 1 FADH,, 1 NADH, and 1 acetyl CoA. 
After the acetyl CoA has been run through the citric acid cycle, 
this step will have generated a total of 14 ATP. The second cycle 
of B oxidation generates 1 FADH, and 1 NADH but 2 acetyl 
CoA. After the acetyl CoA has been run through the citric acid 
cycle, this step will have generated a total of 24 ATP. The total is 
36 ATP. Thus, the foul-smelling caprioic acid has a net yield of 
36 ATP. So on a per carbon basis, this fat yields 20% more ATP 
than does glucose, a manifestation of the fact that fats are more 
reduced than carbohydrates. 

12. Stearate + ATP + 13.5H,O + 8 FAD + 8 NAD*—> 

4.5 acetoacetate + 14.5Ht +8 FADH, +8 NADH + AMP + 2P.. 
13. Palmitate is activated and then processed by B oxidation 
according to the following reactions. 


Palmitate + CoA + ATP ——> palmitoyl CoA + AMP + 2 P. 


Palmitoyl CoA + 7 FAD + 7 NAD* + 7 CoASH + H,O —> 
8 acetyl CoA + 7 FADH, + 7 NADH + 7 H* 


The eight molecules of acetyl CoA combine to form four mol- 
ecules of acetoacetate for release into the blood, and so they 
do not contribute to the energy yield in the liver. However, 
the FADH, and NADH generated in the preparation of 
acetyl CoA can be processed by oxidative phosphorylation to 
yield ATP. 


= 10.5ATP 
= 17.5ATP 


1.5 ATP/FADH) X 7 
2.5 ATP/NADH X 7 


The equivalent of 2 ATP were used to form palmitoyl CoA. Thus, 
26 ATP were generated for use by the liver. 

14. NADH produced with the oxidation to acetoacetate = 2.5 ATP. 
Acetoacetate is converted into acetoacetyl CoA. Two molecules 

of acetyl CoA result from the hydrolysis of acetoacetyl CoA, each 
worth 10 ATP when processed by the citric acid cycle. Total ATP 
yield is 22.5. 


H5C 


15. Because a molecule of succinyl CoA is used to form acetoace- 
tyl CoA. Succinyl CoA could be used to generate one molecule of 
ATP, and so someone could argue that the yield is 21.5. 

16. For fats to be combusted, not only must they be converted 
into acetyl CoA, but the acetyl CoA must be processed by the 
citric acid cycle. In order for acetyl CoA to enter the citric acid 
cycle, there must be a supply of oxaloacetate. Oxaloacetate can be 
formed by the metabolism of glucose to pyruvate and the subse- 
quent carboxylation of pyruvate to form oxaloacetate. 

17. (a) 


i I. i it 
CH—(CH)3— CH—(CH2)3 CH (CH) —- CH——CH,—_COO- 
Phytanic acid 


The problem with phytanic acid is that, as it undergoes B oxida- 
tion, we encounter the dreaded pentavalent carbon atom. Because 
the pentavalent carbon atom doesn’t exist, 8 oxidation cannot take 
place and phytanic acid accumulates. 


CH; CH; 
R—C—-C—cCOoOo- > R—C==C—Coo™ > 
H Hy 
CH; Hs 
R—C==C—coo- > R——C—C—C0O™ > 
| 2 
OH 
CH; 


R—C—+C=C00 © 
| Me 
O 

The dreaded pentavalent 
carbon atom 


(b) Removing methyl] groups, though theoretically possible, would 
be time consuming and lacking in elegance. What would we do 
with the methyl groups? Our livers solve the problem by inventing 
& oxidation. 


i i 
R—C—C—COoOo™ > R—C—CH—COO™ > 
H H, H | 
OH 
i il 
R—-C——-C——COO™ > R—C—C—O +0, 
nll | 
fe) fe) 


One round of «@ oxidation rather than B oxidation converts phy- 
tanic acid into a B-oxidation substrate. 

18. The first oxidation removes two tritium atoms. The hydration 
adds nonradioactive H and OH. The second oxidation removes 
another tritium atom from the B-carbon atom. Thiolysis removes 
an acetyl CoA with only one tritium atom; so the tritium-to-carbon 
ratio is 1/2. This ratio will be the same for two of the acetates. The 
last one, however, does not undergo oxidation, and so all tritium 
remains. The ratio for this acetate is 3/2. The ratio for the entire 
molecule is then 5/6. 


19. In the absence of insulin, lipid mobilization will take place to 
an extent that it overwhelms the ability of the liver to convert the 
lipids into ketone bodies. 
20. (a) 10; (b) 1; (c) 5; (d) 8; (e) 3; (f) 95 (g) 6; (h) 7; (a) 4; G) 2 
21. (a) Oxidation in mitochondria; synthesis in the cytoplasm. 
(b) Coenzyme A in oxidation; acyl carrier protein for synthesis. 
(c) FAD and NAD* in oxidation; NADPH for synthesis. 
(d) The L isomer of 3-hydroxyacyl CoA in oxidation; the D isomer 
in synthesis. 
(e) From carboxy] to methyl in oxidation; from methyl to carboxyl 
in synthesis. 
(f) The enzymes of fatty acid synthesis, but not those of oxidation, 
are organized in a multienzyme complex. 
22. Bicarbonate is required for the synthesis of malonyl CoA from 
acetyl CoA by acetyl CoA carboxylase. 
23. 7acetyl CoA + 6 ATP + 12 NADPH +12 Ht—> 

myristate + 7 CoA + 6 ADP + 6P. + 12 NADP* + 5H,O. 
24. We will need six acetyl CoA units. One acetyl CoA unit will 
be used directly to become the two carbon atoms farthest from 
the acid end. The other five units must be converted into malo- 
nyl CoA. The synthesis of each malonyl CoA molecule costs a 
molecule of ATP; so 5 molecules of ATP will be required. Each 
round of elongation requires 2 molecules of NADPH, 1 molecule 
to reduce the keto group to an alcohol and 1 molecule to reduce 
the double bond. As a result, 10 molecules of NADPH will be 
required. Therefore, 5 molecules of ATP and 10 molecules of 
NADPH are required to synthesize lauric acid. 
25. eb,dac 
26. Such a mutation would inhibit fatty acid synthesis because the 
enzyme cleaves cytoplasmic citrate to yield acetyl CoA for fatty 
acid synthesis. 
27. (a) False. Biotin is required for acetyl CoA carboxylase activity. 
(b) True. 
) False. ATP is required to synthesize malonyl CoA. 
) True. 
) 


f) False, Fatty acid synthase is a dimer. 

) True. 
(h) False. Acetyl CoA carboxylase is stimulated by citrate, which is 
cleaved to yield its substrate acetyl CoA. 
28. Fatty acids with odd numbers of carbon atoms are synthesized 
starting with propionyl ACP (instead of acetyl ACP), which is 
formed from propionyl CoA by acetyl transacetylase. 
29. All of the labeled carbon atoms will be retained. Because we 
need 8 acetyl CoA molecules and only 1 carbon atom is labeled in 
the acetyl group, we will have 8 labeled carbon atoms. The only 
acetyl CoA used directly will retain 3 tritium atoms. The 7 acetyl 
CoA molecules used to make malonyl CoA will lose 1 tritium 
atom on addition of the CO, and another one at the dehydration 
step. Each of the 7 malonyl CoA molecules will retain 1 tritium 
atom. Therefore, the total retained tritium is 10 atoms. The ratio 
of tritium to carbon is 1.25. 
30. With a diet rich in raw eggs, avidin will inhibit fatty acid syn- 
thesis by reducing the amount of biotin required by acetyl CoA 
carboxylase. Cooking the eggs will denature avidin, and so it will 
no longer bind biotin. 
31. The only acetyl CoA used directly, not in the form of malonyl 
CoA, provides the two carbon atoms at the w end of the fatty acid 
chain. Because palmitic acid is a C,, fatty acid, acetyl CoA will 
have provided carbons 15 and 16. 
32. HCO3 is attached to acetyl CoA to form malonyl CoA. When 
malonyl CoA condenses with acetyl CoA to form the four-carbon 
ketoacyl CoA, the HCO is lost as CO. 


(c 
(d 
(e 
( 
(g 
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33. Phosphofructokinase controls the flux down the glycolytic 
pathway. Glycolysis functions to generate ATP or building blocks 
for biosynthesis, depending on the tissue. The presence of citrate 
in the cytoplasm indicates that those needs are met, and there is no 
need to metabolize glucose. 

34. C-1 is more radioactive. 

35. The mutant enzyme will be persistently active because it can- 
not be inhibited by phosphorylation. Fatty acid synthesis will be 
abnormally active. Such a mutation might lead to obesity. 

36. (a) Palmitoleate; (b) linoleate; (c) linoleate; (d) oleate; 

(e) oleate; (f) linolenate. 

37. Decarboxylation drives the condensation of malonyl ACP 

and acetyl ACP. In contrast, the condensation of two molecules of 
acetyl ACP is energetically unfavorable. In gluconeogenesis, decar- 
boxylation drives the formation of phosphoenolpyruvate from 
oxaloacetate. 

38. Fat mobilization in adipocytes is activated by phosphorylation. 
Hence, overproduction of the cAMP.-activated kinase will lead to an 
accelerated breakdown of triacylglycerols and a depletion of fat stores. 
39. Carnitine translocase deficiency and glucose 6-phosphate 
transporter deficiency. 

40. In the fifth round of 8 oxidation, cis-A*-enoyl CoA is formed. 
Dehydration by the classic hydratase yields D-3-hydroxyacyl CoA, 
the wrong isomer for the next enzyme in B oxidation. This dead 
end is circumvented by a second hydratase that removes water 

to give trans-A?-enoyl CoA. The addition of water by the clas- 

sic hydratase then yields L-3-hydroxyacyl CoA, the appropriate 
isomer. Thus, hydratases of opposite stereospecificities serve to 
epimerize (invert the configuration of) the 3-hydroxyl group of 
the acyl CoA intermediate. 

41. An advantage of this arrangement is that the synthetic activ- 
ity of different enzymes is coordinated. In addition, intermediates 
can be efficiently handed from one active site to another without 
leaving the assembly. Furthermore, a complex of covalently joined 
enzymes is more stable than one formed by noncovalent attrac- 
tions. Each of the component enzymes is recognizably homologous 
to its bacterial counterpart. 

42. The probability of synthesizing an error-free polypeptide 
chain decreases as the length of the chain increases. A single mis- 
take can make the entire polypeptide ineffective. In contrast, a 
defective subunit can be spurned in the formation of a noncovalent 
multienzyme complex; the good subunits are not wasted. 

43. The absence of ketone bodies is due to the fact that the liver, 
the source of ketone bodies in the blood, cannot oxidize fatty acids 
to produce acetyl CoA. Moreover, because of the impaired fatty 
acid oxidation, the liver becomes more dependent on glucose as an 
energy source. This dependency results in a decrease in gluconeo- 
genesis and a drop in blood-glucose levels, which is exacerbated by 
the lack of fatty acid oxidation in muscle and a subsequent increase 
in glucose uptake from the blood. 

44. Peroxisomes enhance the degradation of very long chain fatty 
acids. Consequently, increasing the activity of peroxisomes could 
help to lower levels of blood triglycerides. In fact, clofibrate is 
rarely used because of serious side effects. 

45. Citrate works by facilitating, in cooperation with the protein 
MIG12, the formation of active filaments from inactive mono- 
mers. In essence, it increases the number of active sites available, 
or the concentration of enzyme. Consequently, its effect is visible 
as an increase in the value of V,,,,. Allosteric enzymes that alter 
their V_..,, Values in response to regulators are sometimes called 
V-class enzymes. The more common type of allosteric enzyme, in 
which K,, 1s altered, comprises K-class enzymes. Palmitoyl CoA 
causes depolymerization and thus inactivation. 
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46. The thiolate anion of CoA attacks the 3-keto group to forma 
tetrahedral intermediate. This intermediate collapses to form acyl 
CoA and the enolate anion of acetyl CoA. Protonation of the eno- 


late yields acetyl CoA. 
47. 
gk P 
nepge ee 
2 
Malonyl-ACP 
o* fe) 
| ie 
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nes Gis ae” cH, 
Acetyl-ACP 
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H2 
Acetoacetyl-ACP 


48. (a) Fats burn in the flame of carbohydrates. Without carbohy- 
drates, there would be no anapleurotic reactions to replenish the 
components of the citric acid cycle. With a diet of fats only, the 
acetyl CoA from fatty acid degradation would build up. 

(b) Acetone from ketone bodies. 

(c) Yes. Odd-chain fatty acids would lead to the production of 
propionyl CoA, which can be converted into succinyl CoA, a citric 
acid cycle component. It would serve to replenish the citric acid 
cycle, allow glucose synthesis and mitigate the halitosis. 

49. A labeled fat can enter the citric acid cycle as acetyl CoA and 
yield labeled oxaloacetate, but only after two carbon atoms have 
been lost as CO,. Consequently, even though oxaloacetate may be 
labeled, there can be no net synthesis of oxaloacetate and hence no 
net synthesis of glucose or glycogen. 

50. (a) Glucose is the primary fuel used by the brain. Lack of 
pyruvate dehydrogenase would prevent complete oxidation of 
glucose-derived pyruvate by cellular respiration. (b) Because ATP 
cannot be generated aerobically, glucose would be metabolized to 
lactate to obtain some ATP. (c) Such a diet would generate ketone 
bodies for use as fuel by the brain. 

51. I-cell disease (Section 11.3) results when lysosomal enzymes 
are secreted instead of imported into the lysosome. 

52. (a) The V,,,,, is decreased and the K,, is increased. V__.,. 
type) = 13 nmol minute! mg; K,, (wild type) = 45 ~M; 
Vinax (Mutant) = 8.3 nmol minute! mg™!; K,, (mutant) = 74 pM. 
(b) Both the V,,,, and the K,, are decreased. V,_, (wild type) = 

41 nmol minute! mg~!; K,, (wild type) = 104 4M; V__,, (mutant) = 
23 nmol minute~! mg™!; K,, (mutant) = 69 uM. 

(c) The wild type is significantly more sensitive to malonyl CoA. 
(d) With respect to carnitine, the mutant displays approximately 
65% of the activity of the wild type; with respect to palmitoyl CoA, 
approximately 50% activity. On the other hand, 10 yM of malonyl 
CoA inhibits approximately 80% of the wild type but has essen- 
tially no effect on the mutant enzyme. 

(e) The glutamate appears to play a more prominent role in regula- 
tion by malonyl CoA than in catalysis. 

53. (a) Leucine is changed to alanine. 

(b) No change relative to control. 

(c) Tyrosine to alanine. 


(wild 


(d) Greatly reduced lipase binding compared to the control. 

(e) Lipase activity was diminished. 

(f) This suggests that they are separate. The L16A mutation allows 
binding of the lipase but apparently does not allow activation of 
the lipase. 

(g) This mutation decreased both binding and activity, sug- 
gesting that the tyrosine is involved in both aspects of colipase 
activity. 

54. (a) Soraphen A inhibits fatty acid synthesis in a dose-dependent 
manner. 

(b) Fatty acid oxidation is increased in the presence of soraphen A. 
(c) Recall that acetyl CoA carboxylase 2 synthesizes malonyl CoA to 
inhibit the transport of fatty acids into the mitochondria, thereby 
preventing fatty acid oxidation. Soraphen A apparently inhibits 
both forms of the carboxylase. 

(d) Phospholipid synthesis was inhibited in a dose-dependent 
manner. 

(e) Phospholipids are required for membrane synthesis. 

(f) Soraphen A inhibits cell proliferation, especially at higher 
concentrations. 


Chapter 23 

1. When the proteins are denatured, all of the peptide bonds are 
accessible to proteolytic enzymes. If the three-dimensional struc- 
ture of a protein is maintained, access to many peptide bonds is 
denied to the proteolytic enzymes. 

2. First, the ubiquitin-activating enzyme (E1) links ubiquitin to 
a sulfhydryl group on E1 itself. Next, the ubiquitin is transferred 
to a cysteine residue on the ubiquitin-conjugating enzyme (E2) by 
E2. The ubiquitin—protein ligase (E3), using the ubiquitinated E2 
as a substrate, transfers the ubiquitin to the target protein. 

3. (a) 7; (b) 4; (c) 2; (d) 10; (e) 5; (£) 3; (g) 9; (h) 1; @) 6; (9) 8. 

4. (a) The ATPase activity of the 265 proteasome resides in the 
19S subunit. The energy of ATP hydrolysis is used to unfold the 
substrate, which is too large to enter the catalytic barrel. ATP may 
also be required for translocation of the substrate into the barrel. 
(b) Substantiates the answer in part a. Because they are small, the 
peptides do not need to be unfolded. Moreover, small peptides 
could probably enter all at once and not require translocation. 

5. (a) Pyruvate; (b) oxaloacetate; (c) a-ketoglutarate; (d) a- 
ketoisocaproate; (e) phenylpyruvate; (f) hydroxyphenylpyruvate. 
6. (a) Aspartate + a-ketoglutarate + GTP + ATP + 2H,O 4 
NADH + H* > % glucose + glutamate + CO, + ADP + 
GDP + NAD* + 2P.. 

The required coenzymes are pyridoxal phosphate in the transami- 
nation reaction and NAD*/NADH in the redox reactions. 

(b) Aspartate + CO, + NH,* + 3 ATP + NAD* + 4H,O0—> 
oxaloacetate + urea + 2 ADP + 4P.+ AMP + NADH + H*. 
7. In the eukaryotic proteasome, the distinct B subunits have dif- 
ferent substrate specificities, allowing proteins to be more thor- 
oughly degraded. 

8. The six subunits probably exist as a heterohexamer. Cross- 
linking experiments could test the model and help determine 
which subunits are adjacent to one another. 

9. Thiamine pyrophosphate. 

10. Aminotransferases transfer the a-amino group to a-ketoglutarate 
to form glutamate. Glutamate is oxidatively deaminated to form 
an ammonium ion. 

11. Aspartate (oxaloacetate), glutamate (a-ketoglutarate), alanine 
(pyruvate). 

12. Serine and threonine. 

13. They are either fuels for the citric acid cycle, components of 
the citric acid cycle, or molecules that can be converted into a fuel 
for the citric acid cycle in one step. 


14. It acts as an electron sink. 

15. Carbamoyl phosphate and aspartate. 

16. (a) 4; (b) 5; (c) 1; (d) 6; (e) 7; £) 3; (g) 2. 

17. A, arginine; B, citrulline; C, ornithine; D, arginosuccinate. 

The order of appearance: C, B, D, A. 

18. CO, + NH,* + 3ATP + NAD* + aspartate + 3H,O—~> 
urea + 2 ADP + 2P. + AMP + PP; + NADH + H* + 

oxaloacetate. 


Four high-transfer-potential phosphoryl] groups are spent. Note, 
however, that an NADH is generated if fumarate is converted 
into oxaloacetate. NADH can generate 2.5 ATP in the electron- 
transport chain. Taking these ATP into account, only 1.5 high- 
transfer-potential phosphoryl groups are spent. 

19. The synthesis of fumarate by the urea cycle is important 
because it links the urea cycle and the citric acid cycle. Fumarate 
is hydrated to malate, which, in turn, is oxidized to oxaloacetate. 
Oxaloacetate has several possible fates: (1) transamination to 
aspartate, (2) conversion into glucose by the gluconeogenic path- 
way, (3) condensation with acetyl CoA to form citrate, or (4) con- 
version into pyruvate. You can collect. 

20. Ornithine transcarbamoylase (Compound A is analogous to 
PALA; Chapter 10). 

21. Ammonia could lead to the amination of a-ketoglutarate, pro- 
ducing a high concentration of glutamate in an unregulated fashion. 
a-Ketoglutarate for glutamate synthesis could be removed from the 
citric acid cycle, thereby diminishing the cell’s respiration capacity. 
22. The mass spectrometric analysis strongly suggests that three 
enzymes—pyruvate dehydrogenase, a-ketoglutarate dehydroge- 
nase, and the branched-chain a-ketoacid dehydrogenase—are defi- 
cient. Most likely, the common E, component of these enzymes 
is missing or defective. This proposal could be tested by purify- 
ing these three enzymes and assaying their ability to catalyze the 
regeneration of lipoamide. 

23. Benzoate, phenylacetate, and arginine would be given to 
supplement a protein-restricted diet. Nitrogen would emerge in 
hippurate, phenylacetylglutamine, and citrulline. 

24. The liver is the primary tissue for capturing nitrogen as urea. 
If the liver is damaged (for instance, by hepatitis or the excessive 
consumption of alcohol), free ammonia is released into the blood. 


33. 


2-O5P 
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25. This defect can be partly bypassed by providing a surplus of 
arginine in the diet and restricting the total protein intake. In the 
liver, arginine is split into urea and ornithine, which then reacts 
with carbamoyl phosphate to form citrulline. This urea-cycle 
intermediate condenses with aspartate to yield argininosuccinate, 
which is then excreted. Note that two nitrogen atoms—one from 
carbamoyl phosphate and the other from aspartate—are elimi- 
nated from the body per molecule of arginine provided in the 

diet. In essence, argininosuccinate substitutes for urea in carry- 
ing nitrogen out of the body. The formation of argininosuccinate 
removes the nitrogen, and the restriction on protein intake relieves 
the aciduria. 

26. Aspartame, a dipeptide ester (L-aspartyl- L-phenylalanine 
methy] ester), is hydrolyzed to L-aspartate and L-phenylalanine. 
High levels of phenylalanine are harmful in phenylketonurics. 

27. N-Acetylglutamate is synthesized from acetyl CoA and gluta- 
mate. Once again, acetyl CoA serves as an activated acetyl donor. 
This reaction is catalyzed by N-acetylglutamate synthase. 

28. Not all proteins are created equal: some are more important 
than others. Some proteins would be degraded to provide the miss- 
ing amino acid. The nitrogen from the other amino acids would be 
excreted as urea. Consequently, more nitrogen would be excreted 
than ingested. 

29. The carbon skeletons of ketogenic amino acids can be con- 
verted into ketone bodies or fatty acids. Only leucine and lysine are 
purely ketogenic. Glucogenic amino acids are those whose carbon 
skeletons can be converted into glucose. 

30. As shown in Figure 23.28, alanine, a gluconeogenic amino 
acid, is released during the metabolism of tryptophan to acetyl 
CoA and acetoacetyl CoA. 

31. The branched-chain amino acids leucine, isoleucine, and 
valine. The required enzyme is the branched-chain a-ketoacid 
dehydrogenase complex. 

32. Pyruvate (glycolysis and gluconeogenesis), acetyl CoA (citric 
acid cycle and fatty acid synthesis), acetoacetyl CoA (ketone-body 
formation), a-ketoglutarate (citric acid cycle), succinyl CoA (citric 
acid cycle), fumarate (citric acid cycle), and oxaloacetate (citric acid 
cycle and gluconeogenesis). 
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34. 


o7- LSerine 
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The equilibrium constant for the interconversion of L-serine and 
D-serine is exactly 1. 

35. Double-displacement. A substituted enzyme intermediate is 
formed. 

36. Exposure of such a domain suggests that a component of a 
multiprotein complex has failed to form properly or that one com- 
ponent has been synthesized in excess. This exposure leads to rapid 
degradation and the restoration of appropriate stoichiometries. 

37. (a) Depletion of glycogen stores. When they are gone, proteins 
must be degraded to meet the glucose needs of the brain. The 
resulting amino acids are deaminated, and the nitrogen atoms are 
excreted as urea. 

(b) The brain has adapted to the use of ketone bodies, which are 
derived from fatty acid catabolism. In other words, the brain is 
being powered by fatty acid breakdown. 

(c) When the glycogen and lipid stores are gone, the only available 
energy source is protein. 

38. The precise cause of all of the symptoms is not firmly estab- 
lished, but a likely explanation depends on the centrality of oxa- 
loacetate to metabolism. A lack of pyruvate carboxylase would 
reduce the amount of oxaloacetate. The lack of oxaloacetate would 
reduce the activity of the citric acid cycle and so ATP would be 
generated by lactic acid formation. If the concentration of oxaloac- 
etate is low, aspartate cannot be formed and the urea cycle would 
be compromised. Oxaloacetate is also required to form citrate, 
which transports acetyl CoA to the cytoplasm for fatty acid syn- 
thesis. Finally, oxaloacetate is required for gluconeogenesis. 

39. Deamination to a-keto-B-methylvalerate; oxidative decarbox- 
ylation to a-methylbutyryl CoA; oxidation to tiglyl CoA; hydra- 
tion, oxidation, and thiolysis yield acetyl CoA and propionyl CoA; 
propionyl CoA to succinyl CoA. 

40. Glycogen phosphorylase. The coenzyme serves as an acid— 
base catalyst. 

41. In the Cori cycle, the carbon atoms are transferred from 
muscle to liver as lactate. For lactate to be of any use, it must be 
reduced to pyruvate. This reduction requires high-energy electrons 
in the form of NADH. When the carbon atoms are transferred as 
alanine, transamination yields pyruvate directly. 


42. (a) Virtually no digestion in the absence of nucleotides. 

b) Protein digestion is greatly stimulated by the presence of ATP. 
c) AMP-PNP, a nonhydrolyzable analog of ATP, is no more 
effective than ADP. 

d) The proteasome requires neither ATP nor PAN to digest small 
substrates. 

e) PAN and ATP hydrolysis may be required to unfold the pep- 
tide and translocate it into the proteasome. 

f) Although Thermoplasma PAN is not as effective with the other 
proteasomes, it nonetheless results in threefold to fourfold stimula- 
tion of digestion. 

g) In light of the fact that the archaea and eukarya diverged several 
billion years ago, the fact that Thermoplasma PAN can stimulate 
rabbit muscle suggests homology not only between the protea- 
somes, but also between PAN and the 195 subunit (most likely the 
ATPases) of the mammalian 26S proteasome. 


Chapter 24 

1. Nitrogen fixation is the conversion of atmospheric N, into 
NH,*. Diazotrophic (nitrogen-fixing) microorganisms are able to 
fix nitrogen. 

2. (a) 4; (b) 8; (c) 10; (d) 6; (e) 7; (£) 9; (g) 3; (A) 5; @) 2; G) 1. 

3. The reductase provides electrons with high reducing power, 
whereas the nitrogenase, which requires ATP hydrolysis, uses the 
electrons to reduce N, to NH. 

4. False. Nitrogen is thermodynamically favored. ATP expendi- 
ture by the nitrogenase is required to make the reaction kinetically 
possible. 

5. The bacteria provide the plant with ammonia by reducing 
atmospheric nitrogen. This reduction is energetically expensive, 
and the bacteria use ATP from the plant. 

6. Oxaloacetate, pyruvate, ribose-5-phosphate, phospho- 
enolpyruvate, erythrose-4-phosphate, a-ketoglutarate, and 
3-phosphoglycerate. 

7. Human beings do not have the biochemical pathways 

to synthesize certain amino acids from simpler precursors. 
Consequently, these amino acids are “essential” and must be 
obtained from the diet. 


8. Glucose + 2 ADP + 2 P. + 2 NAD* + 2 glutamate —> 

2 alanine + 2 a-ketoglutrate + 2 ATP + 2 NADH + 

2H,O + 2H?*. 

9. N, —> NH,* — glutamate —> serine — glycine > 
6-aminolevulinate —> porphobilinogen —> heme. 

10. Pyridoxal phosphate (PLP). 

11. S-Adenosylmethionine, tetrahydrofolate, and 
methylcobalamin. 

12. (a) N°,N!9-Methylenetetrahydrofolate; 

(b) N>-methyltetrahydrofolate. 

13. -Glutamyl phosphate is a likely reaction intermediate. 
14. The synthesis of asparagine from aspartate passes through an 
acyl-adenylate intermediate. One of the products of the reaction 
will be '8O-labeled AMP. 

15. The administration of glycine leads to the formation 
of isovalerylglycine. This water-soluble conjugate, in 
contrast with isovaleric acid, is excreted very rapidly 
by the kidneys. 

16. The nitrogen atom shaded red is derived from 
glutamine. The carbon atom shaded blue is derived 
from serine. 

17. They carry out nitrogen fixation. The absence of 
photosystem II provides an environment in which O, 
is not produced. Recall that the nitrogenase is very 
rapidly inactivated by O,. 

18. The cytoplasm is a reducing environment, 
whereas the extracellular milieu is an oxidizing 
environment. 

19. (a) None; (b) D-glutamate and oxaloacetate. 

20. Succinyl CoA is formed in the mitochondrial 
matrix as part of the citric acid cycle. 

21. Alanine from pyruvate; aspartate from oxaloac- 
etate; glutamate from a-ketoglutarate. 

22. Lysine cyclodeaminase converts L-lysine into the 
six-membered ring analog of proline, also referred to 
as L-homoproline or L-pipecolate: 


Pipecolate 


23. Y could inhibit the C > D step, Z could inhibit the C > F 
step, and C could inhibit A > B. This scheme is an example of 
sequential feedback inhibition. Alternatively, Y could inhibit the 
C— D step, Z could inhibit the C — F step, and the A> B step 
would be inhibited only in the presence of both Y and Z. This 
scheme is called concerted feedback inhibition. 

24. The rate of the A > B step in the presence of high levels of Y 
and Z would be 24 s~! (0.6 X 0.4 X 100s7}). 

25. Lysine 258 is absolutely essential for the activity of aspartate 
aminotransferase, as it is responsible both for the formation of the 
internal aldimine with the pyridoxal phosphate cofactor and for 
transferring the proton between the ketimine and quinonoid inter- 
mediates. Mutation of this residue to cysteine would be expected 
to dramatically impair catalysis, as cysteine cannot occupy the 
same space as lysine and also exhibits differing pK, properties. 
Upon treatment with 2-bromoethylamine, however, the resulting 
thioether now has a shape and pK, similar to the original lysine 
side chain. Hence, some catalytic activity is restored. 

26. An external aldimine forms with SAM, which is deproton- 
ated to form the quinonoid intermediate. The deprotonated carbon 
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atom attacks the carbon atom adjacent to the sulfur atom to form 
the cyclopropane ring and release methylthioadenosine, the other 
product. 

27. An external aldimine forms with L-serine, which is depro- 
tonated to form the quinonoid intermediate. This intermediate is 
reprotonated on its opposite face to form an aldimine with D-ser- 
ine. This compound is cleaved to release D-serine. The equilibrium 
constant for a racemization reaction is 1 because the reactant and 
product are exact mirror images of each other. 

28. (a) In the first step, histidine attacks the methylene group from 
the methionine subgroup of SAM (rather than the usual methyl 
substituent), resulting in the transfer of an aminocarboxypropyl 
group. Three subsequent conventional SAM-mediated methy]- 
ations of the primary amine yield diphthine. 


adenine — 5‘thiomethyl 
adenosine 


5-Adenosyl 
sAM homocysteine 


x3 


Diphthine 


(b) In this chapter, we have observed two examples of an ATP- 
dependent conversion of a carboxylate into an amide: glutamine 
synthetase, which uses an acyl-phosphate intermediate, and 
asparagine synthetase, which uses an acyl-adenylate intermediate. 
Either mechanism is possible in formation of diphthamide from 
diphthine. 

29. Biotin. 

30. Synthesis from oxaloacetate and a-ketoglutarate would deplete 
the citric acid cycle, which would decrease ATP production. 
Anapleurotic reactions would be required to replenish the citric 
acid cycle. 

31. SAM is the donor for DNA methylation reactions that protect 
a host from digestion by its own restriction enzymes. A lack of 
SAM would render the bacterial DNA susceptible to digestion by 
the cell’s own restriction enzymes. 
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32. Acetate — acetyl-CoA — citrate — isocitrate > 
a-ketoglutarate — succinyl-CoA. 

33. The value of K,, of glutamate dehydrogenase for NH,* is 
high (>> 1 mM), and so this enzyme is not saturated when NH, * 
is limiting. In contrast, glutamine synthetase has very high affinity 
for NH,*. Thus, ATP hydrolysis is required to capture ammonia 
when it is scarce. 

34. (a) Asparagine is much more abundant in the dark. More 
glutamine is present in the light. These amino acids show the most 
dramatic effects. Glycine also is more abundant in the light. 

(b) Glutamine is a more metabolically reactive amino acid, used 

in the synthesis of many other compounds. Consequently, when 
energy is available as light, glutamine will be preferentially synthe- 
sized. Asparagine, which carries more nitrogen per carbon atom 
and is thus a more-efficient means of storing nitrogen when energy 
is short, is synthesized in the dark. Glycine is more prevalent in the 
light because of photorespiration. 

(c) White asparagus has an especially high concentration of aspara- 
gine, which accounts for its intense taste. All asparagus has a large 
amount of asparagine. In fact, as suggested by its name, asparagine 
was first isolated from asparagus. 


Chapter 25 

1. In de novo synthesis, the nucleotides are synthesized from 
simpler precursor compounds, in essence from scratch. In 
salvage pathways, preformed bases are recovered and attached 
to riboses. 

2. Carbon 2 and nitrogen 3 come from carbamoyl phosphate. 
Nitrogen 1 and carbons 4, 5, and 6 are derived from aspartate. 

3. Nitrogen 1: aspartate; carbon 2: N!9-formyltetrahydrofolate; 
nitrogen 3: glutamine; carbons 4 and 5 and nitrogen 7: glycine; 
carbon 6: CO,; carbon 8: N!°-formyltetrahydrofolate; nitrogen 9: 
glutamine. 

4, Energy currency: ATP; signal transduction: ATP and GTP; 
RNA synthesis: ATP, GTP, CTP, and UTP; DNA synthesis: 
dATP, dCTP, dGTP, and TTP; components of coenzymes: ATP 
in CoA, FAD, and NAD(P)‘; carbohydrate synthesis: UDP- 
glucose. These are just some of the uses. 

5. A nucleoside is a base attached to ribose. A nucleotide is a 
nucleoside with the ribose bearing one or more phosphates. 

6. (a) 9; (b) 7; (c) 6; (d) 10; (e) 2; (£) 4; (g) 1; (h) 11; (i) 8; (5) 3; (kK) 5. 
7. Substrate channeling is the process whereby the product of 

one active site moves to become a substrate at another active site 
without ever leaving the enzyme. A channel connects the active 
sites. Substrate channeling greatly enhances enzyme efficiency and 
minimizes the diffusion of a substrate to an active site. 

8. Glucose + 2 ATP + 2 NADP* + H,O —> PRPP + CO, + 
ADP + AMP + 2 NADPH + 3 H?. 

9. Glutamine + aspartate + CO, + 2 ATP + NAD* —> orotate + 
2 ADP + 2P; + glutamate + NADH + H*. 

10. (a, c, and d) PRPP; (b) carbamoyl phosphate. 

11. PRPP and formylglycinamide ribonucleotide 

12. dUMP + serine + NADPH +H*t —> TMP + 

NADP? + glycine. 

13. There is a deficiency of N!0-formyltetrahydrofolate. 
Sulfanilamide inhibits the synthesis of folate by acting as an analog 
of p-aminobenzoate, one of the precursors of folate. 

14. (a) Cell A cannot grow ina HAT medium, because it cannot 
synthesize TMP either from thymidine or from dUMP. Cell B 
cannot grow in this medium, because it cannot synthesize purines 
by either the de novo pathway or the salvage pathway. Cell C 

can grow ina HAT medium because it contains active thymidine 


kinase from cell B (enabling it to phosphorylate thymidine to 
TMP) and hypoxanthine guanine phosphoribosyltransferase 

from cell A (enabling it to synthesize purines from hypoxanthine 
by the salvage pathway). 

(b) Transform cell A with a plasmid containing foreign genes of 
interest and a functional thymidine kinase gene. The only cells that 
will grow ina HAT medium are those that have acquired a thy- 
midylate kinase gene; nearly all of these transformed cells will also 
contain the other genes on the plasmid. 

15. The folate derivative N°, N!°-methylenetetrahydrofolate is 
required by thymidylate synthase to add a methyl group to dUMP, 
forming TMP. Insufficient folate could result in spina bifida. 

16. The reciprocal substrate relation refers to the fact that AMP 
synthesis requires GTP, whereas GMP synthesis requires 
ATP. These requirements tend to balance the synthesis of 
ATP and GTP. 

17. Ring carbon 6 in cytosine will be labeled. In guanine, only 
carbon 5 will be labeled with °C. 

18. UTP is first converted into UDP. Ribonucleotide reductase 
generates (UDP. DeoxyUDP is converted to (UMP. Thymidylate 
synthase generates TMP from dUMP. Monophosphate and 
diphosphate kinases subsequently form TTP. 

19. These patients have a high level of urate because of the break- 
down of nucleic acids. Allopurinol prevents the formation of 
kidney stones and blocks other deleterious consequences of hyper- 
uricemia by preventing the formation of urate. 

20. The free energies of binding are —57.7 (wild type), —49.8 
(Asn 27), and —38.1 (Ser 27) kJ mol! (—13.8, —11.9, and 

—9.1 kcal mol™!, respectively). The loss in binding energy is 

+7.9 kJ mol~! (+1.9 kcal mol~!) and +19.7 kJ mol7! 

(+4.7 kcal mol7!). 

21. By their nature, cancer cells divide rapidly and thus require 
frequent DNA synthesis. Inhibitors of TMP synthesis will impair 
DNA synthesis and cancer growth. 

22. Uridine and cytidine are administered to by-pass the defective 
enzyme in orotic aciduria. 

23. Inosine or hypoxanthine could be administered. 

24. N-1 in both cases, and the amine group linked to C-6 in ATP. 
25. Nitrogen atoms 3 and 9 in the purine ring 

26. Allopurinol, an analog of hypoxanthine, is a suicide inhibitor 
of xanthine oxidase. 

27. An oxygen atom is added to allopurinol to form alloxanthine. 


Alloxanthine 


28. 
The synthesis of carbamoyl phosphate requires 2 ATP 2 ATP 
The formation of PRPP from ribose 5-phosphate 


yields an AMP* 2 ATP 
The conversion of UMP to UTP requires 2 ATP 2 ATP 
The conversion of UTP to CTP requires 1 ATP fl AEP 


Total 7 ATP 


*Remember that AMP is the equivalent of 2 ATP because an ATP 
must be expended to generate ADP, the substrate for ATP synthesis. 


29. (a) Carboxyaminoimidazole ribonucleotide; (b) glycinamide 
ribonucleotide; (c) phosphoribosyl amine; (d) formylglycinamide 
ribonucleotide. 

30. The first reaction proceeds by phosphorylation of glycine to 
form an acyl phosphate followed by nucleophilic attack by the amine 
of phosphoribosylamine to displace orthophosphate. The second 
reaction consists of adenylation of the carbonyl group of xanthylate 
followed by nucleophilic attack by ammonia to displace AMP. 

31. The —NH, group attacks the carbonyl carbon atom to form a 
tetrahedral intermediate. Removal of a proton leads to the elimina- 
tion of water to form inosinate. 

32. The enzyme that uses ammonia, carbamoyl phosphate 
synthetase I, forms carbamoyl phosphate for a reaction with 
ornithine, the first step of the urea cycle. The enzyme that 

uses glutamine, carbamoyl phosphate synthetase II, generates 
carbamoyl phosphate for use in the first step of pyrimidine 
biosynthesis. 

33. PRPP is the activated intermediate in the synthesis of phos- 
phoribosylamine in the de novo pathway of purine formation; 

of purine nucleotides from free bases by the salvage pathway; of 
orotidylate in the formation of pyrimidines; of nicotinate ribo- 
nucleotide; of phosphoribosyl ATP in the pathway leading to his- 
tidine; and of phosphoribosylanthranilate in the pathway leading 
to tryptophan. 

34. (a) cAMP; (b) ATP; (c) UDP-glucose; (d) acetyl CoA; 

(e) NADt, FAD; (f) dideoxynucleotides; (g) fluorouracil; 

(h) CTP inhibits ATCase. 

35. In vitamin B,, deficiency, methyltetrahydrofolate cannot 
donate its methyl group to homocysteine to regenerate methionine. 
Because the synthesis of methyltetrahydrofolate is irreversible, the 
cell’s tetrahydrofolate will ultimately be converted into this form. 
No formyl] or methylene tetrahydrofolate will be left for nucleotide 
synthesis. Vitamin B,, is also required to metabolize propionyl 
CoA generated in the oxidation of odd-chain fatty acids and in the 
degradation of methionine. 

36. Because folate is required for nucleotide synthesis, cells that 
are dividing rapidly would be most readily affected. They would 
include cells of the intestine, which are constantly replaced, and 
precursors to blood cells. A lack of intestinal cells and blood cells 
would account for the symptoms often observed. 

37. In patients with glucose 6-phosphatase deficiency, the cyto- 
plasmic level of ATP in the liver falls as a result of increased gly- 
cogenolysis. In all three conditions, AMP rises above normal, and 
the excess AMP is degraded to urate. 

38. Succinate — malate — oxaloacetate by the citric acid cycle. 
Oxaloacetate — aspartate by transamination, followed by pyrimi- 
dine synthesis. Carbons 4, 5, and 6 are labeled. 

39. Glucose will most likely be converted into two molecules of 
pyruvate, one of which will be labeled in the 2 position: 


Now consider two common fates of pyruvate—conversion into 
acetyl CoA and subsequent processing by the citric acid cycle 

or carboxylation by pyruvate carboxylase to form oxaloacetate. 
Formation of citrate by condensing the labeled pyruvate with oxa- 
loacetate will yield labeled citrate: 
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Q~ 0 
4 


CH, 
—00C—C—OH 


CH, 
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oe 
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The labeled carbon will be retained through one round of the citric 
acid cycle but, on the formation of the symmetric succinate, the 
label will appear in two different positions. Thus, when succinate 
is metabolized to oxaloacetate, which may be aminated to form 
aspartate, two carbons will be labeled: 


When this aspartate is used to form uracil, the labeled COO 
attached to the a-carbon is lost and the other COO™ becomes 
incorporated into uracil as carbon 4. 

Suppose, instead, that labeled 2-['*C]pyruvate is carboxylated 
to form oxaloacetate and processed to form aspartate. In this case, 
the a-carbon of aspartate bears the label. 


When this aspartate is used to synthesize uracil, carbon 6 bears 
the label: 


Q 0 
Nee 


Ce 
ar No 


40. HGPRT, which is nonfunctional in Lesch-Nyhan patients, 

is required to form 6-mercaptopurine ribose monophosphate. 
Consequently, de novo purine synthesis continues. 

41. (a) Some ATP can be salvaged from the ADP that is being 
generated. (b) There are equal numbers of high-phosphoryl-trans- 
fer-potential groups on each side of the equation. (c) Because the 
adenylate kinase reaction is at equilibrium, the removal of AMP 
would lead to the formation of more ATP. (d) Essentially, the cycle 
serves as an anaplerotic reaction for the generation of the citric acid 
cycle intermediate fumarate. 

42. (i) The formation of 5-aminoididazole-4-carboxamide ribo- 
nucleotide from 5-aminoimidazole-4-(N-succinylcarboxamide) 
ribonucleotide in the synthesis of IMP. (ii) The formation of 

AMP from adenylosuccinate. (ii) The formation of arginine from 
argininosuccinate in the urea cycle. 

43. Allopurinol is an inhibitor of xanthine oxidase, which is on the 
pathway for urate synthesis. In your pet duck, this pathway is the 
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means by which excess nitrogen is excreted. If xanthine oxidase 
were inhibited in your duck, nitrogen could not be excreted, with 
severe consequences such as the formation of a dead duck. 

44. (a) The enzyme from the LND 2 patient showed much less 
activity than the normal enzyme, suggesting that the defect in the 
enzyme impaired its catalytic ability. The results for LND 1 are 
puzzling. The enzyme displays activity similar to the enzyme from 
the normal cell line, and yet the patient was suffering from Lesch— 
Nyhan disease. 

(b) Possible explanations include: there may be an inhibitor in the 
cells that prevents the enzyme from acting in vivo but is lost dur- 
ing the purification procedure; the enzyme may be degraded more 
rapidly in vivo than the normal enzyme; the enzyme may be inher- 
ently less stable than the normal enzyme. 

(c) The enzyme from LND 1 lost activity much faster than the 
normal enzyme, suggesting that the enzyme was structurally 
unstable and would lose enzyme activity in the cell, accounting for 
the appearance of the disease. 


Chapter 26 

1. Glycerol 3-phosphate is the foundation for both triacylglycerol 
and phospholipid synthesis. Glycerol 3-phosphate is acylated 
twice to form phosphatidate. In triacylglycerol synthesis, the 
phosphoryl group is removed from glycerol 3-phosphate to form 
diacylglycerol, which is then acylated to form triacylglycerol. In 
phospholipid synthesis, phosphatidate commonly reacts with CTP 
to form CDP-diacylglycerol, which then reacts with an alcohol to 
form a phospholipid. Alternatively, diacylglycerol may react with a 
CDP-alcohol to form a phospholipid. 

2. Glycerol 3-phosphate is formed primarily by the reduction of 
dihydroxyacetone phosphate, a gluconeogenic intermediate, and to 
a lesser extent by the phosphorylation of glycerol. 

3. Glycerol + 4 ATP + 3 fatty acids + 4 H,O—> 

triacylglycerol + ADP + 3 AMP + 7 P. + 4H. 

4. Glycerol + 3 ATP + 2 fatty acids + 2H,O + CTP 4 
ethanolamine —> phosphatidylethanolamine + CMP + ADP + 
2 AMP + 6P.+3H?. 

5. Three. One molecule of ATP to form phosphorylethanolamine 
and two molecules of ATP to regenerate CTP from CMP. 

6. All are synthesized from ceramide. In sphingomyelin, the 
terminal hydroxyl group of ceramide is modified with phosphor- 
ylcholine. In a cerebroside, the hydroxyl group has a glucose or 
galactose attached. In a ganglioside, oligosaccharide chains are 
attached to the hydroxyl group. 

7. (1) Activate the diacylglycerol as CDP-DAG. (ii) Activate the 
alcohol as CDP-alcohol. (iii) Use the base-exchange reaction. 

8. (a) CDP-diacylglycerol; (b) CDP-ethanolamine; (c) acyl CoA; 
(d) phosphatidylcholine; (e) UDP-glucose or UDP-galactose; 

(f) UDP-galactose; (g) geranyl pyrophosphate. 

9. Such mutations are seen in mice. The amount of adipose tis- 
sue would decrease severely because diacylglycerol could not be 
formed. Normally, diacylglycerol is acylated to form triacylglycer- 
ols. If there were deficient phosphatidic acid phosphatase activity, 
no triacylglycerols would form. 

10. (a) 8; (b) 4; (c) 1; (d) 9; (e) 3; (f) 10; (g) 5; (h) 2; (a) 6; Gj) 7. 

11. (1) The synthesis of activated isoprene units (isopentyl pyro- 
phosphate), (ii) the condensation of six of the activated isoprene 
units to form squalene, and (iii) cyclization of the squalene to form 
cholesterol. 

12. The amount of reductase and its activity control the regulation 
of cholesterol biosynthesis. Transcriptional control is mediated 

by SREBP. Translation of the reductase mRNA also is controlled. 
The reductase itself may undergo regulated proteolysis. Finally, 


the activity of the reductase is inhibited by phosphorylation by 
AMP kinase when ATP levels are low. 

13. (aand b) None, because the label is lost as CO). 

14. The hallmark of this genetic disease is elevated cholesterol 
levels in the blood of even young children. The excess cholesterol 
is taken up by marcrophages, which eventually results in the for- 
mation of plaques and heart disease. There are many mutations 
that cause the disease, but all result in malfunctioning of the LDL 
receptor. 

15. The categories of mutations are: (i) no receptor is synthesized; 
(ii) receptors are synthesized but do not reach the plasma mem- 
brane, because they lack signals for intracellular transport or do 
not fold properly; (111) receptors reach the cell surface, but they fail 
to bind LDL normally because of a defect in the LDL-binding 
domain; (iv) receptors reach the cell surface and bind LDL, but 
they fail to cluster in coated pits because of a defect in their car- 
boxyl-terminal regions. 

16. “None of your business” and “T don’t talk biochemistry 

until after breakfast” are appropriate but rude and uninformative 
answers. A better answer might be: “Although it is true that cho- 
lesterol is a precursor to steroid hormones, the rest of the statement 
is oversimplified. Cholesterol is a component of membranes, and 
membranes literally define cells, and cells make up tissues. But to 
say that cholesterol ‘makes’ cells and tissues is wrong.” 

17. Statins are competitive inhibitors of HMG-CoA reductase. 
They are used as drugs to inhibit cholesterol synthesis in patients 
with high levels of cholesterol. 

18. No. Cholesterol is essential for membrane function and as a 
precursor for bile salts and steroid hormones. The complete lack of 
cholesterol would be lethal. 

19. Deamination of cytidine to uridine changes CAA (Gln) into 
UAA (stop). 

20. The LDL contains apolipoprotein B-100, which binds to an 
LDL receptor on the cell surface in a region known as a coated pit. 
On binding, the complex is internalized by endocytosis to form 
an internal vesicle. The vesicle is separated into two components. 
One, with the receptor, is transported back to the cell surface and 
fuses with the membrane, allowing continued use of the receptor. 
The other vesicle fuses with lysosomes inside the cell. The choles- 
teryl esters are hydrolyzed, and free cholesterol is made available 
for cellular use. The LDL protein is hydrolyzed to free amino 
acids. 

21. Benign prostatic hypertrophy can be treated by inhibiting 
5a-reductase. Finasteride, the 4-azasteroid analog of dihydrotes- 
tosterone, competitively inhibits the reductase but does not act on 
androgen receptors. Patients taking finasteride have a markedly 
lower plasma level of dihydrotestosterone and a nearly normal 
level of testosterone. The prostate gland becomes smaller, whereas 
testosterone-dependent processes such as fertility, libido, and 
muscle strength appear to be unaffected. 


CONHC(CH3)3 
is 


CH 


Finasteride 


22. Patients who are most sensitive to debrisoquine have a defi- 
ciency of a liver P450 enzyme encoded by a member of the CYP2 
subfamily. This characteristic is inherited as an autosomal recessive 


trait. The capacity to degrade other drugs may be impaired in peo- 
ple who hydroxylate debrisoquine at a slow rate, because a single 
P450 enzyme usually handles a broad range of substrates. 

23. Many hydrophobic odorants are deactivated by hydroxylation. 
Molecular oxygen is activated by a cytochrome P450 monooxygen- 
ase. NADPH serves as the reductant. One oxygen atom of O, goes 
into the odorant substrate, whereas the other is reduced to water. 
24. Recall that dihydrotestosterone is crucial for the development 
of male characteristics in the embryo. If a pregnant woman were 

to be exposed to Propecia, the 5a-reductase of the male embryo 
would be inhibited, which could result in severe developmental 
abnormalities. 

25. The oxygenation reactions catalyzed by the cytochrome P450 
family permit greater flexibility in biosynthesis. Because plants are 
not mobile, they must rely on physical defenses, such as thorns, 
and chemical defenses, such as toxic alkaloids. The larger P450 
array might permit greater biosynthetic versatility. 

26. This knowledge might enable clinicians to characterize the 
likelihood of a patient’s having an adverse drug reaction or being 
susceptible to chemical-induced illnesses. It might also permit a 
personalized and especially effective drug-treatment regime for 
diseases such as cancer. 

27. The honeybees may be especially sensitive to environmental 
toxins, including pesticides, because these chemicals are not read- 
ily detoxified, owing to the minimal P450 system. 

28. A deficiency of the hydroxylase will cause a build-up of 
progesterone, which will then be converted into estradiol and 
testosterone. 

29. The core structure of a steroid is four fused rings: three cyclo- 
hexane rings and one cyclopentane ring. In vitamin D, the B ring is 
split by ultraviolet light. 

30. The negatively charged phosphoserine residue interacts with 
the positively charged protonated histidine residue and decreases 
its ability to transfer a proton to the thiolate. 


CoA—-S~ 


His 


31. The methyl group is first hydroxylated. The hydroxymethyl- 
amine eliminates formaldehyde to form methylamine. 

32. Note that a cytidine nucleotide plays the same role in the syn- 
thesis of these phosphoglycerides as a uridine nucleotide does in 
the formation of glycogen (Section 21.4). In all of these biosynthe- 
ses, an activated intermediate (UDP-glucose, CDP-diacylglycerol, 
or CDP-alcohol) is formed from a phosphorylated substrate 
(glucose 1-phosphate, phosphatidate, or a phosphorylalcohol) and 
a nucleoside triphosphate (UTP or CTP). The activated interme- 
diate then reacts with a hydroxyl group (the terminus of glycogen, 
the side chain of serine, or a diacylglycerol). 

33. The attachment of isoprenoid side chains confers hydropho- 
bic character. Proteins having such a modification are targeted to 
membranes. 

34. 3-Hydroxy-3-methylglutaryl CoA is also a precursor for 
ketone-body synthesis. If fuel is needed elsewhere in the body, as 
might be the case during a fast, 3-hydroxy-3-methylglutaryl CoA 
is converted into the ketone body acetoacetate. If energy needs are 
met, the liver will synthesize cholesterol. Remember also that cho- 
lesterol synthesis occurs in the cytoplasm while ketone bodies are 
synthesized in the mitochondria. 
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35. One way in which phosphatidylcholine can be synthesized 

is by the addition of three methyl groups to phosphatidyletha- 
nolamine. The methyl donor is a modified form of methionine, 
S-adenosylmethionine or SAM (Section 24.2). 

36. Mutations could occur in the gene encoding the sodium chan- 
nel that would prevent the action of DDT. Alternatively, P450 
enzyme synthesis could be increased to accelerate metabolism 

of the insecticide to inactive metabolites. In fact, both types of 
responses to insecticides have been observed. 

37. Citrate is transported out of the mitochondria in times of 
plenty. ATP-citrate lyase yields acetyl CoA and oxaloacetate. The 
acetyl CoA can then be used to synthesize cholesterol. 

38. (a) There is no effect. (b) Because actin is not controlled by 
cholesterol, the amount isolated should be the same in both experi- 
mental groups; a difference would suggest a problem in the RNA 
isolation. (c) The presence of cholesterol in the diet dramatically 
reduces the amount of HMG-CoA reductase protein. (d) A com- 
mon means of regulating the amount of a protein present is to 
regulate transcription, which is clearly not the case here. (e) The 
translation of mRNA could be inhibited, and the protein could be 
rapidly degraded. 

39. (a) The activity of the enzyme from the GD patient is the same 
as that from the control. This suggests that GD is not caused by a 
mutation that impairs enzyme activity. 

(b) There is much less GCase in the GD sample. Because the same 
number of cells were used in the two samples (confirmed by the 
identical levels of actin in both samples), we know that the differ- 
ence in amount of GCase is not due to different quantities of cells. 
Two possible explanations are that transcription is impaired in the 
GD sample or that the enzyme in GD is more readily destroyed 
than the control. 

(c) Apparently, the enzyme is being synthesized but then degraded. 
(d) The defect in the enzyme from the GD patient seems to result 
in more rapid degradation of the enzyme by the proteasome. The 
fact that the amount of GCase present in the control increased 
upon proteasomal inhibition suggests that GCase may undergo 
rapid turnover under normal conditions. 


Chapter 27 
1. Over the 40 years under consideration, our test subject will have 
consumed 


40 years X 365 days year! X 8400 kJ (2000 kcal) day! 
= 1.2 X 10°kJ (2.9 X 10’ kcals) in 40 years 


Thus, over the 40-year span, our subject has ingested 


1.2 X 108(2.9 X 10’ kcal)/38 kJ (9 kcal g') 
= 3.2 X 10°g = 3200 kg of food 


which is equivalent to more than 6 tons of food! 
2. 55 pounds = 25kg = 25,000 g = total weight gain 


40 years X 365 days year | = 14,600 days 
25,000 g/14,600 days = 1.7 g day! 


which is equivalent to an extra pat of butter per day. Her BMI is 
26.5, and she would be considered overweight but not obese. 

3. Adipose tissue is now known to be an active endocrine organ, 
secreting signal molecules called adipokines. 

4. Caloric homeostasis is the condition in which the energy expen- 
diture of an organism is equal to the energy intake. 

5. Leptin and insulin. 

6. CCK produces a feeling of satiety and stimulates the secretion 
of digestive enzymes by the pancreas and the secretion of bile salts 
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by the gall bladder. GLP-1 also produces a feeling of satiety; in 
addition, it potentiates the glucose-induced secretion of insulin by 
the 8 cells of the pancreas. 

7. Obviously, something is amiss. Although the answer is not 
known, the leptin-signaling pathway appears to be inhibited by 
suppressors of cytokine signaling, the regulatory proteins. 

8. 1: a, b; 2: f; 3:c, d, f; 4: c, d; 5: c; 6: f; 7: e; 8:2; 9: e. 

9. Phosphorylation of dietary glucose after it enters the liver; 
gluconeogenesis; glycogen breakdown. 

10. Type 1 diabetes is due to autoimmune destruction of the 
insulin-producing cells of the pancreas. Type 1 diabetes is also 
called insulin-dependent diabetes because affected people require 
insulin to survive. Type 2 diabetes is characterized by insulin 
resistance. Insulin is produced, but the tissues that should respond 
to insulin, such as muscle, do not. 

11. Leptin stimulates processes impaired in diabetes. For instance, 
leptin stimulates fatty acid oxidation, inhibits triacylglycerol synthe- 
sis, and increases the sensitivity of muscle and the liver to insulin. 
12. (a) A watt is equal to 1 joule (J) per second (0.239 calorie per 
second). Hence, 70 W is equivalent to 0.07 kJ s~! (0.017 kcal s~!). 
(b) A watt is a current of 1 ampere (A) across a potential of 1 volt 
(V). For simplicity, let us assume that all the electron flow is from 
NADH to O, (a potential drop of 1.14 V). Hence, the current is 
61.4 A, which corresponds to 3.86 X 107° electrons per second 

(1 A= 1 coulomb s~! = 6.28 x 10!8 charge s~!). 

(c) About 2.5 molecules of ATP are formed per molecule of 
NADH oxidized (two electrons). Hence, 1 molecule of ATP is 
formed per 0.8 electron transferred. A flow of 3.86 X 107° elec- 
trons per second therefore leads to the generation of 4.83 X 107° 
molecules of ATP per second, or 0.80 mmol s~!. 

(d) The molecular weight of ATP is 507. The total body content of 
ATP of 50 g is equal to 0.099 mol. Hence, ATP turns over about 
once in 125 seconds when the body is at rest. 

13. (a) The stoichiometry of the complete oxidation of glucose is 


Ce6H120¢6 + 6 O2 —> 6 CO, + 6H2O 
and that of tripalmitoylglycerol is 
Cs5,H9gO2 + 72.5 O2 —>51 COs + 49 H2O 


Hence, the RQ values are 1.0 and 0.703, respectively. 

(b) An RQ value reveals the relative use of carbohydrates and fats 
as fuels. The RQ of a marathon runner typically decreases from 
0.97 to 0.77 in the course of a race. The lowering of the RQ indi- 
cates the shift in fuel from carbohydrates to fat. 

14. One gram of glucose (molecular weight 180.2) is equal to 
5.55 mmol, and one gram of tripalmitoylglycerol (molecular 
weight 807.3) is equal to 1.24 mmol. The reaction stoichiometries 
(Problem 13) indicate that 6 mol of HO is produced per mole of 
glucose oxidized, and 49 mol of H,O is produced per mole of tri- 
palmitoylglycerol oxidized. Hence, the H,O yields per gram of fuel 
are 33.3 mmol (0.6 g) for glucose and 60.8 mmol (1.09 g) for tri- 
palmitoylglycerol. Thus, complete oxidation of this fat gives 1.82 
times as much water as does glucose. Another advantage of triacyl- 
glycerols is that they can be stored in essentially anhydrous form, 
whereas glucose is stored as glycogen, a highly hydrated polymer. 
A hump consisting mainly of glycogen would be an intolerable 
burden—far more than the straw that broke the camel’s back. 

15. The starved—fed cycle is the nightly hormonal cycle that 
humans experience during sleep and on eating. The cycle main- 
tains adequate amounts of blood glucose. The starved part— 
sleep—is characterized by increased glucagon secretion and 
decreased insulin secretion. After a meal, glucagon concentration 
falls and insulin concentration rises. 


16. Ethanol is oxidized to yield acetaldehyde by alcohol dehy- 
drogenase, which is subsequently oxidized to acetate and acetal- 
dehyde. Ethanol is also metabolized to acetaldehyde by the P450 
enzymes, with the subsequent depletion of NADPH. 

17. First, fatty liver develops owing to the increased amounts of 
NADH that inhibit fatty acid oxidation and stimulate fatty acid 
synthesis. Second, alcoholic hepatitis begins owing to oxidative 
damage and damage due to excess acetaldehyde that results in 

cell death. Finally, fibrous tissues form, creating scars that impair 
blood flow and biochemical function. Ammonia cannot be con- 
verted into urea, and its toxicity leads to coma and death. 

18. A typical macadamia nut has a mass of about 2 g. Because 

it consists mainly of fats (~37 kJ g~!, ~9 kcal g~!), a nut has a 
value of about 75 kJ (18 kcal). The ingestion of 10 nuts results in 
an intake of about 753 kJ (180 kcal). As stated in the answer to 
Problem 12, a power consumption of 1 W corresponds to 1 J s~! 
(0.239 cal s~!), and so 400-W running requires 0.4 kJ s~! (0.0956 
kcal s~!). Hence, a person would have to run 1882 s, or about 31 
minutes, to spend the calories provided by 10 nuts. 

19. A high blood-glucose level triggers the secretion of insulin, 
which stimulates the synthesis of glycogen and triacylglycerols. A 
high insulin level would impede the mobilization of fuel reserves 
during the marathon. 

20. A lack of adipose tissue leads to an accumulation of fats in the 
muscle, with the generation of insulin resistance. The experiment 
shows that adipokines secreted by the adipose tissue, here leptin, 
facilitate in some fashion the action of insulin in muscle. 

21. Such a mutation would increase the phosphorylation of the 
insulin receptor and IRS in muscle and would improve insulin sen- 
sitivity. Indeed, PTP1B is an attractive therapeutic target for type 
2 diabetes. 

22. Lipid mobilization can be so rapid that it exceeds the ability of 
the liver to oxidize the lipids or convert them into ketone bodies. 
The excess is reesterified and released into the blood as VLDLs. 
23. A role of the liver is to provide glucose for other tissues. In the 
liver, glycolysis is used not for energy production but for biosyn- 
thetic purposes. Consequently, in the presence of glucagon, liver 
glycolysis stops so that the glucose can be released into the blood. 
24. The urea cycle and gluconeogenesis. 

25. (a) Insulin inhibits lipid utilization. 

(b) Insulin stimulates protein synthesis, but there are no amino 
acids in the children’s diet. Moreover, insulin inhibits protein 
breakdown. Consequently, muscle proteins cannot be broken down 
and used for the synthesis of essential proteins. 

(c) Because proteins cannot be synthesized, blood osmolarity is too 
low. Consequently, fluid leaves the blood. An especially important 
protein for maintaining blood osmolarity is albumin. 

26. During strenuous exercise, muscle converts glucose into pyru- 
vate through glycolysis. Some of the pyruvate is processed by cel- 
lular respiration. However, some of it is converted into lactate and 
released into the blood. The liver takes up the lactate and converts 
it into glucose through gluconeogenesis. Muscle may process the 
carbon skeletons of branched-chain amino acids aerobically. The 
nitrogens of these amino acids are transferred to pyruvate to form 
alanine, which is released into the blood and taken up by the liver. 
After the transamination of the amino group to a-ketoglutarate, 
the resulting pyruvate is converted into glucose. Finally, muscle 
glycogen may be mobilized, and the released glucose can be used 
by muscle. 

27. This conversion allows muscle to function anaerobically. 
NAD* is regenerated when pyruvate is reduced to lactate, and so 
energy can continue to be extracted from glucose during strenuous 
exercise. The liver converts the lactate into glucose. 


28. Fatty acids and glucose, respectively. 

29. This practice is called carbo-loading. Depleting the glycogen 
stores will initially cause the muscles to synthesis a large amount of 
glycogen when dietary carbohydrates are provided and will lead to 
the supercompensation of glycogen stores. 

30. The oxygen consumption at the end of exercise is used to 
replenish ATP and creatine phosphate and to oxidize any lactate 
produced. 

31. Oxygen is used in oxidative phosphorylation to resynthesize 
ATP and creatine phosphate. The liver converts lactate released 
by the muscle into glucose. Blood must be circulated to return the 
body temperature to normal, and so the heart cannot return to 

its resting rate immediately. Hemoglobin must be reoxygenated 

to replace the oxygen used in exercise. The muscles that power 
breathing must continue working at the same time as the exercised 
muscles are returning to resting states. In essence, all the biochem- 
ical systems activated in intense exercise need increased oxygen to 
return to the resting state. 

32. Ethanol may replace water that is hydrogen bonded to proteins 
and membrane surfaces. This alteration of the hydration state 

of the protein would alter its conformation and hence function. 
Ethanol may also alter phospholipid packing in membranes. The 
two effects suggest that integral membrane proteins would be most 
sensitive to ethanol, as indeed seems to be the case. 

33. Cells from the type I fiber would be rich in mitochondria, 
whereas those of the type II fiber would have few mitochondria. 
34. (a) The ATP expended during this race amounts to about 
8380 kg, or 18,400 pounds. (b) The cyclist would need about 
$1,260,000,000 to complete the race. 

35. Exercise greatly enhances the ATP needs of muscle cells. 

To more efficiently meet these needs, more mitochondria are 
synthesized. 

36. AMPK activity increases as ATP is used for muscle contrac- 
tion. AMPK inactivates acetyl CoA carboxylase. Recall that malo- 
nyl CoA, the product of acetyl CoA carboxylase, inhibits transport 
of fatty acids into the mitochondria. The decrease in muscle malo- 
nyl CoA allows fatty acid oxidation in muscle. 

37. After the first several days of starvation, most tissues are using 
fatty acids. Ketone bodies provide much of the brain’s energy, 
decreasing glucose requirement. The low insulin/high glucagon 
ration stimulates lipolysis and gluconeogenesis. However, gluco- 
neogenesis will not occur because the increase in fatty acid oxida- 
tion increases acetyl CoA and NADH. The high concentration of 
NADH inhibits the TCA cycle as well as gluconeogenesis. The 
acetyl CoA forms ketone bodies. 

38. (a) The increase in all of the ratios is due to the NADH glut 
caused by the metabolism of alcohol. 

(b) The increased amounts of lactate and D-3-hydroxybutyrate are 
released into the blood, accounting for the acidosis. 

(c) Drinking on an empty stomach suggests that glycogen stores 
are low. Because of the NADH glut, gluconeogenesis cannot occur. 
Consequently, hypoglycemia results. A well-fed drinker will have 
glucose in the blood from the meal and thus will not experience 
hypoglycemia. 

39. If glucose is always provided, even in small amounts, the 
brain will use glucose as a fuel rather than adapting to ketone body 
use. During the fast, muscle protein will be broken down to meet 
the brains glucose needs. This protein degradation will lead to 
organ failure sooner than if the brain had adapted to ketone body 
utilization. 

40. The inability of muscle mitochondria to process all of the 

fatty acids produced by overnutrition leads to excessive levels 

of diacylglycerol and ceramide in the muscle cytoplasm. These 
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second-messenger molecules activate enzymes that impair insulin 
signaling. 

41. Both are due to a lack of thiamine (vitamin B, ). Thiamine, 
which is sometimes called aneurin, is required most notably for the 
proper functioning of pyruvate dehydrogenase. 

42. (a) Red blood cells always produce lactate, and fast-twitch or 
type II, muscle fibers (Problem 33) also produce a large amount of 
lactate. 

(b) At that point, the athlete is beginning to move into anaerobic 
exercise, in which most energy is produced by anaerobic glycolysis. 
(c) The lactate threshold is essentially the point at which the ath- 
lete switches from aerobic exercise, which can be done for extended 
periods, to anaerobic exercise, essentially sprinting, which can be 
done for only short periods. The idea is to race at the extreme of his 
or her aerobic capacity until the finish line is in sight and then to 
switch to anaerobic. 

(d) Training increases the amount of blood vessels and the num- 
ber of muscle mitochondria. Together, they increase the ability to 
process glucose aerobically. Consequently, a greater effort can be 
expended before the switch to anaerobic energy production. 

43. Consider the graph of lactate production as a function of effort 
shown in Problem 42. For an athlete racing at her lactate threshold 
or just below it, the RQ value will be 1. With the finish line in 
sight, our runner ups her pace so that she is now also processing 
glucose to lactic acid in addition to processing glucose aerobically; 
that is, she begins running above her lactate threshold. The lactic 
acid released into the blood will ionize 


CH3CH(OH)COOH == CH3;CH(OH)COO™ 


Lactic acid 


The increase in H* will alter the blood buffer system, leading to 
the formation of carbonic acid 


Ht + HCO; = — H2CO3 
Carbonic acid 


The carbonic acid will dissociate into water and carbon dioxide 
H2CO;3 =— CO, + H,0 


This carbon dioxide will be superimposed on the carbon dioxide 
generated by the combustion of glucose aerobically, leading to an 
RQ greated than 1. 

44. The increase in ATP resulting from the processing of glucose 
in the B cells closes a potassium channel. The closing of the potas- 
sium channel alters the voltage across the cell membrane, which 
leads to an opening of a calcium channel. The influx of calcium 
causes the insulin-containing secretory granules to fuse with the 
cell membrane, resulting in the release of insulin. 


Chapter 28 

1. DNA polymerase I uses deoxyribonucleoside triphosphates; 
pyrophosphate is the leaving group. DNA ligase uses DNA-— 
adenylate (AMP joined to the 5'-phosphoryl group) as a reac- 
tion partner; AMP is the leaving group. Topoisomerase I uses a 
DNA-tyrosyl intermediate (5’-phosphoryl group linked to the 
phenolic OH group); the tyrosine residue of the enzyme is the 
leaving group. 

2. Positive supercoiling resists the unwinding of DNA. The melt- 
ing temperature of DNA increases in proceeding from negatively 
supercoiled to relaxed to positively supercoiled DNA. Positive 
supercoiling is probably an adaptation to high temperature. 

3. The nucleotides used for DNA synthesis have the triphosphate 
attached to the 5’-hydroxyl group with free 3’-hydroxyl groups. 
Such nucleotides can be utilized only for 5’-to-3’ DNA synthesis. 
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4. DNA replication requires RNA primers. Without appropriate 
ribonucleotides, such primers cannot be synthesized. 

5. This close contact prevents the incorporation of ribonucleotides 
rather than 2’-deoxyribonucleotides. 

6. (a) 96.2 revolutions per second (1000 nucleotides per second 
divided by 10.4 nucleotides per turn for B-DNA gives 96.2 rps). 
(b) 0.34 pm s~! (1000 nucleotides per second corresponds to 

3400 A s~! because the axial distance between nucleotides in 
B-DNA is 3.4 A). 

7. Eventually, the DNA would become so tightly wound that move- 
ment of the replication complex would be energetically impossible. 
8. Linking number Lk = Tw + Wr = 48 + 3 = 51. If Tw = 50, 
then Wr = 1. 

9. A hallmark of most cancer cells is prolific cell division, which 
requires DNA replication. If the telomerase were not activated, 
the chromosomes would shorten until they became nonfunctional, 
leading to cell death. 

10. No. 

11. Treat the DNA briefly with endonuclease to occasionally nick 
each strand. Add the polymerase with the radioactive dNTPs. At 
the broken bond, or nick, the polymerase will degrade the existing 
strand with its 5’ — 3’ exonuclease activity and replace it with a 
radioactive complementary copy by using its polymerase activity. 
This reaction scheme is referred to as “nick translation” because 
the nick is moved, or translated, along the DNA molecule without 
ever becoming sealed. 

12. If replication were unidirectional, tracks with a low grain den- 
sity at one end and a high grain density at the other end would be 
seen. On the other hand, if replication were bidirectional, the mid- 
dle of a track would have a low density, as shown in the diagram 
below. For E. coli, the grain tracks are denser on both ends than in 
the middle, indicating that replication is bidirectional. 


Unidirectional synthesis 


o> 
Origin 


Bidirectional synthesis 


o> 
Origin 


13. Pro (CCC), Ser (UCC), Leu (CUC), and Phe (UUC). 
Alternatively, the last base of each of these codons could be U. 

14. Potentially deleterious side reactions are prevented. The 
enzyme itself might be damaged by light if it could be activated by 
light in the absence of bound DNA harboring a pyrimidine dimer. 
15. The free DNA ends that appear in the absence of telomeres are 
repaired by DNA fusion. 

16. The free energy of ATP hydrolysis under standard conditions 
is —30.5 kJ mol~! (—7.3 kcal mol~!). In principle, it could be used 
to break three base pairs. 

17. The oxidation of guanine could lead to DNA repair: DNA 
strand cleavage could allow looping out of the triplet repeat regions 
and triplet expansion. 

18. The release of DNA topoisomerase II after the enzyme has 
acted on its DNA substrate requires ATP hydrolysis. Negative 
supercoiling requires only the binding of ATP, not its hydrolysis. 
19. (a) Size; the top is relaxed and the bottom is supercoiled DNA. 
(b) Topoisomers. (c) The DNA is becoming progressively more 
unwound, or relaxed, and thus slower moving. 

20. (a) It was used to determine the number of spontaneous 
revertants—that is, the background mutation rate. 


(b) To firmly establish that the system was working. A known 
mutagen’s failure to produce revertants would indicate that some- 
thing was wrong with the experimental system. 

(c) The chemical itself has little mutagenic ability but is apparently 
activated into a mutagen by the liver homogenate. 

(d) Cytochrome P450 system. 


Chapter 29 
1. The sequence of the coding (+, sense) strand is 


5'-ATGGGGAACAGCAAGAGTGGGGCCCTGTCCAAGGAG- 3’ 
and the sequence of the template (—, antisense) strand is 
3'-TACCCCTTGTCGTTCTCACCCCGGGACAGGTTCCTC-5’ 


2. An error will affect only one molecule of mRNA of many syn- 
thesized from a gene. In addition, the errors do not become a per- 
manent part of the genomic information. 

3. At any given instant, only a fraction of the genome (total DNA) 
is being transcribed. Consequently, speed is not essential. 

4. The active sites are related by convergent evolution. 

5. Heparin, a glycosaminoglycan, is highly anionic. Its negative 
charges, like the phosphodiester bridges of DNA templates, allow 
it to bind to lysine and arginine residues of RNA polymerase. 

6. This mutant o will competitively inhibit the binding of holo- 
enzyme and prevent the specific initiation of RNA chains at pro- 
moter sites. 

7. The core enzyme without o binds more tightly to the DNA 
template than does the holoenzyme. The retention of o after 
chain initiation would make the mutant RNA polymerase slower. 
The mutant enzyme would also be unlikely to bind alternative 

o factors. 

8. A 100-kDa protein contains about 910 residues, which are 
encoded by 2730 nucleotides. At a maximal transcription rate of 
50 nucleotides per second, the mRNA would be synthesized in 55 s. 
9. The RNA polymerase slides along the DNA rapidly rather than 
simply diffusing through three-dimensional space. 

10. The start site is in red: 


5’-GCCGTTGACACCGTTCGGCGATCGATCCGCTATAATGTGTGGATCCGCTT-3’ 


11. Initiation at strong promoters takes place every 2 s. In this 
interval, 100 nucleotides are transcribed. Hence, centers of tran- 
scription bubbles are 34 nm (340 A) apart. 

12. (a) The lowest band on the gel will be that of strand 3 alone 
(i). Band ii will be at the same position as band i because the 
RNA is not complementary to the nontemplate strand, whereas 
band iii will be higher because a complex is formed between RNA 
and the template strand. Band iv will be higher than the others 
because strand 1 is complexed to 2, and strand 2 is complexed to 
3. Band v is the highest because core polymerase associates with 
the three strands. 

(b) None, because rifampicin acts before the formation of the open 
complex. 

(c) RNA polymerase is processive. When the template is bound, 
heparin cannot enter the DNA-binding site. 

(d) When GTP is absent, synthesis stops when the first cytosine 
residue downstream of the bubble is encountered in the template 
strand. In contrast, with all four nucleoside triphosphates present, 
synthesis will continue to the end of the template. 

13. Backtracking must occur prior to cleavage leading to dinucleo- 
tide products. 

14. The base-pairing energy of the di- and trinucleotide DNA-— 
RNA hybrids formed at the very beginning of transcription is not 
sufficient to prevent strand separation and loss of product. 


15. (a) Because cordycepin lacks a 3’-OH group, it cannot partici- 
pate in 3’ — 5’ bond formation. (b) Because the poly(A) tail is a 
long stretch of adenosine nucleotides, the likelihood that a molecule 
of cordycepin would become incorporated is higher than with most 
RNA. (c) Yes, it must be converted into cordycepin 5'-triphosphate. 
16. There are 2° = 256 possible products. 

17. The relationship between the —10 and —35 sequences could 
be affected by torsional strain. Since topoisomerase II introduces 
negative supercoils in DNA, this prevents this enzyme from over- 
stimulating the expression of its own gene. 

18. Ser-Ie-Phe-His-Pro-Stop 

19. A mutation that disrupted the normal AAUAAA recognition 
sequence for the endonuclease could account for this finding. In 
fact, a change from U to C in this sequence caused this defect in 

a thalassemic patient. Cleavage occurred at the AAUAAA 900 
nucleotides downstream from this mutant AACAAA site. 

20. One possibility is that the 3’ end of the poly(U) donor strand 
cleaves the phosphodiester bond on the 5’ side of the insertion 

site. The newly formed 3’ terminus of the acceptor strand then 
cleaves the poly(U) strand on the 5’ side of the nucleotide that ini- 
tiated the attack. In other words, a uridine residue could be added 
by two transesterification reactions. This postulated mechanism is 
similar to the one in RNA splicing. 

21. Alternative splicing, RNA editing. Covalent modification of 
the proteins subsequent to synthesis. 

22. Attach an oligo(dT) or oligo(U) sequence to an inert support 
to create an affinity column. When RNA is passed through the 
column, only poly(A)-containing RNA will be retained. 

23. (a) Different amounts of RNA are present for the various genes. 
(b) Although all of the tissues have the same genes, the genes are 
expressed to different extents in different tissues. 

(c) These genes are called housekeeping genes—genes that most 
tissues express. They might include genes for glycolysis or citric 
acid cycle enzymes. 

(d) The point of the experiment is to determine which genes are initiat- 
ed in vivo. The initiation inhibitor is added to prevent initiation at start 
sites that may have been activated during the isolation of the nuclei. 
24. DNA is the single strand that forms the trunk of the tree. 
Strands of increasing length are RNA molecules; the beginning of 
transcription is where growing chains are the smallest; the end of 
transcription is where chain growth stops. Direction is left to right. 
Many enzymes are actively transcribing each gene. 


Chapter 30 

1. The Oxford English Dictionary defines translation as the action 
or process of turning from one language into another. Protein syn- 
thesis converts nucleic acid sequence information into amino acid 
sequence information. 

2. An error frequency of 1 incorrect amino acid every 10* incor- 
porations allows for the rapid and accurate synthesis of proteins as 
large as 1000 amino acids. Higher error rates would result in too 
many defective proteins. Lower error rates would likely slow the 
rate of protein synthesis without a significant gain in accuracy. 

3. (1) Each is a single chain. (ii) They contain unusual bases. 

(iii) Approximately half of the bases are base-paired to form 
double helices. (iv) The 5’ end is phosphorylated and is usually 
pG. (v) The amino acid is attached to the hydroxyl group of the 

A residue of the CCA sequence at the 3’ end of the tRNA. 

(vi) The anticodon is located in a loop near the center of the tRNA 
sequence. (vii) The molecules are L-shaped. 

4. First is the formation of the aminoacy] adenylate, which then 
reacts with the tRNA to form the aminoacyl-tRNA. Both steps are 
catalyzed by aminoacyl-tRNA synthetase. 
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5. Unique features are required so that the aminoacyl-tRNA syn- 
thetases can distinguish among the tRNAs and attach the correct 
amino acid to the proper tRNA. Common features are required 
because all tRNAs must interact with the same protein-synthesiz- 
ing machinery. 

6. An activated amino acid is one linked to the appropriate tRNA. 
7. (a) No; (b) no; (c) yes. 

8. The ATP is cleaved to AMP and PP.. Consequently, a second 
ATP is required to convert AMP into ADP, the substrate for oxi- 
dative phosphorylation. 

9. Amino acids larger than the correct amino acid cannot fit into 
the active site of the tRNA. Smaller but incorrect amino acids 

that become attached to the tRNA fit into the editing site and are 
cleaved from the tRNA. 

10. Recognition sites on both faces of the tRNAs may be required 
to uniquely identify the 20 different tRNAs. 

11. The first two bases in a codon form Watson—Crick base 

pairs that are checked for fidelity by bases of the 165 rRNA. The 
third base is not inspected for accuracy, and so some variation is 
tolerated. 

12. Four bands: light, heavy, a hybrid of light 30S and heavy 508, 
and a hybrid of heavy 305 and light 50S. 

13. Two hundred molecules of ATP are converted into 200 AMP + 
400 P, to activate the 200 amino acids, which is equivalent to 400 
molecules of ATP. One molecule of GTP is required for initiation, 
and 398 molecules of GTP are needed to form 199 peptide bonds. 
14. The reading frame is a set of contiguous, nonoverlapping 
three-nucleotide codons that begins with a start codon and ends 
with a stop codon. 

15. A mutation caused by the insertion of an extra base can be 
suppressed by a tRNA that contains a fourth base in its anticodon. 
For example, UUUC rather than UUU is read as the codon for phe- 
nylalanine by a tRNA that contains 3’-AAAG-5’ as its anticodon. 
16. One approach is to synthesize a tRNA that is acylated with a 
reactive amino acid analog. For example, bromoacetyl-phenylalanyl- 
tRNA is an affinity-labeling reagent for the P site of E. coli 
ribosomes. 

17. The sequence GAGGU is complementary to a sequence of 
five bases at the 3’ end of 16S rRNA and is located several bases 
upstream of an AUG start codon. Hence, this region is a start 
signal for protein synthesis. The replacement of G by A would be 
expected to weaken the interaction of this mRNA with the 165 
rRNA and thereby diminish its effectiveness as an initiation signal. 
In fact, this mutation results in a 10-fold decrease in the rate of 
synthesis of the protein specified by this mRNA. 

18. The peptide would be Phe-Cys- His-Val-Ala-Ala. The codons 
UGC and UGU encode cysteine but, because the cysteine was 
modified to alanine, alanine is incorporated in place of cysteine. 
19. Proteins are synthesized from the amino to the carboxyl end 
on ribosomes, whereas they are synthesized in the reverse direction 
in the solid-phase method. The activated intermediate in ribo- 
somal synthesis is an aminoacyl-tRNA; in the solid-phase method, 
it is the adduct of the amino acid and dicyclohexylcarbodiimide. 
20. GTP is not hydrolyzed until aminoacyl-tRNA is delivered 

to the A site of the ribosome. An earlier hydrolysis of GTP 

would be wasteful because EF-Tu-GDP has little affinity for 
aminoacyl-tRNA. 

21. The translation of an mRNA molecule can be blocked by anti- 
sense RNA, an RNA molecule with the complementary sequence. 
The antisense-sense RNA duplex cannot serve as a template for 
translation; single-stranded mRNA is required. Furthermore, 

the antisense-sense duplex is degraded by nucleases. Antisense 
RNA added to the external medium is spontaneously taken up by 
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many cells. A precise quantity can be delivered by microinjection. 
Alternatively, a plasmid encoding the antisense RNA can be intro- 
duced into target cells. 

22. (a) A;. (b) A, > A, > A; > A,. (c) Synthesis is from the 
amino terminus to the carboxy] terminus. 

23. These enzymes convert nucleic acid information into protein 
information by interpreting the tRNA and linking it to the proper 
amino acid. 

24. The rate would fall because the elongation step requires that the 
GTP be hydrolyzed before any further elongation can take place. 
25. Protein factors modulate the initiation of protein synthesis. 
The role of IF1 and IF3 is to prevent premature binding of the 305 
and 50S ribosomal subunits, whereas IF2 delivers Met-tRNA; 

to the ribosome. Protein factors are also required for elongation 
(EF-G and EF-Tu), for termination (release factors, RFs), and for 
ribosome dissociation (ribosome release factors, RRFs). 

26. The signal sequence, signal-recognition particle (SRP), the 
SRP receptor, and the translocon. 

27. The formation of peptide bonds, which in turn are powered by 
the hydrolysis of the aminoacyl-tRNAs. 

28. The Shine-Dalgarno sequence of the mRNA base-pairs with a 
part of the 165 rRNA of the 30S subunit, which positions the sub- 
unit so that the initiator AUG is recognized. 


29. 
Bacteria Eukaryote 
Ribosome size 70S 80S 
mRNA Polycistronic Not polycistronic 
Initiation Shine-Dalgarno is First AUG is used 
required 
Protein factors Required Many more required 


Relation to Translation can start Transcription and 


transcription before transcription _ translation are 
is completed spatially separated 
First aminoacid fMet Met 


30. The SRP binds to the signal sequence and inhibits further 
translation. The SRP ushers the inhibited ribosome to the ER, 
where it interacts with the SRP receptor (SR). The SRP-SR com- 
plex binds the translocon and simultaneously hydrolyzes GTP. On 
GTP hydrolysis, SRP and SR dissociate from each other and from 
the ribosome. Protein synthesis resumes and the nascent protein is 
channeled through the translocon. 

31. The alternative would be to have a single ribosome translating 
a single mRNA molecule. The use of polysomes allows more pro- 
tein synthesis per mRNA molecule in a given period of time and 
thus the production of more protein. 

32. (a) 1, 2, 3, 5, 6, 10; (b) 1, 2, 7, 8; (c) 1, 4, 8, 9. 

33. Transfer RNAs have roles in several recognition processes. A 
tRNA must be recognized by the appropriate aminoacyl-tRNA 
synthetase, and the tRNA must interact with the ribosome and, in 
particular, with the peptidyl transferase. 

34. The aminoacyl-tRNA can be initially synthesized. However, 
the side-chain amino group attacks the ester linkage to form a six- 
membered amide, releasing the tRNA. 

35. (a, d, and e) Type 2; (b, c, and f ) Type 1. 

36. The error rates of DNA, RNA, and protein synthesis are of 
the order of 1071, 1075, and 10~‘, respectively, per nucleotide 
(or amino acid) incorporated. The fidelity of all three processes 
depends on the precision of base-pairing to the DNA or mRNA 
template. Few errors are corrected in RNA synthesis. In con- 
trast, the fidelity of DNA synthesis is markedly increased by the 


3' — 5’ proofreading nuclease activity and by postreplicative 
repair. In protein synthesis, the mischarging of some tRNAs is 
corrected by the hydrolytic action of aminoacyl-tRNA synthe- 
tase. Proofreading also takes place when aminoacyl-tRNA occu- 
pies the A site on the ribosome; the GTPase activity of EF-Tu 
sets the pace of this final stage of editing. 

37. EF-Ts catalyzes the exchange of GTP for GDP bound to 
EF-Tu. In G-protein cascades, an activated 7TM receptor cata- 
lyzes GTP—GDP exchange in a G protein. 

38. The a subunits of G proteins are inhibited by a similar mecha- 
nism in cholera and whooping cough (Section 14.5). 

39. Glu-tRNAG" is formed by misacylation. The activated glu- 
tamate is subsequently amidated to form Gln-tRNA@, Ways 

in which glutamine is formed from glutamate were discussed 

in Section 24.2. In regard to H. pylori, a specific enzyme, Glu- 
tRNA@! amidotransferase, catalyzes the following reaction: 


Gln + Glu-tRNAG" + ATP —> 
Gln-tRNAG* + Glu + ADP + P. 


Glu-tRNA®" is not a substrate for the enzyme; so the transferase 
must also recognize aspects of the structure of tRNAG". 

40. The primary structure determines the three-dimensional 
structure of the protein. Thus, the final phase of information trans- 
fer from DNA to RNA to protein synthesis is the folding of the 
protein into its functional state. 

41. (a) elF-4H has two effects: (1) the extent of unwinding is 
increased and (2) the rate of unwinding is increased, as indicated 
by the increased rise in activity at early reaction times. 

(b) To firmly establish that the effect of e[F-H4 was not due to any 
inherent helicase activity. 

(c) Half-maximal activity was achieved at 0.11 wM of elF-4H. 
Therefore, maximal stimulation would be achieved at a ratio of 1:1. 
(d) eIF-4H enhances the rate of unwinding of all helices, but the 
effect is greater as the helices increase in stability. 

(e) The results in graph C suggest that e[F-4H increases the 
processivity. 

42. (a) The three peaks represent, from left to right, the 405 ribo- 
somal subunit, the 60S ribosomal subunit, and the 805 ribosome. 
(b) Not only are ribosomal subunits and the 805 ribosome present, 
but polysomes of various lengths also are apparent. The individual 
peaks in the polysome region represent polysomes of discrete length. 
(c) The treatment significantly inhibited the number of polysomes 
while increasing the number of free ribosomal subunits. This 
outcome could be due to inhibited protein-synthesis initiation or 
inhibited transcription. 


Chapter 31 

1. (a) Cells will express B-galactosidase, lac permease, and thio- 
galactoside transacetylase even in the absence of lactose. (b) Cells 
will express B-galactosidase, lac permease, and thiogalactoside 
transacetylase even in the absence of lactose. (c) The levels of cata- 
bolic enzymes such as B-galactosidase and arabinose isomerase will 
remain low even at low levels of glucose. 

2. The concentration is 1/(6 X 1023) moles per 1075 liter = 1.7 X 
10-9 M. Because Ky= 10713 M, the single molecule should be 
bound to its specific binding site. 

3. The number of possible 8-bp sites is 4° = 65,536. In a genome 
of 4.6 X 10° base pairs, the average site should appear 4.6 X 
10°/65,536 = 70 times. Each 10-bp site should appear 4 times. 
Each 12-bp site should appear 0.27 times (many 12-bp sites will 
not appear at all). 

4. The lac repressor does not bind DNA when the repressor is bound 
to a small molecule (the inducer), whereas the pur repressor binds 


DNA only when the repressor is bound to a small molecule (the 

corepressor). The E. coli genome contains only a single lac repressor- 

binding region, whereas it has many sites for the pur repressor. 

5. Anti-inducers bind to the conformation of repressors, such as 

the lac repressor, that are capable of binding DNA. They occupy 

a site that overlaps that for the inducer and, therefore, compete for 

binding to the repressor. 

6. The inverted repeat may be a binding site for a dimeric DNA- 

binding protein or it may correspond to a stem-loop structure in 

the encoded RNA. 

7. Bacteriophage \ would be more likely to enter the lytic phase 

because the cooperative binding of the \ repressor to Op2 and 

Ol, which supports the lysogenic pathway, would be disrupted. 

8. Nrepressor gene —10 region GATTTA —35 region TAGATA 
Cro gene —10 region TAATGG —35 region TTGACT 

There are four differences in the —10 region and three differences 

in the —35 region. 

9. Increased Cro concentration reduces the expression of the \ 

repressor gene. Increased \ repressor concentration reduces the 

expression of the Cro gene. At low A repressor concentrations, 

increased \ repressor concentration increases the expression of the 

d repressor gene. At higher \ repressor concentrations, increased 

\ repressor concentration decreases the expression of the A repres- 

sor gene. 

10. Normally, bacterial mRNAs have a leader sequence in which 

a Shine-Dalgarno sequence precedes the AUG start codon. 

The absence of a leader would be expected to lead to inefficient 

translation. 

11. Add each compound to a culture of V. fischeri at low density 

and look for the development of luminescence. 

12. ACC, 7; ACA, 1; ACU, 0; ACG, 0. 

13. Retention. The reaction likely proceeds in two steps: Attack of 

Glu 537 on the carbon of the galactose moiety (with inversion) fol- 

lowed by attack of water on this carbon (with a second inversion) 

to release the galactose from the enzyme. 

14. The footprint appears to have approximately 50% of its inten- 

sity near 3.7 nM so that the dissociation constant is approximately 

3.7 nM, corresponding to a standard free energy of binding of 

—48 kJ/mol (—11 kcal/mol) at T = 298K. 


Chapter 32 

1. The distribution of charged amino acids is H2A (13 K, 13 R, 
2D,7E, charge = +17), H2B (20K, 8R, 3D, 7E, charge = +18), 
H3 (13K, 18R,4D,7E, charge = +20), H4 (11 K, 14R, 3D, 
4E, charge = +18). The total charge of the histone octamer is esti- 
mated to be 2 X (17 + 18 + 20 + 18) = +146. The total charge 
on 150 base pairs of DNA is —300. Thus, the histone octamer 
neutralizes approximately one-half of the charge. 

2. The presence of a particular DNA fragment could be detected 
by hybridization, by PCR, or by direct sequencing. 

3. The total length of the DNA is estimated to be 145 bp X 

3.4 A/bp = 493 A, which represents 1.75 turns or 1.75 X 2ar = 
11.0r. Thus, the radius is estimated to be r = 493 A/11.0 = 44.8 A. 
4, 5-Azacytidine cannot be methylated. Some genes, normally 
repressed by methylation, will be active. 5-Azacytidine is also an 
inhibitor of DNA methyltransferase, resulting in lower levels of 
methylation and, hence, lower levels of gene repression. 

5. Proteins containing these domains will be targeted to methylated 
DNA in repressed promoter regions. They would likely bind in the 
major groove because that is where the methyl group is located. 

6. Gene expression is not expected to respond to the presence of 
estrogen. However, genes for which expression normally responds 
to estrogen will respond to the presence of progesterone. 
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7. The acetylation of lysine will reduce the charge from +1 to 0. 
The methylation of lysine will not reduce the charge. 

8. On the basis of the pattern of cysteine and histidine residues, 
this region appears to contain three zinc-finger domains. 

9. 10/4000 = 0.25%. 0.25% of 12 Mb = 30 kilobase pairs. 

10. Monomethylated-activated; Trimethylated-repressed. 

11. The addition of an IRE to the 5’ end of the mRNA is expected 
to block translation in the absence of iron. The addition of an IRE 
to the 3’ end of the mRNA is not expected to block translation, but 
it might affect mRNA stability. 

12. The sequences of all of the mRNAs would be searched for 
sequences that are fully or nearly complementary to the sequence 
of the miRNA. These sequences would be candidates for regula- 
tion by this mRNA. 

13. The amino group of the lysine residue, formed from the 
protonated form by a base, attacks the carbonyl group of acetyl 
CoA to generate a tetrahedral intermediate. This intermediate 
collapses to form the amide bond and release CoA. 

14. In mouse DNA, most of the Hpall sites are methylated and 
therefore not cut by the enzyme, resulting in large fragments. 
Some small fragments are produced from CpG islands that are 
unmethylated. For Drosophila and E. coli DNA, there is no meth- 
ylation and all sites are cut. 


Chapter 33 

1. The transgenic nematode would avoid the compound. The 
identity of the ligand is determined by the receptor, whereas the 
behavioral response is dictated by the neuron in which the receptor 
is expressed. 

2. Only a mixture of compounds C,-COOH and HOOC-C,- 
COOH is predicted to yield this pattern. 

3. Bitter and sweet sensations are mediated by G proteins coupled 
to 7TM receptors, leading to millisecond time resolution. Salty and 
sour sensations are mediated directly by ion channels, which may 
lead to faster time resolution. 

4. Sound travels 0.15 m in 428 ws. The human hearing system is 
capable of sensing time differences of close to a microsecond, and 
so the difference in arrival times at the two ears is substantial. A 
system based on G proteins is unlikely to be able to reliably distin- 
guish between signals arriving at the two ears, because G proteins 
typically respond in milliseconds. 

5. Ifa plant tastes bitter, animals will avoid eating it even if it is 
nontoxic, which may provide a selective advantage to the plant. 

6. Using mice in which either the gene for T1R1 or the gene for 
T1R3 has been disrupted, test the taste responses of these mice to 
glutamate, aspartate, and a wide variety of other amino acids. 

7. 530 nm light will be absorbed to some degree by all three pho- 
toreceptors, with the largest absorption from the “green” receptor. 
8. These women have four functional color receptors: blue, 

red, green, and a red-green hybrid. The additional color recep- 

tor allows some colors that appear identical to most people to be 
distinguished. 

9. 380 (one for each receptor); there are (380 X 379)/2! = 72,010 
combinations of two receptors; (380 X 379 X 378)/3! = 9,073,260 
combinations of three receptors. 

10. The absorption of light converts 11-cis-retinal into all- 
trans-retinal. 

11. These compounds are enantiomers and must bind to protein 
receptors to elicit a smell. Even these subtle structural differ- 
ences can affect relative receptor binding affinities and, hence, the 
elicited odor. 

12. Vision: cCGMP-gated channel; taste: amiloride-sensitive 
sodium channel; hearing; tip-link channel. 
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13. For all senses, ATP hydrolysis is required to generate and 
maintain ion gradients and membrane potential. Olfaction: ATP 
is required for the synthesis of cAMP. Gustation: ATP is required 
for the synthesis of cyclic nucleotides, and GTP is required for 
the action of gustducin in the detection of bitter and sweet tastes. 
Vision: GTP is required for the synthesis of CGMP and for the 
action of transducin. Hearing and touch: ATP hydrolysis is 
required to generate and maintain ion gradients and membrane 
potential and may be required for other roles as well. 

14. 
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Chapter 34 

1. The innate immune system responds rapidly to common fea- 
tures present in many pathogens. The genes for the innate immune 
system’s key molecules are expressed without substantial modifica- 
tion. In contrast, the adaptive immune system responds to specific 
features present only in a given pathogen. Its genes undergo sig- 
nificant rearrangement and mutation to enable specific recognition 
of a vast number of potential binding surfaces. 

2. VJ and V(D)J recombination; variability in segment joining by 
the action of terminal deoxyribonucleotidyl transferase; somatic 
mutation. 

3. Affinity refers to the strength of a single interaction; avidity 
refers to the cumulative strength of multiple independent binding 
interactions. Avidity may play a significant role in the interaction 
between IgM and antigen because this immunoglobulin class fea- 
tures 10 binding sites. 

4. The intracellular signaling domain common to each of the 
TLRs is responsible for docking other proteins and reporting that 
a targeted pathogen-associated molecular pattern (PAMP), such 

as LPS, has been detected. If a mutation within this domain inter- 
fered with the intracellular docking and signal transduction, then 
TLR4 would not respond to LPS. 

5. Viruses that contain dsRNA genomes would be expected to 
stimulate a TLR3-mediated immune response. 

6. (a) AG?’ = —37kJ mol7!(—8.9 kcal mol!) 

(b) K, = 3.3 x 10°M 7! 

(c) Ron = 4 X 10° M's !. This value is close to the diffusion- 
controlled limit for the combination of a small molecule with a 
protein (p. 231). Hence, the extent of structural change is likely to 
be small; extensive conformational transitions take time. 

7. The fluorescence enhancement and the shift to blue indicate 
that water is largely excluded from the binding site when the hap- 
ten is bound. Hydrophobic interactions contribute significantly to 
the formation of most antigen—antibody complexes. 

8. (a) An antibody combining site is formed by CDRs from both 
the H and the L chains. The V,; and V; domains are essential. A 
small proportion of F.,, fragments can be further digested to pro- 
duce F,,, a fragment that contains just these two domains. C};1 and 
C,_ contribute to the stability of F.,, but not to antigen binding. 


(b) A synthetic F, analog 248 residues long was prepared by 
expressing a synthetic gene consisting of a V,, gene joined to a V;_ 
gene through a linker. [J.S. Huston et al., Proc. Natl. Acad. Sci. 
U.S.A. 85:5879-5883, 1988. ] 

9. (a) Multivalent antigens lead to the dimerization or oligomer- 
ization of transmembrane immunoglobulins, an essential step in 
their activation. This mode of activation is reminiscent of that of 
receptor tyrosine kinases (Section 14.2). 

(b) An antibody specific for a transmembrane immunoglobulin 
will activate a B cell by cross-linking these receptors. This experi- 
ment can be carried out by using, for example, a goat antibody to 
cross-link receptors on a mouse B cell. 

10. Bcells do not express T-cell receptors. The hybridiza- 

tion of T-cell cDNAs with B-cell mRNAs removes cDNAs 
that are expressed in both cells. Hence, the mixture of cDNAs 
subsequent to this hybridization are enriched in those encoding 
T-cell receptors. This procedure, called subtractive hybridiza- 
tion, is generally useful in isolating low-abundance cDNAs. 
Hybridization should be carried out by using mRNAs from 

a closely related cell that does not express the gene of inter- 

est. [See S. M. Hedrick, D. I. Cohen, E. A. Nielsen, and 

M. M. Davis, Nature 308:149-153, 1984, for an interest- 

ing account of how this method was used to obtain genes for 
T-cell receptors. | 

11. TLR4 is the receptor for LPS, a toxin found specifically in the 
walls of Gram-negative bacteria. Mutations that inhibit the func- 
tion of TLR4 impair an affected person’s defenses against this class 
of bacteria. 

12. Ifthe HLA alleles are not matched, then the recipient’s T cell 
receptors will identify the MHC proteins of the transplanted tissue 
as nonself and transplant rejection is likely. 

13. Purify an antibody with a specificity to one antigen. Unfold 
the antibody and allow it to re-fold either in the presence of the 
antigen or in the absence of the antigen. Test the re-folded anti- 
bodies for antigen-binding ability. 

14. In some cases, V-D-J rearrangement will result in com- 
bining V, D, and J segments out of frame. mRNA molecules 
produced from such rearranged genes will produce truncated 
molecules if translated. This possibility is excluded by degrading 
the mRNA. 

15. The mutant bacteria may still stimulate an immune response 
without causing disease. Hence, they may be valuable starting 
points for the design of a live attenuated vaccine for the original 
pathogenic strain. 

16. The peptide is LLQATYSAV (L in second position, V in last). 
17. Catalysis is likely to require a base for removing a proton from 
a water molecule. A histidine, glutamate, or aspartate residue is 
most likely. In addition, a potential hydrogen-bond donor may be 
present and will interact with the negatively charged oxygen atom 
that forms in the transition state. 

18. A phosphotyrosine residue in the carboxyl terminus of Src and 
related protein tyrosine kinases binds to its own SH2 domain to 
generate the inhibited form of Src (Section 14.5). Removal of the 
phosphoryl group from this residue will activate the kinase. 

19. (a) Kq = 10°’ M; (b) Kg = 107° M. The gene was probably 
generated by a point mutation in the gene for antibody A rather 
than by de novo rearrangement. 


Chapter 35 
1. (a) Skeletal muscle and eukaryotic cilia derive their free energy 
from ATP hydrolysis; the bacterial flagellar motor uses a proton- 
motive force. 


(b) Skeletal muscle requires myosin and actin. Eukaryotic cilia 
require microtubules and dynein. The bacterial flagellar motor 
requires MotA, MotB, and FliG, as well as many ancillary 
components. 

2. 6400 A/80 A = 80 body lengths per second. For a 10-foot 
automobile, this body-length speed corresponds to a speed of 

80 X 10 feet = 800 feet per second, or 545 miles per hour. 

3. 4pN = 8.8 X 107! pounds. The weight of a single motor 
domain is 100,000 g mol~!/(6.023 X 1023 molecules mol~!) = 
1.7 X 10719 g = 3.7 X 10”? pounds. Thus, a motor domain can 
lift (8.8 X 107!3/3.7 x 10722) = 2.4 X 10° times its weight. 

4. Both actin filaments and microtubules are built from subunits 
and these subunits bind and hydrolyze nucleoside triphosphates. 
Actin filaments are built of a single type of subunit and these sub- 
units bind ATP. Microtubules are built of two different types of 
subunits and these subunits bind GTP. 

5. The light chains in myosin stiffen the lever arm. The light 
chains in kinesin bind cargo to be transported. 

6. After death, the ratio of ADP to ATP increases rapidly. In 

the ADP form, myosin motor domains bind tightly to actin. 
Myosin—actin interactions are possible because the drop in ATP 
concentration also allows the calcium concentration to rise, clear- 
ing the blockage of actin by tropomyosin through the action of the 
troponin complex. 

7. Above its critical concentration, ATP-actin will polymerize. 
The ATP will hydrolyze through time to form ADP-actin, which 
has a higher critical concentration. Thus, if the initial subunit 
concentration is between the critical concentrations of ATP-actin 
and ADP.-actin, filaments will form initially and then disappear on 
ATP hydrolysis. 

8. A one-base step is approximately 3.4 A = 3.4 X 1074 ym. If 
a stoichiometry of one molecule of ATP per step is assumed, this 
distance corresponds to a velocity of 0.017 um s~'. Kinesin moves 
at a velocity of 6400 A per second, or 0.64 um s~!. 

9. A proton-motive force across the plasma membrane is 
necessary to drive the flagellar motor. Under conditions of star- 
vation, this proton-motive force is depleted. In acidic solution, 
the pH difference across the membrane is sufficient to power 
the motor. 

10. The mean distance between tumbles would be longer when the 
bacterium is moving up a gradient of a chemoattractant. 

11. (a) 1.13 X 1079 dyne 

(b) 6.8 X 10!* erg 

(c) 6.6 X 107!! erg per 80 molecules of ATP. A single kinesin 
motor provides more than enough free energy to power the 
transport of micrometer-size cargoes at micrometer-per-second 
velocities. 

12. The spacing between identical subunits on microtubules 

is 8nm. Thus, a kinesin molecule with a step size that is not a 
multiple of 8 nm would have to be able to bind at more than one 
type of site on the microtubule surface. 

3. KIF1A must be tethered to an additional microtubule-binding 
element that retains an attachment to the microtubule when the 
motor domain releases. 

4. Filaments built from subunits can be arbitrarily long, can be 
dynamically assembled and disassembled, and require only a small 
amount of genetic information to encode. 

15. Protons still flow from outside to inside the cell. Each 
proton might pass into the outer half-channel of one MotA— 
MotB complex, bind to the MS ring, rotate clockwise, and pass 
into the inner half-channel of the neighboring MotA—MotB 
complex. 
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16. Ata high concentration of calcium ion, Ca2* binds to calmod- 
ulin. In turn, calmodulin binds to a protein kinase that phos- 
phorylates myosin light chains and activates it. At low calcium ion 
concentration, the light chains are dephosphorylated by a Ca?*- 
independent phosphatase. 

17. (a) The value of k,,, is approximately 13 molecules per second, 
whereas the K,, value for ATP is approximately 12 uM. 

(b) The step size is approximately (380 — 120)/7 = 37 nm. 

(c) The step size is very large, which is consistent with the presence 
of six light-chain-binding sites and, hence, very long lever arms. 
The rate of ADP release is essentially identical with the overall 
Rk. 80 ADP release is rate limiting, which suggests that both 
motor domains can bind to sites 37 nm apart simultaneously. ADP 
release from the hindmost domain allows ATP to bind, leading to 
actin release and lever-arm motion. 


Chapter 36 

1. (a) Before; (b) after; (c) after; (d) after; (e) before; (f) after. 

2. (a) Yes; (b) yes; (c) no (MW > 600). 

3. If computer programs could estimate log(P) values on the basis 
of chemical structure, then the required laboratory time for drug 
development could be shortened. The determination of the relative 
solubilities of pharmaceutical candidates by allowing each com- 
pound to equilibrate between water and an organic phase would no 
longer be necessary. 

4. Perhaps N-acetylcysteine would conjugate to some of the 
N-acetyl-p-benzoquinone imine that is produced by the metabo- 
lism of acetaminophen, thereby preventing the depletion of the 
liver’s supply of glutathione. 

5. In Phase I clinical trials, approximately 10 to 100 usually 
healthy volunteers are typically enrolled in a study designed to 
assess safety. In contrast, a larger number of subjects are enrolled 
in a typical Phase II trial. Moreover, these persons may benefit 
from the drug administered. In a Phase II trial, efficacy, dosage, 
and safety can be assessed. 

6. The binding of other drugs to albumin could cause extra cou- 
madin to be released. (Albumin is a general carrier for hydropho- 
bic molecules.) 

7. A drug that inhibits a P450 enzyme may dramatically affect 
the disposition of another drug that is metabolized by that same 
enzyme. If this inhibited metabolism is not accounted for when 
dosing, the second drug may reach very high, and sometimes toxic, 
levels in the blood. 

8. Unlike competitive inhibition, noncompetitive inhibition can- 
not be overcome with additional substrate. Hence, a drug that acts 
by a noncompetitive mechanism will be unaffected by changing 
levels of the physiological substrate. 

9. An inhibitor of MDR could prevent the efflux of a che- 
motherapeutic drug from tumor cells. Hence, this type of 

an inhibitor could be useful in averting resistance to cancer 
chemotherapy. 

10. Agents that inhibit one or more enzymes of the glycolytic 
pathway could act to deprive trypanosomes of energy and thus be 
useful for treating sleeping sickness. A difficulty is that glycolysis 
in the host cells also would be inhibited. 

11. Imatinib is an inhibitor of the Bcr-Abl kinase, a mutant 
kinase present only in tumor cells that have undergone a trans- 
location between chromosomes 9 and 22 (Figure 14.34). Before 
initiating treatment with imatinib, we could sequence the 
DNA of the tumor cells and determine (a) whether this translo- 
cation has taken place and (b) whether the sequence of bcr-abl 
carries any mutations that would render the kinase resistant 
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to imatinib. If the translocation has not taken place or if the 
gene carries resistance mutations, then imatinib would likely 
not be an effective treatment for the patient carrying this par- 
ticular tumor. 

12. Sildenafil increases CGMP levels by inhibiting the phospho- 
diesterase-mediated breakdown of cGMP to GMP. Intracellular 
cGMP levels can also be increased by activating its synthesis. This 
activation can be achieved with the use of NO donors (such as 
sodium nitroprusside and nitroglycerin) or compounds that acti- 
vate guanylate cyclase activity. Drugs that act by the latter mecha- 
nism are currently in clinical trials. 

13. A reasonable mechanism would be an oxidative deamination 
following an overall mechanism similar to that in Figure 36.10, 
with release of ammonia. 


NADPH + H*+ 0, + 


NADP* + H,O + NH, + 
CH 


14. Ky ~ 0.3nM. ICso ~ 2.0 nM. Yes, compound A should be 
effective when taken orally because 400 nM is much greater than 
the estimated values of K, and IC,,. 
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ion gradients in, 433-434, 433f 

in mitochondria, 550-551, 551f 

in muscle, 813 

in muscle contraction, 469-470 

in nitrogen fixation, 715-718 

phosphoryl-transfer potential, 429, 
430-431, 430f 

in random sequential reaction, 232 

resonance structures, 429, 429f 

structure of, 426, 427 

synthesis, 432—433 


effect on prostaglandin H2 synthase-1, 354f ATP formation 


as prostaglandin inhibitor, 669 
Assays 
defined, 67 


enzyme-linked immunosorbent (ELISA), 


82-83, 83f 
in protein purification, 67 
ATCase (aspartate transcarbamoylase) 
active site of, 289, 289f, 289 
allosteric interactions in, 288-291 
catalytic subunit, 287-288, 288f 
concerted model, 290 
cooperativity, 290 
defined, 285 
feedback inhibition, 287 
inhibition by CTP, 758 
reaction, 286, 286f 
regulatory subunit, 287-288, 288f 
separable subunits, 287—288 
sigmoidal curve, 291, 291f 
sigmoidal kinetics, 287, 287f 
structure of, 288-289, 289f 
T-to-R transition, 290-291, 290f 
ultracentrifugation studies of, 287, 288f 
Atherosclerosis, 661, 784—785, 787-788 
Atomic motion, 971, 972f 
Atorvastatin, 1044, 1044 
ATP (adenosine triphosphate), 108, 108, 
275, 295t 
as activated carrier, 435 
binding of, 186-187, 186f, 187f 


binding to molecular motors, 1014-1016, 


1015f, 1016f 


1,3-bisphosphoglycerate (1,3-BPG) and, 
459-460 

citric acid cycle and, 508 

by cyclic photophosphorylation, 580, 
580f, 581 

enzyme-bound, 544 

fermentation in, 462 

from glucose to pyruvate conversion, 
461-462 

from glycolysis, 459-462 

from phosphoryl transfer, 459-460 

proton-motive force and, 544-545, 545f 

pyruvate in, 460-462 


ATP hydrolysis 


ABC transporters and, 376 
ATP as base to promote, 277, 277f 
in DNA replication, 832-833, 833f 
in DNA supercoiling, 837-839, 838f 
in driving metabolism, 427-429 
enzyme use of, 276 
as exergonic, 426-427 
free energy from, 426-431 
‘y subunit in, 546 
hydrolysis, 276 
in kinesin motion, 1024-1026, 1025f 
magnesium ions in, 276-277 
myosin and, 276-278 
myosin conformational changes, 

278, 278f 
power stroke in, 1019 
reversible, within myosin active site, 279 
rotational motion, 546 


in transcription, 868, 868f 
transition state formation for, 277—278 
ATP synthase 
a subunit of, 543, 544, 546-547, 546f, 578 
a subunit of, 543, 543 
ATP-driven rotation in, 545-546, 546f 
b subunit of, 543, 544, 578 
B subunit of, 543, 543, 545, 546, 578 
bind-change mechanism for, 544-546, 545f 
c ring of, 545, 546-548 
of chloroplasts, 578-580 
defined, 542 
5 subunit of, 543, 544 
discovery, 542, 542b 
efficiency of, 548, 548b 
e subunit of, 580 
Fo subunit of, 543, 543, 544 
F, subunit of, 543, 543, 544, 545 
G proteins and, 548-549 
yy subunit of, 543, 543, 545, 548, 1027 
ye stalk, 544, 545 
inhibition of, 556 
nucleotide-binding sites, 545, 545f 
proton flow through, 542, 544-546 
proton motion across membrane in, 
547, 547£ 
proton path through membrane in, 
547, 547£ 
proton-conducting unit, 546-547, 546f 
regulation of, 554 
rotational catalysis in, 546, 546f 
ATP synthasome, 552 
ATP synthesis 
in chloroplasts, 578-580 
efficiency of, 544 
as endergonic process, 542 
mechanism, 544, 544f 
oxidative phosphorylation uncoupling 
from, 554-556, 555f 
proton flow around c ring in, 546-548 
proton gradients in, 541-549 
uncoupling electron transport from, 
556-557 
ATP yield, 552-553, 553t 
ATP-ADP cycle, 427, 432, 432f 
ATP-ADP translocase, 550-551, 551f, 557 
ATPases 
AAA, 685, 1012, 1014 
domains of myosin, 276, 276f, 277, 277f 
mitochondrial. See ATP synthase 
Na*-K*, 370, 373 
P-type, 367, 370-373, 374 
SERCA, 370, 371-373, 372f, 372 
ATP-binding cassettes (ABCs). See also 
ABC transporters 
ATP binding with, 375, 375f 
defined, 374 
distance between, 375 
ATP-citrate lyase, 666 
ATP-grasp fold, 745 
Atractyloside, 557 
Atrazine, 584, 584 
Attenuation, 935-937, 936f 


Autoimmune diseases, 1005, 1005f 
Autoinducers, 934, 934f 
Autotrophs, 566, 590 

Axel, Richard, 963 

Axial bonds, 320-321, 321f 
Axoneme, 1023 

Azothioprine, 1051, 1051 


B cells 
activation of, 993-994, 993f 
defined, 984 
memory, 1006 
B vitamins. See Vitamins 
B2-microglobulin, 996-997 
Bacillus amyloliquefaciens, 259 
Bacillus anthracis, 839 
Bacillus subtilis, 867 
Backbone models, 61—62, 62f 
Backbones 
in nucleic acids, 106-107, 106f 
in polypeptide chains, 36, 36f 
Bacteria, 3, 3f. See also Escherichia coli 
(E. coli) 
carcinogen action on, 852, 852f 
cell membranes of, 359, 359f 
defined, 3 
flagella movement of, 1026-1029 
gene expression in, 925-938 
glyoxylate cycle in, 516-517, 517f 
nitrogen fixation in, 714-718 
photosynthetic, 584, 585t 
rapid growth, 128 
restriction enzymes in, 137 
transcription factors in, 945 
transcription in, 861-871 
translation in, 906 
upstream promoter elements in, 864 
Bacterial artificial chromosomes 
(BACs), 147 
Bacterial reaction center 
cyclic electron flow in, 572, 572f 
defined, 569-570 
electron chain in, 570, 570f 
Bacteriochlorophylls, 570 
Bacteriopheophytins, 570-571 
Bacteriorhodopsin, 351-352, 351f 
Baculovirus, 160 
BAL (British anti-lewisite), 516 
Ball-and-chain model, 384—385, 385f 
Ball-and-stick models, 22—23, 23f, 61, 61f 
Band centrifugation, 77—78, 77f 
Basal transcription apparatus, 875 
Base-exchange reactions, 771 
Base-excision repair, 848, 849f 
Base-pair substitutions, tests for, 852 
Base-paired hairpins, 123 
Bases/base pairs 
adenine, 4, 5f 
anticodon, 898, 898f 
compositions, 110, 111t 
cytosine, 4, 5f 
damage to, 846-847 
double helix, 10 
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guanine, 4, 5f 
hydrogen bonds, 5 
minor-groove side of, 830 
mutagenic, 846 
nomenclature of, 744 
non-Watson-Crick, 846, 848, 863 
nucleic acids, 109 
in nucleotides, 107-108 
purine, 107, 107 
pyrimidine, 107, 107 
rRNA in, 903 
rules for, 110 
sequence of, 106, 114, 124 
stacking of, 10, 10f, 111, 111f, 116 
thymine, 4, 5f 
in translation, 896, 903 
tRNA, 896 
uracil, 107, 107 
Watson-Crick, 110, 110f, 113, 114f, 898 
Basic-leucine zipper (bZip), 945, 945f 
B-cell receptor, 993, 993f 
Bcr-abl gene, 417, 417f 
Bcr-Abl kinase, 1048, 1054 
B-DNA helix, 111 
Bennett, Claude, 991 
Benson, Andrew, 589 
Benzoate, 697 
Benzoyl CoA, 697 
Berg, Howard, 1027 
Beriberi, 515, 515b 
Beta blockers, 1048-1049, 1050 
B chains, 205, 205f 
8 oxidation, 813, 815, 818 
B pleated sheets 
amino acid residues in, 51—52, 51t 
antiparallel, 42f 
defined, 42 
mixed, structure of, 43f 
parallel, 43f 
protein rich, 43f 
twisted, schematic, 43f 
B strands 
defined, 42 
DNA recognition through, 927, 927f 
of membrane proteins, 352, 352f 
Ramachandran plot for, 42f 
structure of, 42f 
6 turns, 44, 51-52 
B-adrenergic receptor, 399-400, 400f 
B-adrenergic-receptor kinase, 404 
B-chain, of hemoglobin, 191 
B-globin gene 
coding sequences, 127 
structure of, 127, 127f 
transcription and processing of, 
127-128, 128f 
B-hemoglobin, 207-208 
B-lactam ring, 239 
B-oxidation pathway, 650-651 
B-thalassemia, 207, 208 
B-form DNA, 111-112, 111f, 112t 
Bile, 788 
Bile acids, 646, 646f 
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Bile salts 
defined, 788 
synthesis of, 788-789, 789f 
Bilirubin, 737 
Biliverdin, 736 
Biliverdin reductase, 737 
Bimolecular reactions, 226 
Bimolecular sheets 
defined, 348 
formation of, 348-350 
Binding-change mechanism, 544-546, 545f 
Binding energy, 225, 252 
Biochemistry 
defined, 1 
environmental factors influencing, 20 
evolution timeline, 3f 
as evolving science, 1—23 
genome sequencing and, 17-19 
genomic revolution and, 17-22 
unifying concepts, 1-4 
Biofilms, 934—935 
Biological diversity 
biochemical unity underlying, 1-4 
similarity and, 2f 
Biological macromolecules, 2 
Biological membranes. See Membranes 
Biopterin, 703 
Biotin 
defined, 479 
in gluconeogenesis, 479-480, 479f 
Biotin carboxyl carrier protein (BCCP), 
479, 479£ 
1,3-bisphosphoglycerate (1,3-BPG) 
as acyl phosphate, 457 
ATP formation and, 459-460 
in glycolysis, 433, 457-460, 459f 
oxidation of energy, 433 
phosphoryl-transfer potential, 430 
2,3-bisphosphoglycerate (2,3-BPG) 
allosteric effector, 200-201 
binding of, 201, 201f 
defined, 200 
in oxygen affinity determination, 
200-201, 201f 
Bitter taste, 966, 967-968 
Bjorkman, Pamela, 996 
Blackburn, Elizabeth, 845 
BLAST (Basic Local Alignment Search Tool) 
search, 177, 178f 
Blood, isozymes in, 293 
Blood clotting, 286, 299 
clots, 307-308 
enzymatic cascades, 303-304, 303f 
extrinsic pathway of, 304 
hemophilia and, 308 
intrinsic pathway of, 304 
regulation of, 307-308 
vitamin K in, 304, 306-307 
zymogen activation in, 303-304 
Blood groups, 331-332, 331f 
Blood-brain barrier, 1037 
Blosum-62 
alignment of identities only versus, 176f 


defined, 174 
graphic view of, 175f 
Blotting techniques, 136, 138 
Boat form, 320-321, 321f 
Body mass index (BMJ), 802, 803f 
Body weight, regulation of, 802-804 
Bohr, Christian, 202 
Bohr effect, 202-203, 203f 
Boltzmann’s constant, 222 
Bombardment-mediated transformation, 163 
Bonds 
axial, 320-321 
covalent, 5, 6—7 
disulfide, 36-37, 37£ 
electrostatic interactions, 7 
equatorial, 320-321 
glycosidic, 322, 322f, 325, 325f, 326 
hydrogen. See hydrogen bonds 
hydrophobic effect and, 9, 9f 
hydrophobic interactions and, 9 
ionic interactions, 7—8, 10f 
isopeptide, 683 
length measurement, 7b 
length of, 7b 
noncovalent, 7—8 
peptide. See peptide bonds 
units of measure for, 7b 
van der Waals interactions, 7, 8, 8f, 10, 
348, 990 
Bonitus, Jacob, 515b 
Bordetella pertussis, 418 
Bortezomib, 687 
Bovine-spongiform encephalopathy, 56 
Brain 
in caloric homeostasis, 804—807 
leptin effects in, 806-807, 806f 
role in caloric homeostasis, 804-807 
sensory connections to, 962, 962f 
Branch points, 880, 881f 
Branched-chain amino acids, 687-689 
Branched-chain ketoaciduria, 705 
Branching enzyme, 631-632, 632f 
Brittle bone disease, 46 
Bromodomains, 953, 953 
Brown, Michael, 784 
Brown adipose tissue (BAT), 555, 555f 
Brown fat mitochondria, 555 
Buchner, Edward, 450 
Buck, Linda, 963 
Buffers 
action, 15f 
pH and, 15-17 
protonation, 16f 


C genes, 991-992 
C4 pathway, 599-601, 600f 
Cz plants, 601 
C4 plants, 601 
CAAT box, 874, 874£ 
Caenorhabditis elegans, 66, 956 
genome of, 153 
RNA interference in, 163 
Calcineurin, 994 


Calcitriol, 795 
Calcium ion channels, 382 
Calcium ion pump, 372-374, 372f 
Calcium ions 
calmodulin activation, 407 
coordination to multiple ligands, 
406, 406f 
imaging, 406, 406f 
in phosphorylase activation, 626, 627f 
pump structure, 371 
pumping, 372, 372f 
as second messenger, 398, 399, 405-406 
in signal transduction, 405-407 
in vision, 972—973 
Calmodulin 
defined, 626 
in mitochondrial biogenesis, 813 
in signal transduction, 407, 407f 
Calmodulin-dependent protein kinases 
(CaM kinases), 407, 407f 
Calnexin, 335 
Caloric homeostasis, 802-807 
in body weight regulation, 802-804 
brain role in, 804—807 
defined, 801 
in diabetes, 807-812 
dieting and, 807 
evolution and, 802-804 
insulin in, 805-806 
leptin in, 805-807 
obesity and, 801-812 
satiation signals in, 804-805, 804f 
signaling in, 804-807 
suppressors of cytokine signaling (SOCS) 
in, 806-807, 807£ 
Calreticulin, 335 
Calvin, Melvin, 589 
Calvin cycle, 589-601 
ATP (adenosine triphosphate) in, 597 
C4 pathway and, 599-601, 600f 
carbon dioxide in, 591-598, 599-601 
defined, 566, 589 
five-carbon sugar formation in, 595, 596f 
glycolate in, 593-594, 593f 
hexose sugar formation in, 594-596, 595f 
illustrated, 590f 
light regulation of, 598, 598f 
magnesium ion and, 592, 592f, 598-599 
NADPH in, 589-590, 597 
overview of, 589-590 
oxygenase reaction in, 593-594, 593f 
pentose phosphate pathway and, 610 
3-phosphoglycerate formation in, 
591-592, 592f 
photorespiration in, 594, 594f 
reactions in, 590-597, 597£ 
regulation of, 598-601 
ribulose 1,5-biphosphate in, 591-592, 
591f, 592f, 592 
rubisco in, 591-594, 591f, 592f, 593f, 
598-599 
salvage pathway in, 593-594 
stages, 590-591, 590f 


thioredoxin in, 599, 599t, 599 
cAMP. See Cyclic AMP 
cAMP response element binding (CREB) 
protein, 403 
Camptothecin, 839 
Cancer 
aerobic glycolysis in, 474-476 
breast, 793 
citric acid cycle defects in, 513-514 
colorectal, 851, 851f, 1006 
defective DNA repair in, 850-852 
drug resistance in, 1053-1054 
fatty acid metabolism in, 667-668 
hypoxia inducible factor 1 (HIF-1) in, 513 
immune system in, 1005-1006 
in Li-Fraumeni syndrome, 851 
ovarian, 793 
pyruvate dehydrogenase kinase (PDK) 
in, 513 
signal transduction abnormalities in, 
416-417, 416f, 417£ 
skin, in xeroderma pigmentosum, 
851, 851f 
tumor hypoxia in, 475-476, 476t 
tumor suppressor genes and, 851 
tumor visualization, 474, 475f 
Warburg effect in, 474 
Cancer therapy, 759-760 
Capillary electrophoresis, 139 
Capsaicin, 977-978 
Capsaicin receptor, 977-978, 977f, 978f 
Captopril, 263 
Carbamate 
in Calvin cycle, 592 
formation of, 204, 204f 
Carbamate groups, 204f 
Carbamoy] phosphate 
formation of, 693-694 
in pyrimidine synthesis, 745 
reaction, 694-695 
in urea cycle, 693-695 
Carbamoy] phosphate synthetase I, 693-694 
Carbamoyl phosphate synthetase IT, 745 
in pyrimidine synthesis, 745 
structure of, 745, 745 
Carbanion, 606 
Carbanion intermediates, 607, 607f 
Carbohydrate-asparagine adduct, 57 
Carbohydrates, 315-338 
as building block, 20 
disaccharides, 323-324 
glucose generation from, 450 
isomeric forms of, 317f 
lectins and, 333-336, 334f 
monosaccharide, 316-323 
N-linked, 326, 326f 
oligosaccharides, 323, 331, 331f 
O-linked, 326 
overview of, 315-316 
polysaccharides, 324, 325f 
Carbon dioxide 
binding site, 266, 266f 
in Calvin cycle, 591-598, 599-601 


defined, 203 
hydration acceleration, 264 
hydration of, 268, 268f 
oxygen release and, 202-204 
pH and, 203, 203f, 204 
transport from tissue to lungs, 204, 204f 
Carbon fuels 
free energy of, 432-435, 432f 
prominent, 432, 432f 
Carbon monoxide 
defined, 201 
in oxygen transport disruption, 201-202 
Carbonic acid, 203 
Carbonic anhydrases, 204 
bound zinc in, 265 
in carbon dioxide hydration 
acceleration, 264 
catalytic strategies of, 251 
defined, 264 
evolution of, 268 
mechanism of, 266, 266f 
pH effect on, 265, 265f 
proton shuttle, 267-268, 268f 
structure of, 265, 265 
synthetic analog model system for, 
266-267, 267f 
Carbonium ion intermediate, 621 
Carbon-nitrogen bond, 253 
Carboxamides, 33 
y-carboxyglutamate, 57-58, 306-307, 306f 
Carboxyhemoglobin, 201 
Carboxylation, of pyruvate, 479-480, 514 
Carboxyl-terminal domain (CTD) 
in coupling transcription to 
pre-mRNA, 884f 
phosphorylation of, 875 
serine residues of, 872 
Carcinoembryonic antigen (CEA), 1006 
Carcinogens, tests for, 852, 852f 
Cardiotonic steroids, 373 
Cargo receptor, 919 
Carnitine, 649-650, 650f 
Carnitine acyltransferase I, 650 
Carnitine acyltransferase II, 650 
Carotenoids, 581-582 
Carriers, 367 
Cartilage, 328-329, 329f 
Caspases, 558 
Cassette mutagenesis, 151, 151f 
Catabolism 
ATP generation by, 432 
defined, 424, 714b 
energy from, 432-435 
free energy of, 431, 432-435, 435f 
stages of, 434-435, 434f 
Catabolite activator protein (CAP), 931-932, 
931f, 932f 
Catabolite repression, 931-932 
Catalysis 
by approximation, 252 
by ATCase, 285 
Circe effect in, 231, 231b 
covalent, 252, 253-255, 254f, 255f 
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efficiency of, 230-231, 231f 
enzyme-substrate binding in, 225 
enzyme-substrate complex formation in, 
222-223, 222f 
general acid-base, 252 
maximal rate, 228-229, 230 
metal ion, 252-253 
as selective binding of transition 
state, 241 
specificity constant, 230-231, 231f 
transition state formation and, 222 
velocity of, 231 
zinc activation of water molecule, 
265-267, 266f 
Catalytic antibodies, production of, 241 
Catalytic strategies, 251-283 
of carbonic anhydrases, 251, 264-269 
in chymotrypsin, 253-255, 254f, 255f 
of myosins, 251-252, 275-281 
overview of, 251-252 
of restriction endonucleases, 251, 269-275 
of serine proteases, 251, 255-258 
Catalytic triads 
in chymotrypsin, 255-258, 256f 
in elastase, 258, 259f 
in hydrolytic enzymes, 258-260, 258f, 259f 
serine in, 255-258 
site-directed mutagenesis and, 260, 260f 
of subtilisin, 259, 259f 
in trypsin, 258, 258f, 259f 
Cataract formation, 469, 469f 
Cataracts, 469, 469f 
CD3, 999, 999f 
CD4, 1001, 1001f, 1004-1005 
CD8, 998-1000, 999f, 1004-1005 
CD28, 1000 
CD45, 1000 
cDNA library, 149 
CDP-ethanolamine, 770, 770 
Cech, Thomas, 859 
Celecoxib (Celebrex), 1048, 1048 
Cell-mediated immunity, 995 
Cells 
defined, 2 
hybridoma, 81-82 
motion within, 1011, 1012f 
Cell-to-cell channels, 389-390, 389f, 390f 
Cellular immune response, 984 
Cellular respiration 
citric acid cycle in, 496-497, 496f 
enzymes in, 217 
in oxidative phosphorylation, 552-558 
Cellulose, 324-325 
Cell-wall biosynthesis, 1042, 1042f 
Central dogma, 105 
Centrifugation 
differential, 68, 68f 
sedimentation coefficients in, 76—77, 
76t, 77£ 
ultracentrifugation, 76—78, 77f, 287, 288f 
zonal, 77, 77£ 
Cephalopods, 329 
Ceramidase, 775 
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Ceramide 

as precursor, 774 

synthesis of, 772, 772f 

in tumor growth, 774-775 
Cerebroside, 346, 346, 772 
cGMP phosphodiesterase, 972 
cGMP-gated ion channels, 972, 973 
Chain, Ernest, 1041 
Chair form, 320-321, 321f 
Chanarin-Dorfman syndrome, 648 
Changeux, Jean-Pierre, 198 
Chaperones, 50 
Charcot, Jean-Martin, 136 
Charcot-Marie-Tooth disease, 1022 
Chargaff, Erwin, 110 
Charged tRNA, 898 
Chemical shifts, 98 
Chemiosmotic hypothesis, 542-543, 

542, 543f 

Chemoattractants, 1028, 1029 
Chemokines, 55 
Chemorepellant, 1028 
Chemotaxis, 1028-1029, 1029f 
Cheng, Yung-Chi, 1035 
Cheng-Prusoff equation, 1036 
CheY, 1028 
CheZ, 1028 
Chirality 

of amino acids, 721, 721f 

establishment of, 717, 718f 
Chitin, 329, 329f 
Chlamydomonas, 887 
Chloramphenicol, 914, 914t 
Chlorobium thiosulfatophilum, 584 
Chlorophyll a, 569, 569f, 582f 
Chlorophyll b, 581-582, 582f 
Chlorophylls 

defined, 568 

in photosynthesis, 568-572 

phytol, 569 

structure of, 568-569, 568f 
Chlorophylls, light absorption by, 569, 569f 
Chloroplasts 

ATP synthase, 578-580 

defined, 567 

electron micrograph of, 567f 

evolution of, 568 

genome of, 568 

in photosynthesis, 567-568 

reaction center, 569-570, 570f 

redox conditions, 579-580 

structure of, 567, 567£ 
Chlorpromazine, 1042, 1042 
Cholecystitis, 788 
Cholecystokinin (CKK), 804-805 
Cholera, 417-418 
Choleragen, 417 
Cholesterol 

“bad,” 787 

carbon numbering, 790, 790f 

discovery of, 1043-1044 

excess, effects of, 785, 785f 

fatty acid chains and, 358, 358f 


formation of, 779f 
“good,” 787 
levels, clinical management of, 788 
lipid rafts, 358 
as membrane fluidity regulator, 358 
in membrane lipid, 346 
as precursor for vitamin D, 794-795 
structure of, 346 
Cholesterol synthesis, 776-788 
blocking of, 788 
condensation mechanism of, 778, 778f 
defined, 776, 776b 
isopentenyl pyrophosphate in, 
776-777, 777£ 
labeling of, 776f 
liver in, 779 
metabolism, 784 
mevalonate in, 776-777, 777£ 
rate of, 779-781 
regulation of, 779-788 
site of, 782, 782f, 784 
squalene in, 777—779, 778f, 779f 
stages of, 776 
sterol regulatory element binding protein 
(SREBP) in, 779-781, 780f, 781f 
Choline, 771 
Chorismate, in amino acid synthesis, 
727-729, 727£ 
Chromatin, 943-944, 948-954 
defined, 943 
in gene expression, 943, 948-954 
higher-order, structure of, 944, 944f 
remodeling of, 948-954, 953f 
structure of, 941, 942t 
Chromatin immunoprecipitation (ChIP), 
948, 948f 
Chromatin-remodeling complexes, 953 
Chromatography 
affinity, 70-71, 70f 
gel-filtration, 69, 69f 
high-performance liquid (HPLC), 71, 71f 
ion-exchange, 69-70, 70f 
Chromogenic substrate, 254, 255f 
Chromophore, 971 
Chromosomes 
bacterial artificial, 147 
defined, 109 
Indian muntjac, 109, 109f 
yeast, 941, 942t 
yeast artificial, 147, 147f 
Chronic myelogenous leukemia (CML), 417 
Chronic obstructive pulmonary disease 
(COPD), 303 
Chylomicron remnants, 782 
Chylomicrons, 646-647, 646f, 782, 
782t, 783 
Chymotrypsin, 181, 182 
active site, 255-256, 256f 
catalysis kinetics, 254-255, 255f 
catalytic mechanism, 254, 254f 
catalytic triad in, 255-258, 256f 
conformations of, 301, 301f 
covalent catalysis in, 253-255, 254f, 255f 


diisopropylphosphofluoridate (DIPF) 
and, 254, 254f 
elastase and, 258, 259f 
natural substrate for, 237 
oxyanion hole, 257, 257f 
peptide hydrolysis by, 256, 257f 
S; pockets of, 259f 
serine residue, 253-254 
specificity of, 254f 
specificity pocket of, 258, 258f 
structure of, 255-256 
substrate preferences of, 230-231, 231t 
trypsin and, 258, 258f, 259f 
Chymotrypsinogen, 255 
conformations of, 301, 301f 
defined, 299-300 
proteolytic activation of, 300, 300f 
Cilia, microtubules in, 1023 
Ciprofloxacin, 839 
Circe effects, 231, 231b 
Circular DNA, 113, 113f 
Cirrhosis, 821 
Cis configuration, peptide bonds, 38, 38t, 39t 
Citrate 
in fatty acid metabolism, 666, 671, 672f 
in glyoxylate cycle, 517, 517f 
isomerization of, 504, 504f 
phosphofructokinase inhibition by, 472 
in polymerization, 671 
synthesis of, 513 
transport to cytoplasm, 512 
Citrate synthase 
conformational changes on binding, 
502, 503f 
defined, 502 
mechanism of, 502-503, 503f 
Citric acid cycle, 495-518, 819 
acetyl CoA (acetyl coenzyme A) in, 495, 
498-500, 501-503, 503f, 515 
aconitase in, 504, 504f 
ADP in, 406f, 506 
ATP in, 496-497, 508-510, 513, 514 
biosynthetic roles of, 514-516, 514f 
in cancer, 513-514 
carbon oxidation in, 501—510 
in cellular respiration, 496-497, 496f 
citrate isomerization in, 504, 504f 
citrate synthase in, 502-503, 503f 
citryl CoA in, 502, 503, 503f 
components of, 509t 
control sites of, 512-513, 512f 
defects in, 513-514 
entry to, 510-514 
evolution of, 516 
in exercise, 514, 514f 
glycolysis and, 497-501, 497f 
high-energy electron harvesting in, 
496-497 
illustrated, 509f 
intermediates in, 514 
isocitrate dehydrogenase in, 504, 
512-513, 512f 
isocitrate in, 504—505 


a-ketoglutarate in, 504-505, 505f 
a-ketoglutarate dehydrogenase complex 
in, 505, 512-513, 512f 
ketone bodies in, 819, 819f 
lipoamide in, 500-501, 501f 
in mitochondria, 496, 496f 
overview of, 495-497, 496f 
oxaloacetate in, 496, 502, 507-508 
pathway integration, 514, 514f 
pattern of, 496, 496f 
in photosynthesis. See dark reactions 
pyruvate dehydrogenase in, 511-512 
pyruvate dehydrogenase complex in, 
497-501, 497f, 497t, 501f, 
511-512, 511f 
pyruvate dehydrogenase phosphatase 
(PDP) in, 511 
rate of travel, 512, 512f 
reactions of, 508 
regulation of, 511-513 
replenishment, 514-515 
substrate channeling in, 510 
succinate dehydrogenase in, 507-508 
succinyl CoA in, 505-507, 506f 
succinyl CoA synthetase in, 506-507, 507f 
thiamine pyrophosphate (TPP) in, 
499, 499f 
Citrulline, 694-695 
Citrullinemia, 705t 
Citryl CoA (citryl coenzyme A), 502, 
503, 503f 
Cladribine, 760 
Clamp loaders, 840, 841f 
Class I MHC proteins, 996-997, 996f, 998, 
1002-1003, 1003f 
Class IT MHC proteins, 1000-1003, 1001f 
Class switching, 994-995 
Clathrin, 360, 784 
Clathrin-coated pit, 360 
Cleavage 
DNA. See DNA, cleavage of 
protein, 88-89 


Cleland, W. Wallace, 232 
Clinical trials, 1051-1052, 1052f 
Clofarabine, 760 
Cloning 

of cDNA, 380 


expression, 149 
plasmid vectors in, 144-146 
in recombinant DNA technology, 
147-148 
Cloning vectors, 145 
Closed promoter complex, 865-866, 866f 
Clostridium perfringens, 464 
Clostridium tetani, 1006 
Clotting. See Blood clotting 
Cluster of differentiation, 998 
Coactivators, 950-951 
Coat proteins (COPs), 919 
Coated pits, 784 
Cobalamin, 654-655, 655f. See also 
Vitamin By2 
Cobalt atom, 654-655 


Codon-anticodon interactions, 898 
Codons 
deciphered, 125, 125t 
defined, 21, 106, 123 
maximizing number of, 898 
of mitochondria, 126, 126t 
stop, 910-911, 911f 
as synonyms, 125 
in translation, 897-898, 898f 
Coenzyme A, 437, 437 
Coenzyme By, 654-655, 655f, 656 
Coenzyme Q, 530-531 
Coenzymes 
in amino acid degradation, 682 
defined, 217 
prosthetic groups, 217 
types of, 217t 
vitamin, 438-440, 438t 
Cognate DNA, 269, 272-274, 273f, 
273,274 
Cohesive ends, DNA, 144, 144£ 
Coiled-coil proteins, 44-45, 44f, 44, 45f 
Collagen 
amino acid sequence of, 45, 45f 
ascorbate and, 822 
in cartilage, 328 
defined, 45 
structure of, 45f 
triple helix, 45 
Collagenase, 300 
Color blindness, 974-975, 974£ 
Color vision 
in animals, 974, 974f 
cone receptors, 973-974 
defective, 974-975 
evolution of, 974, 974f 
Combinational association, antibody 
diversity and, 992-993 
Combinatorial chemistry, 185, 1045 
Combinatorial control, 946 
Committed step, 730 
Comparative genomics, 19 
Compartmentalization, 444 
Compartments, drug target, 1037, 1038f 
Competitive inhibition. See also Enzyme 
inhibition 
defined, 234 
dissociation constant, 235 
double-reciprocal plot, 236, 236f 
kinetics of, 235, 235f 
use of, 235 
Complement cascade, 985 
Complementarity-determining regions 
(CDRs), 988-990, 989f 
Complementary DNA (cDNA) 
cloning and sequencing of, 380 
formation of, 149, 149f 
hybridization with, 135 
screening of, 149-150, 150f 
Concerted model 
allosteric enzymes, 290 
defined, 198, 290 
formulation of, 211-212 
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modeling oxygen binding with, 212f 
T and R states, 199f 
tetramers, 199 
Condensing enzyme, 663 
Cones, 970, 973-974 
Conformation selection, 224 
Congenital disorders of glycosylation, 332 
Congenital erythropoietic porphyria, 737 
Congestive heart failure, 373 
Conjugation, in drug metabolism, 1037, 
1038, 1039f 
Connexin, 390 
Connexons, 390 
Conservative substitutions, 174, 175f, 176f 
Controlled termination of replication, 
138-139 
Convergent evolution, 179, 181-182, 181f 
Cooley anemia, 207 
Cooperative binding 
concerted model, 198-199, 199f 
defined, 196 
oxygen delivery and, 196-197, 196f 
sequential model, 199, 199f 
substrates, 233 
Copper ions, in cytochrome c oxidase, 536 
Coproporphyrinogen III, 736, 737f 
Corepressors, 931 
Cori, Carl and Gerty, 637, 638 
Cori cycle, 486, 486f, 493 
Cori disease, 637t 
Corrin ring, 654-655 
Corticosteroids, 792 
Cosmids, 146 
Cotransporters. See Secondary transporters 
Coulomb energy, 7 
Covalent bonds 
in biological molecules, 6—7 
defined, 5 
multiple, 7 
as strongest bonds, 6-7 
Covalent catalysis 
chymotrypsin as example, 253-255, 
254f, 255f 
defined, 252 
Covalent enzyme-bound intermediate, 458 
Covalent modifications 
acetylation, 293-294, 294t 
common, 294t 
dephosphorylation, 293 
in enzyme regulation, 293-298 
irreversible, 294 
phosphorylation, 293, 294t 
COX2 inhibitors, 1047-1048, 1048f 
Crassulacea, 601, 601£ 
Crassulacean acid metabolism (CAM), 601 
Creatine, 232 
Creatine kinase, 431f 
Creatine phosphate, 813-814, 814t 
Creutzfeldt-Jakob disease (CJD), 56 
Crick, Francis, 17, 105, 109, 123, 124, 827 
Critical concentration, 1018 
Cro, in genetic circuit formation, 933, 933f 
Cross talk, 398-399 
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Cross-links, 36-37, 37f 
Cross-peaks, 99 
CTP. See Cytidine triphosphate 
CTP-phosphocholine cytidylyltransferase 
(CCT), 770 
C-type lectins, 334-335, 334f 
Cumulative feedback inhibition, 732 
Cyanobacteria, 568, 568f 
Cyanocobalamin, 655 
Cyclic AMP (cAMP) 
binding to regulatory subunit, 297 
defined, 743 
effects in eukaryotic cells, 297 
in eukaryotic cells, 403 
generation by adenylate cyclase, 402-403 
in glycogen metabolism, 628-629 
phosphorylation stimulation, 403 
as second messenger, 398, 399, 415 
in signal transduction, 402-403 
structure of, 297 
Cyclic GMP (cGMP) 
defined, 743 
as second messenger, 398, 399 
Cyclic nucleotides, 743 
Cyclic photophosphorylation, 580, 580f 
Cyclin-dependent protein kinases, 844 
Cycloheximide, 914, 914t 
Cyclooxygenase inhibitors, development of, 
1047-1048 
Cyclosporin, 994, 994 
Cystathionase, 726 
Cystathionine, 726 
Cystathionine B-synthase, 726 
Cysteine 
linked, 36 
residue modification, 287f 
structure of, 32f, 33 
synthesis of, 722-723, 726 
Cysteine proteases, 261-262, 261f, 261 
Cysteine thiyl radical, 754 
Cystine, 36 
Cytidine, 107 
Cytidine diphosphodiacylglycerol (CDP- 
diacylglycerol), 769, 769 
Cytidine monophosphate (CMP), 769, 769 
Cytidine triphosphate (CTP), 291-292, 
291f, 292f 
ATCase inhibition by, 287, 287f, 291, 758 
defined, 287 
effect on ATCase kinetics, 292, 292f 
in pyrimidine synthesis, 747-748 
T state stabilization, 291, 291f 
Cytidine triphosphate synthetase, 747 
Cytidylate, 108 
Cytochrome 
defined, 534 
in photosynthetic reaction center, 
570, 570£ 
Cytochrome bs, 668, 668f 
Cytochrome bf complex, 572, 575, 575f 
Cytochrome c 
conformation of, 541, 541f 
evolution of, 541, 541f 


in oxidative phosphorylation, 530f, 
533-534 
Cytochrome c oxidase 
defined, 535 
electron flow in, 556 
heme, 536-537 
mechanism, 537-538, 537f 
peroxide bridge, 538, 538f 
proton transport by, 538, 538f 
in respiratory chain, 529-531, 530f, 
530t, 535-538 
structure of, 536, 536f 
subunits, 536 
Cytochrome P450, 1037-1038, 
1038f, 1041 
defined, 790-791 
hydroxylation by, 790-791 
mechanism, 791 
protective function, 791—792 
Cytoglobin, 208 
Cytokines, 1002 
Cytoplasm, 601, 666-667, 666f 
Cytosine, 4, 5f, 110, 110, 850 
Cytoskeleton, 1011, 1022 
Cytotoxic (killer) T cells 
CD8 on, 998-1000 
defined, 984 
foreign peptide recognition, 1001, 1001f 


D amino acids, 29, 29f 
D genes, 992 
Dalgarno, Lynn, 906 
Daltons, 34, 36b 
Dark reactions, 566 
Darst, Seth, 862 
Dawkins, Richard, 53 
De novo pathways. See also Nucleotide 
synthesis 
defined, 744 
purine, 748, 748f 
pyrimidine, 744, 744f 
Deamination 
in amino acid degradation, 692 
example of, 846, 846f 
Death cap, 873f 
Debranching enzyme, 638 
Decarboxylation 
in citric acid cycle, 498 
in gluconeogenesis, 480 
in pentose phosphate pathway, 601 
Decorated actin, 1018 
Dedicated protein kinases, 295 
Deficiency disorders, 20 
Degrons, 684 
Dehydratases, 692 
7-dehydrocholesterol (pro-vitamin D3), 
794-795, 794£ 
Deinococcus radiodurans, 828, 828£ 
Deleterious reactions, 456 
Denisovans, 185, 185f 
Density-gradient equilibrium 
sedimentation, 115, 115f 
Deoxyadenosine, 107 


5’-deoxyadenosy] radical, 655, 655f 
Deoxyadenylate, 107 
Deoxycytidine, 107 
Deoxycytidylate, 107 
Deoxyguanosine, 107 
Deoxyguanosine 3'-monophosphate 
(3'-dGMP), 108, 108 
Deoxyguanylate, 107 
Deoxyhemoglobin 
oP dimers, 195 
carbon dioxide and, 204 
defined, 198 
quaternary structure of, 195f, 195 
Deoxymyoglobin, 192, 193, 193 
Deoxynucleoside 5’-triphosphates, 117 
Deoxyribonucleic acid. See DNA 
Deoxyribonucleoside 3'-phosphoramidites, 
139, 140f, 140 
Deoxyribonucleoside triphosphate (dNTP), 
829-830 
Deoxyribonucleotide synthesis 
control of, 759-760 
deoxyuridylate in, 755-756, 756f 
dihydrofolate reductase in, 756 
overview of, 753 
ribonucleotide reductase in, 753-755, 
755£, 756f, 759, 759f 
thymidylate, 757—758 
thymidylate in, 755-756, 756f 
Deoxyribose, 4, 106, 317, 318f, 320f 
Deoxyribose phosphodiesterase, 849 
Deoxyuridylate (d(UMP), 755-756, 756f 
Dephosphorylation, 293, 296 
Deprotonation 
effect of buffer on, 268, 268f 
water, kinetics of, 267, 267f 
Designer genes, 152 
Desmolase, 792 
Destroying angel, 873 
a-Dextrinase, 450 
Diabetes mellitus 
defined, 808, 808b 
glucose homeostasis in, 808 
incidence of, 808 
insulin in, 808-812 
insulin resistance and, 808-810, 810f 
ketosis in, 659-660, 660f 
obesity and, 807-812 
oxaloacetate in, 657 
type 1, 808, 812, 812f 
type 2, 808, 809-810 
Diacylglycerol (DAG), 398, 399, 647 
in lipid synthesis, 768 
in signal transduction, 405, 405f, 415 
Diacylglycerol kinase, 768 
Dialysis, in protein purification, 69, 69f 
Diastereoisomers, 317, 317f 
Diazotrophic microorganisms, 715 
Dicoumarol, 307 
Dictyostelium discoideum, 276, 629, 1014 
Dicyclohexylcarbodiimide (DCC), 94f, 556 
Diels-Alder reaction, 728 
2,4-dienoyl CoA, 653, 653 


Diet 
fatty acid metabolism and, 672 
ketogenic, 660 
low-carbohydrate, 807 
low-fat, 807 
obesity and, 807 
Differential centrifugation, 68, 68f 
Diffusion 
facilitated, 367, 369 
lateral, 356 
lipid, 356 
membrane protein, 357 
simple, 368 
transverse, 357, 357f 
Diffusion coefficient, 356, 357 
Digestion 
of dietary proteins, 682, 683f 
enzymes in, 299-301, 300f 
of lipids, 645-646, 646f 
proteolytic enzymes in, 299-303 
starved-fed cycle and, 816-817 
Digestive enzymes 
chymotrypsinogen, 299-301, 300f 
synthesis of, 299 
Digitalis, 373 
Digitoxigenin, 373, 373f 
Diglyceride acyltransferase, 768 
Dihydrofolate reductase, 756 
Dihydrolipoyl transacetylase, 499, 500f 
Dihydropteridine reductase, 703 
Dihydrotestosterone (DHT), 793 
Dihydroxyacetone, 316, 317, 317f 
Dihydroxyacetone phosphate (DHAP), 323 
in Calvin cycle, 595 
in glycolysis, 452f, 454-455, 455f, 
456, 456f 
in lipid synthesis, 768, 768f 
Diisopropylphosphofluoridate (DIPF), 237, 
237£, 237, 254, 254f 
Dimerization arm, 412, 412f, 413, 413f 
Dimethylallyl pyrophosphate, 777 
Dimethylbenzimidazole, 655 
2,4-dinitrophenol (DNP), 556-557 
Dioxygenases, 704—705, 821 
Diphosphatidylglycerol (cardiolipin), 
770, 770 
Diphthamide, 915, 915f 
Diphtheria toxin, 914-915, 915f 
Dipolar ions, 29-30, 30f 
Direct DNA repair, 848 
Disaccharides 
defined, 323 
structure of, 323-324, 324f 
types of, 323-324, 323f, 324f 
Diseases and disorders 
albinism, 705t 
alcaptonuria, 705 
alcohol-related, 821-822 
Alzheimer’s disease, 56—57 
amyloidoses, 56-57 
amyotrophic lateral sclerosis (ALS), 143 
Anderson disease, 637t 
anemia, 327 


Angelman syndrome, 685 

anthrax, 839 

argininosuccinase deficiency, 697, 697f 

arsenite poisoning, 516, 516f 

atherosclerosis, 661, 784—785, 787-788 

autoimmune, 1005, 1005f 

beriberi, 515, 515b 

bovine-spongiform encephalopathy, 56 

cancer. See cancer 

carbamoyl phosphate synthetase 
deficiency, 697-698, 698f 

carnitine deficiency, 650 

cataracts, 469, 469f 

Charcot-Marie- Tooth disease, 1022 

cholera, 417-418 

citrullinemia, 705t 

congenital disorders of glycosylation, 332 

congestive heart failure, 373 

Cooley anemia, 207 

Cori disease, 637t 

coronary artery disease, 785 

Creutzfeldt-Jakob disease (CJD), 56 

deficiency disorders, 20 

diabetes. See diabetes mellitus 

drug-resistant, 1053-1054 

emphysema, 303 

environmental factors in, 20 

falciparum malaria, 612 

familial hypercholesterolemia, 767 

galactosemia, 468 

glucose 6-phosphate dehydrogenase 
deficiency, 610-612 

glycogen-storage, 637-638, 637t, 638f 

gout, 761 

Hartnup disease, 682 

heart disease, 788 

hemolytic anemia, 610-612 

hemophilia, 308 

Hers disease, 637t 

HIV infection. See human 
immunodeficiency virus (HIV) 
infection 

homocystinuria, 705t 

Huntington disease, 56, 683, 850 

Hurler disease, 328f, 332 

hyperammonemia, 697 

hyperlysinemia, 705t 

T-cell disease, 332 

lactose intolerance, 467-468 

Leber hereditary optic neuropathy 
(LHON), 557 

Lesch-Nyhan syndrome, 761-762 

Li-Fraumeni syndrome, 851 

long QT syndrome (LQTS), 388-389 

Lynch syndrome, 851, 851f 

mad cow disease, 56 

malaria, 206 

maple-syrup urine disease, 705 

McArdle disease, 637t, 638 

mercury poisoning, 515-516 

mitochondrial, 557 

mucopolysaccharidoses, 328 

multidrug resistance in, 374-376 
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mutations causing, 143 
neurological, protein misfolding in, 
56-57 
ornithine transcarbamoylase deficiency, 
697-698, 698f 
osteoarthritis, 328 
osteogenesis imperfecta, 46 
osteomalacia, 795 
Parkinson disease, 56—57, 683, 685 
phenylketonuria, 705 
Pompe disease, 637-638, 637t, 638f 
porphyrias, 737-738 
predisposition to, 18 
prion, 56-57, 57f 
protein aggregates in, 56-57 
respiratory distress syndrome, 774 
retinitis pigmentosa, 884-885 
ricin poisoning, 915 
rickets, 795 
scrapie, 56 
scurvy, 821 
severe combined immunodeficiency 
(SCID), 164, 760-761 
sickle-cell anemia, 205—206, 205f, 206f 
spina bifida, 762 
Tay-Sachs disease, 774 
thalassemia, 884, 884f 
transmissible spongiform 
encephalopathies, 56-57 
tuberculosis, 687 
tyrosinemia, 705t 
vanishing white matter (VWM) disease, 
913, 913f 
vitamin D deficiency, 795 
von Gierke disease, 637, 637t, 638 
Wernicke-Korsakoff syndrome, 821 
whooping cough, 417-418 
xeroderma pigmentosum, 851, 851t 
Zellweger syndrome, 657 
Dismutation, 539b 
Displacement loop (D-loop), 853 
Dissociation constant (Kg), 229, 235, 
1017-1018 
Distal histidine, 194 
Distribution, drug, 1037, 1037f, 1038f 
Disulfide bonds 
formation of, 36-37, 37f 
reduction, 89, 90f 
Diuron, 584, 584 
Divergent evolution, 181 
DNA 
A-form, 111-112, 111f, 112f, 112t 
ancient, amplification and sequencing of, 
185, 185f 
backbone of, 106f 
bases in, 4, 4, 10, 10f, 108 
B-form, 111-112, 111f, 112t 
building blocks, 4—5 
chemically synthesized linker, 144 
circular, 113, 113f 
coding strand of, 862 
cognate, 269, 272-274, 273f, 273 
cohesive ends, 144, 144f 
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DNA—(continued) 

complementary (cDNA), 135, 149-150, 
149f, 150f, 380 

complementary single-stranded ends, 144 

covalent structure, 4, 4f 

damage to, 828, 845, 847-849 

defined, 105 

denaturation, 14-15, 14f 

directionality of, 108-109 

distortion of, 273-274, 273f 

double helix of, 5, 6, 9-10, 14-15, 105, 
109-111 

double-stranded, 121, 152-153 

electroporation and, 163, 164f 

elongation, 118, 118f 

exchange between species, 184, 184f 

fingerprint, 137 

forensics and, 142, 143f 

functions of, 2 

heredity and, 5 

hybridization, 117 

information flow to RNA, 119, 119f 

ionic interactions, 10f 

length of, 108-109 

as linear polymer, 4, 105, 106 

linking number of, 834-836, 834f 

melting temperature of, 116 

methylation, 274-275 

microinjection of, 159-160, 159f 

nucleotides, 107 

promoter sites in, 122, 122b, 122f, 860 

proofreading of, 847, 848f 

properties of, 4—5 

proviral, 159 

recombinant. See recombinant DNA 
technology 

resolution by density-gradient, 115, 115f 

stability of, 142 

stem-loop structure, 113, 113f 

in storage of genetic information, 4, 5 

structure of, 108, 111f, 113-114, 113f 

sugars, 4 

supercoiled, 113, 113f, 833-839 

synthesis of, 139-141, 140f, 141 

topoisomers of, 835-836, 836f 

transferred (T-DNA), 163 

unwinding of, 833-839, 860, 865-866, 866f 

Watson-Crick model, 111, 116 

winding, 944 

wrapping of, 943-944 

X-ray diffraction of, 109f 

Z-form, 112, 112f, 112t 

DNA, cleavage of, 144, 144f 

EcoRV endonuclease, 269 

metal ion catalysis in, 271-272, 272f 

restriction enzymes and, 269-275 

stereochemistry, 271, 271f 

DNA fragments 

covalent insertion of, 143-144 

insertion and replication of, 145 

insertional inactivation of, 145, 145f 

joined by DNA ligases, 144 

library of, 147-148, 148f 
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NA gyrase, 839 

NA ligase, 135, 849 

defined, 832 

DNA fragments joined by, 144 

in DNA replication, 832, 844 

in forming recombinant DNA molecules, 
143-144 

reaction, 832, 832f 

NA microarrays 

defined, 158 

gene-expression analysis with, 159, 159f 

using, 158-159, 158f 

NA photolyase, 848 

NA polymerase a, 843, 844, 844t 

NA polymerase B, 844t 

NA polymerase 8, 844, 844t 

NA polymerase I, 217, 829, 829, 842, 844t 

NA polymerase II, 844t 

NA polymerase III, 831-832, 840-842, 

840f, 844t 

NA polymerase switching, 843-844 

NA polymerases 

common structural features, 829 

in correcting DNA mistakes, 118 

defined, 117, 829 

DNA fragment of, 830 

error-prone, 846 

eukaryotic, 843-844, 844t 

holoenzyme, 840, 841f, 842-843 

mechanism, 829-830, 830f 

phosphodiester-bridge formation and, 
117-118 

polymerization reaction catalyzed by, 
117, 117£ 

primer, 118, 829 

prokaryotic, 844t 

replication by, 117-119 

specificity of, 830 

structure of, 829, 829 

template, 829 

as template-directed enzymes, 117, 829 

types of, 844t 

NA probes 

amino acid sequences in making, 91 

defined, 138 

generated from protein sequences, 
147, 147£ 

as primer, 140 

synthesis by automated solid-phase 
methods, 139-141 

NA recombination 

in cell division, 855 

defined, 852 

functions of, 853 

Holliday junctions in, 854, 854f 

illustrated, 853f 

initiation of, 853 

mechanism, 854, 854f 

overview of, 828 

RecA in, 853 

recombinases in, 854 

strand invasion in, 853, 853f 

V(D)J, 992, 992£ 


VJ, 991-992, 991f 
DNA recombination synapse, 854 
DNA repair 
base-excision, 848, 849f 
direct, 848 
double-stand, 849 
in E. coli, 847-849 
enzyme complexes in, 848-849, 848f 
example of, 846, 846f 
illustrated, 828f 
ligase in, 849 
mismatch, 848, 848f 
nonhomologous end joining (NHEJ) 
in, 849 
overview of, 828, 845 
proofreading in, 847, 848f 
single-strand, 839-841 
thymine in, 850-851 
trinucleotide repeats and, 850 
uracil DNA glycosylase in, 850, 850f 
DNA replication, 5f, 115f, 116f, 828-845 
ATP hydrolysis in, 832-833, 833f 
cell cycle and, 844, 844f 
clamp loaders in, 840, 841f 
controlled termination of, 138-139, 139f 
coordinated process in, 839-845 
cross-linking in, 847, 847f 
defective, in cancer, 850-852 
defined, 109 
DNA polymerase ITI in, 839, 
840-842, 840f 
DNA polymerases in, 829-830, 
842-843, 844t 
in E. coli, 115, 115f, 839-840, 
842-843, 842f 
error rate, 118 
errors in, 846 
in eukaryotes, 843-844, 843f, 844f 
in eukaryotic cell cycle, 844, 844t 
helicases in, 832-833, 833f 
illustrated, 828f 
lagging strand in, 831f, 832, 
840-842, 841f 
leading strand in, 831f, 832, 
840-842, 841f 
licensing factors in, 843 
ligase in, 832, 832f, 844 
minor-groove interactions in, 830, 830f 
Okazaki fragments in, 831-832, 831f, 841 
origin of, 842-843, 842f, 843f 
origin of replication complexes (ORCs), 
843-844 
overview of, 827 
polymerase switching in, 843-844 
by polymerases, 117-119 
prepriming complex in, 842, 842f 
primer in, 831, 831f 
processivity in, 839-840, 839f 
replication fork in, 831, 841, 841f 
replicon, 843 
RNA polymerases in, 831, 831f 
semiconservative, 115-116 
shape complementarity in, 830, 830f 


shape selectivity in, 830, 831f 
sites of, 842-844 
sliding DNA clamp in, 839-840, 839f 
specificity of, 830 
telomeres in, 844-845, 845f 
template in, 828-829 
trombone model in, 841, 841f 
tumor-suppressor genes in, 851 
DNA sequencing, 89-90 
amino acid sequences and, 90f 
amplification by PCR, 141-142 
ancient DNA, 185, 185f 
by controlled termination of replication, 
138-139, 139f 
decreasing costs of, 18f 
defined, 135 
as gene exploration tool, 136 
human migrations and, 19f 
methods for, 18 
next-generation (NGS), 155-156, 155f 
pyrosequencing, 155-156 
reversible terminator method, 155-156 
semiconductor, 156 
DNA vectors, 143, 144-147 
DnaA, 842, 842f 
DnaB, 840 
DNA-binding domains 
defined, 945 
in eukaryotes, 945-946 
DNA-binding proteins 
basic-leucine zipper (bZip) in, 945, 945f 
binding to regulatory sites in operons, 
927-932 
in eukaryotes, 945-946 
helix-turn-helix motif in, 927, 927f 
homeodomains in, 945, 945f 
in prokaryotes, 926-932 
symmetry matching in, 926-927 
transcription inhibition by, 928-931 
transcription simulation by, 931-932 
zinc-finger domains in, 945-946, 946f 
DNA-binding sites, 931, 931f 
evolution of, 931 
GAL4, 948, 948f 
hypersensitive, 948 
Domains 
acetyllysine-binding, 953, 953f 
defined, 3 
DNA-binding, 926-932, 945-946 
homeodomains, 945, 945f 
immunoglobulin, 986, 988-989 
nuclear-hormone receptor, 950, 950f 
protein, 48, 48f 
Dopamine D2 receptors, 1042 
Double helix, 5, 6 
acid-base reactions and, 14-15 
base pairs, 10 
destabilization, 14-15 
DNA molecule form, 105 
as expression of rules of chemistry, 9-10 
left-handed, 112 
minor groove, 875 
reversibly melted, 116-117 


right-handed, 112, 835 

sequence of bases, 114 

stabilization of, 109-111 

structure, 5,5 

topoisomerases and, 836-839, 837f, 838f 

in transcription, 875, 875f 

in transmission of hereditary information, 
114-117 

unwinding of, 836 


Double helix formation 


from component strands, 6 
entropy and, 13f 

heat release in, 12-13 
illustrated, 6f 

principles of, 10 
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structure-based, 1046-1048, 
1047f, 1048f 
Drug targets, 1033, 1034f 
animal-model testing of, 1049 
compartments, 1037, 1038f 
emerging, 1050, 1050f 
identified in genomes of pathogens, 1050 
identified in human proteome, 1048-1049 
Drugs 
absorption of, 1036-1037, 1037f 
administration routes for, 1036-1037 
delivery with liposomes, 349 
discovery approaches, 1033-1034, 1034f 
distribution of, 1037, 1037f, 1038f 
excretion of, 1039-1040, 1040f 


Double-blind studies, 1052 

Double-displacement reactions, 232, 233 

Double-reciprocal plot, 229, 229f 

Double-stranded DNA, 121, 152-153 

Double-stranded RNA, 162, 162f 

Downstream core promoter element 
(DPE), 874 

Dreyer, William, 991 

Drosophila melanogaster 


half-life of, 1040, 1040f 

metabolism of, 1037-1039 

natural products in, 1043-1044 

resistance to, 1053-1054 

response to genetic variation in, 
1050-1051 

side effects of, 1035 

targets of, 1033, 1034f 

therapeutic index of, 1041 


bases and genes, 66 
fluorescence micrograph of, 82f 
in genetic studies, 847 

genome of, 154, 982 

sensory bristles in, 976 

Toll receptor in, 982 


Drug candidates 


absorption of, 1036-1037, 1037f 
ADME properties of, 1036-1040, 1036f 
administration routes for, 1036-1037 
characteristics of, 1034-1041 
distribution of, 1037, 1037f, 1038f 
effective concentrations of, 1035, 1035f 
excretion of, 1039-1040 

ligand binding in, 1035-1036, 1035f 
metabolism of, 1037-1039 

number of, 1045-1046 

oral bioavailability of, 1036, 1037f 
potency of, 1035-1036 

side effects of, 1035 

target compartments of, 1037, 1038f 
therapeutic index of, 1041 


Drug development, 1033-1055 


animal testing in, 1044 

challenges of, 1034-1041 

clinical trials in, 1051-1052, 1052f 

combinatorial chemistry in, 1045 

genetic differences and, 1050-1051 

genomics in, 1048-1051 

high-throughput screening in, 1045 

natural products in, 1043-1044 

overview of, 1033-1034 

pharmacogenetics/pharmacogenomics 
in, 1050 

phases of, 1051-1052 

screening libraries in, 1044-1046, 1046f 

serendipitous observations in, 1041-1043 

7TM receptors in, 1048 

split-pool synthesis in, 1045, 1045f 


toxicity of, 1040-1041, 1040f 
Dynamic instability, 1023 
Dynein 

AAA ATPases in, 1012, 1014 

ATP binding to, 1014-1016 

structure of, 1014, 1015f 


E. coli. See Escherichia coli 
E site, ribosomal, 905, 905f 
Ear, hair cells of, 975-976, 975f, 976f 
EcoRV endonuclease 
cleavage of DNA, 269, 271-272, 271f 
magnesium ion-binding site in, 
271-272, 272f 
nonspecific and cognate DNA with, 274 
recognition site structure, 272, 272f 
twofold rotational symmetry, 272, 272f 
Edman degradation, 87, 87f 
Effector functions, 895 
EF-hand protein family, 407, 407f 
Eicosanoids, 669, 669f, 670f, 670 
Elastase 
blocking of, 303 
catalytic triad in, 258, 259f 
5; pockets of, 259f 
Electrochemical potential. See Membrane 
potential 
Electron transfer 
half-reactions in, 528-529 
inhibitors, 556-557, 556f 
in oxidative phosphorylation, 526-529, 
540-541, 540f 
in photosynthesis, 568-572 
rate of, 540, 540f 
uncoupling, 556-557 
Electron transport chain, 497 
Electron-density maps, 96-97, 96f, 97f 
Electron-proton transfer reactions, 533, 533f 
Electron-transferring flavoprotein (ETF), 651 
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Electron-transport chain 
components of, 530f, 530t 
in desaturation of fatty acids, 
668-669, 668f 
electron flow through, 538, 539f 
FAD (flavin adenine dinucleotide) and, 
497, 510 
FADH) (flavin adenine dinucleotide 
reduced) and, 526, 527 
inhibition of, 556 
iron-sulfur clusters in, 531-532, 532f 
NADH (nicotinamide adenine 
dinucleotide reduced) and, 497, 510, 
526, 528-529 
in oxidative phosphorylation, 523, 
529-541 
Electron-transport potential, 526 
Electrophilic catalyst, 691 
Electrophoresis 
capillary, 139 
gel, 71-73, 72f, 73f, 137-138, 138f, 
350-351 
SDS, 72-73, 73 
two-dimensional, 74—75, 74f 
Electroporation, 163, 164f 
Electrospray ionization (ESI), 85 
Electrostatic repulsion, 429 
Elongation factors 
bacterial versus eukaryotic, 913 
G (EF-G), 909-910, 910f 
Ts (EF-Ts), 907-908 
Tu (EF-Tu), 907, 907f, 908 
Embden-Meyerhof pathway, 450 
Emphysema, 303 
Enantiomers, 317, 317f 
Encoding of amino acids, 124-127 
Endocytosis, 784 
Endoplasmic reticulum (ER), 360 
protein glycosylation in, 330, 330f 
protein synthesis in, 915-919 
protein targeting from, 915-919 
ribosome binding to, 915-919, 916f 
rough, 916 
smooth, 916 
Endoplasmic reticulum (ER) stress, 811 
Endoplasmic reticulum membrane, 668 
Endosomes 
defined, 361, 784 
LDL release in, 786, 786f 
Endosymbiosis 
mitochondria evolution from, 360 
receptor-mediated, 360-361, 360f 
Endosymbiotic event, 525-526 
Enediol intermediate, in glycolysis, 
455, 456f 
Energy. See also Thermodynamics 
activation, 221-222 
binding, 225, 252 
free. See free energy 
kinetic, 11 
light, 584-585, 970 
potential, 11 
resting human being requirement, 548b 


status, 444 
total, 11 
units of, 219b 
Energy charge, 444, 444f, 482-483 
intracellular, 625 
in regulation, 554, 554f 
Energy coupling agent, 428 
Energy homeostasis. See Caloric 
homeostasis 
Energy transfer 
from accessory pigments to reaction 
centers, 583, 583f 
resonance, 582-583, 582f 
Enhancers 
binding sites for, 947f 
defined, 876, 946-947 
experimental demonstration of, 947f 
in gene expression, 946-947 
in transcription, 876 
Enol CoA hydratase, 651-652, 652f 
Enol intermediates, 502 
Enol phosphate, 461 
Enolase, 461 
Enoy] reductase, 664 
Ensemble studies, 242 
Enterohepatic cycling, 1040, 1040f 
Enthalpy, 11 
Entropy 
change during chemical reaction, 11, 12 
double helix formation and, 13f 
overall, 12 
total, 11 
Envelope form, 321, 321f 
Enzymatic cascades, 303-304, 303f 
Enzymatic velocity, 230 
Enzyme inhibition 
affinity labels, 237-238, 237f, 238f 
competitive, 234 
by diisopropylphosphofluoridate (DIPF), 
237, 237£ 
illustrated, 234f 
irreversible, 234, 237-238 
mechanism-based (suicide), 238, 238f 
noncompetitive, 234-235 
reversible, 234, 234f 
by specific molecules, 234 
by transition-state analogs, 240-241, 241f 
transition-state analogs in, 234 
uncompetitive, 234 
Enzyme kinetics 
for allosteric enzyme, 233, 233f 
chymotrypsin, 254-255, 255f 
of competitive inhibitors, 235, 235f 
defined, 225 
in first-order reactions, 225—226 
Lineweaver-Burk plot, 229, 229f 
maximal rate, 228-229, 230 
Michaelis constant in, 227, 228-230, 229t 
Michaelis-Menten, 229, 233, 235, 254 
Michaelis-Menten model for, 225-233 
of noncompetitive inhibitors, 235, 235f 
in pseudo-first-order reactions, 226 
in second-order reactions, 226 


sigmoidal, 287, 287f 
specificity constant, 230-231, 231f 
steady-state assumption and, 
226-228, 226f 
of uncompetitive inhibitors, 235, 235f 
of water deprotonation, 267, 267f 
Enzyme multiplicity, 732, 732f 
Enzyme regulation 
by covalent modification, 293-298 
of protein kinase A (PKA), 297, 297f 
strategies, 285-286 
Enzyme-linked immunosorbent assay 
(ELISA) 
defined, 82 
indirect, 82, 83f 
sandwich, 82-83, 83f 
Enzyme-pyridoxamine phosphate complex 
(E-PMP), 690 
Enzymes, 215-245 
acceleration of reactions, 221-225 
activation energy and, 221, 221f, 222 
active sites, 216, 223-224, 223f, 230, 252 
allosteric, 233, 233f, 287 
amount control, 286 
assay measurement of, 67 
ATP hydrolysis and, 276 
branching, 631-632, 632f 
catalytic activity, control of, 443-444 
catalytic efficiency, 230-231, 231f 
catalytic power, 215, 216-218 
catalytic reaction, 222-223, 222f 
in cellular respiration, 217 
classes of, 245, 245t 
cofactors for, 217, 217t 
debranching, 638f 
defined, 450b 
digestive, 299 
energy-transducing, 217-218 
ensemble studies, 242f 
homologous, 696f 
isozymes, 285-286, 292-293, 293f 
kinetic properties of, 225-233 
kinetically perfect, 456 
metabolism control of, 443 
multiple forms of, 285-286 
overview of, 215-218 
peptide-cleaving, 260-262 
in photosynthesis, 217 
P-loop structures, 280-281 
processive, 840b 
as proteins, 215 
proteolytic, 216, 286, 299-308 
of purine nucleotide synthesis, 752 
pyridoxal phosphate, 691-692, 691f, 692f 
rate enhancement by, 216, 216t 
reaction rate and, 220-221, 220f 
restriction, 137, 269-275 
reversible covalent modification, 286 
single molecule studies of, 242-243, 242f 
specificity, 215, 216-218, 217f 
spectroscopic characteristics of, 223 
substituted intermediate, 233 
substrate interaction, 223 


ternary complex of, 232 
turnover number of, 230, 230f 
urea cycle, 696-697 
Enzyme-substrate binding 
in catalysis, 225 
conformation selection, 224 
induced-fit model of, 224, 224f, 252, 453 
lock-and-key model of, 224, 224f 
multiple weak attractions, 224 
Enzyme-substrate complex 
defined, 222 
diffusion-controlled encounter, 231 
dissociation constant for, 229 
evidence of existence, 222—223 
fates, 226 
formation, in catalysis, 222-223, 222f 
hydrogen bonds in, 224, 224f 
reformed, 226 
structure of, 223, 223 
Epidermal growth factor (EGF) 
defined, 411 
Ras activation and, 413-414, 413f 
in signal transduction, 397, 411-414 
signaling pathway, 414, 414f 
structure of, 411-412, 411f 
Epidermal growth factor receptor (EGFR) 
defined, 411-412 
dimerization of, 412, 412f, 413 
modulator structure of, 412, 412f 
overexpression of, 416-417 
phosphorylation of carboxyl-terminal 
tail, 413 
unactivated, structure of, 412-413, 413f 
Epigenome, 942 
Epimers, 317f, 318 
Epinephrine 
defined, 399 
in fatty acid metabolism, 671-672 
in glycogen metabolism, 627-629, 
628f, 633 
in signal transduction, 399-407 
synthesis of, 734, 734f 
Epitopes, 79 
Equatorial bonds, 320-321, 321f 
Equilibrium constant 
defined, 219 
dissociation of water, 13-14 
between enzyme-bound reactants and 
products, 279 
metabolism and, 428 
rate constants, 221 
of reactions, 219-220 
Equilibrium potential, 387-388, 387f 
Equilibrium reaction, 220-221 
Error-prone polymerases, 846 
Erythropoietin (EPO), 327, 327f 
Erythrose 4-phosphate, 604, 604f 
Escherichia coli (E. coli) 
amino-terminal protein residues, 
905-906 
DNA polymerase I, 829, 829 
DNA recombination in, 853 
DNA repair in, 847-849 


DNA replication in, 115, 115f, 839-840, 
842-843, 842f 
enterotoxigenic, 773 
fatty acid synthesis in, 664 
flagella, 1012, 1026-1029 
gene expression in, 925-932 
genome of, 109, 109f 
lac operon of, 929-932, 929f, 930f 
membranes, 359 
methionine repressor, 927, 927f 
promoter sequences, 865 
pyruvate dehydrogenase complex of, 
497£, 497 
RecA protein, 933 
recombinant systems and, 78 
restriction enzymes in, 269 
ribonucleotide reductase of, 753 
RNA in, 119, 119t 
RNA polymerases in, 123, 123f, 
861-862, 862t 
two-dimensional electrophoresis and, 74 
Essential amino acids 
defined, 682, 719 
pathway steps, 719-720, 720f 
table of, 682t 
Estradiol, 793, 951 
Estrogen response elements (EREs), 950 
Estrogens 
defined, 789 
DNA-binding receptors and, 949 
pathways for formation of, 794f 
synthesis of, 792-793, 794 
Estrone, 793 
Ethanol 
fermentation of, 462-463, 463f 
in glycolysis, 449 
metabolism, 819-822 
Ethylene, 726, 726f 
Eukarya, 3f 
Eukarya domain, 3 
Eukaryotes 
defining characteristics, 3 
DNA replication in, 843-844, 843f, 844f 
fatty acid synthesis in, 664 
glycolysis in, 451 
membranes, 359-362 
nuclear envelope, 360, 360f 
protein synthesis inhibition in, 914-915 
transcription factors in, 874 
transcription in, 871-886 
translation in, 871-872, 871f 
translation initiation in, 911-912, 912f 
tubulins in, 1023 
Eukaryotic cell cycle, 844, 844f 
Eukaryotic genes 
as discontinuous, 127 
mosaic nature of, 127 
organization of, 942 
quantitation and manipulation of, 
157-159 
Eukaryotic RNA polymerases, 860, 860 
Evolution 
of active sites, 252 
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of amino acid sequences, 37 
of aminoacyl-tRNA synthetases, 902 
of blood types, 331-332 
caloric homeostasis and, 802—803 
of carbonic anhydrases, 268 
of chloroplasts, 568 
of citric acid cycle, 516 
convergent, 181-182, 181f 
of cytochrome c, 541, 541f 
divergent, 181 
of DNA-binding sites, 931 
of drug resistance, 1053-1054 
experimental studies of ancient DNA 
amplification and sequencing in, 
185, 185f 
of globins, 183-184 
of glycogen metabolism, 629 
of glycogen phosphorylase, 629 
of immune system, 981-982 
of introns and exons, 128 
in the laboratory, 186, 186f 
of metabolism, 444-445 
molecular, 185-187 
obesity and, 802-803 
of oxygen, 574b 
of photoreceptors, 973-974, 974f 
of proteasomes, 686, 686f 
of proteins, 128 
of ribosomes, 903 
of signal transduction pathways, 415 
split genes in, 128 
of vitamins, 438—439 
Evolutionary trees, 183-184, 183f 
Excinuclease, 849 
Excretion, drug, 1039-1040, 1040f 
Exemestane, 793 
Exercise 
ATP in, 431, 431f, 476, 813-815, 814t 
citric acid cycle in, 514, 514f 
creatine phosphate in, 813-814, 814t 
diabetes and, 811-812 
fatty acid metabolism in, 813, 813f 
fuel sources for, 813-815, 814t, 815f 
glycolysis during, 476 
mitochondrial biogenesis and, 813, 813f 
oxidative phosphorylation in, 814-815 
super compensation (carbo-loading) 
and, 815 
Exons 
defined, 106 
in encoding protein domains, 128-129 
shuffling, 128, 128f, 129f 
Expression cloning, 149 
Expression vectors, 146 
Extracellular signal-regulated kinases 
(ERKs), 414 
Extrinsic pathway, clotting, 304 


2-'8F.2-p-deoxyglucose (FDG), 474, 475f£ 
Fp fragments, 985-987 

Facilitated diffusion, 367, 369 

F-actin, 1017 

Facultative anaerobes, 464 


C16 
INDEX 


FAD (flavin adenine dinucleotide) 
as catalytic cofactor, 498 
defined, 435 
electron transfer potential of, 500 
structure of oxidized form, 435, 436f 
structure of reactive components of, 
436, 436f 
FADH) (flavin adenine dinucleotide 
reduced), 650-652 
defined, 436 
electron-transport chain and, 497, 510, 
526, 527 
in oxidation-reduction reactions, 
440, 440f 
structure of reactive components of, 
436, 436f 
Falciparum malaria, 612 
Familial hypercholesterolemia, 767, 
784-785 
Faraday constant, 369, 528 
Farnesyl pyrophosphate, 778, 779 
Fast switch, 464 
Fasting, 817 
Fat 
in adipose tissue, 815 
body, energy storage in, 645 
brown, 555 
as building block, 20 
neutral. See triacylglycerols 
Fat cells, 645 
Fatty acid chains 
melting temperature, 357, 357t 
packing of, 358, 358f 
Fatty acid metabolism, 643-672 
acetyl carnitine in, 649-650 
acetyl CoA carboxylase in, 670-672, 671f 
acetyl CoA in, 512, 648-649, 650-652, 
666-667, 666f 
acetyl CoA synthase in, 649 
activation in, 648-649 
acyl carrier protein (ACP) in, 661-662, 
662f, 662, 664-665, 665f 
B-oxidation pathway in, 650-651 
chylomicrons in, 646-647, 646f 
citrate in, 666, 671, 672f 
coenzyme Bj in, 654-655, 655f 
degradation in, 644, 644f, 651-661, 651f, 
657f, 661-662 
diet and, 672 
enoyl CoA hydratase in, 651-652, 652f 
epinephrine in, 671-672 
in exercise, 813, 813f 
FADH) in, 650-652 
glucagon in, 671-672 
insulin in, 671-672 
ketone bodies in, 657—660 
lipases in, 647-648 
lipid mobilization and transport in, 
645-647, 646f 
lipolysis in, 648, 648f 
malonyl CoA in, 662, 671 
methylmalonyl CoA in, 655-656, 
655f, 656 


NADH in, 650-652 

NADPH sources for, 667 

overview of, 643-645 

oxidation in, 648-649, 650-652, 651t, 
656-657 

palmitoyl CoA in, 652, 652f, 653, 671 

pancreatic lipases in, 645-646, 646f 

pathway integration, 667, 667f 

peroxisomes in, 656-657, 657f 

steps in, 644-645, 644f 

triacylglycerols in, 645-648, 647f 

in tumor cells, 667-668 


Fatty acid synthase 


catalytic cycle of, 664, 665f 
defined, 661, 662-663 
structure of, 664, 665f 


Fatty acid synthesis 


in animals, 664—666 

in cytoplasm, 666-667, 666f 
defined, 644-645 

degradation versus, 661-662 
by fatty acid synthase, 661-668 
intermediates in, 662 

malonyl CoA in, 662 

maximal, 670-671 

reactions in, 662-664 

steps in, 644f, 662-664, 664f 


Fatty acid thiokinase, 649 
Fatty acids 


animals and, 660 

B oxidation of, 818 

chain length in, 343-344, 664 

cholesterol and, 358, 358f 

covalent attachment of, 643 

defined, 342 

degree of unsaturation, 343-344, 358 

derivatives, 643 

desaturation of, 668-669, 668f 

as fuel, 643, 647-652 

generation by lipolysis, 648, 648f 

in lipids, 342-344, 343f 

in muscle, 810, 815 

naturally occurring, 343t 

in obesity, 810 

odd-chain, 654, 654f 

physiological roles of, 643 

polyunsaturated, 661, 669 

saturated, 661 

structures of, 343, 343f 

unsaturated, 652-654, 652f, 653f, 661, 
668-670 


Fatty liver, 820 

Favism, 611 

F, fragment, 985, 986 

Feed forward stimulation, 472 
Feedback inhibition, 285, 287, 443 


in amino acid synthesis, 731-732 
cumulative, 732 

enzyme multiplicity and, 732, 732f 

in purine synthesis control, 758-759, 758f 


FeMo cofactor, 716, 716f 
Fermentation 


to alcohol, 449, 462-463, 463f 


as anaerobic process, 464-465 
defined, 462 
in glycolysis, 462-464 
to lactic acid, 449, 463-464, 464£ 
start and end points of, 465t 
Ferredoxin, 580, 716 
Ferredoxin-NADP*, 576-577, 577£ 
Ferredoxin-thioredoxin reductase, 599 
Ferritin, 736, 954-955, 954f 
Ferrochelatase, 241, 736 
Fetal hemoglobin, 201 
Fiber types, muscle, 625, 626t 
Fibrils, 325 
Fibrin, 305, 305f, 308 
Fibrin clots, 305-306, 305f 
Fibrin monomer, 305 
Fibrinogen 
conversion by thrombin, 304-306 
structure of, 304, 305f 
Fibrinopeptides, 304-305 
Fibrous proteins, 44-46 
50S subunit, of ribosomes, 902-903, 903f, 
908, 910 
Fight-or-flight response, 399 
Fingerprint, DNA, 137 
First Law of Thermodynamics, 11 
First-order reactions, 225 
First-pass metabolism, 1039 
Fischer, Emil, 224, 1046 
Fischer projections, 22 
5’ cap, 878-879, 879f 
Flagella 
microtubules in, 1023 
rotary movement of, 1026-1029 
structure of, 1026, 1026f 
Flagellar motor 
chemotaxis and, 1028-1029, 1029f 
components of, 1027, 1027f 
rotation of, 1026-1029 
schematic view of, 1027f 
Flagellin, 1026, 1026f 
Flavin adenine dinucleotide (FAD), 848 
Flavin mononucleotide (FMN), 436, 436f, 
532, 867, 867£ 
Flavoproteins, 500 
Fleming, Alexander, 1041 
FlG, 1027 
FM, 1027 
FIN, 1027 
Flip-flop, 357, 357f 
Florey, Howard, 1041 
Fluconazole, 1038f 
Fluorescence microscopy, 82f, 84, 84f 
Fluorescence recovery after photobleaching 
(FRAP), 356, 356f 
Fluorodeoxyuridylate (F-dUMP), 757 
Fluorouracil, 757 
Folding funnel, 54, 54f 
Foodstuffs, energy extraction from, 
434-435, 434f 
Formylmethionyl-tRNA, 906, 906f, 907 
N-formylmethionine (fMet), 906, 907 
Formyltetrahydrofolate, 749, 750, 750f 


435 preinitiation complex (PIC), 912 
Fourier transform, 96 
Foxglove, 373, 373f 
Fractional saturation, 195 
Francois, Jacob, 928 
Franklin, Rosalind, 109 
Free energy 
of activation, 218 
ATP and, 426-431 
change during chemical reaction, 218—220 
defined, 12 
Gibbs, 12, 218 
in membrane transport, 369, 369f 
negative, 12 
of oxidation of single-carbon compounds, 
432, 432f 
in oxidation-reduction reactions, 528 
of phosphorylated compounds, 431t 
of phosphorylation, 296 
standard change, 219-220 
thermodynamics of metabolism and, 
425-426 
Free radicals. See Reactive oxygen species 
Friedreich’s ataxia, 531-532 
Fructofuranose, 320, 320f 
Fructokinase, 465-466 
Fructopyranose, 320, 320f 
Fructose, 318, 318f 
activation of phosphofructokinase by, 
472, 472£ 
excessive consumption of, 466 
in glycolysis, 465-466, 466f 
metabolism, 466, 466f 
regulation of phosphofructokinase by, 
472, 472£ 
ring form, 320f 
ring structures, 320, 320f 
Fructose 1,6-biphosphate 
in gluconeogenesis, 477f 
in glycolysis, 452f, 453-454, 454f 
Fructose 1-phosphate, in glycolysis, 466 
Fructose 2,6-biphosphate (F-2,6-BP) 
in gluconeogenesis, 483-485, 484f 
in glycolysis, 472, 472f 
Fructose 6-phosphate 
in Calvin cycle, 594, 595f, 598, 598f 
in gluconeogenesis, 477f 
in glycolysis, 452f, 453, 454f, 472 
in pentose phosphate pathway, 604, 604f 
phosphorylation of, 454 
Fructose biphosphatase 2 (FBPase2), 484 
FtsZ, 1023 
Fuel sources for, 816-817 
Fumarase, 508 
Fumarate 
from phenylalamine, 704, 704f 
in purine synthesis, 751 
in urea cycle, 695, 695f 
4-fumarylacetoacetate, 704, 704 
Functional groups, of proteins, 28 
Functional magnetic resonance imaging 
(FMRI), 193-194, 194f 
Furan, 319 


Furanose, 319, 319f, 320-321 
Futile cycles, 485, 486 


G elongation factor (EF-G), 909-910, 910f 
G proteins 
activation of, 400-401, 401f 
ATP synthase and, 548-549 
defined, 400 
in glycogen metabolism, 627-629 
heterotrimeric, 400—402, 401f 
resetting of, 403-404, 403f 
role in signaling pathways, 401 
small, 413 
subunits, 401 
G-actin, 1017 
GAL4 binding sites, 948, 948f 
Galactitol, 469 
Galactokinase, 466 
Galactolipids, 567 
Galactose, 317, 318f 
defined, 466 
in glycolysis, 466-467, 466f 
missing transferase and, 468-469 
ring form, 320f 
Galactose 1-phosphate uridyl 
transferase, 467 
Galactosemia, 468 
B-galactosidase, 332-333, 928, 928f 
B-galactosides, 930 
Galdieria sulphuraria, 184, 184f 
Gangliosides, 346 
disorders of, 774 
structure of, 773, 773f 
synthesis of, 772-773, 773f 
Gap junctions 
closing of, 390 
defined, 368, 389 
illustrated, 389f 
in intercellular communication, 389 
structure of, 390, 390f 
Gas-phase ions, 85 
Gastrointestinal peptides, 804-805, 805t 
GC box, 874, 874£ 
GDP (guanosine diphosphate) 
in citric acid cycle, 506 
hydrolysis of, 427 
in olfaction, 964 
in signal transduction, 401-402, 413, 
413f, 414 
in translation, 907-908 
in tubulin, 1023 
in vision, 972 
Gel electrophoresis 
defined, 71 
isoelectric focusing and, 73, 73f 
polyacrylamide, 72, 72f 
protein separation by, 71-75 
SDS, 72-73, 73£ 
SDS-PAGE, 73, 73f, 74 
two-dimensional electrophoresis and, 
74-75, 746 
Gel-filtration chromatography, 69, 69f 
Gellert, Martin, 836 
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Gene disruption, 160-162 


consequences of, 160-161, 161f 
defined, 160f 
by homologous recombination, 160, 160f 


Gene expression 


analysis with microarrays, 159, 159f 

bacterial, 910 

constitutive, 925 

control of, 925-926 

as DNA information transformation, 
119-124 

levels of, 157-158 

regulated, 925 

RNA in, 119-120 

transcriptional, 910, 925-926 

transcriptional, in eukaryotes, 954-957 

transcriptional, in prokaryotes, 935-937 

translational, 910, 954-956 


Gene expression (eukaryotes), 941-958 


activation domains in, 945, 946 

basic-leucine zipper (bZip) in, 945, 945f 

chromatin in, 943-944, 952-953 

coactivators in, 950-951 

combinatorial control in, 946 

DNA methylation and, 949 

DNA-binding structures in, 945-946 

enhancers in, 946-947 

histones in, 943-944 

homeodomain in, 945, 945£ 

hypomethylation in, 949 

in liver, 942, 942t 

mediator in, 946, 946f 

nuclear hormone receptors in, 
949-951, 950f 

nucleosomes in, 943-944, 943f 

overview of, 941-943 

in pancreas, 942, 942t 

posttranscriptional, 954-957 

versus in prokaryotes, 941-943 

steroid-hormone receptors in, 
951-952, 952f 

tissue-specific, 942t 

transcription factors in, 945-947 

zinc-finger domains in, 945-946, 946f 


Gene expression (prokaryotes), 925-938 


attenuation in, 935-937, 936f 

autoinducers in, 934, 934f 

biofilms and, 934-935 

catabolite activator protein (CAP) in, 
931-932, 931f, 932f 

catabolite repression in, 931-932 

chromatin remodeling in, 948-954, 953f 

corepressors in, 931 

DNA-binding proteins in, 926-932 

DNA-binding sites in, 931, 931f 

versus in eukaryotes, 941-943 

helix-turn-helix motif, 927, 927£ 

lac operon in, 929-932, 929f, 930f 

lac repressor in, 927, 929-930, 929f, 930f 

\ repressor in, 932-933, 933f, 934f 

ligand binding in, 930 

operon model, 928-932 

overview of, 925-926 
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Gene expression (prokaryotes)—(continued ) 
posttranscriptional, 935-937 
pur repressor in, 931, 931f 
quorum sensing in, 934, 934f 
regulatory sites in, 926-927 
signaling in, 933-934 
social interactions in, 934 
symmetry matching in, 927-928 
transcriptional, 925-932 
“Gene guns,” 163 
Gene knockdown, 163 
Gene knockout, 160-162, 160f, 161f 
Gene therapy, 164 
Gene transfer, horizontal, 184, 184f 
General acid-base catalysis, 252 
Genes. See also specific genes 
comparative analysis of, 156-157 
defined, 21 
designer, 152 
horizontal transfer, 184, 184f, 275 
number of, in human genome, 154 
regulator, 928 
reporter, 145 
RNA and, 118-119 
split, 128 
structural, 928 
synthesis by automated solid-phase 
methods, 139-141 
tumor suppressor, 851 
Genetic code 
amino acids, 125, 125t 
aminoacyl-tRNA synthetases and, 
898-902 
defined, 21, 124 
as degenerate, 125 
directionality of, 124 
features of, 125 
as nonoverlapping, 124 
universality of, 126-127 
Genetically modified organisms 
(GMOs), 164 
Genomes 
analysis of, 155-157 
of Arabidopsis thaliana, 154 
of Caenorhabditis elegans, 153 
of chloroplasts, 568 
comparative, 19, 156-157, 156f 
complete, 153f 
defined, 66 
of Drosophila melanogaster, 154, 982 
of E. coli, 109, 109f 
editing, 161-162, 162f 
of Haemophilus influenzae, 153, 177 
human, 154, 154f, 156, 156f 
of Indian muntjac, 109, 109f 
long interspersed elements, 154 
of mitochondria, 525-526, 525f, 526f 
next-generation sequencing, 
155-156, 155f 
noncoding DNA in, 154 
number of genes in, 154 
proteome as functional representation, 66 
of puffer fish, 157, 157f 


of Reclinomonas americana, 526 
of Rickettsia prowazekii, 525-526 
of Saccharomyces cerevisiae, 153, 941 
sequencing of, 17-20, 152-156 
size of, 109 
Genomic library 
creation of, 146-147, 147f 
defined, 147, 148 
screening of, 148, 148f 
Genomic variation, 18 
Genomics 
comparison, 19 
in drug discovery, 1048-1051 
revolution, 17—22 
Geranyl pyrophosphate, 778, 778f 
Gerany] transferase, 778, 779 
Ghrelin, 805, 805t 
Gibbs free energy, 12, 218, 221 
Gigaseal, 379 
Gla domain, in prothrombin, 304 
GlcNAcase, 327 
Gleevec (imatinib mesylate), 1048, 1054 
Globins. See also Hemoglobin; Myoglobin 
defined, 171 
evolution of, 183-184 
in human genome, 208 
sequence alignment in, 171-177 
Glomeruli, 1039 
Glucagon 
in essential enzyme regulation, 485 
excess in type 1 diabetes, 812 
in fatty acid metabolism, 671-672 
Glucagon-like peptide 1 (GLP-1), 805, 805t 
Glucocorticoids, 789 
Glucogenic, amino acids, 698-703 
Glucokinase, 473, 816 
Gluconeogenesis, 476-487 
defined, 449, 476 
energy charge and, 478, 482-483 
fasting and, 817 
fructose 1,6-biphosphate in, 477f 
fructose 2,6-biphosphate in, 
483-485, 484f 
fructose 6-phosphate in, 477f 
glucose 6-phosphate in, 477f, 481, 481f 
glucose generation and, 481 
glycolysis and, 478, 482-487 
historical perspective on, 450 
in kidney, 478 
in liver, 478, 483-485, 818 
in muscle contraction, 485—487, 486f 
overview of, 449, 476-478 
oxaloacetate in, 477f, 478-480, 819 
pathway integration, 486, 486f 
pathway of, 476-478, 477f 
phosphoenolpyruvate (PEP) in, 477f 
pyruvate conversion in, 478-480, 479f 
reactions of, 482t 
reciprocal regulation in, 482-487 
regulation of, 482-487, 812, 812f 
stoichiometry of, 481 
substrate cycles in, 485, 485f 
urea cycle in, 696, 696f 


Glucose, 317-318, 318f 
in adipose tissue, 816 
as cellular fuel, 451 
characteristics of, 451 
complete oxidation of, 552-553, 553t 
conversion into pyruvate, 461-462 
in diabetes, 808 
as essential energy source, 317 
formation of, 451 
generation from glucose 6-phosphate, 
481, 481f 
generation of, 450 
in glycogen, 324, 324f, 617, 618-619 
historical perspective on, 450 
homopolymers of, 324-325 
in liver-glycogen metabolism, 635-636 
metabolism, 424f 
as reducing sugar, 321-322 
ring form, 320f 
storage forms of, 324, 324f 
synthesis from noncarbohydrate 
precursors, 476—482 
Glucose 1-phosphate, in glycogen 
metabolism, 619-620, 622-623 
Glucose 6-phosphate (G-6P), 323 
galactose conversion into, 466-467, 467f 
in gluconeogenesis, 477f, 481, 481f, 623 
in glycolysis, 452f 
homeostasis, 816 
isomerization of, 453-454, 454f 
in liver, 622-623, 637 
metabolism of, 607-610, 618, 618f 
in pentose phosphate pathway, 602, 603f, 
607-610 
transporter, 637 
Glucose 6-phosphate dehydrogenase 
(G6PD) 
deficiency, 610-612 
in pentose phosphate pathway, 602, 
610-612 
in reactive oxygen species protection, 
610-612 
Glucose homeostasis 
in diabetes, 807-812 
in starvation, 818-819 
starved-fed cycle in, 816-817 
Glucose transporters (GLUTs), 368 
families, 473, 474t 
in glycolysis, 473-474, 474t 
Glucose-alanine cycle, 692-693 
Glucose-stimulated insulin secretion 
(GSIS), 810-812 
a-1,6-glucosidase, 638 
Glutamate, 233, 700 
chirality of, 717, 718f 
formation of, 687, 692, 720-721 
in nitrogen fixation, 717-718 
oxidative deamination of, 687-689 
structure of, 33, 34f 
synthesis of, 722, 722f 
Glutamate dehydrogenase, 688-689, 
717, 718 
Glutamate synthase, 718 


Glutamine, 717—718 
degradation, 701 
in nitrogen transport, 693 
in pyrimidine synthesis, 745 
structure of, 32f, 33 
synthesis of, 722 
Glutamine phosphoribosyl 
amidotransferase, 749 
Glutamine synthetase, 717, 718, 732-733 
regulation of, 732-733, 733f 
structure of, 732—733 
Glutathione, 610, 611 
conjugation reaction, 1038, 1039 
defined, 734 
structure of, 734 
Glutathione peroxidase, 611, 735, 735f, 735 
Glutathione reductase, 611, 734 
Glycan-binding proteins, 334 
Glyceraldehyde, 316, 316, 317, 317f 
Glyceraldehyde 3-phosphate (GAP), 323 
in Calvin cycle, 595 
catalytic mechanism of, 458, 459f 
free-energy profiles for, 458, 458f 
in glycolysis, 452f, 454-455, 455f, 456, 
456f, 457-460, 458f, 459f 
oxidation of, 432-433, 458-459, 459f 
in pentose phosphate pathway, 603, 
604, 604£ 
structure of, 458f, 458 
Glycerol 
generation by lipolysis, 648, 648f 
in gluconeogenesis, 476-478 
in liver, 648 
in phosphoglycerides, 344 
Glycerol 3-phosphate 
in lipid synthesis, 768 
reoxidation of, 549 


Glycerol 3-phosphate shuttle, 549-550, 549f 


Glycerol phosphate acyltransferase, 768 
Glycine 
in Calvin cycle, 593 
conversion into serine, 700 
defined, 30 
in liposomes, 349, 349f 
porphyrins from, 736-737, 737f 
structure of, 31f 
synthesis of, 722-723, 723f 
Glycine synthase, 724 
Glycine-glycine-glutamine (GGQ) 
sequence, 911 
Glycoaminoglycans, 326 
Glycocholate, 646f 
Glycogen, 333 
branch point in, 324, 324f 
defined, 324, 617 
glucose storage in, 617, 618-619 
in glycogen metabolism, 627-629, 
628f, 633 
in liver, 618, 618f, 627 
molecules, 617, 618f, 632f 
in muscle, 618, 618f 
phosphorolytic cleavage of, 620-621 
remodeling, 622, 622f 


storage efficiency, 632 
storage sites of, 618, 618f 
structure of, 618f 


Glycogen lakes, 638f 
Glycogen metabolism 


branching enzyme in, 631-632, 632f 

cAMP in, 628-629 

coordinate control of, 632-633, 633f 

degradation of, 618-629 

enzyme breakdown, 619-623 

epinephrine in, 627-629, 628f, 633 

evolution of, 629 

G proteins in, 627-629 

glycogen in, 627-629, 628f, 633 

glycogen phosphorylase in, 619-626 

glycogen synthase in, 630-632, 631f 

glycogen synthase kinase (GSK) in, 632, 
633, 633f 

glycogenin in, 631 

a-1,6-glucosidase in, 621-622 

hormones in, 627-629, 628f 

insulin in, 635, 635f 

in liver, 635-637, 636f 

overview of, 617-619 

pathway integration, 628f 

phosphoglucomutase in, 622, 623f 

phosphoglycerate mutase in, 622 

phosphorolysis in, 619-623 

phosphorylase in, 619-627, 621f, 
636, 636f 

protein kinase A (PKA) in, 629, 633, 633f 

protein phosphatase 1 (PP1) in, 629, 
633-635, 634f, 636 

pyridoxal phosphate (PLP) in, 621, 621f 

regulation of, 618-619, 632-638 

regulatory cascade for, 627-629, 628f 

7TM receptors in, 627-629, 628f 

termination of, 629 

transferase in, 622, 622f 

UDP-glucose and, 630-631 


Glycogen phosphorylase 


a, 623-624, 624f, 626, 627£ 
allosteric regulation of, 624, 624f, 

625, 625t 
amino-terminal domain of, 620-621, 620f 
b, 623, 624f, 625, 626, 627f 
carboxyl-terminal domain of, 620-621, 620f 
defined, 623 
evolution of, 629 
in glycogen metabolism, 619-626 
glycogen-binding site of, 620-621, 620f 
in liver, 623-624, 634f 
in muscle, 625, 625f 
regulation of, 623-627, 624f, 625t, 629 
structure of, 620 


Glycogen synthase 


a-1,6 linkages, 631-632, 631f 
defined, 630 

in glycogen synthesis, 632 
isozymes of, 630-631 


Glycogen synthase kinase (GSK) 


defined, 632 
in glycogen metabolism, 633, 633f 
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Glycogenin, 631 
Glycogen-storage diseases, 637-638, 
637t, 638f 
Glycolate, 593-594, 593f 
Glycolipids, 344, 345-346, 348-350 
N-glycolylneuraminate, 772 
Glycolysis 
aerobic, 474-476, 510 
aldehyde oxidation in, 457-458 
anaerobic mode, 510, 814 
ATP formation in, 459-462 
1,3-bisphosphoglycerate (1,3-BPG) in, 
433, 457-460, 459f 
in cancer, 474-476 
citric acid cycle and, 497-501, 497f 
covalent enzyme-bound intermediate 
in, 458 
defined, 449 
dihydroxyacetone phosphate (DHAP) in, 
452f, 454-455, 455f, 456, 456f 
as Embden-Meyerhof pathway, 450 
enediol intermediate in, 455, 456f 
energy charge and, 482-483 
as energy-conversion pathway, 451-469 
in eukaryotic cells, 451 
fermentation in, 462—464 
first stage of, 451, 451f, 452f 
fructose 1,6-bisphosphate in, 452f, 
453-454, 454f 
fructose 1-phosphate in, 466 
fructose 2,6-bisphosphate in, 472, 472f 
fructose 6-phosphate in, 452f, 453, 
454f, 472 
fructose in, 465-466, 466f 
galactose in, 466-467, 466f 
gluconeogenesis and, 478, 482-487 
glucose 6-phosphate (G-6P) in, 452f 
glucose transporters in, 473-474, 474t 
glucose trapping in, 451-453 
glyceraldehyde 3-phosphate (GAP) in, 
452f, 454-455, 455f, 456, 456f, 
457-460, 458f, 459f 
hexokinase in, 451-453, 453f, 
470-471, 473 
historical perspective on, 450 
insulin in, 816 
isomerization of carbon sugars in, 
453-454, 454f 
in liver, 472-473, 483-485 
in muscle, 469-471, 471f 
NAD* in, 458, 459f, 462-464, 465 
NADH in, 458, 462-464 
overview of, 449, 451 
pathway integration, 486, 486f 
pentose phosphate pathway and, 
602-610 
phosphofructokinase (PFK) in, 469-470, 
470f, 472, 472£ 
phosphoglycerate kinase in, 459 
phosphoglycerate mutase in, 460 
2-phosphoglycerate in, 460-461, 460f 
3-phosphoglycerate in, 460-461, 460f 
pyruvate in, 452f, 459-464 
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Glycolysis—(continued) 
pyruvate kinase in, 471, 473, 473f 
reactions of, 462t—463t 
reciprocal regulation in, 482-487 
regulation of, 812, 812f 
second stage of, 451, 452f, 457f 
stages of, 451, 451f, 452f, 457f 
thioester intermediate in, 457-458, 458f 
triose phosphate isomerase (TPI) in, 
455-456, 455f, 456f 
Glycolytic intermediates, 610 
Glycolytic pathway 
glucokinase in, 473 
hexokinase in, 470-471, 473 
phosphofructokinase (PFK) in, 469-470, 
470f, 472, 472£ 
pyruvate kinase in, 471, 473, 473f 
role of, 469 
Glycopeptide transpeptidase, 239 
Glycophorin, 336, 355, 355f 
Glycoproteins, 325-329 
Glycosaminoglycans, 327-328, 328f 
a-1,6-glucosidase, in glycogen metabolism, 
621-622 


Glycosidic bonds, 322, 322f, 325, 325f, 326f 


Glycosylation, 330, 331, 331f 
congenital disorders of, 332 
errors in, 332 
in nutrient sensing, 327, 327f 
Glycosyltransferases, 331, 331f 
Glyoxylate, 517, 517f, 593 
Glyoxylate cycle, 516-517, 517f 
Glyoxysomes, 517 
GMP (guanosine monophosphate) 
in purine synthesis, 751-752, 751f 
synthesis of, 758-759, 758f 
GMP synthetase, 752 
Goldstein, Joseph, 784 
Golgi complex, 330, 330f, 332, 918 
Gout, 761 
G-protein receptor kinase 2 (GRK2), 404 
G-protein-coupled receptors (GPCRs), 402 
Gradient centrifugation, 77—78, 77f 
Granzymes, 1000 
Grb2, 413, 413f 
Green fluorescent protein (GFP), 58-59, 
58f, 84f 
Greider, Carol, 1, 845 
Group-specific reagents, 237 
Group-transfer reactions, 441, 441f 
GTP (guanosine triphosphate) 
as energy source, 743 
hydrolysis, 403-404 
in olfaction, 964 
in resetting G proteins, 403-404 
in signal transduction, 401-402 
in translation, 907-908 
in tubulin, 1023 
in vision, 972 
GTPases 
defined, 414 
Ras family of, 414, 414t 
in signal transduction, 414, 414t 


Guanidinium chloride, 49 

Guanine, 4, 5f, 110, 110 

Guanine-nucleotide-exchange factor 
(GEF), 413 

Guanosine, 107 

Guanylate, 108, 752 

Guanylate cyclase, 973 

Guide strand, 162-163 

Gustation. See Taste 

Gustducin, 967, 967£ 


H zone, 1020, 1020f 
Haemophilus influenzae, 153, 177 
Hair cells, 975-976, 975f, 976f 
Hairpin turns. See Reverse turns 
Half-life 
of drugs, 1040, 1040f 
of proteins, 683, 684, 685f 
Half-reactions 
defined, 527 
in electron transport, 528-529 
Haptenic determinants, 994 
Haptens, 994 
Hartnup disease, 682 
Hatch, Marshall Davidson, 599 
Haworth projections, 319 
Hearing, 975-977 
hair cells in, 975-976, 975f, 976£ 
ion channels in, 976-977 
mechanism of action, 975 
Heart disease, 788 
Heat 
defined, 11 
oxidative phosphorylation uncoupling in 
generation of, 554 
release in double helix formation, 12-13 
thermodynamics and, 11 
Heat shock protein 70 (Hsp70), 179, 179 


Heat-shock response element, 875-876, 876f 


Heat-shock transcription factor (HSTF), 
875-876 
Heavy (H) chains 
in antibody diversity, 992-993 
antigen-binding and, 991-992 
defined, 985-986 
in immunoglobulin classes, 987t 
Heavy meromyosin (HMM), 1013 
Heinz bodies, 612, 612f 
Helicases 
defined, 832 
in DNA replication, 832-833 
mechanism, 833, 833f 
structure of, 832, 832 
symmetry, 833, 833f 
Helix-turn-helix motif, 48, 48f, 927, 927£ 
Helper T cells 
action, 1002, 1002f 
in cytokine secretion, 1002 
defined, 984, 1000 
in HIV infection, 1003-1004 
peptide recognition, 1001, 1001f 
T-cell receptors (TCRs) and, 1000-1002 
Hemagglutinin, 335-336, 335f 


Heme 
biosynthetic pathway, 737f 
in cytochrome bf complex, 575 
in cytochrome c oxidase, 536-537 
defined, 46 
intrinsic reactivity of, 194-195 
labeling of, 735b, 736, 736f 
in oxygen-binding, 192, 193-194, 194f 
Heme prosthetic group, 534 
Hemichannels, 390 
Hemoglobin, 191-212 
QB tetramer of, 49f 
alignment comparison, 173-174, 173f 
alignment with gap insertion, 173, 173f 
oP dimers, 195 
a chains, 195, 207—208 
amino acid sequences of, 171-172, 
171, 172£ 
B chains, 191, 195, 205, 205f 
conformational changes in, 200, 200f 
cooperativity, 198-199 
defined, 171, 191 
fetal, 201 
globin curve, 195 
Hill plot, 210f 
myoglobin comparison, 191-192 
oxygen delivery by, 196, 196f 
oxygen-binding, 192-202, 196f 
oxygen-binding curve, 196, 196f 
polypeptide chains, 192, 195 
purified, 200, 200f 
quaternary structure of, 195, 195, 
197-198, 198f 
R state, 198, 199f, 200, 211-212 
sequence alignment of, 176, 176f 
sickle-cell, 205, 205f 
space-filling model, 195f 
T state, 198, 199f, 200, 203, 211-212 
tertiary structure of, 178-179 
thalassemia and, 207 
Hemoglobin A (HbA), 195 
Hemoglobin H (HbH), 207 
Hemoglobin § (HbS), 205-206, 206f 
Hemolytic anemia, G6PD deficiency and, 
610-612 
Hemophilia, 308 
Hemorrhagic disposition, 307b 
Hemostasis, 303 
Henderson-Hasselbalch equation, 16 
Heparin, 307 
Heptoses, 316 
HER2 receptor, 413, 417 
Herbicides, photosynthesis and, 584 
Hereditary nonpolyposis colorectal cancer 
(HNPCC), 851, 851f 
Hers disease, 637t 
Heterodimeric sweet receptor, 968-969, 968f 
Heterotrimeric G proteins, 400-402, 401f 
Heterotrophs, 566, 590 
Heterotropic effects, 292 
Hexokinase 
defined, 451 
in glycolysis, 451-453, 453f 


inhibition of, 471 
liver and glycolysis and, 473 
metal ion requirement, 453 
muscle and glycolysis and, 470-471 
Hexose monophosphate pathway. See 
Pentose phosphate pathway 
Hexose monophosphate pool, 595 
Hexose sugars, formation in Calvin cycle, 
594-596, 595f 
Hexoses, 316, 453, 596f 
High-density lipoproteins (HDLs), 
782, 782t 
defined, 783 
properties of, 787 
protective effects and, 787—788 
reverse cholesterol transport in, 787 
High-performance liquid chromatography 
(HPLC), 71, 71f 
High-throughput screening, 1045 
Hill, Archibald, 210 
Hill coefficient, 210, 211f 
Hill plot, 210-211, 210f, 211f 
Histamine, 734, 734f 
Histidine 
degradation, 700, 700f 
distal, 194 
ionization of, 33 
proton shuttle, 268, 268f 
proximal, 193, 200 
structure of, 33, 33f 
Histidine operon, 936-937, 936f 
Histone acetyltransferases (HATS), 952, 952 
Histone code, 954 
Histone deacetylases, 953-954 
Histone octamer, 943-944 
Histones 
acetylation of, 952-953 
amino-terminal tails of, 952-953 
defined, 943 
in gene regulation, 943-944, 944f, 
952-954, 954t 
homologous, 944, 944f 
modifications, 954, 954t 
HIV. See Human immunodeficiency virus 
(HIV) infection 
HMG-CoA reductase, 1044 
HMG-COA reductase inhibitor, 1044-1045 
Hodgkin, Alan, 378 
Hoffman, Felix, 1043 
Hogness box, 122 
Holliday, Robin, 854f 
Holliday junctions, 854, 854f 
Homeodomain, 945, 945f 
Homeostasis, 442, 443f. See also Caloric 
homeostasis; Glucose homeostasis 
Homocysteine 
in activated methyl cycle, 725, 725f 
in amino acid synthesis, 725-726 
in cysteine synthesis, 726 
levels, reducing, 726 
in vascular disease, 726 
Homocysteine methyltransferase, 725 
Homocystinuria, 705t 


Homogenization, in protein purification, 67 
Homogentisate, 704, 704, 705 
Homogentisate oxidase, 704 
Homologies 
databases for, 177, 178f 
defined, 170 
detection of, 171-177 
evolutionary trees and, 183-184, 183f 
repeated motifs and, 180-181 
sequence alignment in, 173-177, 173f, 
179-180 
shuffling in, 173-174, 173f 
statistical analysis in, 173-174 
substitution matrices in, 174-177 
three-dimensional structures and, 179-180 
Homologous recombination, 160, 160f 
Homologous sequences, 177 
Homologs, 170, 171f 
Homolytic cleavage reaction, 655, 655f 
Homopolymers, 324 
Homotropic effects, 290 
Hood, Leroy, 991 
Horizontal gene transfer, 184, 184f, 275 
Hormone receptors, 950-952, 951f 
Hormone-receptor complex, 402 
Hormones 
in acetyl CoA carboxylase regulation, 
671-672 
eicosanoid, 669, 669f, 670f, 670 
in fatty acid metabolism, 670-672 
in gene expression, 950-952 
in glucose metabolism, 485 
in glycogen metabolism, 627-629, 628f 
local, 669 
metabolic functions of, 443—444 
in gluconeogenesis, 485 
in signal transduction, 400-402 
steroid, 789-791, 789f 
Human genome, 154, 154f, 156, 156f 
Human immunodeficiency virus (HIV) 
infection 
classes of, 1003 
drug development effects, 1047, 1047f 
helper T cells in, 1003-1004 
host cell for, 1003 
immune system and, 1003-1004 
protease, 262, 263f 
protease inhibitors, 263, 263f, 792, 1046, 
1046f, 1053 
receptor, 1003f 
schematic representation, 1003f 
vaccine for, 1007 
Humoral immune response, 984 
Huntington disease, 56, 683, 850 
Hurler disease, 328f, 332 
Huxley, Andrew, 378 
Hybridoma cells, 81-82 
Hydrogen bonds, 8f 
acceptors, 15 
in base pairs, 5 
breaking with membranes, 354 
in enzyme-substrate complex, 224, 224f 
interaction, 8 
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as noncovalent bond, 7 
strength of, 8 
water, 9, 10 
Hydrogen ions 
in acid-base reactions, 13 
concentration in solutions, 13 
oxygen release and, 202—204 
Hydrogen-bond donors, 8 
Hydrolases, 245t 
Hydrolysis. See also ATP hydrolysis 
GTP (guanosine triphosphate), 403-404 
of lipids, 647-648 
in metabolism, 427—429, 441-442, 442f 
of peptide bonds, 253, 256, 257f 
of phosphodiester bond, 269-270, 269f 
Hydronium ions, 13 
Hydropathy plots, 355, 356f 
Hydrophilic amino acids, 33, 47 
Hydrophilic moiety, 347, 347f 
Hydrophobic amino acids, 30-32 
structures of, 31f 
types of, 30-32 
Hydrophobic effect, 9, 9f, 10, 32 
Hydrophobic interactions 
defined, 9 
as noncovalent bond, 7 
Hydrophobic moiety, 347, 347f 
3-hydroxy-3-methylglutaryl CoA, 702 
3-hydroxyanthranilate, 704, 704 
25-hydroxycholesterol, 781 
2-hydroxyglutarate, 513 
Hydroxyl groups, 32-33, 57 
Hydroxylation 
cytochrome P450 in, 791-792, 791f 
steroid hormones, 790-791 
3-hydroxy-3-methylglutaryl CoA reductase 
(HMG-CoA reductase), 776-777, 
777£, 779, 781£, 788, 788f 
P-hydroxyphenylpyruvate, 704, 704, 728 
P-hydroxyphenylpyruvate hydroxylase, 704 
Hydroxyproline, 57 
Hyperammonemia, 697 
Hyperbaric oxygen therapy, 202 
Hyperlysinemia, 705t 
Hypersensitive sites, 948 
Hypervariable loops, 988-990, 989f 
Hypochromism, 116, 116f 
Hypomethylation, 949 
Hypoxanthine, 761 
Hypoxanthine-guanine 
phosphoribosyltransferase 
(HGPRT), 752, 761 
Hypoxia 
defined, 475 
tumor, 475-476, 476t 
Hypoxia-inducible factor 1 (HIF-1), 476, 
476f, 513 


I band, 1020, 1020f 

Ibuprofen, 1038, 1038f 

Ice, 9, 9F 

T-cell disease, 332, 332f 

Imatinib mesylate (Gleevec), 1048, 1054 
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Immune response 
basis of, 985 
cellular, 984 
development of, critics of, 985 
humoral, 984 
Immune system, 981-1009 
adaptive, 984-985 
in autoimmune diseases, 1005, 1005f 
B cells in, 984, 993-994, 1006 
in cancer, 1005-1006 
in disease, 1004-1007 
evolution of, 981-982 
innate, 982-983 
MHC proteins in, 995-1004 
overview of, 981-982 
T cells. See T cells 
T-cell receptors. See T-cell receptors 
(TCRs) 
Immunity, cell-mediated, 995 
Immunizations, 1006-1007 
Immunoelectron microscopy, 84, 84f 
Immunoglobulin A (IgA), 987, 987t 
Immunoglobulin D (IgD), 987, 987t 
Immunoglobulin domains. See also 
Immunoglobulin fold 
constant, 988, 988f 
defined, 986 
variable, 988, 988f, 989f 
Immunoglobulin E (IgE), 987-988, 987t 
Immunoglobulin fold 
complementarity-determining regions 
(CDRs), 988-990, 989f 
defined, 988 
elements of, 988-989 
structure of, 989 
Immunoglobulin G (IgG) 
antigen cross-linking, 986, 986f 
cleavage of, 986, 986f 
defined, 985 
properties of, 987t 
structure of, 985-986, 986f 
Immunoglobulin M (IgM) 
defined, 987 
properties of, 987t 
Immunoglobulins. See also Antibodies 
classes of, 986-988, 987f, 987t 
defined, 79, 984 
heavy chains, 985-987, 987t, 991-993 
light chains, 985-987, 987t, 991-993, 
991f, 992F 
production of, 984, 984f 
segmental flexibility of, 987, 987f 
sequence diversity in, 988f 
structure of, 79f 
Immunologic techniques 
enzyme-linked immunosorbent assay 
(ELISA), 82-83, 83f 
fluorescence microscopy, 84, 84f 
hybridoma cells in, 81-82 
monoclonal antibodies in, 80-82, 80f, 81f 
polyclonal antibodies in, 80, 80f 
in protein studies, 79-84 
western blotting, 83-84, 83f 


Immunological memory, 1006 
Immunoreceptor tyrosine-based activation 
motif (ITAM), 993, 993f 
Immunosuppression, 994 
Immunotoxins, 152 
Indian muntjac, 109, 109f 
Indinavir (Crixivan), 263, 263f 
Indinavir complex, 263-264, 263f 
Indirect ELISA, 82, 83f 
Induced pluripotent stem (iPS) cells, 
947, 947£ 
Induced-fit model, 224, 224f, 252 
Influenza virus, 335-336, 335f 
Inhibition 
competitive, 235, 235f, 236f 
noncompetitive, 235-236, 235f, 236f 
reversible, 235-236, 235f, 236f 
uncompetitive, 235, 235f, 236f 
Inhibition constant (Kj), 1036 
Inhibitory factor 1 (IF1), 554 
Innate immune system, 982-983 
Inosinate, 759 
Inosine, 760, 761, 898 


Inositol 1,4,5-trisphosphate (IP3), 398, 399, 


404, 405f, 415 

Insertional inactivation, 145, 145f 
Insulin 

amino acid sequence of, 37f 

binding of, 408-409, 409f 

defined, 27 

in diabetes, 808-812 

in essential enzyme regulation, 485 

in fatty acid metabolism, 671-672 

in glycogen metabolism, 635, 635f 

in glycolysis, 816 

MALDI-TOF mass spectrum of, 86f 

in obesity, 805-806 

pancreatic failure and, 810-812 

in protein synthesis, 816 

release of, 810, 811f 

secretion of, 805 

in signal transduction, 407-411 

signal transduction, 808-809, 808f 
Insulin receptor 

activation of, 408-409, 409f 

binding sites for, 808 

defined, 408 

structure of, 408, 408f 

subunits, 408, 408f 
Insulin resistance 

biochemical basis of, 808-809 

in muscle, 810-812 

in pancreatic failure, 810-812, 811f 
Insulin signaling 

illustrated, 409f 

lipid kinase in, 410, 410f 

pathway, 411, 411f 

termination of, 411 


Insulin-receptor substrates (IRS), 409-411, 


410f, 635 
Integral membrane proteins, 351, 351f 
Interatomic distances, 7b 
Intercalation, 870 


Intermediary metabolism. See Metabolism 
Internal guide sequence (IGS), 887 
Internal ribosome entry sites (IRES), 912 
Intracellular pathogen, 995, 995f 
Intramolecular hemiacetal, 318 
Intramolecular hemiketal, 318-319 
Intramolecular rearrangements, 654, 655f 
Intrinsic pathway, clotting, 304 
Intrinsically unstructured proteins, 55-56 
Introns 
defined, 106 
detection of, 127, 127f 
self-splicing, 887, 887f 
Inverted repeats, 272, 272f 
Ion channels, 378-389 
acetylcholine receptor, 385-387, 
386f, 387£ 
action potentials and, 378, 378f, 387-388 
ball-and-chain model for inactivation, 
384-385, 385f 
calcium, 382 
cell-to-cell, 389-390, 389f, 390f 
cGMP- gated, 972, 973 
defined, 368, 378 
disruption of, 388-389 
energetic basis of selectivity, 382, 382f 
equilibrium potential and, 387-388, 387f 
gap junction, 389-390, 389f, 390f 
in hearing, 976-977 
inactivation of, 384-385, 385f 
ligand-gated, 386 
membrane permeability and, 390-391 
nerve impulses and, 378 
patch-clamp technique, 379, 379f 
potassium, 378, 379-385, 381f, 381, 
384, 385f 
selectivity filter, 381, 381f, 382, 382f 
sequence relations of, 380, 380f 
shaker, 380 
sodium, 378, 382-385, 382f, 383f 
in taste, 969, 969f 
transient receptor potential (TRP), 977 
voltage-gated, 383-384, 384f 
Ion gradients, 433-434, 433f 
Ion semiconductor sequencing, 156 
Ion-exchange chromatography 
defined, 69-70 
illustrated, 70f 
Ionic interactions, 7 
IRE-binding protein (IRP), 955-956, 956f 
Iron 
metabolism of, 954-956 
oxygen-binding to, 192-195, 194f 
structural transition in subunit, 200 
Iron center, of ribonucleotide reductase, 
753, 753f 
Iron-response element (IRE), 955, 956 
Iron-sulfur clusters, 531-532, 532f 
Iron-sulfur proteins 
defined, 531 
in nitrogen fixation, 715-716, 716f 
in oxidative phosphorylation, 531-532 
structure of, 715 


Irreversible inhibition. See also Enzyme 
inhibition 
defined, 234 
in mapping active site, 237-238 
mechanism-based (suicide), 238, 238f 
Isocitrate 
in citric acid cycle, 504-505 
in glyoxylate cycle, 517, 517f 
Isocitrate dehydrogenase 
in citric acid cycle, 504, 512-513, 512f 
mutations in, 513 
Isocitrate lyase, 517 
Isoelectric focusing, 73, 73f 
Isoform, 292 
Isoleucine, 31f, 32 
Isomerases, 245t 
Isomerization reactions, in metabolism, 
441, 441f 
Isomers 
constitutional, 317, 317£ 
diastereoisomers, 317, 317f 
stereoisomers, 317, 317f 
Isopentenyl pyrophosphate, 776-777, 
777£, 777 
Isopeptide bonds, 683 
Isopropylthiogalactoside (IPTG), 930 
Isotope labeling studies, 510, 736 
Isovaleryl CoA, 702 
Isovaleryl CoA dehydrogenase, 702 
Isozymes 
in blood, 293 
defined, 292 
distinguishing, 292 
expression of, 286 
as homologous, 285 
of lactate dehydrogenase (LDH), 
292, 293f 
in varying regulation, 285-286 


J genes, 991-992 
Jagendorf’s demonstration, 578, 578f 
Jenner, Edward, 1006 


K* ions. See Potassium ions 
Reat (rate constant), 230 
Reat/ Ky (specificity constant), 
230-231, 231f 
Kg (dissociation constant), 229, 235, 
1017-1018 
Kendrew, John, 192 
Ketimine, 690 
a-ketoacids, 701-702, 720-721 
B-ketoacyl synthase, 663 
a-ketobutyrate, 701 
Ketogenic diets, 660 
Ketoglutarate, 233 
a-ketoglutarate 
in amino acid degradation, 698, 
700-701, 700£ 
in amino acid synthesis, 717-718, 720 
in citric acid cycle, 504-505, 505f 
in double-displacement reactions, 233 
in glutamate synthesis, 717, 718 


in nitrogen fixation, 717 
oxidative decarboxylation of, 505 
a-ketoglutarate dehydrogenase complex 
in citric acid cycle, 512-513, 512f 
defined, 505 
a-ketoisocaproate, 701 
Ketone bodies 
acetyl CoA formation from, 657-658 
in citric acid cycle, 819, 819f 
defined, 657 
in diabetes, 659-660, 660f 
in fatty acid metabolism, 657-660 
formation of, 657-658, 658f 
as fuel source, 658-660 
high levels of, 659-660 
in liver, 659, 659f, 819, 819£ 
as water soluble, 659 
Ketoses 
aldoses conversion to, 454 
defined, 316 
3-ketosphinganine, 772 
B-ketothiolase, 651 
Keyhole limpet hemocyanin (KLH), 994 
Khorana, Har Gobind, 124 
K; (inhibition constant), 1036 
Kidney 
drug excretion in, 1039 
gluconeogenesis in, 478 
Killed (inactivated) vaccines, 1006 
Killer T cells. See Cytotoxic (killer) T cells 
Kilobase (kb), 120b 
Kilocalories (kcal), 219 
Kilodaltons (kDa), 34, 36b 
Kilojoules (kJ), 219 
Kinase fold, 298 
Kinases, 453, 453f. See also specific kinases 
Kinesin 
ADP in, 1025, 1025f 
ATP binding to, 1014-1016, 1016f 
ATP hydrolysis in, 1024-1026, 1025f 
defined, 1012 
gene mutations, 1022 
monitoring movements mediated 
by, 1024f 
motion processivity, 1024-1026 
movement along microtubules, 1022, 
1024-1026, 1025f 
neck linker, 1016, 1016f 
P-loop NTPase, 1012 
relay helix, 1016, 1016f 
structure of, 1014, 1014£ 
walk, 1025 
Kinetic energy, 11 
Kinetics 
defined, 225 
enzyme. See enzyme kinetics 
as study of reaction rates, 225-226 
of water deprotonation, 267, 267f 
Klenow fragment, 829 
Klug, Aaron, 943 
Ky. See Michaelis constant 
Kornberg, Roger, 862 
Krebs, Hans, 434, 516b 
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Krebs cycle. See Citric acid cycle 
Kringle domain, 304 
Kuhne, Friedrich Wilhelm, 450b 


Lamino acids, 29, 29f 
Laborit, Henri, 1042 
Lac operon, 929-932, 929f, 930f 
Lac regulatory site, 926f 
Lac repressor 
binding of, 929 
components of, 929 
defined, 927 
effects of IPTG on, 930f 
inhibition relief, 931-932 
structure of, 929, 929f, 930 
Lac repressor-DNA complex, 
926-927, 926f 
Lactase, 324 
deficiency of, 467-468 
defined, 450 
Lactate, 232 
in Cori cycle, 486, 486f 
formation from pyruvate, 463-464 
formation of, 485-486, 486f 
in gluconeogenesis, 476 
in glycolysis, 449 
in muscle contraction, 485-486, 486f 
Lactate dehydrogenase (LDH), 292, 
293f, 487 
Lactic acid fermentation, 449, 463-464 
Lactobacillus, 468f 
B-lactoglobulin, 86f 
Lactonases, 602 
Lactose 
defined, 323-324 
structure of, 323-324, 324f 
Lactose intolerance, 467—468 
Lactose operon. See Lac operon; Lac 
repressor 
Lactose permease, 377, 377f, 377 
Lagging strand 
defined, 832 
synthesis of, 831f, 840-842 
template, 841 
‘y chains, 987, 987f 
d phages 
alternative infection modes for, 146, 146f 
for DNA cloning, 144-147 
mutant, as cloning vector, 146, 147f 
\ repressor 
defined, 933 
in gene expression, 932-933 
in genetic circuit formation, 933, 933f 
structure of, 933f 
synthesis control, 934, 934f 
Lamprey, 183, 183f 
Lanosterol, 778, 779, 781 
Lansoprazole, 1049, 1049 
Lateral diffusion, 356, 357, 357f 
Lavoisier, Antoine, 714 
Laws of thermodynamics, 10-12 
Leader peptide sequences, 936, 936f 
Leader sequences, 935-936, 935f 
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Leading strand 
defined, 832 
synthesis of, 831f, 840-842 
Leber hereditary optic neuropathy 
(LHON), 557 
Lectins 
as carbohydrate-binding proteins, 
334-336 
classes, 334-335 
C-type, 334-335, 334f 
defined, 334 
in interactions between cells, 334 
Leder, Philip, 991 
Leghemoglobin 
sequence alignment of, 176, 176f 
tertiary structure of, 178-179 
Leptin 
in caloric homeostasis, 805-807 
defined, 805 
effects in brain, 806-807, 806f 
obesity and, 806-807, 806f 
secretion of, 805, 806 
Leptin resistance, 806-807, 807f 
Lesch-Nyhan syndrome, 761-762 
Leucine, 31f, 32 
Leucine-rich repeats (LRRs), 982 
Leukemia, 474-475 
Leukocyte antigen A2 (HLA-A2), 
996-997, 997£ 
Lever arm, 1015, 1015f, 1022, 1022f 
Levinthal, Cyrus, 53 
Levinthal’s paradox, 53 
Licensing factors, 843 
Li-Fraumeni syndrome, 851 
Ligand binding 
concerted model, 198-199, 199f 
in gene expression, 930 
sequential model, 199, 199f 
Ligand-gated channels, 386 
Ligands, 950 
Ligases, 245t 
Ligation reactions, in metabolism, 440-441 
Light 
absorption, 971-972 
electromagnetic spectrum of, 970, 970f 
in lowering calcium level, 972-973 
Light (L) chains, 985-987, 987t, 991-992 
991f, 992F 
Light absorption, 569, 569f 
Light energy, 584-585, 970 
Light meromyosin (LMM), 1013 
Light reactions. See also Photosynthesis 
in Calvin cycle, 598, 598f 
defined, 566 
herbicides and, 584 
illustrated, 566f 
stoichiometry for, 581 
Light-harvesting complexes, 583 
Light-independent reactions. See Dark 
reactions 
Limit dextrin, 450 
Lineweaver-Burk plot, 229, 229f 
a-1,6 linkages, 631-632, 631f 


Linking number, 834-836, 834f 
Linoleoyl CoA, 653, 653f 
Lipases, 645-646, 646f 
Lipid bilayers. See also Membrane lipids 
as cooperative structures, 348 
defined, 342, 348 
diagram, 348f 
formation of, 348 
as permeability barrier, 349-350, 350f 
self-assembly process, 348 
as self-sealing, 348 
stabilization of, 348 
Lipid droplet, 645 
Lipid rafts, 358 
Lipid vesicles, 348-349, 349f 
Lipids 
digestion of, 645-646, 646f 
energy storage in, 645 
fatty acids in, 342-344, 343f, 343t 
hydrolysis of, 647-648 
membrane. See membrane lipids 
metabolism of. See fatty acid metabolism 
transport of, 646-647, 646f 
Lipinski’s rules, 1036-1037, 1037f 
Lipoamide, in citric acid cycle, 
500-501, 501f 
Lipoic acid, 499, 499f 
Lipolysis, 648, 648f 
Lipopolysaccharide (LPS), 982-983 
Lipoprotein particles, 782 
Lipoproteins, 781, 782f 
chylomicrons, 782, 782t, 783 
components of, 782 
high-density (HDLs), 782, 782t, 783, 
787-788 
intermediate-density (IDLs), 782, 
782t, 783 
low-density (LDLs), 782, 782t, 783-787, 
783f, 786f 
metabolism, 783f 
very-low-density (VLDLs), 782, 782t 
Liposomes 
defined, 348 
formation of, 348-349, 349f 
therapeutic applications of, 349 
Lipoxygenase, 669, 669f 
Live attenuated vaccines, 1006 
Liver 
alcoholic injury of, 820-821 
in amino acid degradation, 687, 692-693 
in cholesterol synthesis, 779 
cirrhosis of, 821 
drug metabolism in, 1039 
ethanol metabolism in, 819-822 
fasting and, 817 
fatty, 820 
gene expression in, 942, 942t 
gluconeogenesis in, 478, 483-485, 818 
glucose 6-phosphate (G-6P) in, 
622-623, 637 
glucose levels and, 622-623 
glucose sensor in, 636 
glycerol in, 648 


glycogen in, 618, 618f, 627 
glycogen metabolism in, 635-637, 636f 
glycogen phosphorylase in, 623-624, 624f 
glycolysis in, 472-473, 472f, 473f, 
483-485 
ketone bodies in, 659, 659f 
ketone body synthesis in, 819, 819f 
peroxisomes in, 657f 
in starvation, 818-819 
in starved-fed cycle, 816-817 
in triacylglycerol synthesis, 769 
L-lectins, 335 
Local hormones, 669f 
Lock-and-key model, 224, 224f 
Long interspersed elements, 154 
Long QT syndrome (LQTS), 388-389 
Loops, 44, 44f 
Lou Gehrig’s Disease, 136 
Lovastatin, 788, 788f, 1044, 1044 
Low-density lipoprotein receptor 
absence of, 784—785 
destruction of, 785-786 
LDL release, 785—786, 786f 
mutations in, 785-786 
Low-density lipoproteins (LDLs), 
782, 782t 
in cholesterol metabolism, 784 
cycling of, 787 
defined, 783 
in familial hypercholesterolemia, 784—785 
metabolism of, 783f 
receptor-mediated endocytosis of, 
784, 784£ 
schematic model of, 783f 
LuxR, 934 
Lyases 
defined, 666b 
reaction type, 245t 
Lymphotactin, 55, 55f 
Lynch syndrome, 851, 851f 
Lysine 
acetylated, 952-953 
structure of, 33, 33f 
Lysogenic pathway, 146 
Lysosomes 
defined, 332 
function of, 332 
in I-cell disease, 332 
with lipids, 774, 774f 
Lysozyme 
antibodies against, 990, 990f 
backbone model, 62f 
ball-and-stick model, 61f 
ribbon diagram, 62f 
space-filling model, 61f 
Lytic pathway, 146 


MacKinnon, Roderick, 380, 384 
Mad cow disease, 56 
Magnesium ions 
in ATP hydrolysis, 276-277 
in Calvin cycle, 592, 592f, 598-599 
in DNA cleavage, 269-271 


Major histocompatible complex (MHC), 
984-985, 996 
Major histocompatible complex proteins, 
995-1004 
anchor residues, 997-998, 997£ 
class I, 996-997, 996f, 998, 1002-1003, 
1003f 
class IT, 1000-1003, 1001f 
diversity of, 1002-1003 
peptide presentation by, 996-998, 996f 
in plasma membrane, 996 
structure of, 996-998 
T-cell receptors and, 998-1003, 998f 
in transplant rejection, 1002-1003 
Malaria, 206, 206f 
Malate, 517, 517f 
in Calvin cycle, 600f 
in citric acid cycle, 508 
in gluconeogenesis, 480 
Malate dehydrogenase, 508, 667 
Malate synthase, 517 
Malate-aspartate shuttle, 550, 550f 
MALDI-TOF mass spectroscopy, 86, 86f 
4-maleylacetoacetate, 704, 704 
Malic enzyme, 667 
Malonyl ACP, 661, 663 
Malonyl CoA, 662, 671 
Malony] transacylase, 663, 663t 
Malonyl/acetyl transacylase (MAT), 
664, 665f£ 
Maltase, 324, 450 
Maltose, 324, 324f 
Mammalian liver homogenate, 852 
Manganese, 574, 574f 
Manganese center, 574, 574f 
Mannose, 317-318, 318f, 320f 
Mannose 6-phosphate, 332, 332f 
Mannose 6-phosphate receptor, 334 
Maple-syrup urine disease, 705 
Mass spectroscopy 
MALDI-TOF, 86, 86f 
peptide sequencing by, 87-88, 88f 
in protein and peptide identification, 
85-92 
proteomic analysis by, 91-92, 92f 
tandem, 87-88, 88f 
Mass-to-charge ratio, 85 
Mast cells, 307, 307f 
Matrix-assisted laser desorption/ionization 
(MALDJ), 85, 332 
Maximal rate, 228-229, 230 
McArdle disease, 637t, 638 
Mechanism-based (suicide) inhibition, 
238, 238f 
Mediator, 946, 946f 
Megasynthases, 670 
Melanin, synthesis of, 734, 734f 
Melanocyte-stimulating hormone (MSH), 
805-806, 806f 
Melting temperature 
of DNA, 116 
of fatty acid chains, 357, 357t 
of phospholipid membrane, 357, 357f 


Membrane anchors, 354, 354f 
Membrane channels 

gap junction, 368 

ion, 368, 378-389 


membrane permeability to, 390-391, 391f 


water, 390-391, 391f 
Membrane diffusion 
facilitated, 367, 369 
lateral, 356, 357, 357£ 
lipid, 356 
protein, 357 
simple, 368 
Membrane lipids. See also Lipid bilayers 
as amphipathic molecule, 347 
archaeal, 346-347, 346f 
carbohydrate moieties, 345-346 
cholesterol, 346, 776-788 
defined, 342 
diffusion coefficient, 356 
fatting acids in, 342-344, 343f, 343t 
glycolipids, 344, 345-346 
hydrophilicity of, 347, 347f 
hydrophobicity of, 347, 347f, 348 
lateral diffusion, 356, 356f, 357, 357f 
metabolism, 775—776 
movement in membranes, 357, 357f 
permeability barrier, 350 
phospholipids, 344-345 
phospholipids, synthesis of, 768-776 
rate of travel, 356 
representations of, 347f 
sphingolipids, synthesis of, 772-774 
synthesis of, 767-797, 775£ 
triacylglycerols, synthesis of, 768-776 
as two-dimensional solutions, 342 
Membrane potential, 369 
Membrane proteins 
a helix of, 351-352, 355-356, 355f, 355t 
ATP hydrolysis, 370-376 
B strands of, 352, 352f 
carriers, 367 
channels. See membrane channels 
content variation, 350 
defined, 342 
diffusion, 357 
function of, 350 
gel patterns of, 350, 350f 
hydropathy plots for, 355, 356f 
hydrophobicity of, 354, 354f 
integral, 351, 351f 
linking to membrane surface, 353 
membrane interaction, 351—354 
peripheral, 351, 351f 
pumps. See pumps 
as two-dimensional solutions, 342 
Membrane transport 
ABC transporters in, 367, 374-376, 
374f, 375£ 
active, 367, 369 
free energy in, 369, 369f 
passive, 369 
potassium ion channels and, 383, 383f 
primary active, 367 
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secondary active, 367 

transporters. See transporters 
Membranes, 341-363 

archaeal, 346-347, 346f 

as asymmetric, 342, 357-358, 359f 

common features of, 342 

defined, 341 

as electrically polarized, 342 

endoplasmic reticulum (ER), 668 

enzymatic activities, 358-359 

fluidity of, 357-359 

ion gradients across, 433-434, 433f 

lipid bilayer, 342, 348-350 

mitochondria, 524-525 

as noncovalent assemblies, 342 

overview of, 341-342 

permeability of, 341, 349-350, 350f, 367 

planar bilayer, 349, 349f 

polarity scale for transmembrane helices, 

355, 355t 

processes of, 350-356 

protein interaction, 351-354 

as sheetlike structures, 342 

synthesis of, 359 
Membrane-spanning «a helices, 351-352, 
355, 355f 
Memory B cells, 1006 
Memory T cells, 1006 
Menten, Maud, 226 
6-mercaptopurine, 1051, 1051 
B-mercaptoethanol, 49, 49f, 50f 
Mercury poisoning, 515-516 
Merrifield, R. Bruce, 94 
Meselson, Matthew, 115, 116 
Metabolic pathways. See also specific 

pathways 

amphibolic, 425 

biosynthetic, 425 

classes of, 424 

defined, 424 

degradative, 425 

formation of, 426 

illustrated, 425f 

motifs, 435-445 

regulation of, 424 

thermodynamically favorable, 425, 426 

thermodynamically unfavorable, 426 
Metabolic syndrome, 809-810, 810f 
Metabolism, 423-446 

activated carriers in, 435-440 

anabolic reactions in, 428-429 

carbon bonds cleavage, 442, 442f 

catabolic reactions in, 428 

catalytic activity control and, 443-444 

cholesterol synthesis, 784 

common motifs in, 423-424, 435-445 

defined, 423 

drug, 1037-1039 

enzyme control and, 443 

evolution of, 444-445 

first-pass, 1039 

free energy in, 426-427 

fructose, 466, 466f 
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Metabolism—(continued) 
glucose, 424f 
of glucose 6-phosphate, 607-610 
group-transfer reactions in, 441, 441f 
hormones and, 443-444 
hydrolytic reactions in, 441-442, 442f 
integration of, 801-822 
interconnected interactions in, 
424-426, 424f 
iron, 954—956 
isomerization reactions in, 441, 441f 
ligation reactions in, 440-441, 440f 
lipid, 775-776 
lipoproteins, 783f 
overview of, 423-424 
oxidation-reduction reactions in, 
440, 440f 
of pyruvate, 462-464 
regulation of, 442-444 
in starvation, 818-819, 819t 
substrate accessibility control and, 444 
substrate cycles in, 485, 485f 
thermodynamics of, 425-426 
types of reactions in, 440t 
Metabolites, 2, 650 
Metal ion catalysis 
in ATP hydrolysis, 276-277 
in carbon dioxide hydrolysis, 265-267 
defined, 252-253 
in DNA cleavage, 271-272, 272f 
Metal ions, 829 
Metalloproteases, 261f, 261, 262 
Metals, cofactor, 217, 217t 
Metamorphic proteins, 55-56 
Metarhodopsin, 972 
Methionine 
degradation, 701 
from homocysteine, 725 
metabolism, 701, 702f 
oxidation of, 303, 303f 
structure of, 31f, 32 
in translation, 906, 906f 
Methionine sulfoxide, 303, 303f 
Methionine synthase, 725 
Methotrexate, 757 
Methylases, 274 
Methylation 
of adenine, 274-275, 275f 
in amino acid synthesis, 724—726 
DNA protection by, 274-275, 274f 
in gene expression, 949 
of phospholipids, 726 
reaction, 654 
Methylcobalamin, 725 
8-methylcrotonyl CoA, 702 
B-methylglutaconyl CoA, 702 
Methylmalonyl CoA, 655-656, 655f, 656 
N-methylmesoporphyrin, 241, 241f 
Metmyoglobin, 194 
Metoprolol, 1050 
Mevalonate, 776-777, 777£ 
Micelles, 348, 348f 
Michaelis, Leonor, 226 


Michaelis constant (Ky) 
in Cheng-Prusoff equation, 1036 
defined, 227 
determination of, 228-229 
as enzyme characteristic, 229-230 
physiological consequences of 
variations, 228 
values of enzymes, 229t 
Michaelis-Menten equation, 229, 233 
Michaelis-Menten kinetics, 225—233, 227f, 
235, 254, 287 
Microbiomes, 19, 19f 
Microbodies, 593 
MicroRNAs, 879, 879f, 956-957, 956f 
Microsomal ethanol-oxidizing system 
(MEOS), 820 
Microtubules 
arrangement of, 1023, 1023f 
in cilia, 1023 
defined, 1022 
dynamic instability of, 1023 
in flagella, 1023 
function of, 1023 
kinesin movement along, 1022, 
1024-1026, 1025f 
structure of, 1022-1023, 1022f 
tubulins in, 1023, 1023f 
Mineralocorticoids, 789 
Minoxidil, 1038 
Mismatch DNA repair, 848, 848f 
Mitchell, Peter, 542, 542b, 558, 578 
Mitochondria 
in apoptosis, 557-558 
ATP synthase in, 550-552 
brown fat, 555 
in citric acid cycle, 496, 496f 
codons of, 126, 126t 
electron entry via shuttles, 549-550 
electron micrograph and diagram of, 525f 
endosymbiotic origin of, 525-526 
genome of, 525-526, 525f, 526f 
matrix, 524 
membrane, 524 
overlapping gene complements of, 526, 526f 
oxidative phosphorylation in, 524-526 
properties of, 524-526 
structure of, 524-525, 525f 
Mitochondrial ATP-ADP translocase, 
550-551, 551f, 557 
Mitochondrial ATPase. See ATP synthase 
Mitochondrial biogenesis, 813, 813f 
Mitochondrial diseases, 557 
Mitochondrial mutants, 557 
Mitochondrial outer membrane 
permeabilization (MOMP), 558 
Mitochondrial porin, 524 
Mitochondrial transporters, 551-552, 
551f, 552f 
defined, 552f 
illustrated, 552f 
for metabolites, 551-552 
structure of, 551, 551f 
Mixed-function oxygenase, 703, 790-791 


Molecular evolution, 185-187 
Molecular heterogeneity, 242-243, 242f 
Molecular imaging agents, 406 
Molecular models 
backbone, 61-62, 62f 
ball-and-stick, 22-23, 23f, 61, 61f 
proteins, 61-62 
small molecules, 22—23 
space-filling, 22, 23f, 61, 61f 
Molecular motors, 1011-1031 
ATP binding to, 1014-1016, 1015f, 1016f 
dynein, 1012, 1014, 1015f, 1022-1026 
flagella, 1012, 1023, 1026-1029 
kinesin, 1012, 1014, 1022-1026 
myosin, 1012, 1013-1014, 1013f, 1014f, 
1018-1022 
operation of, 1011-1012 
overview of, 1011-1012 
P-loop NTPase superfamily, 1012-1016 
structures of, 1012-1016, 1013f, 
1014f, 1015f 
Molecular oxygen, toxic derivatives of, 
538-540 
Molecules 
amphipathic nature of, 348 
Fischer projections, 22 
homologous, 170 
permeability of, 350, 350f 
stereochemical renderings, 22 
three-dimensional structure of, 22—23 
Moloney murine leukemia virus, 159 
Monoclonal antibodies 
defined, 80 
generation for proteins, 80-82 
illustrated, 80f 
preparation of, 81f 
in signal transduction inhibition, 416-417 
Monod, Jacques, 198, 928 
Monomers, 107 
Monooxygenase, 703, 790-791 
Monosaccharides. See also Carbohydrates 
alcohols and, 322 
aldoses, 316 
amines and, 322 
boat form of, 320-321, 321f 
chair form of, 320-321, 321f 
defined, 316 
diastereoisomers, 317, 317f 
envelope form of, 321, 321f 
glycosidic bonds, 322, 322f 
isomers of, 317, 317f 
ketoses, 316 
linkage of, 323-325 
modified, 322-323, 322f 
reducing sugars, 321-322 
ring forms of, 319-321, 319f, 320f 
stereoisomers, 317, 317f 
structure of, 316-318 
Morphine, 1037 
MotA-MotB pairs, 1027, 1028f 
Motifs 
defined, 48 
helix-turn-helix, 48 
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mRNA 


cDNA prepared from, 149-150 
complementarity with DNA, 122, 122f 
defined, 105, 120 

5’ cap of, 878-879, 879f 

FMN and, 867 

modification of, 123, 123f 

poly(A) tail of, 878-879 

precursors, 876, 880-881, 882-883, 882t 
in protein synthesis, 126 

RNA editing and, 879-880, 879f 
structure of, 912, 912f 

transcription and, 883-884, 884f 
transferrin-receptor, 955, 955f 


Mucins 
defined, 326, 329 
overexpression of, 329 
structure of, 329, 329f 
Mucolipidosis, 332 
Mucopolysaccharidoses, 328 
Mucus, 329 
Muller, Hermann, 847 
Multidrug resistance, 374 
Multidrug resistance protein, 374 
Multifunctional protein kinases, 295 
Multiple myeloma, 81 
Multiple-substrate reactions 
classes of, 232 
defined, 231-232 
double-displacement, 232, 233 
sequential, 232 
Muscle 
ATP in contraction, 469-470 
excess fatty acids in, 810 
fatty acids in, 810, 815 
fiber types, 625, 625t 
glycogen in, 618 
glycogen phosphorylase in, 625, 625f 
glycolysis in, 469-471 
insulin resistance in, 810-812 
myofibrils in, 1020 
PP1 regulation in, 634, 634f 
sarcomere in, 1020, 1020f 
Muscle contraction 
actin in, 1019-1021 
alanine in, 486, 486f 
ATP formation in, 813-815 
ATP hydrolysis in, 1019-1022, 1020f 
fuel sources for, 813-815, 814t, 815f 
gluconeogenesis in, 485-487, 486f 
lactate in, 485-486 
myosin in, 1019-1021 
sliding-filament model of, 1020-1021, 1021f 
Muscle-relaxation pathway, 1042-1043, 1043f 
Mutagenesis 
cassette, 151, 151f 
oligonucleotide-directed, 150, 151f 
by PCR, 151-152, 151f 
site-directed, 150-151, 151f, 260, 260f 
Mutagens, 846-847 
Mutase, 460 


Mutations 
deletion, 150, 151-152, 151f 
disease-causing, functional effects of, 152 
in genes encoding hemoglobin 
subunits, 204 
insertion, 150, 151 
at level of function, 177-178 
at level of sequence, 177 
point, 150 
in recombinant DNA technology, 150-152 
substitution, 150 
MWC model, 198-199, 199f 
Mycobacterium tuberculosis, 142, 687 
Myofibrils, 1020 
Myogenin, 160, 161f 
Myoglobin 
alignment comparison, 173-174, 173f 
alignment with gap insertion, 173, 173f 
amino acid sequences of, 171-172, 
171f, 172£ 
as compact molecule, 46 
defined, 46, 171, 191 
distribution of amino acids in, 47f 
hemoglobin comparison, 191-192 
Hill plot, 210f 
oxygen-binding, 192-195, 196f 
oxygen-binding curve, 196, 196f 
as single polypeptide, 192 
structure of, 192, 192, 194-195 
tertiary structure of, 178-179 
three-dimensional structure of, 46f 


Myosin 


active site, 279, 279f 

altered conformation persistence, 278-279 

ATP binding to, 1014-1016, 1015f, 1016f 

ATP complex structure, 276, 276f 

ATP hydrolysis and, 276-278 

ATPase transition-state analog, 277, 277f 

binding to actin filaments, 1018, 1018f 

catalytic strategies of, 251-252 

conformational changes, 278, 278f 

defined, 275, 1012 

heavy chains, 1012, 1014, 1014f 

kinetic studies of, 276 

lever arm, 1015, 1015f, 1022, 1022f 

light chains, 1013, 1013f 

motion observation, 1018-1019, 1019f 

movement along actin, 1016-1022, 1020f 

in muscle contraction, 1019-1021 

P-loop NTPase, 1012 

P-loop structures, 280-281 

power stroke release, 1019 

regulatory light chain, 1012 

relay helix, 1016, 1016f 

S1, 1013, 1013f, 1015, 1015f 

$2, 1013-1014, 1013f 

single-molecule motion, 279-280, 280f, 
1018-1019, 1019f 

as slow enzymes, 278 

structure of, 1013-1014, 1013f 

switch I and switch IJ, 1015-1016, 1016f 

in thick filaments, 1020, 1021f 

in thin filaments, 1020 
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Na’ ions. See Sodium ions 
Na*-K* pumps, 370, 373 
NAD* (nicotinamide adenine dinucleotide) 


binding site for, 465, 465f 

in glycolysis, 458, 459f, 462-464, 465 
light absorption and, 67 

from metabolism of pyruvate, 462-464 
reduction potential of, 528 

regeneration of, 464, 464f 

structures of oxidized form of, 435, 435f 
synthesis of, 734, 734f 


NADH (nicotinamide adenine dinucleotide 


reduced) 
in ATP generation, 436 
electrons from, 549-550 
electron-transport chain and, 497, 510, 
526, 528-529 
ethanol metabolism in, 820-821 
in fatty acid oxidation, 650-652 
in glycolysis, 458, 462-464 
light absorption, 67 
pyruvate reduction by, 464 
in respiratory chain, 532-533, 533f 
structures of oxidized form of, 435, 435f 
transport of, 549-550 


NADH-cytochrome bs reductase, 668, 668f 
NADH-Q oxidoreductase, 529-531, 530f, 


530t, 532-533, 533f 


NADP* (nicotinamide adenine dinucleotide 


phosphate), 602, 607 


NADPH (nicotinamide adenine 


N 
N 
N 
N 
N 
N 
N 
N 
N 
N 


N 
N 
N 
N 


dinucleotide phosphate reduced) 
in Calvin cycle, 589-590, 597 
dihydrofolate reductase and, 756 
in fatty acid synthesis, 667 
formation by photosystem I, 579 
in glucose 6-phosphate conversion, 602 
pathways requiring, 601f 
in pentose phosphate pathway, 608-610 
in reductive biosynthesis, 436 
alidixic acid, 839 
eanderthals, 185, 185f 
eck linker, 1016 
egatively charged amino acids, 33-34, 34f 
eher, Erwin, 379 
ernst equation, 387 
erve impulses, 378, 385 
eural-tube defects, 762 
euroglobin, 208 
eurological diseases. See Diseases and 
disorders 
europeptide Y (NPY), 805-806, 806f 
eurotransmitter release, 360, 361f 
eurotransmitters, 385 
ext-generation sequencing (NGS) 
defined, 155 
detection methods in, 155, 155f 
as highly parallel, 155 
ion semiconductor, 156 
pyrosequencing, 155-156 
reversible terminator method, 155 


NF-kB, 687 
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N 
N 
N 
N 


N 


N 
N 


irenberg, Marshall, 124 

itric oxide (NO), synthesis of, 735, 735f 
itric oxide synthase, 735 

itrogen 

in amino acids, 714 

excess, disposal of, 693-698 
metabolism, 696f, 733 

peripheral tissue transport of, 692-693 
itrogen fixation 

in amino acid synthesis, 714-718 

ATP in, 715-718 

defined, 714 

FeMo cofactor in, 716, 716f 

glutamate in, 717-718 

glutamine in, 717—718 

iron-sulfur proteins in, 715-716, 715f 
P clusters in, 716, 716f 

itrogenase, 715, 716, 716f, 716 
itrogenase complex, 715 


'SN labeling, 735b 
N-linkage, 326, 326f 
N-linked oligosaccharides, 326, 326f, 332 


N 
N 
N 


ociceptors, 977 

ompC, 976 

oncompetitive inhibition. See also 
Enzyme inhibition 

defined, 234 

double-reciprocal plot, 236, 236f 

example of, 236 

inhibitor binding, 236 

kinetics of, 235-236, 235f 


Nonconservative substitutions, 174, 175f 
Noncovalent bonds 


N 


N 


N 
N 
N 
N 
N 
N 
N 
N 
N 


in biological molecules, 7—8 

hydrogen bonds, 5, 7, 8, 8f, 9, 10 
hydrophobic effect and, 9-10, 9f 
types of, 7 

van der Waals interactions, 7, 8, 8f, 10 
onessential amino acids 

defined, 719 

synthesis of, 719, 720 

onheme iron proteins, 531 
onhomologous end joining (NHEJ), 849 
onribosomal peptides, 670 


onshivering thermogenesis, 555f, 556, 558 


orthern blotting, 138 
ovobiocin, 839 

-terminal rule, 684 

TPases, P-loop. See P-loop NTPases 
uclear envelope, 360, 360f 
uclear hormone receptors 
defined, 949 

domain structures, 950, 950f 
drug-binding to, 951-952 

in gene expression, 949-951 
ligand-binding to, 950, 951f 


Nuclear magnetic resonance (NMR) 


spectroscopy, 65, 97-100 
basis of, 98, 98f 
chemical shifts, 98 
defined, 97 
NOESY and, 99-100, 99f, 100f 
one-dimensional, 98, 98f 


RNA structure, 187, 187f 
signals, 98f 
Nuclear Overhauser effect (NOE), 99 
Nuclear Overhauser enhancement 
spectroscopy (NOESY), 99-100, 
99f, 100f 
Nuclear pores, 360 
Nucleation-condensation model, 53 
Nucleic acids. See also DNA; RNA 
backbones in, 106-107, 106f 
bases, 106-109 
complementary sequences, 109-114 


complex structures, 113-114, 113f, 114f 


defined, 105 
melting of, 116 


nucleotides as monomeric units of, 107-108 


polymeric structure, 106f 

solid-phase synthesis of, 136 
Nucleoside 5’-phosphate, 108 
Nucleoside diphosphate, 245 
Nucleoside diphosphate kinase, 427, 747 
Nucleoside diphosphokinase, 506 
Nucleoside monophosphate, 427 
Nucleoside monophosphate kinases, 245, 
427, 747 
Nucleoside phosphorylases, 760, 761 
Nucleosides 

defined, 107, 744 

nomenclature of, 744 

units, 107-108 
Nucleosome core particle, 943, 943f 
Nucleosomes, 943-944 
Nucleotidases, 760 
Nucleotide synthesis, 743-763 

de novo pathways in, 744-752, 744f 

deoxyribonucleotide, 753-758, 

759-760, 759f 

disorders of, 760—762 

inosinate in, 759 

overview of, 743-744 


of purine, 748-752, 748f, 758-759, 758f 


of pyrimidine, 744—748, 744f, 758 

regulation of, 758-760 

salvage pathways in, 744-752, 744f 

substrate channeling in, 745-746, 746f 
Nucleotide triphosphates, 107 
Nucleotides 

cyclic, 743 

defined, 106, 743, 744 

DNA, 107 

in encoding amino acids, 124 

ester linkage, 107 

metabolism disruptions, 760-762 

as monomeric units of nucleic acids, 

107-108 

nomenclature of, 744 

RNA, 107-108 

sequences in DNA, 135-136 

in signal transduction pathways, 743 
Nutrition, 20, 20f 


Obesity, 801-812 
causes of, 803-804 


diabetes and, 807-812 
dieting and, 807 
evolution and, 802-804 
health consequences of, 803t 
insulin in, 805-806 
insulin resistance and, 809-810, 810f 
leptin resistance in, 806-807, 806f 
overview of, 801-802 
prevalence of, 803-804 
Obligate anaerobes, 464, 465t 
Octadecanoic acid, 342 
Odd-chain fatty acids, 654, 654f 
Odorant receptors (ORs), 964f 
activation of, 965f 
conserved and variant regions, 
963-964, 964f 
evolution of, 963-964, 963f 
in olfaction, 963-965 
Off-diagonal peaks, 99 
Okazaki, Reiji, 831 
Okazaki fragments, 831-832, 831f, 841 
Olfaction, 962-966. See also Sensory 
systems 
anosmias, 962 
combinatorial mechanisms in, 
964-966 
electronic nose in, 965—966, 966f 
evolution of, 963-964, 963f 
impaired, 962 
main olfactory epithelium in, 
962, 963f 
molecule shape and, 962 
odorant receptors (ORs) in, 963-965, 
963f, 964f 
7TM receptors in, 963 
signal transduction in, 964, 964f 
Olfactory neurons, 964, 965f 
Oligomeric 7TM receptors, 969 
Oligomerization, in B-cell activation, 
993-994 
Oligomycin, 556 
Oligonucleotide-directed mutagenesis, 
150, 151f 
Oligopeptides. See Peptides 
Oligosaccharide chain, 772 
Oligosaccharides, 332f 
antigen structures, 331, 331f 
attached to erythropoietin, 327, 327f 
defined, 323 
enzymes responsible for assembly, 331 
N-linked, 326, 326f, 332 
O-linked, 326 
sequencing of, 332-333, 333f 
O-linkage, 326 
Omeprazole (Prilosec), 1049, 1049 
Oncogenes, 416 
One-dimensional NMR, 98, 98f 
Open promoter complex, 866, 866f 
Open reading frames (ORFs), 177 
Operator site, 928 
Operons, 929f, 930f 
defined, 928-929 
histidine, 936-937, 936f 


lac, 929-932, 929f, 930f 
structure of, 928, 929f 
threonine, 936, 936f 
trp, 936 
Opsin, 970, 971 
Optical traps, 1019f 
Oral bioavailability, 1036, 1037f 
Origin of replication complexes, 843 
Origin of replication (oriC locus), 842 
Ornithine, 694-695 
Ornithine transcarbamoylase, 694, 696 
Ornithine transcarbamoylase deficiency, 
697-698, 698f 
Orotate, 746-747, 746f, 747£ 
Orotate phosphoribosyltransferase, 747 
Orthologs, 170, 171 
Orthophosphate, 619 
Osteoarthritis, 328 
Osteogenesis imperfecta, 46 
Osteomalacia, 795 
Otto, John, 307b 
Overlap peptides, 89, 89f 
Oxaloacetate, 233 
in amino acid degradation, 698, 700 
in Calvin cycle, 600, 600f 
in citric acid cycle, 496, 502, 507-508 
conversion into phosphoenolpyruvate, 480 
in fasting or diabetes, 657 
formation of, 480 
in gluconeogenesis, 477f, 478-480, 
480f, 819 
in glyoxylate cycle, 517f 
synthesis of, 440-441, 441f 
Oxidation 
activated carriers of electrons for, 435 
of aldehydes, 457 
carbon, 432—433 
in drug metabolism, 1037, 1039 
energy, 433 
in fatty acid metabolism, 648-649, 
650-652, 651t 
of glyceraldehyde 3-phosphate (GAP), 
458-459, 459f 
of membrane proteins, 652f 
phase I transformations in, 1039 
pyruvate dehydrogenase complex and, 
498-499 
of succinate, 507-508 
Oxidation-reduction potential. See 
Reduction potential 
Oxidation-reduction reactions 
free-energy change in, 528 
in metabolism, 440, 440f 
Oxidative phosphorylation, 523-560 
ADP in, 550-551, 551f, 553-554, 554f 
ATP yield in, 552-553, 553t 
ATP-ADP translocase in, 550-551, 
551f, 557 
cellular respiration in, 552-558 
chemiosmotic hypothesis in, 542-543, 
542f, 543f 
defined, 433, 523 
electron transfer in, 526-529 


electron-transport chain in, 548f 
in exercise, 814-815 
inhibition of, 556 
iron sulfur clusters in, 531-532, 532f 
in mitochondria, 524-526, 525f, 526f 
mitochondrial transporters in, 551-552, 
551f, 552f 
NADH in, 526, 528-529 
overview of, 523-524, 524f, 548f 
photosynthesis comparison, 579f 
proton gradients in, 433, 433f, 524, 529, 
541-549 
proton-motive force in, 523, 542-543, 
542f, 544-545, 545f 
rate of, 553-554, 554f 
reduction potential in, 526-528, 526f 
respiratory chain in, 433, 529-541 
respiratory control in, 553-554, 554f 
shuttles in, 549-552, 549f, 550f 
transporters in, 549-552, 551f, 552f 
uncoupling of, 554-556, 555f 
Oxidized lipoamide, 499-500 
Oxidoreductases, 245t 
Oxidosqualene cyclase, 778 
Oxoguanine-adenine base pair, 846f 
Oxyanion hole 
of chymotrypsin, 257, 257f 
defined, 257 
of subtilisin, 259, 259f 
Oxygen 
affinity in red blood cells, 200-201, 201f 
bound, stabilizing, 194f 
carbon monoxide and, 201-202 
concentration in tissues, 197, 197f 
cooperative release of, 197 
evolution of, 574b 
in hydroxylation function, 790-791 
partial pressure, 194-195 
photons for generation of, 574, 574f 
reactive, 194 
toxic derivatives of, 538-540 
Oxygen binding 
with concerted model, 212f 
cooperative, 196-197 
heme in, 193-194 
hemoglobin, 192-195, 196f 
hemoglobin quaternary structure and, 
197-198, 198f 
myoglobin, 192-195, 196f 
pure hemoglobin, 200, 200f 
sites, 197 
Oxygen therapy, hyperbaric, 202 
Oxygenase reaction, 593-594, 593f 
Oxygenases 
in aromatic amino acid degradation, 
703-705 
mixed-function, 703 
Oxygen-binding curve 
defined, 195 
fractional saturation, 195 
for hemoglobin, 196, 196f 
for Hill coefficients, 210-211, 211f 
for myoglobin, 196, 196f 
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sigmoid, 196 

T-to-R transition, 199, 199f 
Oxyhemoglobin, 198 
Oxymyoglobin, 192, 193 


P clusters, 716, 716f 
P site, ribosomal, 905, 905f 
p53, 851 
p160 family, 951 
P680, 572, 573 
P700, 572, 576, 576f 
P960, 571-572 
PALA, 289-290, 289f, 289, 290f 
Palindromes, 137, 137b 
Palmitate, 652, 652f, 666 
Palmitoleate, 652-653, 652f 
Palmitoyl CoA, 652, 652f, 671, 772 
Pamaquine, 611 
Pancreas 
a cells, 802 
B cells, 804 
gene expression in, 942, 942t 
Pancreatic failure, 810-812 
Pancreatic lipases, 645-646, 646f 
Pancreatic trypsin inhibitor, 302-303, 302f 
Pancreatic zymogens, 301-303, 301f 
Papain, 216-217 
Paracoccus denitrificans, 541 
Paralogs, 170, 171f 
Paraquat, 584 
Park, James, 1042 
Parkinson disease, 56—57, 683, 685 
Partial pressure, oxygen, 194-195 
Passenger strand, 162 
Passive membrane transport, 369 
Pasteur, Louis, 450, 1006 
Patch-clamp technique, 379, 379f 
Pathogen-associated molecular patterns 
(PAMPs), 982-983, 983t, 984f 
Pauling, Linus, 193, 204-205, 222b, 240 
PCSK9 (proprotein convertase subtilisin/ 
kexin type), 787, 788 
Penicillin, 1041-1042, 1041f, 1041 
conformations of, 240, 240f, 240 
mechanism of action, 239-240, 240f 
reactive site of, 239, 239f 
thiazolidine ring, 239 
Penicillium citrinum, 1044 
Penicilloyl-enzyme derivative, 240, 240f 
Pentose phosphate pathway, 601-612 
Calvin cycle and, 610 
in cell growth, 610 
defined, 589, 601 
erythrose 4-phosphate in, 604, 604f 
fructose 6-phosphate in, 604, 604f 
glucose 6-phosphate in, 602, 607-610 
glucose 6-phosphate dehydrogenase in, 
610-612 
glyceraldehyde 3-phosphate (GAP) in, 
603, 604, 604f 
glycolysis and, 602-610 
illustrated, 602f 
modes, 608-609 
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Pentose phosphate pathway—(continued) 

NADP? in, 607 

NADPH in, 608-610 

nonoxidative phase of, 602-607, 605t 

overview of, 589-590 

oxidative phase of, 602, 603f, 605t 

rate of, 607 

reactions, 605-607, 605f, 606f 

ribulose 5-phosphate in, 602, 603, 603f, 
604, 608-609 

sedoheptulose 7-phosphate in, 603, 
603f, 604f 

tissues with, 609t 


transaldolase in, 602-605, 606-607, 606f 


transketolase in, 602-606, 605f 
Pentose shunt. See Pentose phosphate 
pathway 
Pentoses, 316 
Peptide bonds 
chemical nature of, 253 
cis configuration, 38, 38t, 39t 
cleavage of, 59, 258 
defined, 35 
formation in translation, 908-909, 908f 
formation of, 35f 
hydrolysis of, 253 
lengths, 38, 38t 
as planar, 38, 38t 
rigidity of, 39 
rotation of, 38-39, 39t 
stability of, 35 
torsion angles, 39, 39b 
trans configuration, 38, 38t, 39t 
X-Pro linkages, 38, 39t 
Peptide mass fingerprinting, 91 
Peptide sequences, 52f 
Peptide-ligation methods, 95 
Peptides 
gastrointestinal, 804-805, 805t 
mass spectroscopy for sequencing, 
87-88, 88f 
nonribosomal, 670 
overlap, 89, 89f 
presentation by internalized 
proteins, 1000 
presentation by MHC proteins, 
996-998, 996f 
synthesis of, 92-95, 94f 
Peptidoglycan, 239, 239f 
Peptidyl transferase center, 908 
Perforin, 1000 
Peripheral membrane proteins, 351, 351f 
Periplasm, 359 
Periplasmic side, 570 
Permeability 
lipid bilayers, 349-350, 350f 
membrane, 341, 367, 390-391 
of potassium ion channels, 382 
selective, 341 
Permeability coefficients, 350, 350f 
Perovskite, 585 
Peroxide bridge, in cytochrome c oxidase, 
538, 538f 


Peroxisomes 
in Calvin cycle, 593 
defined, 657 
electron micrograph of, 594f, 657f 
in fatty acid oxidation, 656-657, 657f 
Perutz, Max, 195 
PEST sequences, 684 
P-glycoprotein, 374 
pH 
buffers and, 15-17 
carbon dioxide and, 203, 204 
defined, 13 
effect on carbonic anhydrase activity, 
265, 265f 
ionization state as function of, 30f 
Pharmacogenetics/pharmacogenomics, in 
drug development, 1050 
Pharmacology, 1034b. See also Drug 
development 
Phase I transformations, in oxidation, 1039 
Phase II transformations, in 
conjugation, 1039 
Phenotype-genotype correlation, 
1050-1051, 1050f 
Pheny] isothiocyanate, 87 
Phenylalanine 
degradation of, 703, 703, 704f 
structure of, 31f, 32 
synthesis of, 727-728, 728f 
Phenylalanine hydroxylase, 703 
Phenylketonuria, 705 
Phenylpyruvate, 728 
Phosphatases 
deficiency, 512 
defined, 295 
in phosphorylation, 294-296, 295t 
PP2A, 296 
Phosphate carrier, 552 
Phosphate esters, 434 
Phosphates, 16, 434 
Phosphatidate, 344, 769, 769 
Phosphatidic acid phosphatase (PAP), 768, 
775-776, 775£ 
Phosphatidylcholine, synthesis of, 
770-771, 771£ 
Phosphatidylethanolamine, 770, 770 
Phosphatidylethanolamine 
methyltransferase, 771 
Phosphatidylinositol, 769, 769 
Phosphatidylinositol 3,4,5-trisphosphate 
(PIP3), 410, 410f, 808-809 
Phosphatidylinositol 4,5-bisphosphate 


(PIP), 404, 405f, 410, 410£, 770, 808 


Phosphatidylinositol-dependent protein 
kinase (PDK), 809 

Phosphatidylserine, 771 
Phosphocreatine, 232 
Phosphoenolpyruvate (PEP) 

formation of, 461 

in gluconeogenesis, 477f 

in glycolysis, 461 

NAD*t regeneration in, 462-464 

phosphoryl-transfer potential, 430, 461 


Phosphoenolpyruvate carboxykinase, 480 
Phosphoenolpyruvate carboxylase, 600 
Phosphofructokinase (PFK), 454 
activation of, 472, 472f 
in liver, 472, 472£ 
in muscle, 469-470, 470f 
regulation of, 472, 472f 
Phosphofructokinase 2 (PFK2), 
483-484, 484f 
Phosphoglucomutase, 623f 
Phosphogluconate pathway. See Pentose 
phosphate pathway 
6-phosphogluconate dehydrogenase, 602 
6-phosphoglucono-8-lactone, 602, 603f 
Phosphoglucose isomerase, 454 
2-phosphoglycerate, in glycolysis, 
460-461, 460f 
3-phosphoglycerate 
in Calvin cycle, 591-592, 592f 
in cysteine synthesis, 722-723 
in glycine synthesis, 722-723, 723f 
in glycolysis, 460-461, 460f 
in oxygenase reaction, 593, 593f 
in serine synthesis, 722-723, 722f 
3-phosphoglycerate dehydrogenase, 
730-731 
Phosphoglycerate kinase, 459 
Phosphoglycerate mutase, 460, 622 
Phosphoglycerides, 344 
Phosphoglycolate, 593, 593f, 594f 
Phosphoinositide cascade, 629 
activation of, 404—405 
defined, 404 
illustrated, 405f 
reaction, 405, 405f 
Phospholipase C, 404 
Phospholipid synthesis 
from activated alcohol, 770 
activated intermediate, 769-770 
phosphatidylcholine, 770-771, 771f 
sources of intermediates in, 
768-776, 768f 
Phospholipids 
base-exchange reactions in generation 
of, 771 
in bimolecular sheet formation, 
348-350 
flip-flop, 357, 357f 
lipid vesicles formed from, 348-349 
melting temperature, 357, 357f 
membrane, 344-345 
platform for, 344 
Phosphopantetheine, 662f 
Phosphopentose epimerase, 595 
Phosphopentose isomerase, 595 
Phosphoribomutase, 760 


5-phosphoribosyl-1-pyrophosphate (PRPP) 


in purine synthesis, 749, 759 
in pyrimidine synthesis, 728, 
746-747, 747£ 
Phosphorolysis, in glycogen metabolism, 
619-623 
Phosphorothioates, 270-271, 271f 


Phosphoryl group, 296 
Phosphorylase. See Glycogen phosphorylase 
Phosphorylase a, 623-624, 624f, 626, 627f, 
636, 636f 
Phosphorylase b, 623, 624f, 625, 626, 627f 
Phosphorylase kinase 
activation of, 626, 627f 
defined, 626 
subunits, 626, 627f 
Phosphorylation 
amplified effects of, 296 
ATP formation by, 580, 580f, 581 
cAMP stimulation of, 403 
of carboxyl-terminal domain (CTD), 875 
of carboxy-terminal tail, 413 
in citric acid cycle, 511-512 
control of, 294-296 
in covalent modification, 293, 294t 
dephosphorylation and, 296 
EGF receptor, 413 
in enzyme regulation, 293-298, 294t 
free energy of, 296 
of fructose 6-phosphate, 454 
in gluconeogenesis, 480, 484 
in glycogen metabolism, 619-620 
in glycolysis, 451-453, 454 
in metabolism, 427, 433 
of nucleoside monophosphates, 427 
oxidative. See oxidative phosphorylation 
in phosphorylase activation, 626, 627f 
in phosphorylase conversion, 626 
in purine nucleotide synthesis, 749-751, 
749t, 750£ 
of pyruvate dehydrogenase, 511 
reactive intermediates, 323 
of serine residue, 484 
substrate-level, 460 
sugars, 323 
Phosphorylethanolamine, 770, 770 
Phosphoryl-transfer potential, 429, 
430-431, 430f, 432-433 
Phosphoserine, 57, 58 
Phosphothreonine, 58 
Phosphotriesters, 139, 140f, 140 
Photoinduced charge separation, 
569-572, 569f 
Photophosphorylation, cyclic, 580, 
580f, 581 
Photoreceptors, 970, 973-974, 973f 
Photorespiration 
in C4 pathway, 599-601 
in Calvin cycle, 594, 594f 
Photosynthesis, 565-586 
accessory pigments in, 581-584 
artificial photosynthetic systems and, 585 
in autotrophs, 566 
bacterial reaction center, 569-572, 
570, 571f 
basic equation of, 565 
chlorophylls in, 568-572 
in chloroplasts, 567-568, 567f 
components of, location of, 
583-584, 584f 


cyclic photophoshorylation in, 580, 
580f, 581 
dark reactions, 566, 589 
defined, 565 
electron transfer in, 568-572 
energy conversion in, 566-567 
enzymes in, 217 
herbicide inhibition of, 584, 584 
in heterotrophs, 566 
light absorption in, 569, 569f 
light energy conversion in, 566-567 
light reactions in, 566, 566f, 581, 584, 589 
light-harvesting complexes in, 583, 583f 
overview of, 565-567 
oxidative phosphorylation 
comparison, 579f 
photo charge separation in, 569-572, 569f 
photosystem I. See photosystem I 
photosystem II. See photosystem II 
proton-motive force in, 578 
reaction centers in, 569-572, 570f, 571, 
572, 576, 576f, 581-584 
resonance energy transfer in, 582-583, 582f 
in thylakoid membranes, 567-568 
yield in, 566b 
Z scheme of photosynthesis, 577, 577f 


Photosynthetic catastrophe, 567b 
Photosynthetic systems, 584-585, 584t 
Photosystem I. See also Photosynthesis 


defined, 567, 572 

electron flow through, 580 

ferredoxin generated by, 576-577 
illustrated, 572f 

inhibitors of, 584 

link to photosystem II, 575, 575f 
NADPH formation by, 579 

reaction center, 572, 576, 576f 
structure of, 575-576, 575f 

Z scheme of photosynthesis, 577, 577 


Photosystem II. See also Photosynthesis 


defined, 567 

electron flow through, 573b 
electron source, 573-574 
illustrated, 572f 

inhibitors of, 584 

link to photosystem I, 575, 575f 
location of, 574-575 
photochemistry of, 573 

reaction center, 572, 573 
structure of, 573, 573f 

Z scheme of photosynthesis, 577, 577f 
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glyoxylate cycle in, 516-517, 517f 
oxaloacetate synthesis, 660 
photosynthesis in. See photosynthesis 
starch in, 597 
sucrose synthesis in, 597-598, 598f 
thylakoid membranes of, 583 
Plaques, in genomic library screening, 
148, 148f 
Plasma-membrane proteins, 916, 916f 
Plasmids 
defined, 144 
engineered, 145 
expression vectors, 146 
genes, 144 
microinjection of, 159f, 160 
polylinker region, 145, 145f 
Ti, 163-164, 163f 
as vectors for DNA cloning, 144-146 
Plasmin, 308 
Plasminogen, 308 
Plasmodium falciparum, 206, 336, 525 
Plastocyanin, 583 
Plastoquinone, 572-573, 583 
Pleckstrin homology domain, 410 
P-loop NTPases 
in helicases, 832 
molecular motors, 1012-1016 
proteins containing, 280-281 
P-loop structures, 280-281 
PLP-Schiff-base linkage, 621 
Pluripotent stem cells, 947 
Point mutation, 150 
Poisoning 
arsenite, 516, 516f 
mercury, 515-516 
ricin, 915 
Polar amino acids 
structure of, 33 
types of, 32-33 
Poly(A) tail-binding protein I (PABPI), 
912, 912f 
Polyacrylamide gel electrophoresis (PAGE), 
72, 72£ 
Polyampholytes, 73 
Polyclonal antibodies, 80, 80f 
Polygalacturonase, 164 
Polyketides, 670 
Polylinker region, plasmid, 145, 145f 
Polymerase chain reaction (PCR) 
cycle steps, 141f, 142, 142f 
defined, 141 


Phytol, 569 
Pichia pastoris, 78 
Ping-pong reactions, 233 
Pinopsin, 974 
pK, values, 34t 
Placebo effect, 1052 
Planar bilayer membrane, 349, 349f 
Planck’s constant, 222 
Plants 
C3, 601 


DNA sequence amplification by, 
141-142, 141f, 142f 

first cycle of, 141f 

as gene exploration tool, 136 

inverse, 151, 151f 

multiple cycles of, 142f 

mutagenesis by, 151-152, 151f 

quantitative (qPCR), 157-158, 158f 

uses of, 142-143 


Cy, 600-601 
genetically engineered, 163-164, 163f 


Polymerase switching, 843-844 
Polymerization reactions, 1018 
Polymorphisms, 1002-1003, 1003f 
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Polypeptide chains 
backbone, 36, 36f 
bond rotation in, 38-39, 39t 
cleavage of, 88, 89t 
components of, 36f 
cross-linked, 36-37, 37f 
directionality of, 35-36 
disulfide bonds of, 36-37, 37f 
flexibility of, 38-39 
formation of, 35 
loops, 44, 44f 
random-coil conformation, 50 
residue, 35 
reverse turns, 44, 44f 
side chains, 36, 36f, 47 
subunit structures, 48—49 
Polypyrimidine tract, 880 
Polysaccharides 
defined, 324 
glycosidic bonds, 325, 325f 
Polysomes, 910, 910f 
Polyunsaturated fatty acids, 661, 669 
Pompe disease, 637-638, 637t, 638f 
Porin 
amino acid distribution in, 47f 
amino acid sequence of, 352, 352f 
defined, 352 
hydropathy plot for, 355, 356f 
structure of, 352 
Porphobilinogen, 736, 737, 737f 
Porphobilinogen deaminase, 736 
Porphyrias, 737-738 
Porphyrins, 241 
disorders of, 737—738 
synthesis of, 736-737, 737f 
Porter, Rodney, 985 
Positively charged amino acids, 33, 33f 
Posttranscriptional gene expression 
in eukaryotes, 954-957 
in prokaryotes, 935-937 
Posttranslational modifications, 90 
Potassium ion channels 
as archetypical structure, 379-380 
hERG, 389 
inactivation of, 384-385, 385f 
path through, 381, 381f 
permeability of, 382 
purification of, 380 
selectivity filter, 381, 381f, 382, 382f 
structure of, 380—382, 381f, 382 
transport model, 383, 383f 
voltage-gated, 383-384, 384f 
Potassium ions 
action potentials and, 378 
dehydration of, 382, 382f 
hydration of, 383f 
Potential energy, 11 
Power stroke, 1019 
PP2A, 296 
Precursor ions, 87 
Pregnenolone 
androgen synthesis by, 792-793 
corticosteroids from, 792 


defined, 792 

estrogen synthesis by, 792-793 
progesterone from, 792 
structure of, 793 

synthesis of, 792-793 


Pre-mRNA processing, 877, 877t, 878-879, 


878f, 884-885, 884f 
Prephenate branch, 728 
Prilosec (omeprazole), 1049, 1049 
Primaquine, 611 
Primary active transport, 367 
Primary antibody, 83-84 
Primary messenger, 398 
Primary protein structure, 27, 35-40, 48, 
78-79 
Primase, 831 
Primers 
defined, 829b 
DNA probes as, 140 
in DNA replication, 831, 831f 
match stringency, 142 
in polymerase chain reaction (PCR), 
141-142, 141f 
RNA, 841 
Prion diseases, 56—57, 57f 
Prions, 56, 56f 
Probes. See DNA probes 
Procaspases, 299 
Processivity 
defined, 621, 839, 840b 
in DNA replication, 839-840, 839f 
kinesin motion, 1024-1026 
Procollagen, 300 
Product ions, 87 
Progesterone, 789, 792, 793 
Progestogen, 789 
Programmed cell death. See Apoptosis 
Proinsulin, synthesis of, 150, 150f 
Prokaryotes. See Bacteria 
Prokaryotic RNA polymerases, 860, 860 
Proliferating cell nuclear antigen 
(PCNA), 832 
Proline 
degradation, 701, 701f 
structure of, 31f, 32 
synthesis of, 722-723 
Prolyl hydroxylase, 821 
Prolyl hydroxylase 2, 513 
Promoter sites 
base sequences, 122b 
defined, 122 
for transcription, 122, 122f 
Promoters 
alternative sequences, 865, 865f 
bacterial, 864, 864f 
closed complex, 865-866, 866f 
core, 864 
defined, 864 
open complex, 866, 866f 
strong, 864 
upstream of, 864 
weak, 864 
Pro-N-terminal degrons, 684 


Proofreading 
in DNA repair, 847, 848f 
in transcription, 863 
in translation, 900-901 
Propionyl CoA, 654, 654f, 701-702 
6-n-propyl-2-thiouracil, 967, 967 
Prostacyclin, 669 
Prostacyclin synthase, 669, 669f 
Prostaglandin, 669 
Prostaglandin H 
attachment of, 353 
defined, 353 
formation of, 353, 353f 
synthase-1, 353f, 354f 
Prostaglandin Hz synthase-1, 353-354, 353f 
Prosthetic groups, 46, 217 
Protease inhibitors 
Q,-antiproteinase, 303 
drugs, 263-264, 263f 
in HIV infection treatment, 263, 263f, 
792, 1046, 1046f, 1053 
initial design of, 1046, 1046f 
pancreatic trypsin inhibitor, 302-303, 302f 
serine, 302 
Proteases, 333 
active sites, 261f 
applied to glycoproteins, 333 
aspartyl, 261f, 261, 262, 262f 
catalytic triad in, 258-260 
cysteine, 261-262, 261f, 261 
metalloproteases, 261f, 261, 262 
reaction facilitation, 253-264 
serine, 251 
Protease-substrate interactions, 258, 258f 
Proteasomes. See also Amino acid 
degradation 
195 regulatory unit, 685, 685f 
205, 685, 685f 
265, 685, 685f 
in amino acid degradation, 685-686 
defined, 681 
evolution of, 686, 686f 
free amino acid generation, 686f 
prokaryotic, 686, 686f 
Protein colipase, 646 
Protein Data Bank, 100 
Protein domains, 48, 48f 
Protein folding, 904f 
essence of, 53 
funnel, 54, 54£ 
as highly cooperative process, 52—53 
illustrated, 21f 
Levinthal’s paradox in, 53 
misfolding and, 811 
nucleation-condensation model, 53 
pathway of chymotrypsin inhibitor, 54f 
by progressive stabilization, 53-54 
transition from folded to unfolded, 52f 
typing-monkey analogy, 53, 53f 
Protein identification 
cleavage in, 88-89, 89f 
genomic and proteomic methods as 
complementary, 89-90 


MALDI-TOF mass spectroscopy in, 
86, 86f 
mass spectroscopy in, 85—92, 86f, 88f, 92f 
peptide mass fingerprinting, 91 
in protein studies, 65 
Protein kinase A (PKA) 
bound to inhibitor, 298, 298 
cAMP activation of, 297, 297£ 
catalytic subunit, 298, 298f 
chains, 403 
consensus sequence, 295 
defined, 286 
gene stimulation, 403 
in glycogen metabolism, 633, 633f 
pseudosubstrate binding to, 298, 298f 
regulation of, 297, 297f 
in signal transduction, 415 
Protein kinase B (PKB), 809 
Protein kinase C (PKC), 810 
Protein kinase inhibitors, as anticancer 
drugs, 417 
Protein kinases 
catalysis, 286 
dedicated, 295 
defined, 294 
kinase fold, 298 
multifunctional, 295 
in phosphorylation, 294—296, 295t 
serine, 294-295, 295t 
in signal transduction, 415 
specificity, 295 
threonine, 294-295, 295t 
Protein phosphatase 1 (PP1) 
functions of, 629, 633-635 
in glycogen metabolism, 629, 633-635, 
634f, 636 
phosphorylase a and b and, 636 
regulation in muscle, 634, 634f 
regulation of glycogen synthesis by, 
633-634, 634f 
substrate for, 636 
subunits, 634 
Protein purification, 78 
affinity chromatography in, 70-71, 70f 
assays in, 67 
binding affinity and, 68-71 
cell release and, 67—68 
charge and, 68-71 
dialysis in, 69, 69f 
electrophoretic analysis of, 75f 
gel electrophoresis in, 71—73, 72f, 73f 
gel-filtration chromatography in, 69, 69f 
high-performance liquid chromatography 
(HPLC) in, 71, 71£ 
homogenization in, 67 
ion-exchange chromatography in, 
69-70, 70 
isoelectric focusing in, 73, 73f 
in protein studies, 65 
quantitative evaluation of, 75-76, 75f 
recombinant DNA technology in, 78-79 
salting out in, 68 
SDS-PAGE in, 73, 73, 74 


sedimentation coefficient and, 76, 77f 


sedimentation-equilibrium technique in, 78 


separation of proteins in, 71—75 
size and, 68-71 
solubility and, 68-71 
two-dimensional electrophoresis in, 
74-75, 74£ 
ultracentrifugation and, 76-78, 77f 
zonal centrifugation in, 77, 77f 
Protein sorting 
defined, 915 
pathways, 918, 918f 
Protein structures, 2f, 3 


amino acid sequence as determinant, 49-59 


amino acid sequences and, 37, 37f 

complex assembly, 28, 28f 

dictating function, 28f 

elucidation of, 100 

family of, 100, 100f 

models of, 22—23, 23f, 61-62 

primary, 27, 35-40, 48, 78-79, 178-179 

quaternary, 27, 48—49 

secondary, 27, 40-46, 48 

synthetic peptides and, 93 

tertiary, 27, 46-48, 178-179 

three-dimensional prediction from 
sequence, 54 

Protein studies 

enzyme-linked immunosorbent assay 
(ELISA) in, 82-83, 83f 

fluorescence microscopy in, 84, 84f 

immunology in, 79-84 


mass spectroscopy in, 85-92, 86f, 88f, 92f 


peptide synthesis, 92-95 
purification methods for, 66-79 
steps in, 65-66 
western blotting in, 83-84, 83f 
x-ray crystallography in, 95-97 
Protein synthesis, 893-921. See also 
Translation 
accuracy of, 894-895, 894t 
adaptors in, 123, 123f 
antibiotic inhibitors of, 913-915, 914, 
914t, 915f 
endoplasmic reticulum (ER) in, 915-919 
eukaryotic versus bacterial, 911-913 
insulin in, 816 
mRNA in, 126 
peptide-ligation methods, 95 
polypeptide-chain growth in, 894, 894f 
ribosomes as site of, 893, 902-911 
Protein targeting, 915-919 
Protein turnover. See also Amino acid 
degradation 
defined, 682 
regulation of, 683-687 
Proteins, 27—63 
acetylation of, 293-294 
adaptor, 410 
affinity tags and, 78 
aggregated, in neurological diseases, 
56-57, 56f 
aggregation of, 683 
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allosteric, 285, 286-292 

alpha helix of, 40-41 

amino acids. See amino acids 

antibody generation to, 79-80 

B pleated sheets of, 44-45, 44f, 45f 

as building block, 20 

building blocks, 2 

carbohydrate units of, 58 

cargo, 918-919 

cellular, degradation of, 682-683 

chaperone, 50 

cleavage of, 57-59, 88-89, 89f 

coat (COPs), 919 

coiled-coil, 44-45, 44f, 44, 45£ 

covalent modification of, 293-298, 294t 

crystallization of, 95 

defined, 2, 36 

degradation of. See amino acid 
degradation 

denatured, 52, 52f 

dietary, digestion and absorption of, 
682, 683f 

disulfide bonds, 36-37 

DNA-binding, 926-932 

enzymes as, 215 

evolution of, 128 

fibrous, 44—46 

flexibility of, 29, 29f 

functional groups of, 27-28 

functions of, 27—29 

genetically engineered, 152 

glycan-binding, 334 

glycosylation of, 327-332 

half-life of, 683, 684, 685t 

hemoglobin as model of, 191-212 

interaction of, 28 

intrinsically unstructured, 55-56 

as linear polymers, 27-28, 35 

loops, 44, 44f 

membrane. See membrane proteins 

metamorphic, 55-56 

misfolding of, 56-57 

molecular weights of, 76t 

overview of, 65—66 

plasma-membrane, 916, 916f 

properties of, 27—29 

pure, 66 

refolding of, 50, 50f 

regulatory, 930 

release factors, 126 

repressor, 928 

reverse turns, 44, 44f 

ribosomal. See ribosomes 

rigidity of, 29 

S values of, 76t 

secretory, 916, 916f 

secretory pathway, 915-916 

separation of, 65, 71-75 

subunits of, 48—49 

tagging for destruction, 683-685 

translocation of, 915-919 

unfolding of, 52, 52f 

unstructured, 55—56 
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Proteoglycans 
in cartilage, 328-329, 329f 
defined, 326 
properties of, 327-328 
structural roles, 327-328 
Proteolysis, 216, 819 
Proteolytic activation, 286, 299-308 
apoptosis by, 299 
of chymotrypsinogen, 299-300, 300f 
zymogens, 301, 301f 
Proteolytic enzymes 


Purine nucleotide synthesis, 24, 34, 724 


AMP in, 751-752 
control of, 758-759, 758f 
de novo pathway for, 450f, 748, 
748£, 749t 
enzymes of, 752 
GMP in, 751-752 
phosphorylation in, 749-751, 749t, 750f 
ribose phosphate in, 749 
salvage pathway for, 748, 752 


Purine nucleotides, 107, 748 


activation of, 299-308 

in blood clotting, 303-304 
chymotrypsinogen as, 299-301, 300f 
in digestion, 299-303 

function of, 216 

inhibitors of, 302-303 


Purinosomes, 752, 752f 

Puromycin, 914, 914t, 914 

Pyran, 318 

Pyranose, 318, 319f, 320-321 

Pyridoxal phosphate enzymes 
bond cleavage in, 691, 691f 


Proteomes, 66 

Prothrombin, 304, 304f 

Proton abstraction, 268 

Proton gradients 
across thylakoid membrane, 578-581 
in ATP synthesis, 541-549 


reaction choice, 691-692, 692f 
stereoelectronic effects, 691, 691f 


Pyridoxal phosphate (PLP), 621, 621f 


in amino acid degradation, 682, 691-692 
in Schiff-base intermediates formation, 
689-690 


cytochrome bf contribution to, 575, 575f 

directionality of, 575, 575f 

in oxidative phosphorylation, 433, 433f, 
523, 524, 529, 541-549 


in Schiff-base linkage, 720 
Pyridoxamine phosphate (PMP), 690, 
720, 720£ 
Pyridoxine (vitamin Be), 689 


power transmission by, 558, 558f 
Proton shuttle, 267-268, 268f 
Proton transport, 538, 538f 
Proton-motive force 

ATP forms without, 544-545, 545f 

in oxidative phosphorylation, 523, 

542-543, 542f 

in photosynthesis, 578 
Proto-oncogenes, 416 
Protoporphyrin, 192 
Protoporphyrin IX, 736, 737f 
Proviral DNA, 159 
Proximal histidine, 193, 200 
PrP, 56 
Prusoff, William, 1035 
Pseudo-first-order reactions, 226 
Pseudogenes, 154 
Pseudosubstrate sequence, 297 
Psoralens, 847, 883 
P-type ATPases 

defined, 367, 370 

as evolutionarily conserved, 374 

SERCA, 370-373 
Puffer fish, genome of, 157, 157f 
Pumps 

action, 371, 371f 

ATP-driven, 367 

calcium ion, 371, 371f, 372-374, 372f 

defined, 367 

ion gradients, 368 

MDR, 374 

Na*-K*, 370, 373 

purification of, 370 
Pur repressor, 931, 931f 
Purine biosynthetic pathway, 696 
Purine catabolism, 760f 


Pyrimidine biosynthetic pathway, 696 
Pyrimidine nucleotide synthesis, 734 


carbamoyl phosphate in, 745, 745 
control of, 758 

cytidine triphosphate (CTP) in, 747-748 
de novo pathway for, 744, 744f 
glutamine in, 745 

mono-, di-, and triphosphates in, 747 
orotate in, 746-747, 746f 

substrate channeling in, 745-746, 746f 


Pyrimidine nucleotides 


defined, 744 
recycling of, 748 


Pyrimidines, 107 
Pyrosequencing, 155-156 
Pyrrolidine rings, 45, 45f 
Pyruvate, 232 


in amino acid degradation, 692-693, 698, 
699-700, 699f 

in ATP formation, 460—461 

carboxylation of, 479-480, 514 

conversion into phosphoenolpyruvate, 
478-480, 479f 

fates of, 462, 462f 

glucose conversion into, 461-462 

in glycolysis, 452f 

metabolism disruption, 515-516 

oxaloacetate synthesis from, 
440-441, 440f 

reduction of, 464, 464f 


Pyruvate carboxylase 


in citric acid cycle, 514-515 
in gluconeogenesis, 478-480, 479f 


Pyruvate decarboxylase, 462-463, 513 
Pyruvate dehydrogenase, in citric acid cycle, 


511-512 


Pyruvate dehydrogenase complex 
carbon dioxide production, 
498-500 
in citric acid cycle, 497-501, 497f, 497t, 
501f, 511-512, 511f 
components of, 498-500, 499f 
of E. coli, 497£, 497 
mechanism, 498-500, 499f 
reactions of, 500-501, 501f 
regulation of, 511-512, 511f 
response to energy change, 511, 511f 


Pyruvate dehydrogenase kinase (PDK), in 


cancer, 513 


Pyruvate dehydrogenase phosphatase, 511 
Pyruvate kinase 


liver and glycolysis and, 473, 473f 
muscle and glycolysis and, 471 
Pyruvate kinase M, 475 


Q cycle 
defined, 535 
in respiratory chain, 535, 535f 
Q-cytochrome c oxidoreductase 
defined, 533-534 
in respiratory chain, 529-531, 530f, 530t, 
533-534 
structure of, 534, 534f 
Quantitative PCR (qPCR), 157-158, 158f 
Quaternary protein structure, 27, 48f 
complex, 49f 
defined, 48 
Quinones 
oxidative states of, 531, 531f 
ubiquitous, 530 
Quinonoid intermediate, 690 
Quorum sensing, 934, 934f 


R groups. See Amino acid side chains 
R state, of hemoglobin, 198, 199f, 200, 
211-212 
Racker, Efraim, 542b 
Raf, 414 
Raloxifene, 952 
Ramachandran plot 
for angles of rotation, 40f 
B stands, 42f 
defined, 39 
for helices, 40f 
Random-coil conformation, 50 
Ranitidine (Zantac), 1049, 1049 
Ras 
activation of, 413, 413f 
affixation to cytoplasmic face, 294 
GTPase activity of, 414, 414t 
in protein kinase cascade initiation, 414 
Rate constants, 225-226 
RBP4, 806 
Reactants, 219 
Reaction centers 
accessory pigments and, 581-584, 583f 
bacterial, 569-572, 570f, 571f 
cyclic electron flow in, 572, 572f 
defined, 569 


photosystem I, 572, 576, 576f 
photosystem IT, 572, 573 
Reaction rates 
enzymes and, 220-221, 220f 
free-energy difference and, 218 
kinetics as study of, 225-226 
rate constants, 225—226 
Reactions 
acid-base, 13-15 
ATP hydrolysis, 279, 279f 
bimolecular, 226 
in Calvin cycle, 590-597, 597f 
carbonic anhydrases and, 264-268 
of citric acid cycle, 508 
deleterious, 456 
DNA ligase, 832, 832f 
double-displacement, 232, 233 
enzyme acceleration of, 221-225 
equilibrium, 220-221 
equilibrium constant of, 220 
in fatty acid oxidation, 651t 
in fatty acid synthesis, 662-664 
first-order, 225 
free-energy difference of, 218, 219 
of gluconeogenesis, 482t 
group-transfer, 441, 441f 
hydrolytic, 441-442, 442f 
isomerization, 441, 441f 
ligation, 440-441, 440f 
multiple-substrate, 231-233 
oxidation-reduction, 440, 440f 
oxygenase, 593-594, 593f 
pentose phosphate pathway, 605-607, 
605f, 606f 
polymerization, 1018 
pseudo-first-order, 226 
of pyruvate dehydrogenase complex, 
500-501, 501f 
reduction potentials of, 527t, 
528-529 
second-order, 226 
sequential, 232 
standard reduction potentials of, 527 
transaldolase, 606—607, 606f 
transketolase, 605-606, 605f 
velocity versus substrate concentration, 
222, 222f 
Reactive oxygen species (ROS) 
defined, 539 
glucose 6-phosphate dehydrogenase in 
protection against, 610-612 
pathological conditions that may entail 
injury, 539t 
release prevention, 194-195 
source of, 656 
Reactive substrate analogs, 237-238, 
237f, 238f 
RecA, 853, 933 
Receptor-mediated endocytosis, 360-361, 
360f, 784, 784£ 
Reclinomonas americana, 526 
Recognition helix, 927 
Recognition sites (sequences), 269 


in cognate and noncognate DNA, 
272-274, 273f, 274£ 
defined, 269 
distortion of, 273-274, 273f 
EcoRV endonuclease, 272, 272f 
structure of, 272, 272t 
tRNA template, 123 
Recombinant DNA 
formation of, 143-144 
manipulation of, 143 
Recombinant DNA technology, 135-166 
amino acid sequence information, 89-90 
bacterial artificial chromosomes in, 147 
in biology, 143-152 
blotting techniques in, 136, 138, 138f 
cloning in, 147-148 
cohesive-end method in, 144, 144f 
complementary DNA (cDNA) in, 
149-150, 149f, 150f 
defined, 135 
in disease-causing mutations 
identification, 143 
DNA probes in, 138, 139-141 
DNA sequencing in, 136, 138-139 
DNA synthesis in, 139-141, 140f, 141 
electroporation in, 163, 164f 
functional effects of disease-causing 
mutations and, 152 
d phase in, 144-146, 147f 
gel electrophoresis in, 137-138, 138f 
gene disruption in, 160-163, 160f, 161f 
gene expression analysis in, 157-159, 
158f, 159f 
gene therapy in, 164 
genomic libraries in, 147-148, 148f 
mutation creation, 150-152 
in mutation identification, 143, 143f 
overview of, 135-136 
in plants, 163-164, 163f 
plasmids in, 144-146 
polymerase chain reaction (PCR) in, 136, 
141-143, 141f, 142 
in protein purification, 78-79 
requirement, 136 
restriction-enzyme analysis in, 136, 137 
RNA interference in, 162-163, 162f 
solid-phase approach in, 139-141, 140f 
tools of, 136-143 
transgenic animals in, 159f, 160 
vectors in, 145-146 
yeast artificial chromosomes (YACs) in, 
147, 147£ 
Recombinases, 854 
Recombination 
in antibodies, 991-995, 991f, 992f 
in color blindness, 974-975, 974£ 
DNA. See DNA recombination 
homologous, 974b 
Recombination signal sequences (RSSs), 992 
Recombination synapse, 854 
Redox balance, 463, 463f 
Redox couples, 527 
Redox potential. See Reduction potential 
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Reducing sugars, 321-322 
Reductase mRNA, in cholesterol regulation, 
781 
5a-reductase, 793 
Reduction potential 
measurement of, 526, 526f 
of NAD*, 528 
in oxidative phosphorylation, 
526-528, 526f 
of reactions, 527t, 528-529 
Reed, Randall, 963 
Regulator genes, 928 
Regulatory domains 
in amino acid synthesis, 731-732 
recurring, 731f 
structures of, 732f 
Regulatory light chain, 1012 
Regulatory protein Py, 733, 733f 
Regulatory proteins, 930 
Regulatory sites, 287 
Relay helix, 1016, 1016f 
Release factors 
defined, 126 
ribosome (RRF), 911 
translation termination by, 910-911, 911f 
Repeating motifs, 91, 91f, 180-181, 180f 
Replication factor C (RFC), 844 
Replication fork. See also DNA replication 
defined, 831 
schematic view of, 840f 
Replication protein A, 843 
Replicon, 843 
Reporter genes, 145 
Repressors 
corepressors, 931 
A, 932-934, 933f 
lac, 928-930, 929f, 930f 
protein, 928 
pur, 931, 931f 
Resistin, 806 
Resonance energy transfer, 582-583, 582f 
Resonance structures 
ATP (adenosine triphosphate), 429 
defined, 7 
depiction of, 7 
improbable, 429f 
Respirasome, 529 
Respiration, 524b 
Respiratory chain 
coenzyme Q in, 530-531 
components of, 530f, 530t 
cytochrome c oxidase in, 529, 530f, 530t, 
535-538 
defined, 523 
electron transfer in, 540-541, 540f 
NADH-Q oxidoreductase in, 529-531, 
530f, 530t, 532-533, 533f 
in oxidative phosphorylation, 529-541 
Q cycle in, 535, 535f 
Q-cytochrome c oxidoreductase in, 529, 
530f, 530t, 533-534 
succinate-Q reductase in, 529, 530f, 
530t, 533 
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Respiratory control, 553-554, 554f 
Respiratory distress syndrome, 774 
Respiratory quotient (RQ), 815 
Restriction enzymes (endonucleases), 251 
analysis, 137 
defined, 137b, 269 
in DNA cleavage, 137, 269-275 
in E. coli, 269 
in forming recombinant DNA molecules, 
143-144 
hydrolysis of phosphodiester bond, 
269-270, 269f 
inverted repeats, 272, 272f 
recognition sites, 272, 272f, 273-274, 273f 
restriction-modification systems, 274 
specificity, 137, 137f, 272, 274 
type II, 275 
Restriction fragments 
separation by gel electrophoresis, 
137-138, 138f 
in Southern blotting, 138 
Restriction-modification systems, 274 
11-cis-retinal, 970, 971-972, 972f 
Retinitis pigmentosa, 884-885 
Retrovir, 263 
Retroviruses 
defined, 118 
in drug resistance evolution, 1053 
flow of information in, 119, 119f 
in gene introduction, 159 
Reverse cholesterol transport, 787 
Reverse transcriptase, 119, 119f 
Reverse turns 
amino acid residues in, 51—52 
defined, 44 
Reversible covalent modification, 286, 443, 
732-733, 733£ 
Reversible inhibition, 234, 234f. See also 
Enzyme inhibition 
Reversible terminator method, 155 
Rhizobium bacteria, 714 
Rho, 868, 868f£ 
Rhodopseudomonas viridis, 569, 570f, 584 
Rhodopsin, 400, 400f, 970-971, 971f 
Rhodopsin kinase, 972 
Rhodospirillum rubrum, 541 
Ribbon diagrams, 62, 62f 
Ribonuclease 
amino acid sequences in, 49-50, 49f 
denatured, 50 
reduction and denaturation of, 50f 
sequence comparison of, 169, 170 
structure of, 49-51, 49f 
Ribonuclease II (RNase IIT), 870 
Ribonuclease P (RNase P), 870 
Ribonucleic acid. See RNA 
Ribonucleotide reductase 
as cancer therapy target, 759-760 
defined, 753 
in deoxyribonucleotide synthesis, 
753-755 
elements of, 753, 753f 
mechanism, 754, 754f 


regulation of, 759, 759f 
stable radicals, 755 
structure of, 753 
tyrosyl radicals, 753, 753f 
Ribose, 106, 106, 317, 318f, 320f 
Ribosomal initiator element (rInr), 873 
Ribosome release factor (RRF), 911 
Ribosomes 
bacterial, 911 
catalysis by proximity and 
orientation, 908 
defined, 126, 893 
endoplasmic reticulum bound, 
915-919, 916f 
eukaryotic, 911 
50S subunit of, 902-903, 903f, 908, 910 
polysomal, 910, 910f 
in protein synthesis, 893, 902-911 
schematic representation, 905f 
705 subunit of, 903, 903f 
sites of, 905, 905f 
structure of, 903, 903f 
subunits of, 902-903, 903f 
30S subunit of, 902-903, 903f, 907, 907f 
in translation, 898-911 
tRNA-binding sites, 905, 905f 
Riboswitches, 867, 867£ 
Ribulose 1,5-bisphosphate, 591-592, 591f, 
592f, 592, 595-597, 596f, 597f 
Ribulose 5-phosphate 
in Calvin cycle, 602, 603f 
in pentose phosphate pathway, 603, 604, 
608-609 
Ricin, 915, 915b 


Ricinus communis, 915, 915f 


Rickets, 795 


Rickettsia prowazekti, 525-526 
Rieske center, 534 


Rifampicin, 869, 869f 
Ritonavir, 792 


RNA. See also Transcription 


ATP binding of, 186-187, 186f, 187f 

backbone of, 106, 106f 

base composition of, 121, 121t 

defined, 107 

double-stranded, 162, 162f 

in E. coli, 119, 119t 

flow of information from DNA to, 
119, 119f 

in gene expression, 119-120 

guide strand, 162-163 

hybridization with, 135 

as linear polymer, 105, 106 

messenger. See mRNA 

micro, 879, 879f, 956-957, 956f 

nucleotides, 107-108 

passenger strand, 162 

primer, 841 

ribosomal. See rRNA 

small interference (siRNA), 162, 162f 

small nuclear (snRNA), 882-883 

small regulatory, 879, 879f 

splicing of, 880-883, 880f, 881f, 882t 


stem-loop structure, 113, 113f 
structure of, 113-114, 113f, 114f, 
187, 187£ 
synthesis of, 120-121 
transfer. See tRNA 
NA editing, 879-880, 879f 
NA interference, 162-163, 162f 
NA polymerase holoenzyme complex, 
865, 865 
NA polymerase I 
in RNA synthesis, 872, 873, 873f 
rRNA production, 877 
RNA polymerase II 
promoter region, 874 
in RNA synthesis, 872, 873, 873f 
RNA polymerase III, 873-874, 873f, 
877, 878£ 
RNA polymerases, 120-121, 873f 
activated precursors, 121 
active site of, 861 
backtracking and, 863-864, 863f 
catalytic action of, 861-871 
components of, 121 
defined, 859, 860 
divalent metal ion, 121 
in DNA replication, 831, 831f 
DNA templates and, 121-122, 121t, 861 
in E. coli, 123, 123f, 861-862, 862t 
eukaryotic, 860, 860, 872-874, 872t, 873f 
eukaryotic promoter elements, 873, 873f 
functions of, 860-861 
illustrated, 120f 
prokaryotic, 860, 860 
promoter sites in, 864-865 
in proofreading, 863 
RNA-directed, 118 
structures of, 860, 860 
subunits of, 862, 862t, 865 
synthesis of, 121f 
in transcription, 861-871 
transcription mechanism, 121, 121f 
RNA processing 
eukaryotic pre-rRNA, 877, 877f£ 
of pre-mRNA, 877f, 878-879, 878f 
in transcription of eukaryotes, 872, 
883-884, 884f 
tRNA, 877, 878t 
RNA sequences, comparison of, 182-183, 
182f 
RNA splicing. See Splicing 
RNA-DNA hybrid 
backtracking, 863-864, 863f 
during elongation, 869 
lengths of, 863 
separation, 863, 863f 
translocation, 863, 863f 
RNA- induced silencing complex (RISC), 
162-163, 162f, 956 
Rods, 970-971, 970f 
Rofecoxib (Vioxx), 1048, 1048, 1052 
Rossmann, Michael, 465 
Rossmann fold, 465 
Rosuvastatin, 1044, 1044 
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Rotational catalysis, in ATP synthase, 
546, 546f 
Rotenone, 556 
rRNA 
in base pairing, 903 
defined, 120 
folding of, 903, 904f 
in structural scaffolding, 903 
transcription of, 870-871, 870f, 877, 877£ 
in translation, 903-905, 904f 
types of, 120 
Rubisco 
in Calvin cycle, 591-594, 591f, 592f, 
593f, 598-599 
defined, 591 
magnesium ion in, 592, 592f 
oxygenase reaction, 593-594, 593f 
structure of, 591, 591f 
Rubisco activase, 593 


S; pockets, 258, 259f 
Saccharomyces cerevisiae, 153, 941 
Sakmann, Bert, 379 
Salicin, 1043 
Salicylic acid, 1043, 1043 
Salmonella test, 852, 852£ 
Salt bridges, 203, 203f 
Salting out, 68 
Salty taste, 966, 969 
Salvage pathways. See also Nucleotide 
synthesis 
defined, 744 
for IMP and GMP synthesis, 762 
purine, 748, 752 
pyrimidine, 748 
Sample half-cell, 526-527 
Sandwich ELISA, 82-83, 83f 
Sanger, Frederick, 37, 138, 152 
Sanger dideoxy method, 138-139, 139f 
Sarcomere, 1020, 1020f 
Sarcoplasmic reticulum Ca2* ATPase. See 
SERCA 
Saturated fatty acids, 661 
Schiff bases, 451, 606, 621 
in amino acid degradation, 689-690 
in amino acid synthesis, 717 
in aminoacrylate, 729, 729 
defined, 621b 
in retinal, 971, 971f 
Scrapie, 56 
Screening libraries, 1044-1046, 1046f 
Screw sense, 40-41, 40b, 40f 
Scurvy, 821 
SDS (sodium dodecyl sulfate), 72 
SDS-PAGE, 73, 73f, 74 
SDS-polyacrylamide gel electrophoresis, 
350, 351 
Second Law of Thermodynamics, 11-12, 13 
Second messengers, 398, 399, 406, 415 
Secondary active transport, 367 
Secondary antibody, 84 
Secondary protein structure, 27, 40-46, 
48, 51t 


Secondary transporters, 376, 376f, 377 
Second-order reactions, 226 
Secretory pathway, 915-916 
Secretory proteins, 916, 916f 
Sedimentation coefficients, 76, 77f£ 
Sedimentation equilibrium, 78 
Sedoheptulose 7-phosphate, 603, 603f, 604f 
Segmental flexibility, 987, 987f 
Selectins, 335 
Selective estrogen receptor modulators 
(SERMs), 952 
Selectivity filter, 381, 381f, 382, 382f 
Self-splicing 
defined, 886 
example of, 886, 886f 
intron structure, 887f 
mechanism, 887, 887£ 
Self-tolerance, 1005 
Sensory systems, 961-979 
brain connection, 962, 962f 
hearing, 975-977 
olfaction, 962-966 
overview of, 961-962 
taste, 966-970 
touch, 977-978 
vision, 970-975 
Sequence alignment 
with conservative substitution, 176f 
with gap insertion, 172-173, 173f 
of hemoglobin, 176f 
of identities only versus Blosum-62, 176f 
of leghemoglobin, 176f 
of repeated motifs, 180, 180f 
scoring system, 176, 176f 
shuffling and, 173-174, 173f 
statistical analysis of, 171-177 
statistical comparison of, 173, 173f 
Sequence comparison methods, 169-170 
Sequence identities, 172 
Sequence template, 180 
Sequential model 
allosteric enzymes, 290-291 
of hemoglobin-oxygen binding, 199, 199f 
Sequential reactions, 232 
SERCA 
defined, 370 
P-type ATPase, 370-373 
pumping by, 372, 372f 
structure of, 372 
Serine 
in catalytic triads, 255-258 
in chymotrypsin, 253-254, 254f 
in cysteine synthesis, 722-723, 726 
defined, 32-33 
from glycine, 700 
in glycine synthesis, 722—723 
pyruvate formation from, 699f, 700 
in sphingolipid synthesis, 772, 772 
sphingosine from, 734 
structure of, 32f 
synthesis of, 722, 722f, 726 
Serine dehydratase, 692 
Serine hydroxymethyltransferase, 723 
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Serine kinases, 294-295, 295t 
Serine protease inhibitors, 302 
Serine proteases 
catalytic strategies of, 251 
convergent evolution and, 181-182 
Serotonin, synthesis of, 734, 734f 
Serpins, 302 
7TM (seven-transmembrane-helix) 
receptors 
biological functions mediated by, 400t 
defined, 399 
in drug development, 1048 
in glycogen metabolism, 627-629, 628f 
ligand binding to, 400-402 
in olfaction, 963 
oligomeric, 969 
phosphoinositide cascade activation, 
404-405, 405f 
in signal termination, 404f 
in taste, 967-969, 967£ 
in vision, 970-974 
70S initiation complex, 907, 907f 
705 subunit, of ribosomes, 903, 903f, 907 
Severe acute respiratory syndrome 
(SARS), 1050 
Severe combined immunodeficiency 
(SCID), 164, 760-761 
Shaker channel, 380, 384 
Shaker gene, 380 
Shape complementarity, 830, 830f, 831f 
Shemin, David, 735b, 736 
Shikimate, in amino acid synthesis, 727—729 
Shine, John, 906 
Shine-Dalgarno sequences, 126, 906 
Shuffled sequences, 173-174, 173f 
Sialic acids, 772 
Sickle-cell anemia, 17-18, 205-206, 
205f, 206f 
Side chains 
amino acid. See amino acid side chains 
hydrophilic, 47 
hydrophobic, 47 
polypeptide chains, 36, 36f, 47 
Sigmoid oxygen-binding curve, 196 
Sigmoidal curve, 291, 291f 
Sigmoidal kinetics, 287, 287f 
Signal transduction, 397-420 
abnormalities in cancer, 416-417, 
416f, 417£ 
abnormalities in cholera, 417—418 
abnormalities in whooping cough, 
417-418 
adaptor proteins in, 410 
B-adrenergic receptor in, 399-400, 400f 
calcium ions in, 405-407, 406f, 415 
calmodulin in, 407, 407£ 
cAMP in, 402-403 
cross talk, 398-399 
defects, 415-418 
defined, 397 
element recurrence in, 415 
epidermal growth factor in, 397, 411-414 
epinephrine in, 399-407 
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Signal transduction—(continued) 
evolution of, 415 
function of, 397, 398f 
G proteins in, 400-403, 401f, 402f 
GTPases and, 414, 414t 
insulin in, 407-411 
molecular circuits, 398-399 
monoclonal antibodies as inhibitors, 
416-417 
nucleotides in, 743 
in olfaction, 964, 964f 
overview of, 397-399 
primary messenger, 398 
principles of, 398-399, 398f 
protein kinases in, 415 
protein kinase A in, 403, 415 
Ras in, 413-414, 413f 
second messengers, 398, 406, 415 
secondary messengers, 399 
7TM receptors, 399-402, 400t, 404f 
Src homology domains in, 410, 410f, 413, 
415, 416, 416f 
in vision, 972-973, 972f 
Signal-recognition-particle (SRP), 
916-918, 917£ 
Sildenafil, 1042-1043 
Simple diffusion, 368 
Single-molecule studies, enzyme, 
242-243, 242f 
Single-stranded-binding protein (SSB), 
840, 840f 
Singulo method, 242 
Site-directed mutagenesis, 150-151, 151f 
Slack, C. Roger, 599 
Sliding DNA clamp, 839-840, 839f 
Sliding-filament model, 1020-1021, 1021f 
Small G proteins, 413 
Small interference RNA (siRNA), 162, 162f 
Small nuclear ribonucleoprotein particles 
(snRNPs), 882-883, 882t 
Smell. See Olfaction 
SNARE proteins, 361, 362f 
Social interactions, in gene expression, 934 
SOD1 gene, 159 
Sodium dodecyl sulfate (SDS) 
in gel electrophoresis, 72—73, 73f 
PAGE, 73, 73f 
Sodium ion channels 
amiloride-sensitive, 969, 969f 
inactivation of, 384-385 
paddles, 388 
purification of, 379-380 
in taste, 969 
Sodium ions, 378 
Solid-phase DNA synthesis, 139-141, 140f 
Solid-phase peptide synthesis, 93-94, 94f 
Somatic mutation, antibody diversity and, 
992-993 
Sonicating, 349 
Sos, 413, 413f 
Sour taste, 966, 969-970 
Southern, Edwin, 138 
Southern blotting, 138, 138f, 160 


Space-filling models, 22, 23f, 61, 61f, 195f 
Special pair, 570-571 
Specificity constant, 230-231, 231f 
Sphingolipids 
ceramide and, 772, 772f 
defined, 772 
diversity and, 773-774 
gangliosides, 772-773, 774 
synthesis of, 772-774, 773f 
Sphingomyelin 
defined, 345, 772 
structure of, 345 
Sphingosine, 344 
defined, 345 
structure of, 345 
synthesis of, 734, 734f 
Spina bifida, 762 
Spleen tyrosine kinase (Syk), 993 
Spliceosomes 
assembly and action, 882-883, 882f 
catalytic center, 883, 883f 
defined, 128, 881 
Splicing 
alternative, 128, 129f, 877, 885-886, 
885f, 885t 
branch points in, 880, 881f 
catalytic center, 883, 883f 
defined, 128, 880 
group I, 887, 888f 
group II, 887-888, 888f 
mechanism, 881, 881f 
mutations affecting, 880-881, 
884-885, 884f 
pathways comparison, 888f 
self-splicing and, 886-888, 886f, 887f 
sites of, 880-881, 880f 
snRNPs in, 882-883, 882t 
spliceosomes in, 881, 882-883, 882f 
transesterification in, 881-882, 882f 
Splicing factors, 882 
Split genes, 128 
Split-pool synthesis, 1045, 1045f 
Spudich, James, 1019 
Squalene 
cyclization of, 778-779, 779f 
synthesis of, 777-779, 778f 
Squalene synthase, 778 
Src 
affixation to cytoplasmic face, 294 
in cancer, 416 
Src homology 2 (SH2) domain, 410, 410f, 
415, 416, 416f 
Src homology 3 (SH3) domain, 413, 415, 
416, 416f 
SREBP (sterol regulatory element binding 
protein), 779-781, 780f, 781f, 784 
SREBP cleavage activating protein (SCAP), 
780-781, 781f 
SRP receptor, 917, 917f 
Stacking forces, 111 
Stahl, Franklin, 115, 116 
Standard free-energy change, 219-220 
Standard reference half-cell, 526, 527 


Staphylococcus aureus, 239, 239f 
Starch, 324, 597 
Starvation, metabolic adaptations in, 
818-819, 819t 
Starved-fed cycle, 816-817 
Statins, 788 
Steady-state assumption 
defined, 227 
in enzyme kinetics, 226 
Stearoyl CoA desaturase, 668, 668f 
Steatorrhea, 646 
Steitz, Tom, 829 
Stem-loop structure, 113, 113f 
Stereochemical renderings, 22 
Stereochemistry 
of cleaved DNA, 270-271, 271f 
observation of, 270 
of proton addition, 721, 721f 
Stereocilia, 975-976 
Stereoelectronic control, 691f, 692 
Stereoisomers 
carbohydrate, 317, 317f 
notation for, 29b 
Steric exclusion, 39 
Steroid hormones 
anabolic, 951 
binding and activation, 789-790 
classes of, 789 
defined, 789 
hydroxylation of, 790-791 
identification of, 790 
synthesis of, 789-791, 789f 
Sterol regulatory element binding protein 
(SREBP), 779-781, 780f, 781f, 784 
Sterol regulatory element (SRE), 779 
Sticky ends, DNA, 144, 144f 
Stoichiometry 
of gluconeogenesis, 481 
for light reactions, 581 
of palmitate synthesis, 666 
Strand invasion, 853, 853f 
Strand separation, 141, 141f 
Streptomyces coelicolor, 865 
Streptomyces lividans, 380 
Streptomycin, 914, 914t, 914 
Stroma, 567 
Stroma lamellae, 567-568 
Strominger, Jack, 1042 
Structural genes, 928 
Structure-activity relationship (SAR), 1046 
Structure-based drug development, 
1046-1048, 1047f, 1048f 
Substituted enzyme intermediates, 233 
Substitution matrix, 174-176, 175f, 176f 
Substitutions 
in amino acid sequences, 174-177, 
175£, 176f 
conservative, 174, 175f, 176f 
nonconservative, 174, 175f 
Substrate binding 
in catalysis, 225 
conformation selection, 224 
cooperative, 233 


induced-fit model, 224, 224f, 252 
lock-and-key model, 224, 224f 
Substrate channeling 
in amino acid synthesis, 730, 730f 
defined, 510 
in pyrimidine synthesis, 745-746, 746f 
Substrate cycles, 485, 485f 
Substrate-induced cleft closing, 453 
Substrate-level phosphorylation, 460 
Substrates. See also Enzyme-substrate 
complex 
accessibility of, controlling, 444 
in biochemical reactions, 231—233 
chromogenic, 254, 255f 
concentration, 226, 226f, 228 
defined, 216 
enzyme interaction, 223 
homotropic effect on allosteric 
enzymes, 290 
multiple, in reactions, 231-233 
reciprocal relation, 759 
spectroscopic characteristics of, 223 
Subtilisin, 182 
catalytic triad of, 259, 259f 
oxyanion hole of, 259, 259f 
site-directed mutagenesis of, 260, 260f 
structure of, 181-182 
Subunit vaccines, 1006 
Succinate 
in glyoxylate cycle, 517, 517f 
oxidation of, 507-508 
Succinate dehydrogenase, in citric acid cycle, 
507-508 
Succinate-Q reductase, 529-531, 530f, 
530t, 533 
Succinyl CoA 
in amino acid degradation, 698, 701, 701f 
in citric acid cycle, 505-506, 506f 
formation of, 505, 654, 654f, 655-656, 656f 
porphyrins from, 736-737, 737f 
Succinyl CoA synthetase, 507 
0282 heterodimer, 507 
in biochemical transformation, 506—507 
reaction mechanism of, 506, 506f 
structure of, 507, 507 
subunits, 507, 507£ 
Sucrase, 323, 450 
Sucrose 
defined, 324 
structure of, 323-324, 324f 
synthesis of, 598, 598f 
Sucrose 6-phosphate, 598, 598f 
Sugars 
in cyclic forms, 318-320 
DNA, 4 
five-carbon, 595, 596f, 601-607 
monosaccharides, 316-323 
N-linked, 330 
O-linked, 330 
phosphorylation, 323 
reducing, 321-322 
Suicide inhibition, 238, 238f, 757, 761 
Sulfhydryl groups, 33 


Sulfolipids, 567 
Super compensation (carbo-loading), 815 
Supercoiling, 113 
ATP hydrolysis in, 837-839, 838f 
catalyzation of, 838 
defined, 834, 835 
degree of, 835-836 
DNA condensation from, 836 
linking number and, 834-836, 834f 
negative, 836, 836f, 837-839 
positive, 836 
relaxation of, 836-837, 837f 
right-handed, 835 
topoisomerases and, 836-839, 837f, 
838f, 838 
twist in, 834f, 835 
writhe in, 834f, 835-836 
Superhelical cable, 45 
Superoxide anion, 194 
Superoxide dismutase, 539-540, 540f 
Superoxide radicals, 538-540 
Supersecondary structures 
defined, 48 
helix-turn-helix, 48f 
Suppressors of cytokine signaling (SOCS), 
806-807, 807£ 
Svedberg units, 76 
Sweet taste, 966, 968-969 
Symmetry matching, 927-928 
Symporters, 376, 376f, 377 
Synaptic cleft, 385, 386f 
Synchrotron radiation, 95 
Synonyms, 125 
Synthase, 502b 
Synthetic analog system model, 266-267, 267f 
Synthetic peptides 
as antigens, 92 
construction of, 93-94, 94f 
as drugs, 92-93, 93f 
linking of, 95 
in receptor isolation, 92 
in three-dimensional structure of 
proteins, 93 
Synthetic vaccines, 152 


T cells 
activation of, 999-1000, 999f 
defined, 984 
helper, 984, 1000-1002 
HIV infection and, 1003-1004 
killer, 984, 998-1000 
memory, 1006 
negative selection in, 1004-1005, 1004f 
positive selection in, 1004, 1004f 
T state, of hemoglobin, 198, 199f, 200, 203, 
211-212 
T1R proteins, as sweet receptors, 
968-969, 968f 
T2R proteins, as bitter receptors, 
967-968, 968f 
Tamoxifen, 952, 952f 
Tandem mass spectroscopy, 87-88, 88f 
TAP proteins, 996 
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Taste, 966-970. See also Sensory systems 
anatomic structures in, 967, 967f 
bitter, 966, 967-968 
ion channels in, 969, 969f 
salty, 966, 969 
7TM receptors in, 967-969, 967f 
sour, 966, 969-970 
sweet, 966, 968-969 
tastants in, 966-967, 966f, 966 
types of, 966 
umami, 966, 969 

Taste buds, 967, 967£ 

TATA box 
defined, 122 
in transcription of eukaryotes, 874, 874f 

TATA-box-binding protein (TBP), 

180-181, 874-875, 875f, 953 

Taxol, 1023-1024 

Tay-Sachs disease, 774 

T-cell receptors (TCRs) 

CD3 and, 999, 999f 

CD4 and, 1001, 1001f, 1004-1005 
CD8 and, 998-1000, 999f, 1004-1005 
CD28 and, 1000 

CD45 and, 1000 

defined, 982, 998 

docking mode, 985 

domains of myosin, 998, 998f 
genetic diversity of, 985, 998 

helper T cells and, 1000-1002 

killer T cells and, 998-1000 

MHC proteins and, 998-1003, 998f 
structure of, 998f 

in T-cell activation, 999-1000, 999f 

Telomerase, 845, 845f 

Telomeres 
defined, 844 
in DNA replication, 844-845, 845f 

Temperature, 11-12 

Template 
coding strands and, 862, 862f 
complementarity with mRNA, 122, 122f 
defined, 829b 
DNA polymerases, 829 
lagging strand, 841 
in replication, 828-829 
RNA polymerases and, 121-122, 

121t, 861 
transcribed regions of, 866-867 
transcription bubbles and, 866 
transcription in, 123 
Ternary complex, 232 
Tertiary protein structure, 27, 46-48, 
178-179, 179£ 

Testosterone, 789, 793, 793 

Tetrahydrobiopterin, 703, 703 

Tetrahydrofolate, 723f 
in amino acid synthesis, 723-724 
defined, 723 
one-carbon groups carried by, 723t 

Tetrahymena, 886, 887 

Tetrapyrrole, 736 

Tetraubiquitin, 684 
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Tetroses, 316 
Thalassemia, 207, 884, 884f 
Therapeutic index, 1041 
Thermodynamics 
of coupled reactions, 425-426, 427-429 
laws of, 10-12 
of metabolism, 425-426 
Thermogenin, 555 
Thiamine (vitamin B,), 438t, 515 
Thiamine pyrophosphate (TPP) 
in citric acid cycle, 499, 499f 
deficiency, 515 
defined, 515 
Thick filaments, in myosin, 1020, 1021f 
Thin filaments, in actin, 1020 
Thioester intermediate, 457-459, 458f 
Thioesterase, 664 
6-thioguanine, 1051, 1051 
Thiol groups, 33 
Thiopurine methyltransferase, 1051 
Thioredoxin, 579-580 
in Calvin cycle, 599, 599 
enzyme activation by, 599f 
enzymes regulated by, 599t 
Thioredoxin reductase, 754 
305 initiation complex, 907, 907f 
305 subunit, of ribosomes, 902-903, 
903f, 907 
-35 region, 122 
Three-dimensional structure 
conservation of, 178-179, 179f 
in sequence alignment evaluation, 
179-180 
tertiary structure and, 178-179 
in understanding evolutionary 
relationships, 177 
Threonine 
defined, 33 
pyruvate and, 700 
structure of, 32f 
synthesis of, 732, 732f 
Threonine deaminase, 731—732, 731f 
Threonine dehydratase, 692 
Threonine kinases, 294—295, 295t 
Threonine operon, 936, 936f 
Threonyl-tRNA synthetase, 899-900 
Threonyl-tRNA synthetase complex, 
901, 901 
Threshold effects 
allosteric enzymes, 291, 291f 
defined, 291 
Thrombin, 217, 217f 
antithrombin and, 307-308 
in blood clotting, 304-305, 307 
dual function of, 307 
inhibitors of, 307 
Thromboxane synthase, 669 
Thromboxanes, 669 
Thylakoid membranes 
defined, 567-568 
pH gradient across, 578 
photosynthesis in, 567-568 
photosystem II and, 574-575 


proton gradient across, 578-581 
stacked, 583 
unstacked, 583 
Thylakoid spaces, 567-568, 578 
Thylakoids, 567 
Thymidine, 107, 748 
Thymidine kinase, 748 
Thymidine monophosphate (TMP), 757 
Thymidine phosphorylase, 748 
Thymidylate, 107 
blocking synthesis of, 757—758 
defined, 755 
synthesis of, 755-756, 756f 
Thymidylate synthase, 756 
Thymine, 4, 5f, 110, 110, 748 
in DNA repair, 850-851 
synthesis of, 724 
Thymocytes, 1004 
Thyroxine, synthesis of, 734, 734f 
Time-of-flight (TOF) mass analyzer, 85 
Tip links, 976, 976f 
Tissue factor pathway inhibitor 
(TFPI), 307 
Tissue factor (TF), 304 
Tissue plasminogen activator (TPA), 129f, 
308, 308f 
Titin, 36 
Toll-like receptors (TLRs) 
defined, 982 
extracellular domain of, 983, 983f 
illustrated, 982f 
PAMPs and, 982-983, 983t, 984f 
Tonegawa, Susumu, 991 
Topoisomerases 
bacterial, 839 
defined, 836 
type I, 836-837, 837f 
type II, 836, 837-839 
Topoisomers, 835-836, 836f 
Torpedo marmorata, 386, 386f 
Torr, 196b 
Torsion angles, 39, 39b 
Tosyl-L-phenylalanine chloromethyl ketone 
(TPCK), 237-238, 237f, 237 
Touch, 977-978 
Toxicity, drug, 1040-1041, 1049f 
Toxins, 1006 
Toxoid vaccines, 1006 
Trace elements, 20 
Trans configuration, peptide bonds, 38, 
38t, 39t 
Trans unsaturated fatty acids, 661 
Transaldolase 
in pentose phosphate pathway, 602-605, 
606-607, 606f 
reactions, 606-607, 606f 
Transaminases 
in amino acid degradation, 688-691 
defined, 688 
mechanism, 690, 690f 
Transamination 
in amino acid degradation, 689-691 
amino acid synthesis by, 720-721, 720f 


mechanism of action, 690, 690f 
Schiff base intermediates in, 689 
Transcription, 859-890 
ATP hydrolysis in, 868, 868f 
in bacteria, 861-871 
of B-globin gene, 128, 128f 
chemistry of, 861 
defined, 859 
in DNA templates, 123 
elongation, 860, 862-863, 863f, 866, 866f 
in gene expression, 910 
inhibition, by antibiotics, 869-870, 869f 
mRNA processing and, 883-884, 884f 
nuclear hormone receptors in, 
950-951, 951f 
overview of, 859-860 
promoter sites for, 122, 122f 
promoters in, 864-865, 864f 
proofreading in, 863 
regulation of, 871-876 
rho in, 868, 868f 
riboswitches in, 867, 867f£ 
RNA polymerases in, 121, 121f, 861-871 
of rRNA, 870-871, 870f, 877, 877£ 
self-splicing in, 886-888, 887f, 888f 
splicing in, 860, 876, 880-883, 880f, 
881f, 882t 
stages of, 860-861 
start site, 864 
termination of, 860, 866-867, 867f 
of tRNA, 870-871, 870f 
upstream promoter elements in, 864, 873 
Transcription activator-like effector 
nucleases (TALENSs), 161-162, 
161f, 162f 
Transcription bubbles 
defined, 862 
elongation at, 866, 866f 
schematic representation, 866f 
structure of, 862 
Transcription factors 
activation domain, 945, 946 
defined, 160, 945 
DNA-binding domains of, 945-946 
in eukaryotes, 874, 875-876, 945-947 
in gene expression, 945-947 
in induced pluripotent stem (iPS) cells, 
947, 947£ 
nuclear hormone receptor, 949-951, 950f 
in prokaryotes, 945 
regulatory domains of, 946 
Transcription in eukaryotes, 871-886 
in bacteria versus, 871-872, 871f 
CAAT box in, 874, 8746 
carboxyl-terminal domain (CTD) in, 872 
downstream core promoter element 
(DPE) in, 874 
enhancers in, 876 
eukaryotic promoter elements, 873f 
GC box in, 874, 874£ 
initiation of, 872, 874-875, 875f 
initiator element (Inr) in, 874 


microRNAs in, 879, 879f 


nuclear membrane in, 871 
pre-mRNA processing and, 877, 877f, 
878-879, 878f, 884-885, 884f 
products of, 876-886 
RNA editing in, 879-880, 879f 
RNA polymerases in, 872-874, 872t, 873f 
RNA processing and, 872, 883-884, 884f 
splicing in, 876, 880-883, 880f, 881f, 
882t, 883f 
TATA box in, 874, 874£ 
transcription factors in, 874, 875-876 
translation and, 871-872, 871f 
upstream promoter elements in, 873 
Transcription initiation, 860 
in bacteria, 862-863 
de novo, 862-863 
in eukaryotes, 872, 874-875, 875f 
Transcription repressors, 946 
Transcriptome, 158 
Transducin 
rhodopsin and, 972 
a subunit of, 280 
Transferases, 245t 
Transferred DNA (T-DNA), 163 
Transferrin, 361, 954 
Transferrin receptor, 361, 361f, 954 
Transferrin-receptor mRNA, 955, 955f 
Transgenic mice, 160 
Transglutaminase, 306 
Transition state 
ATP hydrolysis, 277-278 
in catalysis, 222, 241 
catalytic stabilization of, 215 
defined, 221 
formation facilitation of, 221-225 
symbol, 221 
Transition-state analogs, 234, 240-241, 
241f, 242, 277, 277£ 
Transketolase 
in Calvin cycle, 595, 596f 
in pentose phosphate pathway, 
602-606, 605f 
reaction, 605-606, 605f 
Translation, 893-921 
accuracy of, 894-895, 894t 
activation sites, 899-900, 900f 
adenylation in, 899-902 
aminoacyl-tRNA synthetase in, 894, 
898-902 
anticodons in, 896, 898, 898f, 901 
in bacteria, 906, 907f, 911-913 
base pairing in, 896, 903 
codon-anticodon interactions in, 898 
codons in, 897-898, 898f 
defined, 893 
direction of, 910 
elongation factors in, 907-908, 907f, 913 
in eukaryotes, 871-872, 871f, 911-913 
eukaryotic initiation of, 911-912, 912f 
formylmethionyl-tRNA in, 906, 
906f, 907 
in gene expression, 910 
initiation factors, 907, 907£ 


initiation of, 905-906, 907f, 911-912 
initiation sites, 906, 906f 
mechanism of action, 909 
mRNA in, 912 
organization in, 913 
overview of, 893-894 
peptide bond formation in, 908-909, 908f 
proofreading in, 900-901 
reading frame, 907 
release factors in, 910-911, 911f 
ribosomes in, 898-911 
rRNA in, 903-905, 904f 
Shine-Dalgarno sequences in, 906 
signaling in, 915-919 
termination of, 910-911, 911f, 913 
translocation in, 909-910, 910f 
tRNA in, 893, 894, 895-898, 905, 905f, 
911-912 
wobble hypothesis and, 897, 897t 
Translesion, 846 
Translocase, 909 
Translocation 
inhibition by antibiotics, 914-915, 915f 
of proteins, 915-919 
RNA-DNA hybrid, 863, 863f 
signal sequences in, 916-918, 916f 
in translation, 909-910, 910f 
Translocon, 917 
Transmembrane helices, 354-356, 
355f, 355t 
Transmissible spongiform encephalopathies, 
56-57 
Transplant rejection, 1002-1003 
Transport vesicles, 918 
Transporters 
ABC, 367, 374-376, 374f, 375f, 1054 
antiporters, 376, 376f 
ATP-ADP, 551-552, 551f, 552f 
dicarboxylate, 552, 552f 
glucose, 368, 473-474, 474t 
glucose 6-phosphate, 638 
mitochondrial, 551-552, 551f, 552f 
in oxidative phosphorylation, 549-552 
phosphate, 552, 552f 
in photosynthesis, 568-572, 569f 
pyruvate, 552 
secondary, 376-377 
symporters, 376, 376f 
tricarboxylate, 552, 552f 
uniporters, 376, 376f 
Transverse diffusion, 357, 357f 
Tree of life, 3f 
Triacylglycerol synthesis, 768-776 
liver in, 769 
sources of intermediates in, 768, 768f 
Triacylglycerol synthetase complex, 768 
Triacylglycerols 
in adipose tissue, 818 
defined, 645 
as energy source, 645, 645f 
in fatty acid metabolism, 647-648, 647f 
Tricarboxylic acid (TCA) cycle. See Citric 
acid cycle 
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Trimethoprim, 758 
Trimethylamine (TMA), 771 
Trimethylamine-N- oxide (TMAO), 771 
Trinucleotide repeats, 850 
Triose phosphate isomerase (TPI), 238, 238 
catalytic mechanism of, 455 
in glycolysis, 455-456, 456f 
structure of, 455f 
Trioses, 316 
Triple helix, of collagen, 45 
tRNA 
acceptor stem, 896, 897 
as adaptor molecules, 123, 124f 
amino acid attachment to, 123, 123f 
amino acid-attachment site, 123 
anticodons of, 896, 897 
bases, 896 
CCA terminal region of, 896, 896f 
charged, 898 
codons in, 897-898 
common features of, 895-897 
defined, 120 
extra arm, 896 
helix stacking in, 896, 896f 
initiator, 911 
inosine in, 898 
molecule design, 895-897, 895f 
precursors of, 896 
recognition sites on, 901, 901f 
RNA processing, 877, 878t 
structure of, 896, 896f 
synthetase recognition of, 901, 901f 
template-recognition site, 123 
transcription of, 870-871, 870f 
in translation, 893, 894, 895-898, 
905, 905f 
translocation of, 909-910, 910f 
wobble hypothesis and, 897, 897t 
Trombone model, 841, 841f 
Tropomyosin, 1020, 1021 
Troponin complex, 1020 
TRP channels, 977 
Trp operon, 936 
Trypanosomes, 880 
Trypsin, 217, 217f 
catalytic triad in, 258, 258f, 259f 
chymotrypsin and, 258, 258f, 259f 
defined, 301 
generation of, 301 
S; pockets of, 259f 
structure of, 258, 258f 
Trypsin inhibitor, 302-303, 302f 
Trypsinogen, 301 
Tryptophan 
in alanine, 700 
degradation of, 704, 704f 
nicotinamide from, 734, 734f 
serotonin from, 734, 734f 
structure of, 31f, 32 
synthesis of, 728-729, 729f 
Tryptophan synthase, 729-730 
Ts elongation factor (EF-Ts), 907-908 
t-SNARE, 919 
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T-to-R equilibrium, 291-292, 292f 
T-to-R state transition, ATCase, 
290-291, 290f 
Tu elongation factor (EF-Tu), 907, 
907f, 908 
Tuberculosis, 687 
Tubulin, 1023, 1023f 
Tumor growth, ceramide metabolism as 
stimulant, 774-775 
Tumor hypoxia, 475 
‘Tumor suppressor genes, 851 
Tumor-inducing (Ti) plasmids, 163-164, 163f 
Turnover number, 230, 230f 
2', 3'-dideoxy analog, 139 
Two-dimensional electrophoresis in, 73 
Twofold rotational symmetry, 272, 272f 
Type 1 diabetes 
defined, 808 
glucagon excess in, 812 
insulin insufficiency in, 812 
metabolic derangements in, 812, 812f 
treatment for, 812 
Type 2 diabetes 
defined, 808 
metabolic syndrome and, 809-810, 810f 
treatments for, 811-812 
Type I topoisomerases, 836-837 
Type II topoisomerases, 836, 837-839 
Typing-monkey analogy, 53, 53f 
Tyrosine, 703 
defined, 32 
degradation of, 703, 704f 
structure of, 32f 
synthesis of, 727-728, 728f 
thyroxine from, 734, 734f 
Tyrosine kinases, 294 
Tyrosine phosphatase IB, 809 
Tyrosinemia, 705t 
Tyrosyl radical, 753, 753f 
Tyrosyl-tRNA synthetase, 901 


Ubiquinol, 533, 651 
Ubiquinone, 530-531, 651 
Ubiquinone reductase, 651 
Ubiquitin 
conjugation, 683-684, 684f 
defined, 681, 683 
pathway, 686 
in protein tagging, 683-685 
structure of, 683-684 
Ubiquitination, 684-686 
UDP (uridine diphosphate), 632 
UDP-galactose 4-epimerase, 467 
UDP-glucose 
defined, 466, 630, 743 
in galactose conversion, 467 
glucose transfer from, 630-631 
synthesis of, 630 
UDP-glucose pyrophosphorylase, 630 
Ultracentrifugation, 76-78 
Umami taste, 966, 969 
UMP kinase, 747 


Uncompetitive inhibition. See also Enzyme 
inhibition 

defined, 234 

double-reciprocal plot, 236, 236f 

example of, 235 

kinetics of, 235, 235f 

Uncoupling proteins, 555-556, 555f 

Unfolded protein response (UPR), 811 

Uniporters, 376, 376f 

Unsaturated fatty acids, 652-654, 652f, 
653f, 661, 668-670 

Unstructured proteins, 55-56 

Upstream promoter elements, 864, 873 

Uracil, 107, 107 

Uracil DNA glycosylase, 850, 850f 

Urate, 761 

Urea, 49, 694, 695b 

Urea cycle 

in amino acid degradation, 693-698 

carbamoyl phosphate in, 693-695 

defined, 681 

disorders of, 697-698, 697f, 698f 

enzymes, 696-697 

in gluconeogenesis, 696, 696f 

illustrated, 693f 

reotelic, 693 

reotelic organisms, 693, 698 

ric acid, 698 

ridine, 107 

ridine monophosphate synthetase, 747 

ridine triphosphate (UTP), 286, 630 

ridylate, 108, 747 

roporphyrinogen III, 736, 737, 737f 


U 
U 
U 
U 
U 
U 
U 
U 


V genes, 991-992 
Vaccines 
defined, 1006 
in disease prevention/eradication, 
1006-1007 
HIV, 1007 
killed (inactivated), 1006 
live attenuated, 1006 
subunit, 1006 
synthetic, 152 
toxoid, 1006 
Vaccinia virus, 160 
Valine 
degradation of, 702-703 
in maple syrup urine disease, 705 
structure of, 31f, 32 
van der Waals forces, 224 
van der Waals interactions 
in antigen-antibody binding, 990 
base stacking, 111, 111f 
contact distance, 8 
defined, 8 
energy of, 8, 8f 
hydrocarbon tails, 348 
minimization of, 10 
as noncovalent bond, 7 
Vanishing white matter (VWM) disease, 
913, 913£ 


Variable number of tandem repeats (VNTR) 
region, 329, 329f 
Vascular endothelial growth factor (VEGF), 
187, 476 
Vasopressin, 92-93, 93f 
VDAC, 524 
V(D)J recombination, 992, 992f 
Vectors 
cloning, 145, 146, 147f 
defined, 143 
expression, 146 
plasmid, 144-146, 145f 
viral, 159-160 
Very-low-density lipoproteins (VLDLs), 
782, 782t 
Vibrio fischeri, 934 
Vicine, 611, 611f 
Vioxx (rofecoxib), 1048, 1048, 1052 
Viral hepatitis, 691 
Viral receptors, 335, 335f 
Viral vectors, 159-160 
Virions, 146 
Viruses 
coats, 49 
in gene introduction, 159 
human immunodeficiency, 1003-1004, 
1003f 
infectious mechanisms of, 335-336 
influenza, 335-336, 335f 
progeny, 146 
retroviruses, 118-119, 119f, 1053 
RNA, 118-119 
Vision, 970-975. See also Sensory systems 
in animals, 974, 974f 
calcium ion in, 972—973 
color, 970, 973-975 
color blindness and, 974-975, 974£ 
cones in, 970, 973-974 
evolution of, 974, 974£ 
light absorption in, 971-972 
photoreceptors in, 970, 973-974, 973f 
retinal-lysine linkage in, 971, 971f 
rhodopsin in, 970-971, 971f 
rods in, 970-971, 970f 
7TM receptors in, 970-974 
signal transduction in, 972-973, 972f 
Visual pigments 
in cones, 973-974 
defined, 970 
in rods, 970-971 
Vitamin A (retinol), 439, 439f, 439t 
Vitamin B, (thiamine), 438t, 515 
Vitamin By (riboflavin), 438t 
Vitamin B3 (niacin), 438t 
Vitamin Bs; (pantothenic acid), 438t 
Vitamin Bg (pyridoxine), 438t 
Vitamin B; (biotin), 438t 
Vitamin Bo (folic acid), 438t 
Vitamin By,2, 438t 
as coenzyme, 654-655, 655f, 656 
corrin ring and cobalt atom, 654-655 
in fatty acid metabolism, 654-655 


Vitamin C (ascorbic acid) 
deficiency of, alcohol related, 821-822 
forms of, 822, 822f 
function of, 439, 439t 
Vitamin D (calcitriol) 
biochemical role, 795 
cholesterol as precursor, 794-795 
deficiency, 795 
function of, 439, 439t 
structure of, 439f 
synthesis of, 794-795, 794f 
Vitamin E (a-tocopherol) 
function of, 439-440, 439t 
structure of, 439f 
Vitamin K 
in blood clotting, 304, 306-307, 439t, 440 
deficiency, 439t 
in y-carboxyglutamate formation, 
306-307, 306f 
structure of, 306, 439f 
Vitamins 
B, 438, 438t 
coenzyme, 438-440, 438t 
defined, 438 
evolution of, 438-439 
noncoenzyme, 439, 439t 
in reducing homocysteine levels, 726 
roles of, 20 
VJ recombination, 991-992, 991f 
Vmax» 230 
Voltage-gated ion channels, 383-384, 384f 
von Gierke disease, 637, 637t, 638 
von Gierke, Edgar, 637 
VR1 (capsaicin receptor), 977-978, 
977£, 978£ 
v-SNARE, 919 


Wald, George, 971 
Wang, James, 836 
Warburg, Otto, 474 
Warburg effect, 474, 554 
Warfarin, 307 
Water 
attack, facilitating, 277, 277f 
concentration of, 14 
dissociation of, equilibrium constant, 
13-14 
as highly cohesive, 9 
hydrogen bonds, 9, 10 
as polar molecule, 8-9 
properties of, 8-9 
Water-oxidizing complex (WOC), 574, 574f 
Watson, James, 17, 109, 827 
Watson-Crick base pairs. See Bases/base pairs 
Watson-Crick DNA model, 110, 110f, 
111, 116 
Wernicke-Korsakoff syndrome, 821 
Wernicke’s encephalopathy, 515 
Western blotting, 83-84, 83f, 138 
White adipose tissue (WAT), 555 
Whooping cough, 417-418 
Wiley, Don, 996 
Wilkins, Maurice, 109 
Windows, 355 
Withering, William, 373 
Wobble hypothesis, 897, 897t 
Wyman, Jeffries, 198 


Xanthine oxidase, 761 
Xanthomas, 785 

Xanthylate (XMP), 752 
Xenobiotic compounds, 1037 
Xeroderma pigmentosum, 851 


C43 
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X-ray crystallography 
defined, 95 
diffraction patterns, 96, 96f 
electron-density map, 96-97, 96f, 97f 
experiment, 95, 95f 
reflections, 96 
resolution, 96, 97f 
synchrotron radiation and, 95 


Yamanaka, Shinya, 947 

Yanofsky, Charles, 935 

Yeast artificial chromosomes (YACs), 
147, 147£ 

Yeast chromosomes, 941, 942t 


Z line, 1020, 1020f, 1021 
Z scheme of photosynthesis, 577, 577f 
Zantac (ranitidine), 1049, 1049 
ZAP-70, 999-1000 
Zellweger syndrome, 657 
Z-form DNA, 112, 112f, 112t 
Zinc 
activation of water molecule, 265-267, 266f 
in biological systems, 265 
in carbonic anhydrase, 265, 265f 
Zinc-finger domains, 945-946, 946f 
Zinc-finger nucleases (ZFNs), 161-162, 162f 
Zonal centrifugation, 77, 77f 
Zwitterions, 29, 30f 
Zymogens, 301-303, 301f 
cascade of activations, 303-304, 303f 
conversion into proteases, 302 
defined, 286 
developmental process control by, 299 
proteolytic activation, 301, 301f 
secretion of, 300, 300f 


ACIDITY CONSTANTS 


pKa values of some acids 


Acid pK’ (at 25°C) Acid pK’ (at 25°C) 
Acetic acid 4.76 Malic acid, pK; 3.40 
Acetoacetic acid 3.58 ys) 5.11 
Ammonium ion 9.25 Phenol 9.89 
Ascorbic acid, pK; 4.10 Phosphoric acid, pK 2.12 
pK> 11.79 pK 7.21 
Benzoic acid 4.20 oo, pK; 12.07 
n-Butyric acid 4.81 Pyridinium On 225 
Cassdylicacd 6.19 Pyrophosphoric acid, a 0.85 
1.49 
Citric acid, pK; Bila OK 5.77 
P' = : Succinic acid, pK; 4.21 
Ethylammonium ion 10.81 K 5.64 
Formic acid 3.75 ; a ; 
Glvei K 2.35 Trimethylammonium ion 79) 
ee K; 9.78 Tris (hydroxymethyl) aminomethane 8.08 
Imidazolium ion 6.95 Water* 15.74 
Lactic acid 3.86 
Fumaric acid, pK; 3.03 
pK» 4.44 
*{H‘][OH ] = 10° '*; [HO] = 55.5M. 
Typical pK, values of ionizable groups in proteins 
. Typical : Typical 
Grou Acid —— Base Grou Acid ———— Base 
. pKa ad pKa 
Terminal ° ? H 
a-carboxyl c —-* C7 3.1 Cysteine et —— _s- 83 
group “~No- H ~~So 
H 
Aspartic fe) fe) Tyrosine < > oO S= < po 10.4 
acid Sey Gee ae 
Glutamic yi ikl pee + ,H 
acid g 2 Lysine —Ni, rr Ny, 10.0 
y \ ‘H —_— \ ‘H . 
H H H 
Na N H H 
Histidine — =) 6.0 i i 
\ ON \UN oe ey ue \ 4 
SH SH Arginine Vis — ioe 12.5) 
Terminal + H wr Pent 
Q-amino Nowy ——— Ne 8.0 
group H H 


Note: pK, values depend on temperature, ionic strength, and the microenvironment of the ionizable group. 


STANDARD BOND LENGTHS 


Bond Structure Length (A) 
C—H RCH) 1.07 
Aromatic 1.08 
RCH3 1.10 
C6 Hydrocarbon 1.54 
Aromatic 1.40 
C=C Ethylene 1.33 
(C=C Acetylene 1.20 
C—N RNH) 1.47 
O=C—N 1.34 
C—O Alcohol 1.43 
Ester 1.36 
C= Aldehyde 1.22 
Amide 1.24 
GS 5 R2S 1.82 
N—H Amide 0.99 
O—H Alcohol 0.97 
O—O O, 1.21 
1-0) Ester 1.56 
S—H Thiol 1.33 
SS Disulfide 2.05 


